NafoA.19254.a

histidine ammonia-lyase

CENTER ID: NafoA.19254.a
ORGANISM: Naegleria fowleri ATCC 30863
ASSOCIATED DISEASE:
CURRENT STATUS: purified
COMMUNITY REQUEST: True
NIH RISK GROUP: 2
SELECT AGENT: False
NIH PRIORITY
pathogens category:
IIB

Ordering Clones & Proteins

If there are materials available for this target, they will be listed below. Materials can be ordered from SSGCID using the button in the "order material" column. Clicking the button will add the material to a virtual cart. You may order multiple materials at a time at no cost to you, as this contract is funded by NIAID. When you are ready to place your order, click the "Place Order" link which will appear in the top right corner of the page after you place your first item in your cart.

Clones*

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
NafoA.19254.a.B1.GE40638 Full length(NafoA.19254.a) 1 574
* SSGCID clones represent un-induced expression constructs which have been verified by sequencing from vector primers. Clones may contain a tag, such as N-term 6xHis. Get sequence information using the button in the "info" column.

Proteins

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
NafoA.19254.a.B1.PW38417 Full length(NafoA.19254.a) 1 574

External Resources

RESOURCE REFERENCE ID
EuPathDB: AmoebaDB:NF0108100

Sequences

These sequences are the native gene sequence; sequences of constructs derived from these sequences may differ due to codon optimization or other protocols. To find the specific sequence of any material you may have ordered, click on the "info" button next to the name of that material.
AA Sequence
MQDNLCESIS SSMILSDHYL YLDGESLTCE DLYRIGYEPH LKIALTQQAK DRIIEARKIV DNIIHTNQVK YGINTGFGHF ATTVIAKEDI EDLQKNLIRS HAAGVGEYLP LERAKMLLAL RINVLAKGFS GIRLETVERL VKAFNAGCIS AVPCKGSVGA SGDLAPLAHM ALGMMGEGMM YNPSTKTYED AGKVLKEHGL EPIEVHAKEG LALINGTQFI CALGTDALVR SINLVRMADV VGAMTLEALR GSFKAFDARI HTARRHGGQQ KVAGRMRHLL FAEKVENEKR VFEISEISAS HWNCSRVQDA YTLRCIPQVH GVVNDTVEFC RSVMENELNA ATDNPMVFID PSGSKVPNSR HIHFEDHEDF HSHLHHYTTA DENDEQIVSG GNFHGEYPAK IMDYLGIAIT ELANISERRV ARLIDGNLSG LPAFLVKEGG LNSGFMIAHC TASALTSENK VLAHPSSNDT LSTSAAKEDH VSMGPFASMK CLDIVKNVEY VLAIELMCAC QGIDLLRPLK TTPILEKVYE LVRSEVPPYE KDRFLSPDIE KICHLIRTGK VWQVIKGDIP PELH
NT Sequence
ATGCAAGATA ATTTGTGCGA ATCTATTTCA TCCTCCATGA TTCTCTCCGA TCATTACCTC TATCTCGACG GTGAGAGTCT CACCTGTGAG GATTTATACA GAATCGGCTA TGAACCTCAT TTGAAAATTG CTCTCACACA ACAAGCCAAA GACCGTATCA TAGAGGCTCG AAAAATTGTG GATAACATCA TCCACACGAA TCAAGTCAAG TACGGAATCA ATACAGGATT TGGTCATTTT GCTACCACTG TGATTGCAAA GGAAGATATT GAAGATTTAC AAAAGAATTT AATTCGCTCG CATGCAGCTG GTGTTGGAGA GTATTTGCCT TTGGAGAGAG CGAAAATGTT GTTGGCGTTG CGGATCAATG TGCTGGCAAA AGGCTTCTCG GGAATTCGAT TGGAAACGGT TGAAAGATTG GTGAAGGCTT TTAATGCTGG ATGCATTTCA GCAGTTCCTT GCAAGGGAAG TGTCGGAGCA AGTGGAGATT TGGCACCTCT CGCCCACATG GCACTCGGAA TGATGGGTGA AGGAATGATG TACAATCCTT CCACCAAGAC CTATGAAGAT GCAGGCAAAG TTCTCAAAGA ACACGGATTG GAACCTATTG AAGTGCATGC CAAGGAAGGA TTGGCACTCA TTAATGGTAC ACAATTTATT TGTGCATTGG GAACAGATGC CTTGGTTCGT TCCATTAATT TAGTACGAAT GGCTGATGTT GTGGGAGCAA TGACTTTGGA GGCACTTAGA GGTTCGTTTA AAGCCTTTGA TGCGAGAATT CACACAGCAA GACGTCACGG TGGACAACAA AAAGTTGCTG GAAGAATGAG ACATTTGCTC TTCGCTGAAA AGGTGGAAAA TGAGAAGAGA GTTTTTGAAA TTTCAGAGAT TTCAGCCAGT CATTGGAATT GTTCTCGAGT GCAAGATGCC TATACTTTGA GATGTATTCC ACAAGTGCAT GGTGTTGTCA ATGACACTGT TGAATTTTGC AGAAGTGTTA TGGAAAATGA GTTGAATGCA GCCACAGATA ATCCAATGGT ATTTATTGAT CCAAGTGGAA GTAAAGTTCC AAACTCACGA CACATCCATT TTGAAGACCA CGAAGATTTC CATTCGCATC TGCATCACTA CACAACAGCT GACGAAAATG ACGAACAAAT TGTGAGCGGA GGTAATTTCC ACGGCGAATA TCCAGCCAAG ATTATGGATT ATTTGGGAAT TGCAATTACT GAACTTGCAA ACATTAGTGA GAGACGAGTT GCTCGTTTGA TTGATGGTAA CTTGAGTGGT TTACCTGCAT TTTTGGTTAA AGAAGGTGGT CTAAACAGTG GATTCATGAT TGCTCATTGT ACAGCCAGTG CCTTGACAAG TGAAAACAAA GTTTTGGCTC ACCCAAGTTC CAATGACACT CTTTCCACAT CTGCTGCCAA AGAAGATCAC GTCTCTATGG GACCATTTGC AAGTATGAAG TGTTTAGACA TTGTTAAAAA TGTCGAATAT GTTTTGGCCA TTGAATTGAT GTGTGCTTGT CAAGGTATCG ATTTATTACG TCCACTCAAG ACCACACCAA TTCTCGAAAA GGTCTACGAA CTTGTGAGAT CAGAGGTTCC ACCCTATGAG AAGGACAGAT TTTTGTCGCC AGATATTGAG AAAATTTGTC ATTTAATTAG AACTGGTAAG GTTTGGCAAG TCATTAAGGG TGACATTCCT CCTGAACTGC AT
Details for NafoA.19254.a.B1.GE40638
HARVESTED ON: 10/18/2016
SEQUENCED ON: 10/19/2016
EXPECTED MW: 64kDa
OBSERVED MW: 64kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Many (50-100)
TOTAL EXPRESSION LEVEL: Moderate Expression
SOLUBLE EXPRESSION LEVEL Moderate Expression
EXPRESSION HOST: BL 21 (DE3) Rosetta
SEQUENCING RESULT:
PERCENT IDENTITY: 96
PERCENT COVERAGE: 49
Validated AA Sequence
MAHHHHHHMQ DNLCESISSS MILSDHYLYL DGESLTCEDL YRIGYEPHLK IALTQQAKDR IIEARKIVDN IIHTNQVKYG INTGFGHFAT TVIAKEDIED LQKNLIRSHA AGVGEYLPLE RAKMLLALRI NVLAKGFSGI RLETVERLVK AFNAGCISAV PCKGSVGASG DLAPLAHMAL GMMGEGMMYN PSTKTYEDAG KVLKEHGLEP IEVHAKEGLA LINGTQFICA LGTDALVRSI NLVRMADVVG AMTLXALRGS FKXFDARIHT ARRHGGQQKV AXXXXXXAXX
Validated NT Sequence
ccttaatgac ttgccaaacc ttaccagttc taattaaatg acaaattttc tcaatatctg gcgacaaaaa tctgtccttc tcatagggtg gaacctctga tctcacaagt tcgtagacct tttcgagaat tggtgtggtc ttgagtggac gtaataaatc gataccttga caagcacaca tcaattcaat ggccaaaaca tattcgacat ttttaacaat gtctaaacac ttcatacttg caaatggtcc catagagacg tgatcttctt tggcagcaga tgtggaaaga gtgtcattgg aacttgggtg agccaaaact ttgttttcac ttgtcaaggc actggctgta caatgagcaa tcatgaatcc actgtttaga ccaccttctt taaccaaaaa tgcaggtaaa ccactcaagt taccatcaat caaacgagca actcgtctct cactaatgtt tgcaagttca gtaattgcaa ttcccaaata atccataatc ttggctggat attcgccgtg gaaattacct ccgctcacaa tttgttcgtc attttcgtca gctgttgtgt agtgatgcag atgcgaatgg aaatcttcgt ggtcttcaaa atggatgtgt cgtgagtttg gaactttact tccacttgga tcaataaaaa ccattggatt atctgtggct gcattcaact cattttccat aacacttctg caaaattcaa cagtgtcatt gacaacacca tgcacttgtg gaatacatct caaagtatag gcatcttgca ctcgagaann nnntccaatg actgncnnnn nnnnnntgaa nnnttcnaaa cnnnnnnctc attttnnnnn ntttcannnn ngagcnnntn nnnnannnnn nnnngcaact ttttgttgtc caccgtgacg tcttgctgtg tgaattctcg catcaaagnc tttaaacgaa cctctaagtg cnnccaaagt cattgctccc acaacatcag ccattcgtac taaattaatg gaacgaacca aggcatctgt tcccaatgca caaataaatt gtgtaccatt aatgagtgcc aatccttcct tggcatgcac ttcaataggt tccaatccgt gttctttgag aactttgcct gcatcttcat aggtcttggt ggaaggattg tacatcattc cttcacccat cattccgagt gccatgtggg cgagaggtgc caaatctcca cttgctccga cacttccctt gcaaggaact gctgaaatgc atccagcatt aaaagccttc accaatcttt caaccgtttc caatcgaatt cccgagaagc cttttgccag cacattgatc cgcaacgcca acaacatttt cgctctctcc aaaggcaaat actctccaac accagctgca tgcgagcgaa ttaaattctt ttgtaaatct tcaatatctt cctttgcaat cacagtggta gcaaaatgac caaatcctgt attgattccg tacttgactt gattcgtgtg gatgatgtta tccacaattt ttcgagcctc tatgatacgg tctttggctt gttgtgtgag agcaattttc aaatgaggtt catagccgat tctgtataaa tcctcacagg tgagactctc accgtcgaga tagaggtaat gatcggagag aatcatggag gatgaaatag attcgcacaa attatcttgc atatggtggt ggtggtggtg agccat
Expected Protein Sequence
MAHHHHHHMQ DNLCESISSS MILSDHYLYL DGESLTCEDL YRIGYEPHLK IALTQQAKDR IIEARKIVDN IIHTNQVKYG INTGFGHFAT TVIAKEDIED LQKNLIRSHA AGVGEYLPLE RAKMLLALRI NVLAKGFSGI RLETVERLVK AFNAGCISAV PCKGSVGASG DLAPLAHMAL GMMGEGMMYN PSTKTYEDAG KVLKEHGLEP IEVHAKEGLA LINGTQFICA LGTDALVRSI NLVRMADVVG AMTLEALRGS FKAFDARIHT ARRHGGQQKV AGRMRHLLFA EKVENEKRVF EISEISASHW NCSRVQDAYT LRCIPQVHGV VNDTVEFCRS VMENELNAAT DNPMVFIDPS GSKVPNSRHI HFEDHEDFHS HLHHYTTADE NDEQIVSGGN FHGEYPAKIM DYLGIAITEL ANISERRVAR LIDGNLSGLP AFLVKEGGLN SGFMIAHCTA SALTSENKVL AHPSSNDTLS TSAAKEDHVS MGPFASMKCL DIVKNVEYVL AIELMCACQG IDLLRPLKTT PILEKVYELV RSEVPPYEKD RFLSPDIEKI CHLIRTGKVW QVIKGDIPPE LH
Full NT Sequence (Expression Vector + Insert)
taatacgact cactataggg agaccacaac ggtttccctc tagaaataat tttgtttaac tttaagaagg agatatacca tggctcacca ccaccaccac catatgcaag ataatttgtg cgaatctatt tcatcctcca tgattctctc cgatcattac ctctatctcg acggtgagag tctcacctgt gaggatttat acagaatcgg ctatgaacct catttgaaaa ttgctctcac acaacaagcc aaagaccgta tcatagaggc tcgaaaaatt gtggataaca tcatccacac gaatcaagtc aagtacggaa tcaatacagg atttggtcat tttgctacca ctgtgattgc aaaggaagat attgaagatt tacaaaagaa tttaattcgc tcgcatgcag ctggtgttgg agagtatttg cctttggaga gagcgaaaat gttgttggcg ttgcggatca atgtgctggc aaaaggcttc tcgggaattc gattggaaac ggttgaaaga ttggtgaagg cttttaatgc tggatgcatt tcagcagttc cttgcaaggg aagtgtcgga gcaagtggag atttggcacc tctcgcccac atggcactcg gaatgatggg tgaaggaatg atgtacaatc cttccaccaa gacctatgaa gatgcaggca aagttctcaa agaacacgga ttggaaccta ttgaagtgca tgccaaggaa ggattggcac tcattaatgg tacacaattt atttgtgcat tgggaacaga tgccttggtt cgttccatta atttagtacg aatggctgat gttgtgggag caatgacttt ggaggcactt agaggttcgt ttaaagcctt tgatgcgaga attcacacag caagacgtca cggtggacaa caaaaagttg ctggaagaat gagacatttg ctcttcgctg aaaaggtgga aaatgagaag agagtttttg aaatttcaga gatttcagcc agtcattgga attgttctcg agtgcaagat gcctatactt tgagatgtat tccacaagtg catggtgttg tcaatgacac tgttgaattt tgcagaagtg ttatggaaaa tgagttgaat gcagccacag ataatccaat ggtatttatt gatccaagtg gaagtaaagt tccaaactca cgacacatcc attttgaaga ccacgaagat ttccattcgc atctgcatca ctacacaaca gctgacgaaa atgacgaaca aattgtgagc ggaggtaatt tccacggcga atatccagcc aagattatgg attatttggg aattgcaatt actgaacttg caaacattag tgagagacga gttgctcgtt tgattgatgg taacttgagt ggtttacctg catttttggt taaagaaggt ggtctaaaca gtggattcat gattgctcat tgtacagcca gtgccttgac aagtgaaaac aaagttttgg ctcacccaag ttccaatgac actctttcca catctgctgc caaagaagat cacgtctcta tgggaccatt tgcaagtatg aagtgtttag acattgttaa aaatgtcgaa tatgttttgg ccattgaatt gatgtgtgct tgtcaaggta tcgatttatt acgtccactc aagaccacac caattctcga aaaggtctac gaacttgtga gatcagaggt tccaccctat gagaaggaca gatttttgtc gccagatatt gagaaaattt gtcatttaat tagaactggt aaggtttggc aagtcattaa gggtgacatt cctcctgaac tgcattgagt aagataggat ccggctgcta acaaagcccg aaaggaagct gagttggctg ctgccaccgc tgagcaataa ctagcataac cccttggggc ctctaaacgg gtcttgaggg gttttttgct gaaaggagga actatatccg gatatccaca ggacgggtgt ggtcgccatg atcgcgtagt cgatagtggc tccaagtagc gaagcgagca ggactgggcg gcggccaaag cggtcggaca gtgctccgag aacgggtgcg catagaaatt gcatcaacgc atatagcgct agcagcacgc catagtgact ggcgatgctg tcggaatgga cgatatcccg caagaggccc ggcagtaccg gcataaccaa gcctatgcct acagcatcca gggtgacggt gccgaggatg acgatgagcg cattgttaga tttcatacac ggtgcctgac tgcgttagca atttaactgt gataaactac cgcattaaag cttatcgatg ataagctgtc aaacatgaga attcttgaag acgaaagggc ctcgtgatac gcctattttt ataggttaat gtcatgataa taatggtttc ttagacgtca ggtggcactt ttcggggaaa tgtgcgcgga acccctattt gtttattttt ctaaatacat tcaaatatgt atccgctcat gagacaataa ccctgataaa tgcttcaata atattgaaaa aggaagagta tgagtattca acatttccgt gtcgccctta ttcccttttt tgcggcattt tgccttcctg tttttgctca cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac gagtgggtta catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg aagaacgttt tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatccc gtgttgacgc cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg ttgagtactc accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat gcagtgctgc cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg gaggaccgaa ggagctaacc gcttttttgc acaacatggg ggatcatgta actcgccttg atcgttggga accggagctg aatgaagcca taccaaacga cgagcgtgac accacgatgc ctgcagcaat ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt cccggcaaca attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct cggcccttcc ggctggctgg tttattgctg ataaatctgg agccggtgag cgtgggtctc gcggtatcat tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca cgacggggag tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct cactgattaa gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt taaaacttca tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa atactgtcct tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag gaagcggaag agcgcctgat gcggtatttt ctccttacgc atctgtgcgg tatttcacac cgcatatatg gtgcactctc agtacaatct gctctgatgc cgcatagtta agccagtata cactccgcta tcgctacgtg actgggtcat ggctgcgccc cgacacccgc caacacccgc tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag ctgtgaccgt ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg cgaggcagct gcggtaaagc tcatcagcgt ggtcgtgaag cgattcacag atgtctgcct gttcatccgc gtccagctcg ttgagtttct ccagaagcgt taatgtctgg cttctgataa agcgggccat gttaagggcg gttttttcct gtttggtcac tgatgcctcc gtgtaagggg gatttctgtt catgggggta atgataccga tgaaacgaga gaggatgctc acgatacggg ttactgatga tgaacatgcc cggttactgg aacgttgtga gggtaaacaa ctggcggtat ggatgcggcg ggaccagaga aaaatcactc agggtcaatg ccagcgcttc gttaatacag atgtaggtgt tccacagggt agccagcagc atcctgcgat gcagatccgg aacataatgg tgcagggcgc tgacttccgc gtttccagac tttacgaaac acggaaaccg aagaccattc atgttgttgc tcaggtcgca gacgttttgc agcagcagtc gcttcacgtt cgctcgcgta tcggtgattc attctgctaa ccagtaaggc aaccccgcca gcctagccgg gtcctcaacg acaggagcac gatcatgcgc acccgtggcc aggacccaac gctgcccgag atgcgccgcg tgcggctgct ggagatggcg gacgcgatgg atatgttctg ccaagggttg gtttgcgcat tcacagttct ccgcaagaat tgattggctc caattcttgg agtggtgaat ccgttagcga ggtgccgccg gcttccattc aggtcgaggt ggcccggctc catgcaccgc gacgcaacgc ggggaggcag acaaggtata gggcggcgcc tacaatccat gccaacccgt tccatgtgct cgccgaggcg gcataaatcg ccgtgacgat cagcggtcca gtgatcgaag ttaggctggt aagagccgcg agcgatcctt gaagctgtcc ctgatggtcg tcatctacct gcctggacag catggcctgc aacgcgggca tcccgatgcc gccggaagcg agaagaatca taatggggaa ggccatccag cctcgcgtcg cgaacgccag caagacgtag cccagcgcgt cggccgccat gccggcgata atggcctgct tctcgccgaa acgtttggtg gcgggaccag tgacgaaggc ttgagcgagg gcgtgcaaga ttccgaatac cgcaagcgac aggccgatca tcgtcgcgct ccagcgaaag cggtcctcgc cgaaaatgac ccagagcgct gccggcacct gtcctacgag ttgcatgata aagaagacag tcataagtgc ggcgacgata gtcatgcccc gcgcccaccg gaaggagctg actgggttga aggctctcaa gggcatcggt cgacgctctc ccttatgcga ctcctgcatt aggaagcagc ccagtagtag gttgaggccg ttgagcaccg ccgccgcaag gaatggtgca tgcaaggaga tggcgcccaa cagtcccccg gccacggggc ctgccaccat acccacgccg aaacaagcgc tcatgagccc gaagtggcga gcccgatctt ccccatcggt gatgtcggcg atataggcgc cagcaaccgc acctgtggcg ccggtgatgc cggccacgat gcgtccggcg tagaggatcg agatctcgat cccgcgaaat
Details for NafoA.19254.a.B1.PW38417
PURIFICATION DATe: 1/16/2018
CONCENTRATION: 13.63mg/ml
OBSERVED MW: 70kDa
EXPRESSION LEVEL: Low Expression
PROTEIN PURIFICATION BUFFER: 25 mM HEPES pH 7.0, 500 mM NaCl, 5% Glycerol , 2 mM DTT, 0.025% Azide
EXPRESSION HOST: BL 21 (DE3) Rosetta
VIAL COUNT (approx.): 8
VIAL VOLUME: 200µl
PERCENT IDENTITY: 96
PERCENT COVERAGE: 49
Protocol Notes
notes unavailable
Validated AA Sequence
MAHHHHHHMQ DNLCESISSS MILSDHYLYL DGESLTCEDL YRIGYEPHLK IALTQQAKDR IIEARKIVDN IIHTNQVKYG INTGFGHFAT TVIAKEDIED LQKNLIRSHA AGVGEYLPLE RAKMLLALRI NVLAKGFSGI RLETVERLVK AFNAGCISAV PCKGSVGASG DLAPLAHMAL GMMGEGMMYN PSTKTYEDAG KVLKEHGLEP IEVHAKEGLA LINGTQFICA LGTDALVRSI NLVRMADVVG AMTLXALRGS FKXFDARIHT ARRHGGQQKV AXXXXXXAXX
Validated NT Sequence
ccttaatgac ttgccaaacc ttaccagttc taattaaatg acaaattttc tcaatatctg gcgacaaaaa tctgtccttc tcatagggtg gaacctctga tctcacaagt tcgtagacct tttcgagaat tggtgtggtc ttgagtggac gtaataaatc gataccttga caagcacaca tcaattcaat ggccaaaaca tattcgacat ttttaacaat gtctaaacac ttcatacttg caaatggtcc catagagacg tgatcttctt tggcagcaga tgtggaaaga gtgtcattgg aacttgggtg agccaaaact ttgttttcac ttgtcaaggc actggctgta caatgagcaa tcatgaatcc actgtttaga ccaccttctt taaccaaaaa tgcaggtaaa ccactcaagt taccatcaat caaacgagca actcgtctct cactaatgtt tgcaagttca gtaattgcaa ttcccaaata atccataatc ttggctggat attcgccgtg gaaattacct ccgctcacaa tttgttcgtc attttcgtca gctgttgtgt agtgatgcag atgcgaatgg aaatcttcgt ggtcttcaaa atggatgtgt cgtgagtttg gaactttact tccacttgga tcaataaaaa ccattggatt atctgtggct gcattcaact cattttccat aacacttctg caaaattcaa cagtgtcatt gacaacacca tgcacttgtg gaatacatct caaagtatag gcatcttgca ctcgagaann nnntccaatg actgncnnnn nnnnnntgaa nnnttcnaaa cnnnnnnctc attttnnnnn ntttcannnn ngagcnnntn nnnnannnnn nnnngcaact ttttgttgtc caccgtgacg tcttgctgtg tgaattctcg catcaaagnc tttaaacgaa cctctaagtg cnnccaaagt cattgctccc acaacatcag ccattcgtac taaattaatg gaacgaacca aggcatctgt tcccaatgca caaataaatt gtgtaccatt aatgagtgcc aatccttcct tggcatgcac ttcaataggt tccaatccgt gttctttgag aactttgcct gcatcttcat aggtcttggt ggaaggattg tacatcattc cttcacccat cattccgagt gccatgtggg cgagaggtgc caaatctcca cttgctccga cacttccctt gcaaggaact gctgaaatgc atccagcatt aaaagccttc accaatcttt caaccgtttc caatcgaatt cccgagaagc cttttgccag cacattgatc cgcaacgcca acaacatttt cgctctctcc aaaggcaaat actctccaac accagctgca tgcgagcgaa ttaaattctt ttgtaaatct tcaatatctt cctttgcaat cacagtggta gcaaaatgac caaatcctgt attgattccg tacttgactt gattcgtgtg gatgatgtta tccacaattt ttcgagcctc tatgatacgg tctttggctt gttgtgtgag agcaattttc aaatgaggtt catagccgat tctgtataaa tcctcacagg tgagactctc accgtcgaga tagaggtaat gatcggagag aatcatggag gatgaaatag attcgcacaa attatcttgc atatggtggt ggtggtggtg agccat
Expressed Protein Sequence
MAHHHHHHMQ DNLCESISSS MILSDHYLYL DGESLTCEDL YRIGYEPHLK IALTQQAKDR IIEARKIVDN IIHTNQVKYG INTGFGHFAT TVIAKEDIED LQKNLIRSHA AGVGEYLPLE RAKMLLALRI NVLAKGFSGI RLETVERLVK AFNAGCISAV PCKGSVGASG DLAPLAHMAL GMMGEGMMYN PSTKTYEDAG KVLKEHGLEP IEVHAKEGLA LINGTQFICA LGTDALVRSI NLVRMADVVG AMTLEALRGS FKAFDARIHT ARRHGGQQKV AGRMRHLLFA EKVENEKRVF EISEISASHW NCSRVQDAYT LRCIPQVHGV VNDTVEFCRS VMENELNAAT DNPMVFIDPS GSKVPNSRHI HFEDHEDFHS HLHHYTTADE NDEQIVSGGN FHGEYPAKIM DYLGIAITEL ANISERRVAR LIDGNLSGLP AFLVKEGGLN SGFMIAHCTA SALTSENKVL AHPSSNDTLS TSAAKEDHVS MGPFASMKCL DIVKNVEYVL AIELMCACQG IDLLRPLKTT PILEKVYELV RSEVPPYEKD RFLSPDIEKI CHLIRTGKVW QVIKGDIPPE LH
Full NT Sequence (Expression Vector + Insert)
TAATACGACT CACTATAGGG AGACCACAAC GGTTTCCCTC TAGAAATAAT TTTGTTTAAC TTTAAGAAGG AGATATACCA TGGCTCACCA CCACCACCAC CATATGCAAG ATAATTTGTG CGAATCTATT TCATCCTCCA TGATTCTCTC CGATCATTAC CTCTATCTCG ACGGTGAGAG TCTCACCTGT GAGGATTTAT ACAGAATCGG CTATGAACCT CATTTGAAAA TTGCTCTCAC ACAACAAGCC AAAGACCGTA TCATAGAGGC TCGAAAAATT GTGGATAACA TCATCCACAC GAATCAAGTC AAGTACGGAA TCAATACAGG ATTTGGTCAT TTTGCTACCA CTGTGATTGC AAAGGAAGAT ATTGAAGATT TACAAAAGAA TTTAATTCGC TCGCATGCAG CTGGTGTTGG AGAGTATTTG CCTTTGGAGA GAGCGAAAAT GTTGTTGGCG TTGCGGATCA ATGTGCTGGC AAAAGGCTTC TCGGGAATTC GATTGGAAAC GGTTGAAAGA TTGGTGAAGG CTTTTAATGC TGGATGCATT TCAGCAGTTC CTTGCAAGGG AAGTGTCGGA GCAAGTGGAG ATTTGGCACC TCTCGCCCAC ATGGCACTCG GAATGATGGG TGAAGGAATG ATGTACAATC CTTCCACCAA GACCTATGAA GATGCAGGCA AAGTTCTCAA AGAACACGGA TTGGAACCTA TTGAAGTGCA TGCCAAGGAA GGATTGGCAC TCATTAATGG TACACAATTT ATTTGTGCAT TGGGAACAGA TGCCTTGGTT CGTTCCATTA ATTTAGTACG AATGGCTGAT GTTGTGGGAG CAATGACTTT GGAGGCACTT AGAGGTTCGT TTAAAGCCTT TGATGCGAGA ATTCACACAG CAAGACGTCA CGGTGGACAA CAAAAAGTTG CTGGAAGAAT GAGACATTTG CTCTTCGCTG AAAAGGTGGA AAATGAGAAG AGAGTTTTTG AAATTTCAGA GATTTCAGCC AGTCATTGGA ATTGTTCTCG AGTGCAAGAT GCCTATACTT TGAGATGTAT TCCACAAGTG CATGGTGTTG TCAATGACAC TGTTGAATTT TGCAGAAGTG TTATGGAAAA TGAGTTGAAT GCAGCCACAG ATAATCCAAT GGTATTTATT GATCCAAGTG GAAGTAAAGT TCCAAACTCA CGACACATCC ATTTTGAAGA CCACGAAGAT TTCCATTCGC ATCTGCATCA CTACACAACA GCTGACGAAA ATGACGAACA AATTGTGAGC GGAGGTAATT TCCACGGCGA ATATCCAGCC AAGATTATGG ATTATTTGGG AATTGCAATT ACTGAACTTG CAAACATTAG TGAGAGACGA GTTGCTCGTT TGATTGATGG TAACTTGAGT GGTTTACCTG CATTTTTGGT TAAAGAAGGT GGTCTAAACA GTGGATTCAT GATTGCTCAT TGTACAGCCA GTGCCTTGAC AAGTGAAAAC AAAGTTTTGG CTCACCCAAG TTCCAATGAC ACTCTTTCCA CATCTGCTGC CAAAGAAGAT CACGTCTCTA TGGGACCATT TGCAAGTATG AAGTGTTTAG ACATTGTTAA AAATGTCGAA TATGTTTTGG CCATTGAATT GATGTGTGCT TGTCAAGGTA TCGATTTATT ACGTCCACTC AAGACCACAC CAATTCTCGA AAAGGTCTAC GAACTTGTGA GATCAGAGGT TCCACCCTAT GAGAAGGACA GATTTTTGTC GCCAGATATT GAGAAAATTT GTCATTTAAT TAGAACTGGT AAGGTTTGGC AAGTCATTAA GGGTGACATT CCTCCTGAAC TGCATTGAGT AAGATAGGAT CCGGCTGCTA ACAAAGCCCG AAAGGAAGCT GAGTTGGCTG CTGCCACCGC TGAGCAATAA CTAGCATAAC CCCTTGGGGC CTCTAAACGG GTCTTGAGGG GTTTTTTGCT GAAAGGAGGA ACTATATCCG GATATCCACA GGACGGGTGT GGTCGCCATG ATCGCGTAGT CGATAGTGGC TCCAAGTAGC GAAGCGAGCA GGACTGGGCG GCGGCCAAAG CGGTCGGACA GTGCTCCGAG AACGGGTGCG CATAGAAATT GCATCAACGC ATATAGCGCT AGCAGCACGC CATAGTGACT GGCGATGCTG TCGGAATGGA CGATATCCCG CAAGAGGCCC GGCAGTACCG GCATAACCAA GCCTATGCCT ACAGCATCCA GGGTGACGGT GCCGAGGATG ACGATGAGCG CATTGTTAGA TTTCATACAC GGTGCCTGAC TGCGTTAGCA ATTTAACTGT GATAAACTAC CGCATTAAAG CTTATCGATG ATAAGCTGTC AAACATGAGA ATTCTTGAAG ACGAAAGGGC CTCGTGATAC GCCTATTTTT ATAGGTTAAT GTCATGATAA TAATGGTTTC TTAGACGTCA GGTGGCACTT TTCGGGGAAA TGTGCGCGGA ACCCCTATTT GTTTATTTTT CTAAATACAT TCAAATATGT ATCCGCTCAT GAGACAATAA CCCTGATAAA TGCTTCAATA ATATTGAAAA AGGAAGAGTA TGAGTATTCA ACATTTCCGT GTCGCCCTTA TTCCCTTTTT TGCGGCATTT TGCCTTCCTG TTTTTGCTCA CCCAGAAACG CTGGTGAAAG TAAAAGATGC TGAAGATCAG TTGGGTGCAC GAGTGGGTTA CATCGAACTG GATCTCAACA GCGGTAAGAT CCTTGAGAGT TTTCGCCCCG AAGAACGTTT TCCAATGATG AGCACTTTTA AAGTTCTGCT ATGTGGCGCG GTATTATCCC GTGTTGACGC CGGGCAAGAG CAACTCGGTC GCCGCATACA CTATTCTCAG AATGACTTGG TTGAGTACTC ACCAGTCACA GAAAAGCATC TTACGGATGG CATGACAGTA AGAGAATTAT GCAGTGCTGC CATAACCATG AGTGATAACA CTGCGGCCAA CTTACTTCTG ACAACGATCG GAGGACCGAA GGAGCTAACC GCTTTTTTGC ACAACATGGG GGATCATGTA ACTCGCCTTG ATCGTTGGGA ACCGGAGCTG AATGAAGCCA TACCAAACGA CGAGCGTGAC ACCACGATGC CTGCAGCAAT GGCAACAACG TTGCGCAAAC TATTAACTGG CGAACTACTT ACTCTAGCTT CCCGGCAACA ATTAATAGAC TGGATGGAGG CGGATAAAGT TGCAGGACCA CTTCTGCGCT CGGCCCTTCC GGCTGGCTGG TTTATTGCTG ATAAATCTGG AGCCGGTGAG CGTGGGTCTC GCGGTATCAT TGCAGCACTG GGGCCAGATG GTAAGCCCTC CCGTATCGTA GTTATCTACA CGACGGGGAG TCAGGCAACT ATGGATGAAC GAAATAGACA GATCGCTGAG ATAGGTGCCT CACTGATTAA GCATTGGTAA CTGTCAGACC AAGTTTACTC ATATATACTT TAGATTGATT TAAAACTTCA TTTTTAATTT AAAAGGATCT AGGTGAAGAT CCTTTTTGAT AATCTCATGA CCAAAATCCC TTAACGTGAG TTTTCGTTCC ACTGAGCGTC AGACCCCGTA GAAAAGATCA AAGGATCTTC TTGAGATCCT TTTTTTCTGC GCGTAATCTG CTGCTTGCAA ACAAAAAAAC CACCGCTACC AGCGGTGGTT TGTTTGCCGG ATCAAGAGCT ACCAACTCTT TTTCCGAAGG TAACTGGCTT CAGCAGAGCG CAGATACCAA ATACTGTCCT TCTAGTGTAG CCGTAGTTAG GCCACCACTT CAAGAACTCT GTAGCACCGC CTACATACCT CGCTCTGCTA ATCCTGTTAC CAGTGGCTGC TGCCAGTGGC GATAAGTCGT GTCTTACCGG GTTGGACTCA AGACGATAGT TACCGGATAA GGCGCAGCGG TCGGGCTGAA CGGGGGGTTC GTGCACACAG CCCAGCTTGG AGCGAACGAC CTACACCGAA CTGAGATACC TACAGCGTGA GCTATGAGAA AGCGCCACGC TTCCCGAAGG GAGAAAGGCG GACAGGTATC CGGTAAGCGG CAGGGTCGGA ACAGGAGAGC GCACGAGGGA GCTTCCAGGG GGAAACGCCT GGTATCTTTA TAGTCCTGTC GGGTTTCGCC ACCTCTGACT TGAGCGTCGA TTTTTGTGAT GCTCGTCAGG GGGGCGGAGC CTATGGAAAA ACGCCAGCAA CGCGGCCTTT TTACGGTTCC TGGCCTTTTG CTGGCCTTTT GCTCACATGT TCTTTCCTGC GTTATCCCCT GATTCTGTGG ATAACCGTAT TACCGCCTTT GAGTGAGCTG ATACCGCTCG CCGCAGCCGA ACGACCGAGC GCAGCGAGTC AGTGAGCGAG GAAGCGGAAG AGCGCCTGAT GCGGTATTTT CTCCTTACGC ATCTGTGCGG TATTTCACAC CGCATATATG GTGCACTCTC AGTACAATCT GCTCTGATGC CGCATAGTTA AGCCAGTATA CACTCCGCTA TCGCTACGTG ACTGGGTCAT GGCTGCGCCC CGACACCCGC CAACACCCGC TGACGCGCCC TGACGGGCTT GTCTGCTCCC GGCATCCGCT TACAGACAAG CTGTGACCGT CTCCGGGAGC TGCATGTGTC AGAGGTTTTC ACCGTCATCA CCGAAACGCG CGAGGCAGCT GCGGTAAAGC TCATCAGCGT GGTCGTGAAG CGATTCACAG ATGTCTGCCT GTTCATCCGC GTCCAGCTCG TTGAGTTTCT CCAGAAGCGT TAATGTCTGG CTTCTGATAA AGCGGGCCAT GTTAAGGGCG GTTTTTTCCT GTTTGGTCAC TGATGCCTCC GTGTAAGGGG GATTTCTGTT CATGGGGGTA ATGATACCGA TGAAACGAGA GAGGATGCTC ACGATACGGG TTACTGATGA TGAACATGCC CGGTTACTGG AACGTTGTGA GGGTAAACAA CTGGCGGTAT GGATGCGGCG GGACCAGAGA AAAATCACTC AGGGTCAATG CCAGCGCTTC GTTAATACAG ATGTAGGTGT TCCACAGGGT AGCCAGCAGC ATCCTGCGAT GCAGATCCGG AACATAATGG TGCAGGGCGC TGACTTCCGC GTTTCCAGAC TTTACGAAAC ACGGAAACCG AAGACCATTC ATGTTGTTGC TCAGGTCGCA GACGTTTTGC AGCAGCAGTC GCTTCACGTT CGCTCGCGTA TCGGTGATTC ATTCTGCTAA CCAGTAAGGC AACCCCGCCA GCCTAGCCGG GTCCTCAACG ACAGGAGCAC GATCATGCGC ACCCGTGGCC AGGACCCAAC GCTGCCCGAG ATGCGCCGCG TGCGGCTGCT GGAGATGGCG GACGCGATGG ATATGTTCTG CCAAGGGTTG GTTTGCGCAT TCACAGTTCT CCGCAAGAAT TGATTGGCTC CAATTCTTGG AGTGGTGAAT CCGTTAGCGA GGTGCCGCCG GCTTCCATTC AGGTCGAGGT GGCCCGGCTC CATGCACCGC GACGCAACGC GGGGAGGCAG ACAAGGTATA GGGCGGCGCC TACAATCCAT GCCAACCCGT TCCATGTGCT CGCCGAGGCG GCATAAATCG CCGTGACGAT CAGCGGTCCA GTGATCGAAG TTAGGCTGGT AAGAGCCGCG AGCGATCCTT GAAGCTGTCC CTGATGGTCG TCATCTACCT GCCTGGACAG CATGGCCTGC AACGCGGGCA TCCCGATGCC GCCGGAAGCG AGAAGAATCA TAATGGGGAA GGCCATCCAG CCTCGCGTCG CGAACGCCAG CAAGACGTAG CCCAGCGCGT CGGCCGCCAT GCCGGCGATA ATGGCCTGCT TCTCGCCGAA ACGTTTGGTG GCGGGACCAG TGACGAAGGC TTGAGCGAGG GCGTGCAAGA TTCCGAATAC CGCAAGCGAC AGGCCGATCA TCGTCGCGCT CCAGCGAAAG CGGTCCTCGC CGAAAATGAC CCAGAGCGCT GCCGGCACCT GTCCTACGAG TTGCATGATA AAGAAGACAG TCATAAGTGC GGCGACGATA GTCATGCCCC GCGCCCACCG GAAGGAGCTG ACTGGGTTGA AGGCTCTCAA GGGCATCGGT CGACGCTCTC CCTTATGCGA CTCCTGCATT AGGAAGCAGC CCAGTAGTAG GTTGAGGCCG TTGAGCACCG CCGCCGCAAG GAATGGTGCA TGCAAGGAGA TGGCGCCCAA CAGTCCCCCG GCCACGGGGC CTGCCACCAT ACCCACGCCG AAACAAGCGC TCATGAGCCC GAAGTGGCGA GCCCGATCTT CCCCATCGGT GATGTCGGCG ATATAGGCGC CAGCAACCGC ACCTGTGGCG CCGGTGATGC CGGCCACGAT GCGTCCGGCG TAGAGGATCG AGATCTCGAT CCCGCGAAAT