MythA.10611.a

Alcohol dehydrogenase B (A0R4K5 homolog)

CENTER ID: MythA.10611.a
ORGANISM: Mycobacterium thermoresistibile ATCC 19527 / NCTC 10409
ASSOCIATED DISEASE: Respiratory Infection
CURRENT STATUS: diffraction
COMMUNITY REQUEST: False
NIH RISK GROUP: 2
SELECT AGENT: False
NIH PRIORITY
pathogens category:
IIIC

Ordering Clones & Proteins

If there are materials available for this target, they will be listed below. Materials can be ordered from SSGCID using the button in the "order material" column. Clicking the button will add the material to a virtual cart. You may order multiple materials at a time at no cost to you, as this contract is funded by NIAID. When you are ready to place your order, click the "Place Order" link which will appear in the top right corner of the page after you place your first item in your cart.

Clones*

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
MythA.10611.a.A1.GE29591 Full length( MythA.10611.a ) 1 387
MythA.10611.a.AE1.GE43246 Full length( MythA.10611.a ) 1 387
* SSGCID clones represent un-induced expression constructs which have been verified by sequencing from vector primers. Clones may contain a tag, such as N-term 6xHis. Get sequence information using the button in the "info" column.

Proteins

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
MythA.10611.a.A1.PS00775 Full length( MythA.10611.a ) 1 387
External Resources
RESOURCE REFERENCE ID
PATRIC ID: fig|1078020.3.peg.2754
RefSeq: WP_003926285.1
UniProt: G7CGR1
Sequences
These sequences are the native gene sequence; sequences of constructs derived from these sequences may differ due to codon optimization or other protocols. To find the specific sequence of any material you may have ordered, click on the "info" button next to the name of that material.
AA Sequence
MSAEVIGGRQ VAMKTKGALI WEFNQPWSVE EIEIGDPVKD EVKIRMEASG MCHSDHHLVT GGIPMGGFPV LGGHEGAGVV EEVGPGVEEV QPGDHVVLSF IPSCGQCPSC QAGLRNLCDL GAGLLNGAAV SDGTFRITAR GQNVYPMTLL GTFSPYMVVH KSSVVKIDKD IPFEVACLVG CGVTTGYGSA VRSGDVRPGD DVAIVGVGGV GMGALQGAVN AGARYIFAID PVEWKRDQAL KFGATHVYPD IESAMAGIAE VTWGLMAKKV IVTVGELKGE DIDAYVNLTA KGGTCVLTAI GSLLDTQVNL NLSMLTLLQK NLQGTIFGGG NPHHDIPQLL SMYKAGKLNL DDMVTRQYKL EQINEGYQDM LEGRNIRGVI RYTDDDR
NT Sequence
CTGTCGGCAG AGGTCATCGG AGGAAGGCAG GTTGCGATGA AGACAAAAGG CGCCCTCATC TGGGAGTTCA ATCAGCCGTG GTCGGTCGAG GAGATCGAGA TCGGTGACCC CGTCAAGGAT GAGGTCAAGA TCCGGATGGA AGCCTCGGGC ATGTGCCACT CCGACCACCA TCTGGTGACC GGCGGCATCC CGATGGGCGG GTTCCCGGTG CTCGGCGGGC ATGAAGGCGC CGGGGTGGTC GAGGAGGTCG GCCCGGGGGT CGAGGAGGTG CAGCCGGGCG ACCACGTGGT GTTGTCGTTC ATCCCGTCGT GCGGGCAGTG CCCGTCGTGT CAGGCCGGGC TGCGCAACCT CTGCGATCTC GGGGCCGGTC TGCTCAACGG CGCCGCGGTG TCCGACGGGA CCTTCCGCAT CACCGCCCGC GGGCAGAACG TCTACCCGAT GACGCTGCTG GGCACCTTCT CGCCGTACAT GGTGGTGCAC AAGAGCTCGG TGGTGAAGAT CGACAAGGAC ATCCCGTTCG AGGTGGCCTG CCTGGTCGGC TGCGGTGTCA CCACCGGCTA CGGCTCGGCG GTGCGCAGCG GCGACGTCCG CCCGGGCGAC GACGTCGCGA TCGTCGGGGT CGGCGGCGTC GGCATGGGTG CGCTGCAGGG GGCGGTCAAC GCCGGGGCCC GCTACATCTT CGCGATCGAC CCGGTGGAGT GGAAGCGCGA TCAGGCGCTG AAATTCGGTG CCACCCACGT CTATCCGGAC ATCGAGTCGG CGATGGCGGG CATCGCGGAG GTCACCTGGG GTCTGATGGC CAAGAAGGTC ATCGTCACCG TCGGTGAGCT CAAGGGTGAG GACATCGACG CTTACGTCAA CCTCACCGCC AAGGGCGGAA CCTGCGTGCT GACCGCGATC GGCAGCCTGC TGGACACCCA GGTCAACCTG AATCTGTCCA TGCTGACGCT GCTGCAGAAG AACCTGCAGG GCACCATCTT CGGCGGCGGC AACCCGCATC ACGACATCCC GCAGCTGCTG TCGATGTACA AGGCCGGCAA GCTCAACCTC GACGACATGG TCACCCGGCA GTACAAGCTG GAGCAGATCA ACGAGGGCTA CCAGGACATG CTCGAGGGCC GCAACATCCG CGGCGTCATC CGCTACACCG ACGACGACCG G
Details for MythA.10611.a.A1.GE29591
HARVESTED ON: 6/22/2010
SEQUENCED ON: 7/1/2010
EXPECTED MW: 43kDa
OBSERVED MW: 43kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Many (50-100)
TOTAL EXPRESSION LEVEL: Moderate Expression
SOLUBLE EXPRESSION LEVEL Moderate Expression
EXPRESSION HOST: data unavailable
SEQUENCING RESULT: pass with incomplete coverage
PERCENT IDENTITY: 100
PERCENT COVERAGE: 94
Validated AA Sequence
MLSAEVIGGR QVAMKTKGAL IWEFNQPWSV EEIEIGDPVK DEVKIRMEAS GMCHSDHHLV TGGIPMGGFP VLGGHEGAGV VEEVGPGVEE VQPGDHVVLS FIPSCGQCPS CQAGLRNLCD LGAGLLNGAA VSDGTFRITA RGQNVYPMTL LGTFSPYMVV HKSSVVKIDK DIPFEVACLV GCGVTTGYGS AVRSGDVRPG DDVAIVGVGG VGMGALQGAV NAGARYIFAI DPVEWKRDQA LKFGATHVYP DIESAMAGIA EVTWGLMAKK VIVTVGELKG EDIDAYVNLT AKGGTCVLTA IGSLLDTQVN LNLSMLTLLQ KNLQGTIFGG GNPHHDIPQL LSMYKAGKLN LDDMVTRQYK LEQINEGYQD MLEGRNIRGV IRYTDD
Validated NT Sequence
cgtcgtcggt gtagcggatg acgccgcgga tgttgcggcc ctcgagcatg tcctggtagc cctcgttgat ctgctccagc ttgtactgcc gggtgaccat gtcgtcgagg ttgagcttgc cggccttgta catcgacagc agctgcggga tgtcgtgatg cgggttgccg ccgccgaaga tggtgccctg caggttcttc tgcagcagcg tcagcatgga cagattcagg ttgacctggg tgtccagcag gctgccgatc gcggtcagca cgcaggttcc gcccttggcg gtgaggttga cgtaagcgtc gatgtcctca cccttgagct caccgacggt gacgatgacc ttcttggcca tcagacccca ggtgacctcc gcgatgcccg ccatcgccga ctcgatgtcc ggatagacgt gggtggcacc gaatttcagc gcctgatcgc gcttccactc caccgggtcg atcgcgaaga tgtagcgggc cccggcgttg accgccccct gcagcgcacc catgccgacg ccgccgaccc cgacgatcgc gacgtcgtcg cccgggcgga cgtcgccgct gcgcaccgcc gagccgtagc cggtggtgac accgcagccg accaggcagg ccacctcgaa cgggatgtcc ttgtcgatct tcaccaccga gctcttgtgc accaccatgt acggcgagaa ggtgcccagc agcgtcatcg ggtagacgtt ctgcccgcgg gcggtgatgc ggaaggtccc gtcggacacc gcggcgccgt tgagcagacc ggccccgaga tcgcagaggt tgcgcagccc ggcctgacac gacgggcact gcccgcacga cgggatgaac gacaacacca cgtggtcgcc cggctgcacc tcctcgaccc ccgggccgac ctcctcgacc accccggcgc cttcatgccc gccgagcacc gggaacccgc ccatcgggat gccgccggtc accagatggt ggtcggagtg gcacatgccc gaggcttcca tccggatctt gacctcatcc ttgacggggt caccgatctc gatctcctcg accgaccacg gctgattgaa ctcccagatg agggcgcctt ttgtcttcat cgcaacctgc cttcctccga tgacctctgc cgacagcatc gaaccaggac cctgggtctg agcttccagg gtacc
Expected Protein Sequence
MAHHHHHHMG TLEAQTQGPG SMLSAEVIGG RQVAMKTKGA LIWEFNQPWS VEEIEIGDPV KDEVKIRMEA SGMCHSDHHL VTGGIPMGGF PVLGGHEGAG VVEEVGPGVE EVQPGDHVVL SFIPSCGQCP SCQAGLRNLC DLGAGLLNGA AVSDGTFRIT ARGQNVYPMT LLGTFSPYMV VHKSSVVKID KDIPFEVACL VGCGVTTGYG SAVRSGDVRP GDDVAIVGVG GVGMGALQGA VNAGARYIFA IDPVEWKRDQ ALKFGATHVY PDIESAMAGI AEVTWGLMAK KVIVTVGELK GEDIDAYVNL TAKGGTCVLT AIGSLLDTQV NLNLSMLTLL QKNLQGTIFG GGNPHHDIPQ LLSMYKAGKL NLDDMVTRQY KLEQINEGYQ DMLEGRNIRG VIRYTDDDR
Full NT Sequence (Expression Vector + Insert)
ttcttgaaga cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca cccgtggcca ggacccaacg ctgcccgaga tgcgccgcgt gcggctgctg gagatggcgg acgcgatgga tatgttctgc caagggttgg tttgcgcatt cacagttctc cgcaagaatt gattggctcc aattcttgga gtggtgaatc cgttagcgag gtgccgccgg cttccattca ggtcgaggtg gcccggctcc atgcaccgcg acgcaacgcg gggaggcaga caaggtatag ggcggcgcct acaatccatg ccaacccgtt ccatgtgctc gccgaggcgg cataaatcgc cgtgacgatc agcggtccag tgatcgaagt taggctggta agagccgcga gcgatccttg aagctgtccc tgatggtcgt catctacctg cctggacagc atggcctgca acgcgggcat cccgatgccg ccggaagcga gaagaatcat aatggggaag gccatccagc ctcgcgtcgt gaacgccagc aagacgtagc ccagcgcgtc ggccgtaaca acaccattta aatggagtgg ttacaaatgg agtggttaat taacaacacc atttgtcgac gctctccctt atgcgactcc tgcattagga agcagcccag tagtaggttg aggccgttga gcaccgccgc cgcaaggaat ggtgcatgca aggagatggc gcccaacagt cccccggcca cggggcctgc caccataccc acgccgaaac aagcgctcat gagcccgaag tggcgagccc gatcttcccc atcggtgatg tcggcgatat aggcgccagc aaccgcacct gtggcgccgg tgatgccggc cacgatgcgt ccggcgtaga ggatcgagat ctcgatcccg cgaaattaat acgactcact atagggagac cacaacggtt tccctctaga aataattttg tttaacttta agaaggagat ataccatggc tcatcaccat caccatcata tgggtaccct ggaagctcag acccagggtc ctggttcgat gctgtcggca gaggtcatcg gaggaaggca ggttgcgatg aagacaaaag gcgccctcat ctgggagttc aatcagccgt ggtcggtcga ggagatcgag atcggtgacc ccgtcaagga tgaggtcaag atccggatgg aagcctcggg catgtgccac tccgaccacc atctggtgac cggcggcatc ccgatgggcg ggttcccggt gctcggcggg catgaaggcg ccggggtggt cgaggaggtc ggcccggggg tcgaggaggt gcagccgggc gaccacgtgg tgttgtcgtt catcccgtcg tgcgggcagt gcccgtcgtg tcaggccggg ctgcgcaacc tctgcgatct cggggccggt ctgctcaacg gcgccgcggt gtccgacggg accttccgca tcaccgcccg cgggcagaac gtctacccga tgacgctgct gggcaccttc tcgccgtaca tggtggtgca caagagctcg gtggtgaaga tcgacaagga catcccgttc gaggtggcct gcctggtcgg ctgcggtgtc accaccggct acggctcggc ggtgcgcagc ggcgacgtcc gcccgggcga cgacgtcgcg atcgtcgggg tcggcggcgt cggcatgggt gcgctgcagg gggcggtcaa cgccggggcc cgctacatct tcgcgatcga cccggtggag tggaagcgcg atcaggcgct gaaattcggt gccacccacg tctatccgga catcgagtcg gcgatggcgg gcatcgcgga ggtcacctgg ggtctgatgg ccaagaaggt catcgtcacc gtcggtgagc tcaagggtga ggacatcgac gcttacgtca acctcaccgc caagggcgga acctgcgtgc tgaccgcgat cggcagcctg ctggacaccc aggtcaacct gaatctgtcc atgctgacgc tgctgcagaa gaacctgcag ggcaccatct tcggcggcgg caacccgcat cacgacatcc cgcagctgct gtcgatgtac aaggccggca agctcaacct cgacgacatg gtcacccggc agtacaagct ggagcagatc aacgagggct accaggacat gctcgagggc cgcaacatcc gcggcgtcat ccgctacacc gacgacgacc ggaaacagca cgaacaagtt ctgcagccaa gcttctcgag gatccggctg ctaacaaagc ccgaaaggaa gctgagttgg ctgctgccac cgctgagcaa taactagcat aaccccttgg ggcctctaaa cgggtcttga ggggtttttt gctgaaagga ggaactatat ccggatatcc acaggacggg tgtggtcgcc atgatcgcgt agtcgatagt ggctccaagt agcgaagcga gcaggactgg gcggcggcca aagcggtcgg acagtgctcc gagaacgggt gcgcatagaa attgcatcaa cgcatatagc gctagcagca cgccatagtg actggcgatg ctgtcggaat ggacgatatc ccgcaagagg cccggcagta ccggcataac caagcctatg cctacagcat ccagggtgac ggtgccgagg atgacgatga gcgcattgtt agatttcata cacggtgcct gactgcgtta gcaatttaac tgtgataaac taccgcatta aagcttatcg atgataagct gtcaaacatg agaa
Details for MythA.10611.a.AE1.GE43246
HARVESTED ON: 6/12/2019
SEQUENCED ON: 6/18/2019
EXPECTED MW: 42kDa
OBSERVED MW: 42kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Good (10-50)
TOTAL EXPRESSION LEVEL: Low Expression
SOLUBLE EXPRESSION LEVEL Low Expression
EXPRESSION HOST: BL 21 (DE3) Rosetta
SEQUENCING RESULT:
PERCENT IDENTITY: data unavailable
PERCENT COVERAGE: data unavailable
Validated AA Sequence
sequence unavailable
Validated NT Sequence
sequence unavailable
Expected Protein Sequence
MSAEVIGGRQ VAMKTKGALI WEFNQPWSVE EIEIGDPVKD EVKIRMEASG MCHSDHHLVT GGIPMGGFPV LGGHEGAGVV EEVGPGVEEV QPGDHVVLSF IPSCGQCPSC QAGLRNLCDL GAGLLNGAAV SDGTFRITAR GQNVYPMTLL GTFSPYMVVH KSSVVKIDKD IPFEVACLVG CGVTTGYGSA VRSGDVRPGD DVAIVGVGGV GMGALQGAVN AGARYIFAID PVEWKRDQAL KFGATHVYPD IESAMAGIAE VTWGLMAKKV IVTVGELKGE DIDAYVNLTA KGGTCVLTAI GSLLDTQVNL NLSMLTLLQK NLQGTIFGGG NPHHDIPQLL SMYKAGKLNL DDMVTRQYKL EQINEGYQDM LEGRNIRGVI RYTDDDRGHH HHHH
Full NT Sequence (Expression Vector + Insert)
tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acattaacgc ttacaattta ggtggcactt ttcggggaaa tgtgcgcgga acccctattt gtttattttt ctaaatacat tcaaatatgt atccgctcat gagacaataa ccctgataaa tgcttcaata atttgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca cccgtggggc cgccatgccg gcgataatgg cctgcttctc gccgaaacgt ttggtggcgg gaccagtgac gaaggcttga gcgagggcgt gcaagattcc gaataccgca agcgacaggc cgatcatcgt cgcgctccag cgaaagcggt cctcgccgaa aatgacccag agcgctgccg gcacctgtcc tacgagttgc atgataaaga agacagtcat aagtgcggcg acgatagtca tgccccgcgc ccaccggaag gagctgactg ggttgaaggc tctcaagggc atcggtcgag atcccggtgc ctaatgagtg agctaactta cattaattgc gttgcgctca ctgcccgctt tccagtcggg aaacctgtcg tgccagctgc attaatgaat cggccaacgc gcggggagag gcggtttgcg tattgggcgc cagggtggtt tttcttttca ccagtgagac gggcaacagc tgattgccct tcaccgcctg gccctgagag agttgcagca agcggtccac gctggtttgc cccagcaggc gaaaatcctg tttgatggtg gttaacggcg ggatataaca tgagctgtct tcggtatcgt cgtatcccac taccgagata tccgcaccaa cgcgcagccc ggactcggta atggcgcgca ttgcgcccag cgccatctga tcgttggcaa ccagcatcgc agtgggaacg atgccctcat tcagcatttg catggtttgt tgaaaaccgg acatggcact ccagtcgcct tcccgttccg ctatcggctg aatttgattg cgagtgagat atttatgcca gccagccaga cgcagacgcg ccgagacaga acttaatggg cccgctaaca gcgcgatttg ctggtgaccc aatgcgacca gatgctccac gcccagtcgc gtaccgtctt catgggagaa aataatactg ttgatgggtg tctggtcaga gacatcaaga aataacgccg gaacattagt gcaggcagct tccacagcaa tggcatcctg gtcatccagc ggatagttaa tgatcagccc actgacgcgt tgcgcgagaa gattgtgcac cgccgcttta caggcttcga cgccgcttcg ttctaccatc gacaccacca cgctggcacc cagttgatcg gcgcgagatt taatcgccgc gacaatttgc gacggcgcgt gcagggccag actggaggtg gcaacgccaa tcagcaacga ctgtttgccc gccagttgtt gtgccacgcg gttgggaatg taattcagct ccgccatcgc cgcttccact ttttcccgcg ttttcgcaga aacgtggctg gcctggttca ccacgcggga aacggtctga taagagacac cggcatactc tgcgacatcg tataacgtta ctggtttcac attcaccacc ctgaattgac tctcttccgg gcgctatcat gccataccgc gaaaggtttt gcgccattcg atggtgtccg ggatctcgac gctctccctt atgcgactcc tgcattagga agcagcccag tagtaggttg aggccgttga gcaccgccgc cgcaaggaat ggtgcatgca aggagatggc gcccaacagt cccccggcca cggggcctgc caccataccc acgccgaaac aagcgctcat gagcccgaag tggcgagccc gatcttcccc atcggtgatg tcggcgatat aggcgccagc aaccgcacct gtggcgccgg tgatgccggc cacgatgcgt ccggcgtaga ggatcgagat cgatctcgat cccgcgaaat taatacgact cactataggg gaattgtgag cggataacaa ttcccctcta gaaataattt tgtttaactt taagaaggag tctctcccct gtcggcagag gtcatcggag gaaggcaggt tgcgatgaag acaaaaggcg ccctcatctg ggagttcaat cagccgtggt cggtcgagga gatcgagatc ggtgaccccg tcaaggatga ggtcaagatc cggatggaag cctcgggcat gtgccactcc gaccaccatc tggtgaccgg cggcatcccg atgggcgggt tcccggtgct cggcgggcat gaaggcgccg gggtggtcga ggaggtcggc ccgggggtcg aggaggtgca gccgggcgac cacgtggtgt tgtcgttcat cccgtcgtgc gggcagtgcc cgtcgtgtca ggccgggctg cgcaacctct gcgatctcgg ggccggtctg ctcaacggcg ccgcggtgtc cgacgggacc ttccgcatca ccgcccgcgg gcagaacgtc tacccgatga cgctgctggg caccttctcg ccgtacatgg tggtgcacaa gagctcggtg gtgaagatcg acaaggacat cccgttcgag gtggcctgcc tggtcggctg cggtgtcacc accggctacg gctcggcggt gcgcagcggc gacgtccgcc cgggcgacga cgtcgcgatc gtcggggtcg gcggcgtcgg catgggtgcg ctgcaggggg cggtcaacgc cggggcccgc tacatcttcg cgatcgaccc ggtggagtgg aagcgcgatc aggcgctgaa attcggtgcc acccacgtct atccggacat cgagtcggcg atggcgggca tcgcggaggt cacctggggt ctgatggcca agaaggtcat cgtcaccgtc ggtgagctca agggtgagga catcgacgct tacgtcaacc tcaccgccaa gggcggaacc tgcgtgctga ccgcgatcgg cagcctgctg gacacccagg tcaacctgaa tctgtccatg ctgacgctgc tgcagaagaa cctgcagggc accatcttcg gcggcggcaa cccgcatcac gacatcccgc agctgctgtc gatgtacaag gccggcaagc tcaacctcga cgacatggtc acccggcagt acaagctgga gcagatcaac gagggctacc aggacatgct cgagggccgc aacatccgcg gcgtcatccg ctacaccgac gacgaccggg ggcaccacca tcatcatcat taacggatcc gaattcgagc tccgtcgaca agcttgcggc cgcactcgag caccaccacc accaccactg agatccggct gctaacaaag cccgaaagga agctgagttg gctgctgcca ccgctgagca ataactagca taaccccttg gggcctctaa acgggtcttg aggggttttt tgctgaaagg aggaactata tccggat
Details for MythA.10611.a.A1.PS00775
PURIFICATION DATe: 9/21/2010
CONCENTRATION: 23.67mg/ml
OBSERVED MW: data unavailable
EXPRESSION LEVEL: Moderate Expression
PROTEIN PURIFICATION BUFFER: 20 mM HEPES, pH 7.0, 300 mM NaCl, 5% glycerol and 1 mM TCEP
EXPRESSION HOST: data unavailable
VIAL COUNT (approx.): 3
VIAL VOLUME: data unavailable
PERCENT IDENTITY: 100
PERCENT COVERAGE: 94
Protocol Notes
notes unavailable
Validated AA Sequence
MLSAEVIGGR QVAMKTKGAL IWEFNQPWSV EEIEIGDPVK DEVKIRMEAS GMCHSDHHLV TGGIPMGGFP VLGGHEGAGV VEEVGPGVEE VQPGDHVVLS FIPSCGQCPS CQAGLRNLCD LGAGLLNGAA VSDGTFRITA RGQNVYPMTL LGTFSPYMVV HKSSVVKIDK DIPFEVACLV GCGVTTGYGS AVRSGDVRPG DDVAIVGVGG VGMGALQGAV NAGARYIFAI DPVEWKRDQA LKFGATHVYP DIESAMAGIA EVTWGLMAKK VIVTVGELKG EDIDAYVNLT AKGGTCVLTA IGSLLDTQVN LNLSMLTLLQ KNLQGTIFGG GNPHHDIPQL LSMYKAGKLN LDDMVTRQYK LEQINEGYQD MLEGRNIRGV IRYTDD
Validated NT Sequence
cgtcgtcggt gtagcggatg acgccgcgga tgttgcggcc ctcgagcatg tcctggtagc cctcgttgat ctgctccagc ttgtactgcc gggtgaccat gtcgtcgagg ttgagcttgc cggccttgta catcgacagc agctgcggga tgtcgtgatg cgggttgccg ccgccgaaga tggtgccctg caggttcttc tgcagcagcg tcagcatgga cagattcagg ttgacctggg tgtccagcag gctgccgatc gcggtcagca cgcaggttcc gcccttggcg gtgaggttga cgtaagcgtc gatgtcctca cccttgagct caccgacggt gacgatgacc ttcttggcca tcagacccca ggtgacctcc gcgatgcccg ccatcgccga ctcgatgtcc ggatagacgt gggtggcacc gaatttcagc gcctgatcgc gcttccactc caccgggtcg atcgcgaaga tgtagcgggc cccggcgttg accgccccct gcagcgcacc catgccgacg ccgccgaccc cgacgatcgc gacgtcgtcg cccgggcgga cgtcgccgct gcgcaccgcc gagccgtagc cggtggtgac accgcagccg accaggcagg ccacctcgaa cgggatgtcc ttgtcgatct tcaccaccga gctcttgtgc accaccatgt acggcgagaa ggtgcccagc agcgtcatcg ggtagacgtt ctgcccgcgg gcggtgatgc ggaaggtccc gtcggacacc gcggcgccgt tgagcagacc ggccccgaga tcgcagaggt tgcgcagccc ggcctgacac gacgggcact gcccgcacga cgggatgaac gacaacacca cgtggtcgcc cggctgcacc tcctcgaccc ccgggccgac ctcctcgacc accccggcgc cttcatgccc gccgagcacc gggaacccgc ccatcgggat gccgccggtc accagatggt ggtcggagtg gcacatgccc gaggcttcca tccggatctt gacctcatcc ttgacggggt caccgatctc gatctcctcg accgaccacg gctgattgaa ctcccagatg agggcgcctt ttgtcttcat cgcaacctgc cttcctccga tgacctctgc cgacagcatc gaaccaggac cctgggtctg agcttccagg gtacc
Expressed Protein Sequence
MAHHHHHHMG TLEAQTQGPG SMLSAEVIGG RQVAMKTKGA LIWEFNQPWS VEEIEIGDPV KDEVKIRMEA SGMCHSDHHL VTGGIPMGGF PVLGGHEGAG VVEEVGPGVE EVQPGDHVVL SFIPSCGQCP SCQAGLRNLC DLGAGLLNGA AVSDGTFRIT ARGQNVYPMT LLGTFSPYMV VHKSSVVKID KDIPFEVACL VGCGVTTGYG SAVRSGDVRP GDDVAIVGVG GVGMGALQGA VNAGARYIFA IDPVEWKRDQ ALKFGATHVY PDIESAMAGI AEVTWGLMAK KVIVTVGELK GEDIDAYVNL TAKGGTCVLT AIGSLLDTQV NLNLSMLTLL QKNLQGTIFG GGNPHHDIPQ LLSMYKAGKL NLDDMVTRQY KLEQINEGYQ DMLEGRNIRG VIRYTDDDR
Full NT Sequence (Expression Vector + Insert)
TTCTTGAAGA CGAAAGGGCC TCGTGATACG CCTATTTTTA TAGGTTAATG TCATGATAAT AATGGTTTCT TAGACGTCAG GTGGCACTTT TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTTATTTTTC TAAATACATT CAAATATGTA TCCGCTCATG AGACAATAAC CCTGATAAAT GCTTCAATAA TATTGAAAAA GGAAGAGTAT GAGTATTCAA CATTTCCGTG TCGCCCTTAT TCCCTTTTTT GCGGCATTTT GCCTTCCTGT TTTTGCTCAC CCAGAAACGC TGGTGAAAGT AAAAGATGCT GAAGATCAGT TGGGTGCACG AGTGGGTTAC ATCGAACTGG ATCTCAACAG CGGTAAGATC CTTGAGAGTT TTCGCCCCGA AGAACGTTTT CCAATGATGA GCACTTTTAA AGTTCTGCTA TGTGGCGCGG TATTATCCCG TGTTGACGCC GGGCAAGAGC AACTCGGTCG CCGCATACAC TATTCTCAGA ATGACTTGGT TGAGTACTCA CCAGTCACAG AAAAGCATCT TACGGATGGC ATGACAGTAA GAGAATTATG CAGTGCTGCC ATAACCATGA GTGATAACAC TGCGGCCAAC TTACTTCTGA CAACGATCGG AGGACCGAAG GAGCTAACCG CTTTTTTGCA CAACATGGGG GATCATGTAA CTCGCCTTGA TCGTTGGGAA CCGGAGCTGA ATGAAGCCAT ACCAAACGAC GAGCGTGACA CCACGATGCC TGCAGCAATG GCAACAACGT TGCGCAAACT ATTAACTGGC GAACTACTTA CTCTAGCTTC CCGGCAACAA TTAATAGACT GGATGGAGGC GGATAAAGTT GCAGGACCAC TTCTGCGCTC GGCCCTTCCG GCTGGCTGGT TTATTGCTGA TAAATCTGGA GCCGGTGAGC GTGGGTCTCG CGGTATCATT GCAGCACTGG GGCCAGATGG TAAGCCCTCC CGTATCGTAG TTATCTACAC GACGGGGAGT CAGGCAACTA TGGATGAACG AAATAGACAG ATCGCTGAGA TAGGTGCCTC ACTGATTAAG CATTGGTAAC TGTCAGACCA AGTTTACTCA TATATACTTT AGATTGATTT AAAACTTCAT TTTTAATTTA AAAGGATCTA GGTGAAGATC CTTTTTGATA ATCTCATGAC CAAAATCCCT TAACGTGAGT TTTCGTTCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT TTTTTCTGCG CGTAATCTGC TGCTTGCAAA CAAAAAAACC ACCGCTACCA GCGGTGGTTT GTTTGCCGGA TCAAGAGCTA CCAACTCTTT TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AGATACCAAA TACTGTCCTT CTAGTGTAGC CGTAGTTAGG CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA GACGATAGTT ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCACACAGC CCAGCTTGGA GCGAACGACC TACACCGAAC TGAGATACCT ACAGCGTGAG CTATGAGAAA GCGCCACGCT TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGCCTG GTATCTTTAT AGTCCTGTCG GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG GGGCGGAGCC TATGGAAAAA CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG CTCACATGTT CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG AGTGAGCTGA TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA GCGCCTGATG CGGTATTTTC TCCTTACGCA TCTGTGCGGT ATTTCACACC GCATATATGG TGCACTCTCA GTACAATCTG CTCTGATGCC GCATAGTTAA GCCAGTATAC ACTCCGCTAT CGCTACGTGA CTGGGTCATG GCTGCGCCCC GACACCCGCC AACACCCGCT GACGCGCCCT GACGGGCTTG TCTGCTCCCG GCATCCGCTT ACAGACAAGC TGTGACCGTC TCCGGGAGCT GCATGTGTCA GAGGTTTTCA CCGTCATCAC CGAAACGCGC GAGGCAGCTG CGGTAAAGCT CATCAGCGTG GTCGTGAAGC GATTCACAGA TGTCTGCCTG TTCATCCGCG TCCAGCTCGT TGAGTTTCTC CAGAAGCGTT AATGTCTGGC TTCTGATAAA GCGGGCCATG TTAAGGGCGG TTTTTTCCTG TTTGGTCACT GATGCCTCCG TGTAAGGGGG ATTTCTGTTC ATGGGGGTAA TGATACCGAT GAAACGAGAG AGGATGCTCA CGATACGGGT TACTGATGAT GAACATGCCC GGTTACTGGA ACGTTGTGAG GGTAAACAAC TGGCGGTATG GATGCGGCGG GACCAGAGAA AAATCACTCA GGGTCAATGC CAGCGCTTCG TTAATACAGA TGTAGGTGTT CCACAGGGTA GCCAGCAGCA TCCTGCGATG CAGATCCGGA ACATAATGGT GCAGGGCGCT GACTTCCGCG TTTCCAGACT TTACGAAACA CGGAAACCGA AGACCATTCA TGTTGTTGCT CAGGTCGCAG ACGTTTTGCA GCAGCAGTCG CTTCACGTTC GCTCGCGTAT CGGTGATTCA TTCTGCTAAC CAGTAAGGCA ACCCCGCCAG CCTAGCCGGG TCCTCAACGA CAGGAGCACG ATCATGCGCA CCCGTGGCCA GGACCCAACG CTGCCCGAGA TGCGCCGCGT GCGGCTGCTG GAGATGGCGG ACGCGATGGA TATGTTCTGC CAAGGGTTGG TTTGCGCATT CACAGTTCTC CGCAAGAATT GATTGGCTCC AATTCTTGGA GTGGTGAATC CGTTAGCGAG GTGCCGCCGG CTTCCATTCA GGTCGAGGTG GCCCGGCTCC ATGCACCGCG ACGCAACGCG GGGAGGCAGA CAAGGTATAG GGCGGCGCCT ACAATCCATG CCAACCCGTT CCATGTGCTC GCCGAGGCGG CATAAATCGC CGTGACGATC AGCGGTCCAG TGATCGAAGT TAGGCTGGTA AGAGCCGCGA GCGATCCTTG AAGCTGTCCC TGATGGTCGT CATCTACCTG CCTGGACAGC ATGGCCTGCA ACGCGGGCAT CCCGATGCCG CCGGAAGCGA GAAGAATCAT AATGGGGAAG GCCATCCAGC CTCGCGTCGT GAACGCCAGC AAGACGTAGC CCAGCGCGTC GGCCGTAACA ACACCATTTA AATGGAGTGG TTACAAATGG AGTGGTTAAT TAACAACACC ATTTGTCGAC GCTCTCCCTT ATGCGACTCC TGCATTAGGA AGCAGCCCAG TAGTAGGTTG AGGCCGTTGA GCACCGCCGC CGCAAGGAAT GGTGCATGCA AGGAGATGGC GCCCAACAGT CCCCCGGCCA CGGGGCCTGC CACCATACCC ACGCCGAAAC AAGCGCTCAT GAGCCCGAAG TGGCGAGCCC GATCTTCCCC ATCGGTGATG TCGGCGATAT AGGCGCCAGC AACCGCACCT GTGGCGCCGG TGATGCCGGC CACGATGCGT CCGGCGTAGA GGATCGAGAT CTCGATCCCG CGAAATTAAT ACGACTCACT ATAGGGAGAC CACAACGGTT TCCCTCTAGA AATAATTTTG TTTAACTTTA AGAAGGAGAT ATACCATGGC TCATCACCAT CACCATCATA TGGGTACCCT GGAAGCTCAG ACCCAGGGTC CTGGTTCGAT GCTGTCGGCA GAGGTCATCG GAGGAAGGCA GGTTGCGATG AAGACAAAAG GCGCCCTCAT CTGGGAGTTC AATCAGCCGT GGTCGGTCGA GGAGATCGAG ATCGGTGACC CCGTCAAGGA TGAGGTCAAG ATCCGGATGG AAGCCTCGGG CATGTGCCAC TCCGACCACC ATCTGGTGAC CGGCGGCATC CCGATGGGCG GGTTCCCGGT GCTCGGCGGG CATGAAGGCG CCGGGGTGGT CGAGGAGGTC GGCCCGGGGG TCGAGGAGGT GCAGCCGGGC GACCACGTGG TGTTGTCGTT CATCCCGTCG TGCGGGCAGT GCCCGTCGTG TCAGGCCGGG CTGCGCAACC TCTGCGATCT CGGGGCCGGT CTGCTCAACG GCGCCGCGGT GTCCGACGGG ACCTTCCGCA TCACCGCCCG CGGGCAGAAC GTCTACCCGA TGACGCTGCT GGGCACCTTC TCGCCGTACA TGGTGGTGCA CAAGAGCTCG GTGGTGAAGA TCGACAAGGA CATCCCGTTC GAGGTGGCCT GCCTGGTCGG CTGCGGTGTC ACCACCGGCT ACGGCTCGGC GGTGCGCAGC GGCGACGTCC GCCCGGGCGA CGACGTCGCG ATCGTCGGGG TCGGCGGCGT CGGCATGGGT GCGCTGCAGG GGGCGGTCAA CGCCGGGGCC CGCTACATCT TCGCGATCGA CCCGGTGGAG TGGAAGCGCG ATCAGGCGCT GAAATTCGGT GCCACCCACG TCTATCCGGA CATCGAGTCG GCGATGGCGG GCATCGCGGA GGTCACCTGG GGTCTGATGG CCAAGAAGGT CATCGTCACC GTCGGTGAGC TCAAGGGTGA GGACATCGAC GCTTACGTCA ACCTCACCGC CAAGGGCGGA ACCTGCGTGC TGACCGCGAT CGGCAGCCTG CTGGACACCC AGGTCAACCT GAATCTGTCC ATGCTGACGC TGCTGCAGAA GAACCTGCAG GGCACCATCT TCGGCGGCGG CAACCCGCAT CACGACATCC CGCAGCTGCT GTCGATGTAC AAGGCCGGCA AGCTCAACCT CGACGACATG GTCACCCGGC AGTACAAGCT GGAGCAGATC AACGAGGGCT ACCAGGACAT GCTCGAGGGC CGCAACATCC GCGGCGTCAT CCGCTACACC GACGACGACC GGAAACAGCA CGAACAAGTT CTGCAGCCAA GCTTCTCGAG GATCCGGCTG CTAACAAAGC CCGAAAGGAA GCTGAGTTGG CTGCTGCCAC CGCTGAGCAA TAACTAGCAT AACCCCTTGG GGCCTCTAAA CGGGTCTTGA GGGGTTTTTT GCTGAAAGGA GGAACTATAT CCGGATATCC ACAGGACGGG TGTGGTCGCC ATGATCGCGT AGTCGATAGT GGCTCCAAGT AGCGAAGCGA GCAGGACTGG GCGGCGGCCA AAGCGGTCGG ACAGTGCTCC GAGAACGGGT GCGCATAGAA ATTGCATCAA CGCATATAGC GCTAGCAGCA CGCCATAGTG ACTGGCGATG CTGTCGGAAT GGACGATATC CCGCAAGAGG CCCGGCAGTA CCGGCATAAC CAAGCCTATG CCTACAGCAT CCAGGGTGAC GGTGCCGAGG ATGACGATGA GCGCATTGTT AGATTTCATA CACGGTGCCT GACTGCGTTA GCAATTTAAC TGTGATAAAC TACCGCATTA AAGCTTATCG ATGATAAGCT GTCAAACATG AGAA