MytuD.18813.a

Uncharacterized glycosyl hydrolase Rv3401/MT3509 (EC 3.2.1.-)

CENTER ID: MytuD.18813.a
ORGANISM: Mycobacterium tuberculosis H37Rv
ASSOCIATED DISEASE: Tuberculosis
CURRENT STATUS: expressed
COMMUNITY REQUEST: True
NIH RISK GROUP: 2
SELECT AGENT: False
NIH PRIORITY
pathogens category:
IIIC

Ordering Clones & Proteins

If there are materials available for this target, they will be listed below. Materials can be ordered from SSGCID using the button in the "order material" column. Clicking the button will add the material to a virtual cart. You may order multiple materials at a time at no cost to you, as this contract is funded by NIAID. When you are ready to place your order, click the "Place Order" link which will appear in the top right corner of the page after you place your first item in your cart.

Clones*

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
MytuD.18813.a.B1.GE37843 full length 1 786
* SSGCID clones represent un-induced expression constructs which have been verified by sequencing from vector primers. Clones may contain a tag, such as N-term 6xHis. Get sequence information using the button in the "info" column.

External Resources

RESOURCE REFERENCE ID
BV-BRC: fig|83332.12.peg.3798
RefSeq: NP_217918.1
RefSeq: YP_006516886.1
TubercuList: Rv3401
MTB Network Portal: Rv3401
UniProt: P9WN13

Sequences

These sequences are the native gene sequence; sequences of constructs derived from these sequences may differ due to codon optimization or other protocols. To find the specific sequence of any material you may have ordered, click on the "info" button next to the name of that material.
AA Sequence
MITEDAFPVE PWQVRETKLN LNLLAQSESL FALSNGHIGL RGNLDEGEPF GLPGTYLNSF YEIRPLPYAE AGYGYPEAGQ TVVDVTNGKI FRLLVGDEPF DVRYGELISH ERILDLRAGT LTRRAHWRSP AGKQVKVTST RLVSLAHRSV AAIEYVVEAI EEFVRVTVQS ELVTNEDVPE TSADPRVSAI LDRPLQAVEH ERTERGALLM HRTRASALMM AAGMEHEVEV PGRVEITTDA RPDLARTTVI CGLRPGQKLR IVKYLAYGWS SLRSRPALRD QAAGALHGAR YSGWQGLLDA QRAYLDDFWD SADVEVEGDP ECQQAVRFGL FHLLQASARA ERRAIPSKGL TGTGYDGHAF WDTEGFVLPV LTYTAPHAVA DALRWRASTL DLAKERAAEL GLEGAAFPWR TIRGQESSAY WPAGTAAWHI NADIAMAFER YRIVTGDGSL EEECGLAVLI ETARLWLSLG HHDRHGVWHL DGVTGPDEYT AVVRDNVFTN LMAAHNLHTA ADACLRHPEA AEAMGVTTEE MAAWRDAADA ANIPYDEELG VHQQCEGFTT LAEWDFEANT TYPLLLHEAY VRLYPAQVIK QADLVLAMQW QSHAFTPEQK ARNVDYYERR MVRDSSLSAC TQAVMCAEVG HLELAHDYAY EAALIDLRDL HRNTRDGLHM ASLAGAWTAL VVGFGGLRDD EGILSIDPQL PDGISRLRFR LRWRGFRLIV DANHTDVTFI LGDGPGTQLT MRHAGQDLTL HTDTPSTIAV RTRKPLLPPP PQPPGREPVH RRALAR
NT Sequence
atgatcaccg aggacgcctt ccccgtcgaa ccgtggcagg tccgcgagac caagctcaac ctgaacctgc tggcccagtc cgaatcccta ttcgccttgt ccaacgggca cattggatta cgcggcaacc tcgacgaggg cgaacccttc ggactgccgg gcacctacct gaactctttc tacgaaatcc ggccgctgcc gtacgccgag gccggttatg gatatccgga ggccggccag accgttgtcg acgtcaccaa cggcaagatc tttcgcctgt tggtcggcga cgagccgttc gacgtccggt atggcgaatt gatctcccac gaacggatcc tcgacctgcg cgccgggacg ctgacccgcc gcgcgcactg gcgctcaccg gcgggcaagc aagtcaaagt gacgtccacc cggctggtgt cgctggccca ccgcagcgtc gcggcgatcg agtacgtcgt cgaggcaatc gaggaattcg ttcgcgtgac cgtgcagtcc gaactcgtca ccaacgagga cgtaccggag acctcggccg acccgcgggt gtcggccatc ctggacaggc cgctacaggc cgtcgagcac gaacgcaccg agcggggtgc acttctcatg caccgcaccc gagccagcgc gctgatgatg gccgcaggga tggaacacga ggtcgaggtt cccgggcggg tcgagatcac caccgacgcc cgcccggacc tggcccgaac caccgtgatc tgcgggctgc gcccgggaca gaagctgcgc atcgtcaaat acctggccta tggctggtcc agcctgcgct cccgcccggc gctgcgcgac caggccgccg gcgcgctgca cggtgcccgc tacagcggct ggcaggggct gctggacgcg caacgcgcct acctcgacga cttctgggac agcgcggacg tggaggtcga gggcgacccg gaatgtcagc aagcggtgcg tttcgggtta tttcacctgt tgcaggccag cgcgcgcgcc gaacgccgcg cgatccccag caaggggctc accggaaccg ggtatgacgg ccacgccttt tgggacaccg aaggtttcgt gctaccggtg ctcacctaca ccgcaccgca tgcggtcgcc gacgcgctgc ggtggcgggc gtcgacgttg gacctggcca aggagcgggc ggccgagctc ggcctggaag gtgccgcctt tccctggcgg accatccgcg gacaggagtc ctcggcctac tggccggccg gcacggcggc ctggcacatc aacgccgaca tcgcgatggc gttcgagcgg taccgcatcg tcaccggcga cggttcgctg gaggaggaat gcggccttgc ggtgctgatc gagaccgccc ggctgtggct ctcgctcggg caccacgacc gccacggcgt ctggcacctc gacggggtca ccggtcccga cgagtacacg gcggtcgtcc gcgacaacgt gttcacgaat ctgatggcgg cgcacaatct gcacaccgcc gccgatgctt gcttgcgcca ccccgaggcg gcggaggcca tgggtgtcac caccgaggag atggccgcct ggcgcgacgc ggccgacgcc gccaacattc cctacgacga ggaactcggt gtccaccagc agtgtgaagg gttcaccacc cttgcggagt gggatttcga agccaacacc acttatccgt tgctactgca cgaggcctac gtgcgcttgt atcccgcaca ggtgatcaag caggccgacc tggtgctggc gatgcagtgg cagagtcacg cgttcacgcc cgagcagaag gcgcgcaacg tcgactacta cgaacggcgc atggtgcgcg actcgtcgtt gtcggcctgc actcaggcgg tgatgtgcgc cgaggtcggc catctcgagt tggcccacga ctatgcctac gaagccgccc tgatcgacct gcgcgacctg caccgcaaca cccgtgacgg cctacacatg gcttcgctgg ccggagcctg gacggcgctg gtcgtaggct tcggcggcct acgcgacgac gagggcatcc tgtccatcga tccgcagctg cccgacggca tctcgcggct gcggttccgg ctgcgatggc gcggcttccg gctgatcgtc gacgccaacc acaccgacgt caccttcatc cttggcgacg gtcccggcac ccagctgacc atgcgccacg ccggccaaga tctgacgctg cacacggaca caccgtccac catcgccgtg cgcacccgta agccgctgct gccgccacca ccgcagccgc caggccgcga gccagtgcac cgccgggctt tagcccggtg a
Details for MytuD.18813.a.B1.GE37843
HARVESTED ON: 3/6/2014
SEQUENCED ON: 3/13/2014
EXPECTED MW: 88kDa
OBSERVED MW: 88kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Too many to count (100+)
TOTAL EXPRESSION LEVEL: Moderate Expression
SOLUBLE EXPRESSION LEVEL Insoluble
EXPRESSION HOST: data unavailable
SEQUENCING RESULT: pass with incomplete coverage
PERCENT IDENTITY: 85
PERCENT COVERAGE: 48
Validated AA Sequence
MAHHHHHHMI TEDAFPVEPW QVRETKLNLN LLAQSESLFA LSNGHIGLRG NLDEGEPFGL PGTYLNSFYE IRPLPYAEAG YGYPEAGQTV VDVTNGKIFR LLVGDEPFDV RYGELISHER ILDLRAGTLT RRAHWRSPAG KQVKVTSTRL VSLAHRSVAA IEYVVEAIEE FVRVTVQSEL VTNEDVPETS ADPRVSAILD RPLQAVEHER TERGALLMHR TRASALMMAA GMEHEVEVPG RVEITTDARP DLARTTVICG LRPGQKLRIV KYLAYGWSSL RSRPALRDQA AGALHGARYS GWQGCWTRNA PTSTTSGTAR TWRSRATXNV SSGAFRVISP XQASARADAX XIPARGSPXX XXXRLXXXXX LRXSLXXXHA XPTPPAXX
Validated NT Sequence
cnctanannn ttttgtttac tttaagaagg agatatacca tggctcacca ccaccaccac catatgatca ccgaggacgc cttccccgtc gaaccgtggc aggtccgcga gaccaagctc aacctgaacc tgctggccca gtccgaatcc ctattcgcct tgtccaacgg gcacattgga ttacgcggca acctcgacga gggcgaaccc ttcggactgc cgggcaccta cctgaactct ttctacgaaa tccggccgct gccgtacgcc gaggccggtt atggatatcc ggaggccggc cagaccgttg tcgacgtcac caacggcaag atctttcgcc tgttggtcgg cgacgagccg ttcgacgtcc ggtatggcga attgatctcc cacgaacgga tcctcgacct gcgcgccggg acgctgaccc gccgcgcgca ctggcgctca ccggcgggca agcaagtcaa agtgacgtcc acccggctgg tgtcgctggc ccaccgcagc gtcgcggcga tcgagtacgt cgtcgaggca atcgaggaat tcgttcgcgt gaccgtgcag tccgaactcg tcaccaacga ggacgtaccg gagacctcgg ccgacccgcg ggtgtcggcc atcctggaca ggccgctaca ggccgtcgag cacgaacgca ccgagcgggg tgcacttctc atgcaccgca cccgagccag cgcgctgatg atggccgcag ggatggaaca cgaggtcgag gttcccgggc gggtcgagat caccaccgac gcccgcccgg acctggcccg aaccaccgtg atctgcgggc tgcgcccggg acagaagctg cgcatcgtca aatacctggc ctatggctgg tccagcctgc gctcccgccc ggcgctgcgc gaccaggccg ccggcgcgct gcacggtgcc cgctacagcg gctggcaggg ctgctggacg cgcaacgcgc ctacctcgac gacttctggg acagcgcgga cgtggaggtc gagggcgacc cngaatgtca gcagcggtgc gtttcgggtt atttcacctn ngcaggccag cgcgcgcgcc gacgccnncn cgatcccagc aaggggctca ccggnnnnnn tgangncacg ccttttngan nnnannnnnc tncggnnntc actnnccnna ncncatgcgn cgccnacccc ccctgccnnn nnnng
Expected Protein Sequence
MAHHHHHHMI TEDAFPVEPW QVRETKLNLN LLAQSESLFA LSNGHIGLRG NLDEGEPFGL PGTYLNSFYE IRPLPYAEAG YGYPEAGQTV VDVTNGKIFR LLVGDEPFDV RYGELISHER ILDLRAGTLT RRAHWRSPAG KQVKVTSTRL VSLAHRSVAA IEYVVEAIEE FVRVTVQSEL VTNEDVPETS ADPRVSAILD RPLQAVEHER TERGALLMHR TRASALMMAA GMEHEVEVPG RVEITTDARP DLARTTVICG LRPGQKLRIV KYLAYGWSSL RSRPALRDQA AGALHGARYS GWQGLLDAQR AYLDDFWDSA DVEVEGDPEC QQAVRFGLFH LLQASARAER RAIPSKGLTG TGYDGHAFWD TEGFVLPVLT YTAPHAVADA LRWRASTLDL AKERAAELGL EGAAFPWRTI RGQESSAYWP AGTAAWHINA DIAMAFERYR IVTGDGSLEE ECGLAVLIET ARLWLSLGHH DRHGVWHLDG VTGPDEYTAV VRDNVFTNLM AAHNLHTAAD ACLRHPEAAE AMGVTTEEMA AWRDAADAAN IPYDEELGVH QQCEGFTTLA EWDFEANTTY PLLLHEAYVR LYPAQVIKQA DLVLAMQWQS HAFTPEQKAR NVDYYERRMV RDSSLSACTQ AVMCAEVGHL ELAHDYAYEA ALIDLRDLHR NTRDGLHMAS LAGAWTALVV GFGGLRDDEG ILSIDPQLPD GISRLRFRLR WRGFRLIVDA NHTDVTFILG DGPGTQLTMR HAGQDLTLHT DTPSTIAVRT RKPLLPPPPQ PPGREPVHRR ALAR
Full NT Sequence (Expression Vector + Insert)
taatacgact cactataggg agaccacaac ggtttccctc tagaaataat tttgtttaac tttaagaagg agatatacca tggctcacca ccaccaccac catatgatca ccgaggacgc cttccccgtc gaaccgtggc aggtccgcga gaccaagctc aacctgaacc tgctggccca gtccgaatcc ctattcgcct tgtccaacgg gcacattgga ttacgcggca acctcgacga gggcgaaccc ttcggactgc cgggcaccta cctgaactct ttctacgaaa tccggccgct gccgtacgcc gaggccggtt atggatatcc ggaggccggc cagaccgttg tcgacgtcac caacggcaag atctttcgcc tgttggtcgg cgacgagccg ttcgacgtcc ggtatggcga attgatctcc cacgaacgga tcctcgacct gcgcgccggg acgctgaccc gccgcgcgca ctggcgctca ccggcgggca agcaagtcaa agtgacgtcc acccggctgg tgtcgctggc ccaccgcagc gtcgcggcga tcgagtacgt cgtcgaggca atcgaggaat tcgttcgcgt gaccgtgcag tccgaactcg tcaccaacga ggacgtaccg gagacctcgg ccgacccgcg ggtgtcggcc atcctggaca ggccgctaca ggccgtcgag cacgaacgca ccgagcgggg tgcacttctc atgcaccgca cccgagccag cgcgctgatg atggccgcag ggatggaaca cgaggtcgag gttcccgggc gggtcgagat caccaccgac gcccgcccgg acctggcccg aaccaccgtg atctgcgggc tgcgcccggg acagaagctg cgcatcgtca aatacctggc ctatggctgg tccagcctgc gctcccgccc ggcgctgcgc gaccaggccg ccggcgcgct gcacggtgcc cgctacagcg gctggcaggg gctgctggac gcgcaacgcg cctacctcga cgacttctgg gacagcgcgg acgtggaggt cgagggcgac ccggaatgtc agcaagcggt gcgtttcggg ttatttcacc tgttgcaggc cagcgcgcgc gccgaacgcc gcgcgatccc cagcaagggg ctcaccggaa ccgggtatga cggccacgcc ttttgggaca ccgaaggttt cgtgctaccg gtgctcacct acaccgcacc gcatgcggtc gccgacgcgc tgcggtggcg ggcgtcgacg ttggacctgg ccaaggagcg ggcggccgag ctcggcctgg aaggtgccgc ctttccctgg cggaccatcc gcggacagga gtcctcggcc tactggccgg ccggcacggc ggcctggcac atcaacgccg acatcgcgat ggcgttcgag cggtaccgca tcgtcaccgg cgacggttcg ctggaggagg aatgcggcct tgcggtgctg atcgagaccg cccggctgtg gctctcgctc gggcaccacg accgccacgg cgtctggcac ctcgacgggg tcaccggtcc cgacgagtac acggcggtcg tccgcgacaa cgtgttcacg aatctgatgg cggcgcacaa tctgcacacc gccgccgatg cttgcttgcg ccaccccgag gcggcggagg ccatgggtgt caccaccgag gagatggccg cctggcgcga cgcggccgac gccgccaaca ttccctacga cgaggaactc ggtgtccacc agcagtgtga agggttcacc acccttgcgg agtgggattt cgaagccaac accacttatc cgttgctact gcacgaggcc tacgtgcgct tgtatcccgc acaggtgatc aagcaggccg acctggtgct ggcgatgcag tggcagagtc acgcgttcac gcccgagcag aaggcgcgca acgtcgacta ctacgaacgg cgcatggtgc gcgactcgtc gttgtcggcc tgcactcagg cggtgatgtg cgccgaggtc ggccatctcg agttggccca cgactatgcc tacgaagccg ccctgatcga cctgcgcgac ctgcaccgca acacccgtga cggcctacac atggcttcgc tggccggagc ctggacggcg ctggtcgtag gcttcggcgg cctacgcgac gacgagggca tcctgtccat cgatccgcag ctgcccgacg gcatctcgcg gctgcggttc cggctgcgat ggcgcggctt ccggctgatc gtcgacgcca accacaccga cgtcaccttc atccttggcg acggtcccgg cacccagctg accatgcgcc acgccggcca agatctgacg ctgcacacgg acacaccgtc caccatcgcc gtgcgcaccc gtaagccgct gctgccgcca ccaccgcagc cgccaggccg cgagccagtg caccgccggg ctttagcccg gtgagtaaga taggatccgg ctgctaacaa agcccgaaag gaagctgagt tggctgctgc caccgctgag caataactag cataacccct tggggcctct aaacgggtct tgaggggttt tttgctgaaa ggaggaacta tatccggata tccacaggac gggtgtggtc gccatgatcg cgtagtcgat agtggctcca agtagcgaag cgagcaggac tgggcggcgg ccaaagcggt cggacagtgc tccgagaacg ggtgcgcata gaaattgcat caacgcatat agcgctagca gcacgccata gtgactggcg atgctgtcgg aatggacgat atcccgcaag aggcccggca gtaccggcat aaccaagcct atgcctacag catccagggt gacggtgccg aggatgacga tgagcgcatt gttagatttc atacacggtg cctgactgcg ttagcaattt aactgtgata aactaccgca ttaaagctta tcgatgataa gctgtcaaac atgagaattc ttgaagacga aagggcctcg tgatacgcct atttttatag gttaatgtca tgataataat ggtttcttag acgtcaggtg gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt atttttctaa atacattcaa atatgtatcc gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga agagtatgag tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat tatcccgtgt tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag aattatgcag tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa cgatcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc gccttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca cgatgcctgc agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cctgatgcgg tattttctcc ttacgcatct gtgcggtatt tcacaccgca tatatggtgc actctcagta caatctgctc tgatgccgca tagttaagcc agtatacact ccgctatcgc tacgtgactg ggtcatggct gcgccccgac acccgccaac acccgctgac gcgccctgac gggcttgtct gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca tgtgtcagag gttttcaccg tcatcaccga aacgcgcgag gcagctgcgg taaagctcat cagcgtggtc gtgaagcgat tcacagatgt ctgcctgttc atccgcgtcc agctcgttga gtttctccag aagcgttaat gtctggcttc tgataaagcg ggccatgtta agggcggttt tttcctgttt ggtcactgat gcctccgtgt aagggggatt tctgttcatg ggggtaatga taccgatgaa acgagagagg atgctcacga tacgggttac tgatgatgaa catgcccggt tactggaacg ttgtgagggt aaacaactgg cggtatggat gcggcgggac cagagaaaaa tcactcaggg tcaatgccag cgcttcgtta atacagatgt aggtgttcca cagggtagcc agcagcatcc tgcgatgcag atccggaaca taatggtgca gggcgctgac ttccgcgttt ccagacttta cgaaacacgg aaaccgaaga ccattcatgt tgttgctcag gtcgcagacg ttttgcagca gcagtcgctt cacgttcgct cgcgtatcgg tgattcattc tgctaaccag taaggcaacc ccgccagcct agccgggtcc tcaacgacag gagcacgatc atgcgcaccc gtggccagga cccaacgctg cccgagatgc gccgcgtgcg gctgctggag atggcggacg cgatggatat gttctgccaa gggttggttt gcgcattcac agttctccgc aagaattgat tggctccaat tcttggagtg gtgaatccgt tagcgaggtg ccgccggctt ccattcaggt cgaggtggcc cggctccatg caccgcgacg caacgcgggg aggcagacaa ggtatagggc ggcgcctaca atccatgcca acccgttcca tgtgctcgcc gaggcggcat aaatcgccgt gacgatcagc ggtccagtga tcgaagttag gctggtaaga gccgcgagcg atccttgaag ctgtccctga tggtcgtcat ctacctgcct ggacagcatg gcctgcaacg cgggcatccc gatgccgccg gaagcgagaa gaatcataat ggggaaggcc atccagcctc gcgtcgcgaa cgccagcaag acgtagccca gcgcgtcggc cgccatgccg gcgataatgg cctgcttctc gccgaaacgt ttggtggcgg gaccagtgac gaaggcttga gcgagggcgt gcaagattcc gaataccgca agcgacaggc cgatcatcgt cgcgctccag cgaaagcggt cctcgccgaa aatgacccag agcgctgccg gcacctgtcc tacgagttgc atgataaaga agacagtcat aagtgcggcg acgatagtca tgccccgcgc ccaccggaag gagctgactg ggttgaaggc tctcaagggc atcggtcgac gctctccctt atgcgactcc tgcattagga agcagcccag tagtaggttg aggccgttga gcaccgccgc cgcaaggaat ggtgcatgca aggagatggc gcccaacagt cccccggcca cggggcctgc caccataccc acgccgaaac aagcgctcat gagcccgaag tggcgagccc gatcttcccc atcggtgatg tcggcgatat aggcgccagc aaccgcacct gtggcgccgg tgatgccggc cacgatgcgt ccggcgtaga ggatcgagat ctcgatcccg cgaaat