BrsuA.18178.a

Lon protease (EC 3.4.21.53) (ATP-dependent protease La)

CENTER ID: BrsuA.18178.a
ORGANISM: Brucella suis 1330
ASSOCIATED DISEASE: Brucellosis
CURRENT STATUS: soluble
COMMUNITY REQUEST: True
NIH RISK GROUP: 3
SELECT AGENT: True
NIH PRIORITY
pathogens category:
IIIB

Ordering Clones & Proteins

If there are materials available for this target, they will be listed below. Materials can be ordered from SSGCID using the button in the "order material" column. Clicking the button will add the material to a virtual cart. You may order multiple materials at a time at no cost to you, as this contract is funded by NIAID. When you are ready to place your order, click the "Place Order" link which will appear in the top right corner of the page after you place your first item in your cart.

Clones*

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
BrsuA.18178.a.A1.GE33969 full length 1 812
* SSGCID clones represent un-induced expression constructs which have been verified by sequencing from vector primers. Clones may contain a tag, such as N-term 6xHis. Get sequence information using the button in the "info" column.

External Resources

RESOURCE REFERENCE ID
BV-BRC: fig|204722.5.peg.1127
RefSeq: NP_698111.1
UniProt: Q8G0I7

Sequences

These sequences are the native gene sequence; sequences of constructs derived from these sequences may differ due to codon optimization or other protocols. To find the specific sequence of any material you may have ordered, click on the "info" button next to the name of that material.
AA Sequence
MTGIEQKTPV GGSETGGADG LYAVLPLRDI VVFPHMIVPL FVGREKSIRA LEEVMGVDKQ ILLATQKNAA DDDPAPDAIY EIGTIANVLQ LLKLPDGTVK VLVEGTARAK ISKFTDREDY HEAYAAALQE PEEDAVEIEA LARSVVSDFE NYVKLNKKIS PEVVGAASQI DDYSKLADTV ASHLAIKIPE KQEMLSVLSV RERLEKALSF MEAEISVLQV EKRIRSRVKR QMEKTQREYY LNEQMKAIQK ELGDSEDGRD EVAEIEERIT KTKLSKEARE KALAELKKLR SMSPMSAEAT VVRNYLDWLL SIPWGKKSKV KQDLNFAQEV LDAEHFGLGK VKERIVGYLA VQARSTKIKG PILCLVGPPG VGKTSLARSI AKATGREYVR MSLGGVRDEA EIRGHRRTYI GSMPGKVIQS MKKAKKSNPL FLLDEIDKMG QDFRGDPSSA MLEVLDPEQN ATFMDHYLEV EYDLSNVMFV TTANTMNIPG PLLDRMEIIR IAGYTEDEKL EIAKRHLLPK AIKDHALQPK EFSVTEDALR NVIRHYTREA GVRSLEREVM TLARKAVTEI LKTKKKSVKI TDKNLSDYLG VEKFRFGQID GEDQVGVVTG LAWTEVGGEL LTIEGVMMPG KGRMTVTGNL RDVMKESISA AASYVRSRAI DFGIEPPLFD KRDIHVHVPE GATPKDGPSA GIAMVTAIVS VLTGIPVRKD IAMTGEVTLR GRVLPIGGLK EKLLAALRGG IKKVLIPEEN AKDLAEIPDN VKNNLEIVPV SRVGEVLKHT LVRQPEPIEW TEQENPTAVP PVEDEAGASL AH
NT Sequence
atgacgggta ttgaacagaa aacaccggtt ggtggttctg aaacgggtgg tgcggacggg ctctatgctg tcttgccgct gcgtgacatc gtcgtctttc cccatatgat cgtcccgctt tttgtgggcc gcgaaaaatc cattcgcgcg cttgaagaag tgatgggcgt tgacaagcag atattgcttg ccacccagaa gaacgctgcc gatgacgatc cggcgccgga cgcgatctat gagattggta cgatcgccaa tgtgttgcag cttctgaaac tgccggacgg caccgtcaag gtgctggtcg aaggcacggc acgcgccaaa atttccaagt ttaccgatcg tgaagattat cacgaggctt atgctgcggc tctgcaggag ccggaggaag atgctgtcga gatcgaggca ctggcccgct cggtggtttc cgactttgaa aattacgtca agctgaacaa gaagatttcg ccggaagtgg ttggcgcggc cagccagatc gacgattatt ccaagcttgc cgatacggtt gcctcgcacc ttgcgatcaa aatccctgaa aagcaggaaa tgctgtcggt tctttcggtg cgcgagcgcc ttgagaaagc cctttccttc atggaagccg aaatttctgt tctacaggtt gagaagcgta ttcgcagccg cgtcaagcgc cagatggaga agacgcagcg cgaatattat ctcaatgagc agatgaaggc gatccagaag gagcttggcg acagtgagga cggccgtgat gaagtggccg aaatcgaaga gcgcatcacc aagaccaagc tcagcaagga agcgcgcgaa aaagctctgg ccgagctgaa gaaactgcgc agcatgagcc cgatgtccgc tgaagcgacg gtggttcgca attatctcga ctggttgctc tccattccat ggggcaagaa gtcgaaggtg aagcaggatc tgaactttgc gcaggaagtg ctggatgcgg agcatttcgg ccttggcaag gtcaaggaac gcatcgtcgg atatctggcg gtgcaggccc gttcgaccaa gatcaagggt ccgatcctct gcctcgttgg ccctcccggc gtcggcaaga cctcgcttgc gcgctcgatt gccaaggcaa cgggccgcga atatgtccgc atgtcgcttg gcggcgtacg cgacgaggct gaaatccgcg gtcatcgccg cacctatatc ggctcgatgc ccggcaaggt catccagtcg atgaagaagg cgaagaagtc caatccgctt ttcctgctcg atgaaatcga caagatgggc caggatttcc gcggcgatcc gtcttcggcc atgctggagg tgctggaccc ggaacagaac gcgaccttca tggatcacta ccttgaagtt gagtatgatc tatcgaacgt catgttcgtg acgaccgcca atacgatgaa tattcccggt ccacttctgg atcgtatgga gatcatccgt atcgccggtt acacggaaga cgaaaagctg gagatcgcca agcggcacct gttgccgaag gccatcaagg accatgccct gcaaccgaag gagttttcgg ttacggaaga tgcgctgcgc aacgttatcc gccattatac gcgggaagcg ggcgtgcgta gccttgaacg cgaggtgatg acccttgcgc gcaaggccgt gacggaaatc ctgaagacga agaagaagtc ggtaaagatt accgacaaga acctctccga ttaccttggt gtggagaagt tccgcttcgg tcagatcgac ggtgaagatc aggtgggtgt cgtgactggc cttgcctgga ccgaagtcgg cggtgagctt ttgaccatcg aaggcgtcat gatgccgggt aagggccgca tgacggttac gggtaatctc cgtgacgtga tgaaggaatc gatttcggcc gcggcatcct atgtccgctc gcgcgcgatc gatttcggca tcgagcctcc gctgttcgac aagcgcgata tccacgtgca cgtgccggaa ggcgcgacgc cgaaggatgg tccttctgcc ggtatcgcca tggttacggc catcgtctcc gtgctgacgg gtattcccgt tcgcaaggac atcgccatga cgggtgaagt cacgttgcgc ggtcgggttc tgccaatcgg cgggttgaag gaaaagctgc ttgcggcctt gcgcggcggt atcaagaagg ttctgatccc ggaagagaac gccaaggatc tggcggaaat cccggacaat gtgaagaaca atcttgagat cgttccggta tcccgcgtcg gtgaagtgct gaagcacacg ctcgtgcgcc agcctgaacc gattgaatgg accgagcagg agaatcccac tgccgtgcct ccggtggagg atgaagcagg ggcttcgctg gcccattaa
Details for BrsuA.18178.a.A1.GE33969
HARVESTED ON: 11/21/2011
SEQUENCED ON: 11/29/2011
EXPECTED MW: 92kDa
OBSERVED MW: 92kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Too many to count (100+)
TOTAL EXPRESSION LEVEL: Low Expression
SOLUBLE EXPRESSION LEVEL Low Expression
EXPRESSION HOST: data unavailable
SEQUENCING RESULT: pass with incomplete coverage
PERCENT IDENTITY: 77
PERCENT COVERAGE: 43
Validated AA Sequence
MAHHHHHHMG TLEAQTQGPG SMTGIEQKTP VGGSETGGAD GLYAVLPLRD IVVFPHMIVP LFVGREKSIR ALEEVMGVDK QILLATQKNA ADDDPAPDAI YEIGTIANVL QLLKLPDGTV KVLVEGTARA KISKFTDRED YHEAYAAALQ EPEEDAVEIE ALARSVVSDF ENYVKLNKKI SPEVVGAASQ IDDYSKLADT VASHLAIXIP EKQEMLSVLS VRERLEKALS FMEAEISVLQ VEKRIRSRVK RQMEKTQREY XLNEQMXAIQ XXLGXSEXGR XEXAEIXXXS XTXSXGXXEX XXXXXXXXXX AXXXXXXXXX XXXXXXXXXX GXXSXXXXXX XXXXXXXXXX XXXXXXXXXX X
Validated NT Sequence
annnattttn tttactttaa gaaggagana taccatggct catcaccatc accatcatat gggtaccctg gaagctcaga cccagggtcc tggttcgatg acgggtattg aacagaaaac accggttggt ggttctgaaa cgggtggtgc ggacgggctc tatgctgtct tgccgctgcg tgacatcgtc gtctttcccc atatgatcgt cccgcttttt gtgggccgcg aaaaatccat tcgcgcgctt gaagaagtga tgggcgttga caagcagata ttgcttgcca cccagaagaa cgctgccgat gacgatccgg cgccggacgc gatctatgag attggtacga tcgccaatgt gttgcagctt ctgaaactgc cggacggcac cgtcaaggtg ctggtcgaag gcacggcacg cgccaaaatt tccaagttta ccgatcgtga agattatcac gaggcttatg ctgcggctct gcaggagccg gaggaagatg ctgtcgagat cgaggcactg gcccgctcgg tggtttccga ctttgaaaat tacgtcaagc tgaacaagaa gatttcgccg gaagtggttg gcgcggccag ccagatcgac gattattcca agcttgccga tacggttgcc tcgcaccttg cgatcnaaat ccctgaaaag caggaaatgc tgtcggttct ttcggtgcgc gagcgccttg agaaagccct ttccttcatg gaagccgaaa tttctgttct acaggttgag aagcgtattc gcagccgcgt caagcgccag atggagaaga cncagcgcga atattntctc aatgagcaga tgaangcgat ccagaannan cttggnnaca gtgaggangg ccgtgangaa ntngccgaaa tcganagnnc atcncnnacc anntcannag gannncnnga aaancnctng cnanctgnnn acnnnnnanc angagcccna nnncnnngnn nngannnnnc nnannannnn nnncngnnnn nnnnncnnnn cntggggnan nnatcnnann nnnnnnnnnn nnngnanttn nnnnnnannt nnncnnnnnn nnnnnnnnng nnngannnnn nnnnnncnnn nnnnnnng
Expected Protein Sequence
MAHHHHHHMG TLEAQTQGPG SMTGIEQKTP VGGSETGGAD GLYAVLPLRD IVVFPHMIVP LFVGREKSIR ALEEVMGVDK QILLATQKNA ADDDPAPDAI YEIGTIANVL QLLKLPDGTV KVLVEGTARA KISKFTDRED YHEAYAAALQ EPEEDAVEIE ALARSVVSDF ENYVKLNKKI SPEVVGAASQ IDDYSKLADT VASHLAIKIP EKQEMLSVLS VRERLEKALS FMEAEISVLQ VEKRIRSRVK RQMEKTQREY YLNEQMKAIQ KELGDSEDGR DEVAEIEERI TKTKLSKEAR EKALAELKKL RSMSPMSAEA TVVRNYLDWL LSIPWGKKSK VKQDLNFAQE VLDAEHFGLG KVKERIVGYL AVQARSTKIK GPILCLVGPP GVGKTSLARS IAKATGREYV RMSLGGVRDE AEIRGHRRTY IGSMPGKVIQ SMKKAKKSNP LFLLDEIDKM GQDFRGDPSS AMLEVLDPEQ NATFMDHYLE VEYDLSNVMF VTTANTMNIP GPLLDRMEII RIAGYTEDEK LEIAKRHLLP KAIKDHALQP KEFSVTEDAL RNVIRHYTRE AGVRSLEREV MTLARKAVTE ILKTKKKSVK ITDKNLSDYL GVEKFRFGQI DGEDQVGVVT GLAWTEVGGE LLTIEGVMMP GKGRMTVTGN LRDVMKESIS AAASYVRSRA IDFGIEPPLF DKRDIHVHVP EGATPKDGPS AGIAMVTAIV SVLTGIPVRK DIAMTGEVTL RGRVLPIGGL KEKLLAALRG GIKKVLIPEE NAKDLAEIPD NVKNNLEIVP VSRVGEVLKH TLVRQPEPIE WTEQENPTAV PPVEDEAGAS LAH
Full NT Sequence (Expression Vector + Insert)
ttcttgaaga cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca cccgtggcca ggacccaacg ctgcccgaga tgcgccgcgt gcggctgctg gagatggcgg acgcgatgga tatgttctgc caagggttgg tttgcgcatt cacagttctc cgcaagaatt gattggctcc aattcttgga gtggtgaatc cgttagcgag gtgccgccgg cttccattca ggtcgaggtg gcccggctcc atgcaccgcg acgcaacgcg gggaggcaga caaggtatag ggcggcgcct acaatccatg ccaacccgtt ccatgtgctc gccgaggcgg cataaatcgc cgtgacgatc agcggtccag tgatcgaagt taggctggta agagccgcga gcgatccttg aagctgtccc tgatggtcgt catctacctg cctggacagc atggcctgca acgcgggcat cccgatgccg ccggaagcga gaagaatcat aatggggaag gccatccagc ctcgcgtcgt gaacgccagc aagacgtagc ccagcgcgtc ggccgtaaca acaccattta aatggagtgg ttacaaatgg agtggttaat taacaacacc atttgtcgac gctctccctt atgcgactcc tgcattagga agcagcccag tagtaggttg aggccgttga gcaccgccgc cgcaaggaat ggtgcatgca aggagatggc gcccaacagt cccccggcca cggggcctgc caccataccc acgccgaaac aagcgctcat gagcccgaag tggcgagccc gatcttcccc atcggtgatg tcggcgatat aggcgccagc aaccgcacct gtggcgccgg tgatgccggc cacgatgcgt ccggcgtaga ggatcgagat ctcgatcccg cgaaattaat acgactcact atagggagac cacaacggtt tccctctaga aataattttg tttaacttta agaaggagat ataccatggc tcatcaccat caccatcata tgggtaccct ggaagctcag acccagggtc ctggttcgat gacgggtatt gaacagaaaa caccggttgg tggttctgaa acgggtggtg cggacgggct ctatgctgtc ttgccgctgc gtgacatcgt cgtctttccc catatgatcg tcccgctttt tgtgggccgc gaaaaatcca ttcgcgcgct tgaagaagtg atgggcgttg acaagcagat attgcttgcc acccagaaga acgctgccga tgacgatccg gcgccggacg cgatctatga gattggtacg atcgccaatg tgttgcagct tctgaaactg ccggacggca ccgtcaaggt gctggtcgaa ggcacggcac gcgccaaaat ttccaagttt accgatcgtg aagattatca cgaggcttat gctgcggctc tgcaggagcc ggaggaagat gctgtcgaga tcgaggcact ggcccgctcg gtggtttccg actttgaaaa ttacgtcaag ctgaacaaga agatttcgcc ggaagtggtt ggcgcggcca gccagatcga cgattattcc aagcttgccg atacggttgc ctcgcacctt gcgatcaaaa tccctgaaaa gcaggaaatg ctgtcggttc tttcggtgcg cgagcgcctt gagaaagccc tttccttcat ggaagccgaa atttctgttc tacaggttga gaagcgtatt cgcagccgcg tcaagcgcca gatggagaag acgcagcgcg aatattatct caatgagcag atgaaggcga tccagaagga gcttggcgac agtgaggacg gccgtgatga agtggccgaa atcgaagagc gcatcaccaa gaccaagctc agcaaggaag cgcgcgaaaa agctctggcc gagctgaaga aactgcgcag catgagcccg atgtccgctg aagcgacggt ggttcgcaat tatctcgact ggttgctctc cattccatgg ggcaagaagt cgaaggtgaa gcaggatctg aactttgcgc aggaagtgct ggatgcggag catttcggcc ttggcaaggt caaggaacgc atcgtcggat atctggcggt gcaggcccgt tcgaccaaga tcaagggtcc gatcctctgc ctcgttggcc ctcccggcgt cggcaagacc tcgcttgcgc gctcgattgc caaggcaacg ggccgcgaat atgtccgcat gtcgcttggc ggcgtacgcg acgaggctga aatccgcggt catcgccgca cctatatcgg ctcgatgccc ggcaaggtca tccagtcgat gaagaaggcg aagaagtcca atccgctttt cctgctcgat gaaatcgaca agatgggcca ggatttccgc ggcgatccgt cttcggccat gctggaggtg ctggacccgg aacagaacgc gaccttcatg gatcactacc ttgaagttga gtatgatcta tcgaacgtca tgttcgtgac gaccgccaat acgatgaata ttcccggtcc acttctggat cgtatggaga tcatccgtat cgccggttac acggaagacg aaaagctgga gatcgccaag cggcacctgt tgccgaaggc catcaaggac catgccctgc aaccgaagga gttttcggtt acggaagatg cgctgcgcaa cgttatccgc cattatacgc gggaagcggg cgtgcgtagc cttgaacgcg aggtgatgac ccttgcgcgc aaggccgtga cggaaatcct gaagacgaag aagaagtcgg taaagattac cgacaagaac ctctccgatt accttggtgt ggagaagttc cgcttcggtc agatcgacgg tgaagatcag gtgggtgtcg tgactggcct tgcctggacc gaagtcggcg gtgagctttt gaccatcgaa ggcgtcatga tgccgggtaa gggccgcatg acggttacgg gtaatctccg tgacgtgatg aaggaatcga tttcggccgc ggcatcctat gtccgctcgc gcgcgatcga tttcggcatc gagcctccgc tgttcgacaa gcgcgatatc cacgtgcacg tgccggaagg cgcgacgccg aaggatggtc cttctgccgg tatcgccatg gttacggcca tcgtctccgt gctgacgggt attcccgttc gcaaggacat cgccatgacg ggtgaagtca cgttgcgcgg tcgggttctg ccaatcggcg ggttgaagga aaagctgctt gcggccttgc gcggcggtat caagaaggtt ctgatcccgg aagagaacgc caaggatctg gcggaaatcc cggacaatgt gaagaacaat cttgagatcg ttccggtatc ccgcgtcggt gaagtgctga agcacacgct cgtgcgccag cctgaaccga ttgaatggac cgagcaggag aatcccactg ccgtgcctcc ggtggaggat gaagcagggg cttcgctggc ccattaaaca gcacgaacaa gttctgcagc caagcttctc gaggatccgg ctgctaacaa agcccgaaag gaagctgagt tggctgctgc caccgctgag caataactag cataacccct tggggcctct aaacgggtct tgaggggttt tttgctgaaa ggaggaacta tatccggata tccacaggac gggtgtggtc gccatgatcg cgtagtcgat agtggctcca agtagcgaag cgagcaggac tgggcggcgg ccaaagcggt cggacagtgc tccgagaacg ggtgcgcata gaaattgcat caacgcatat agcgctagca gcacgccata gtgactggcg atgctgtcgg aatggacgat atcccgcaag aggcccggca gtaccggcat aaccaagcct atgcctacag catccagggt gacggtgccg aggatgacga tgagcgcatt gttagatttc atacacggtg cctgactgcg ttagcaattt aactgtgata aactaccgca ttaaagctta tcgatgataa gctgtcaaac atgagaa