MymaA.01156.a

O-acetylhomoserine sulfhydrylase MetC

CENTER ID: MymaA.01156.a
ORGANISM: Mycobacterium marinum ATCC BAA-535 / M
ASSOCIATED DISEASE: Cutaneous ulcerations, deep abscesses
CURRENT STATUS: in PDB
COMMUNITY REQUEST: False
NIH RISK GROUP: 2
SELECT AGENT: False
NIH PRIORITY
pathogens category:
IIIC

Ordering Clones & Proteins

If there are materials available for this target, they will be listed below. Materials can be ordered from SSGCID using the button in the "order material" column. Clicking the button will add the material to a virtual cart. You may order multiple materials at a time at no cost to you, as this contract is funded by NIAID. When you are ready to place your order, click the "Place Order" link which will appear in the top right corner of the page after you place your first item in your cart.

Clones*

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
MymaA.01156.a.A1.GE30145 Full length( MymaA.01156.a ) 1 449
* SSGCID clones represent un-induced expression constructs which have been verified by sequencing from vector primers. Clones may contain a tag, such as N-term 6xHis. Get sequence information using the button in the "info" column.

Proteins

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
MymaA.01156.a.A1.PS00936 Full length( MymaA.01156.a ) 1 449

Structures

4KAM
DEPOSITED: 4/22/2013
DETERMINATION: XRay
CLONE: MymaA.01156.a.A1.GE30145
PROTEIN: MymaA.01156.a.A1.PS00936

Publications by SSGCID

Increasing the structural coverage of tuberculosis drug targets.
Abendroth J, Abramov A, Armour B, Barrett LK, Baugh L, Begley DW, Buchko GW, Choi R, Clifton MC, Dieterich SH, Dranow DM, Edwards TE, Fairman JW, Ferrell M, Fox D, Gardberg AS, Hewitt SN, Lorimer D, Lyons-Abbott S, Mundt E, Muruthi MM, Myers J, Myler PJ, Napuli AJ, Phan I, Sekar A, Serbzhinskiy D, Stacy R, Staker BL, Stewart LJ, Taylor BM, Thompkins K, Tran N, Van Voorhis WC, Zhang Y
Tuberculosis (Edinb) - 2014
volume 95, issue 2, pages 142-8
PMID: 25613812; PMCID: PMC4361283

External Resources

RESOURCE REFERENCE ID
BV-BRC: fig|216594.6.peg.1265
RefSeq: YP_001849498.1
UniProt: B2HDS7

Sequences

These sequences are the native gene sequence; sequences of constructs derived from these sequences may differ due to codon optimization or other protocols. To find the specific sequence of any material you may have ordered, click on the "info" button next to the name of that material.
AA Sequence
MSAENTSTDA DPTAHWSFET KQIHAGQQPD SATNARALPI YQTTSYTFEN TAHAAALFGL EVPGNIYTRL GNPTTDVVEQ RIAALEGGVA ALFLSSGQAA ETFAILNLAG AGDHIVSSPR LYGGTYNLFH YSLAKLGIEV SFVDDPDNLD SWQAAVRPNT KAFFGETISN PQIDLLDTPG VAEVAHRNGI PLIVDNTIAT PYLIRPFTQG ADIVVHSATK YLGGHGAAIA GVIVDGGTFD WTQGRFPEFT TPDPSYHGVV FAELGAPAYA LKARVQLLRD LGSAASPFNA FLVAQGLETL SLRIERHVSN AQRVAEFLAD REDVVTVNYA GLPGSPWHER AKKLSPKGTG AVLSFELAGG VEAGKAFVNA LKLHSHVANI GDVRSLVIHP ASTTHAQLSP AEQLSTGVSP GLVRLAVGIE GIEDILADLE LGFAAARKFS GDSQAVAAI
NT Sequence
ATGAGCGCCG AAAACACCAG CACTGACGCA GATCCGACCG CGCATTGGTC ATTTGAAACC AAGCAGATCC ACGCTGGCCA GCAGCCCGAT TCCGCCACCA ACGCGCGGGC GCTGCCGATC TACCAAACCA CCTCGTACAC CTTCGAAAAC ACGGCGCACG CTGCCGCTTT GTTCGGCCTG GAGGTTCCCG GCAACATCTA CACGCGGCTG GGCAACCCCA CTACCGATGT GGTCGAGCAG CGCATCGCCG CGCTCGAAGG TGGGGTCGCC GCGCTGTTCC TGTCCTCCGG TCAGGCCGCG GAAACCTTCG CCATCCTCAA CCTGGCCGGC GCGGGCGATC ACATCGTGTC CAGCCCCCGC CTCTACGGCG GCACCTACAA CCTGTTCCAT TACTCGCTAG CAAAGCTGGG GATCGAGGTC AGCTTCGTCG ACGACCCCGA CAACCTGGAC TCGTGGCAGG CGGCGGTGCG GCCCAACACC AAGGCGTTTT TTGGTGAGAC CATCTCCAAC CCGCAGATCG ACCTGCTCGA CACCCCCGGG GTTGCTGAAG TCGCCCACCG CAACGGGATA CCGCTGATCG TCGACAACAC CATCGCCACG CCGTACCTGA TTCGGCCGTT CACGCAGGGC GCCGACATCG TCGTGCACTC GGCCACCAAG TACCTGGGCG GGCACGGCGC CGCGATCGCG GGTGTGATCG TCGATGGCGG CACATTCGAC TGGACGCAGG GCCGTTTTCC CGAATTCACC ACACCGGACC CCAGCTACCA CGGCGTGGTG TTTGCCGAGT TGGGCGCGCC GGCCTATGCC CTCAAGGCAC GCGTCCAGCT GCTGCGCGAC TTGGGGTCGG CGGCCTCGCC GTTCAACGCC TTCCTGGTGG CCCAGGGCCT GGAAACCCTG AGCCTGCGGA TCGAACGGCA CGTTTCCAAC GCGCAGCGCG TCGCAGAGTT CCTGGCCGAC CGCGAGGACG TCGTCACGGT CAACTACGCC GGACTACCCG GTTCGCCGTG GCATGAGCGG GCAAAGAAGC TGTCCCCCAA GGGAACCGGG GCGGTGCTGT CGTTCGAGTT GGCCGGCGGC GTCGAGGCCG GTAAAGCATT CGTGAATGCG CTCAAACTGC ACAGCCACGT CGCCAACATC GGCGATGTGC GCTCGCTGGT GATCCATCCG GCATCGACCA CCCACGCCCA GCTGAGCCCG GCCGAGCAGC TGTCCACCGG CGTCAGTCCG GGACTGGTGC GTTTAGCAGT GGGCATCGAG GGTATAGAAG ACATCCTGGC CGACCTGGAG CTCGGGTTCG CCGCGGCACG CAAGTTCAGC GGCGATTCAC AGGCAGTAGC AGCCATC
Details for MymaA.01156.a.A1.GE30145
HARVESTED ON: 8/15/2010
SEQUENCED ON: 8/25/2010
EXPECTED MW: 49kDa
OBSERVED MW: 50kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Many (50-100)
TOTAL EXPRESSION LEVEL: Low Expression
SOLUBLE EXPRESSION LEVEL Moderate Expression
EXPRESSION HOST: data unavailable
SEQUENCING RESULT: pass with incomplete coverage
PERCENT IDENTITY: 100
PERCENT COVERAGE: 99
Validated AA Sequence
MAHHHHHHMG TLEAQTQGPG SMSAENTSTD ADPTAHWSFE TKQIHAGQQP DSATNARALP IYQTTSYTFE NTAHAAALFG LEVPGNIYTR LGNPTTDVVE QRIAALEGGV AALFLSSGQA AETFAILNLA GAGDHIVSSP RLYGGTYNLF HYSLAKLGIE VSFVDDPDNL DSWQAAVRPN TKAFFGETIS NPQIDLLDTP GVAEVAHRNG IPLIVDNTIA TPYLIRPFTQ GADIVVHSAT KYLGGHGAAI AGVIVDGGTF DWTQGRFPEF TTPDPSYHGV VFAELGAPAY ALKARVQLLR DLGSAASPFN AFLVAQGLET LSLRIERHVS NAQRVAEFLA DREDVVTVNY AGLPGSPWHE RAKKLSPKGT GAVLSFELAG GVEAGKAFVN ALKLHSHVAN IGDVRSLVIH PASTTHAQLS PAEQLSTGVS PGLVRLAVGI EGIEDILADL ELGFAAARKF SGDSQAVAA
Validated NT Sequence
ttaagaagga gatataccat ggctcatcac catcaccatc atatgggtac cctggaagct cagacccagg gtcctggttc gatgagcgcc gaaaacacca gcactgacgc agatccgacc gcgcattggt catttgaaac caagcagatc cacgctggcc agcagcccga ttccgccacc aacgcgcggg cgctgccgat ctaccaaacc acctcgtaca ccttcgaaaa cacggcgcac gctgccgctt tgttcggcct ggaggttccc ggcaacatct acacgcggct gggcaacccc actaccgatg tggtcgagca gcgcatcgcc gcgctcgaag gtggggtcgc cgcgctgttc ctgtcctccg gtcaggccgc ggaaaccttc gccatcctca acctggccgg cgcgggcgat cacatcgtgt ccagcccccg cctctacggc ggcacctaca acctgttcca ttactcgcta gcaaagctgg ggatcgaggt cagcttcgtc gacgaccccg acaacctgga ctcgtggcag gcggcggtgc ggcccaacac caaggcgttt tttggtgaga ccatctccaa cccgcagatc gacctgctcg acacccccgg ggttgctgaa gtcgcccacc gcaacgggat accgctgatc gtcgacaaca ccatcgccac gccgtacctg attcggccgt tcacgcaggg cgccgacatc gtcgtgcact cggccaccaa gtacctgggc gggcacggcg ccgcgatcgc gggtgtgatc gtcgatggcg gcacattcga ctggacgcag ggccgttttc ccgaattcac cacaccggac cccagctacc acggcgtggt gtttgccgag ttgggcgcgc cggcctatgc cctcaaggca cgcgtccagc tgctgcgcga cttggggtcg gcggcctcgc cgttcaacgc cttcctggtg gcccagggcc tggaaaccct gagcctgcgg atcgaacggc acgtttccaa cgcgcagcgc gtcgcagagt tcctggccga ccgcgaggac gtcgtcacgg tcaactacgc cggactaccc ggttcgccgt ggcatgagcg ggcaaagaag ctgtccccca agggaaccgg ggcggtgctg tcgttcgagt tggccggcgg cgtcgaggcc ggtaaagcat tcgtgaatgc gctcaaactg cacagccacg tcgccaacat cggcgatgtg cgctcgctgg tgatccatcc ggcatcgacc acccacgccc agctgagccc ggccgagcag ctgtccaccg gcgtcagtcc gggactggtg cgtttagcag tgggcatcga gggtatagaa gacatcctgg ccgacctgga gctcgggttc gccgcggcac gcaagttcag cggcgattca caggcagtag cagccat
Expected Protein Sequence
MAHHHHHHMG TLEAQTQGPG SMSAENTSTD ADPTAHWSFE TKQIHAGQQP DSATNARALP IYQTTSYTFE NTAHAAALFG LEVPGNIYTR LGNPTTDVVE QRIAALEGGV AALFLSSGQA AETFAILNLA GAGDHIVSSP RLYGGTYNLF HYSLAKLGIE VSFVDDPDNL DSWQAAVRPN TKAFFGETIS NPQIDLLDTP GVAEVAHRNG IPLIVDNTIA TPYLIRPFTQ GADIVVHSAT KYLGGHGAAI AGVIVDGGTF DWTQGRFPEF TTPDPSYHGV VFAELGAPAY ALKARVQLLR DLGSAASPFN AFLVAQGLET LSLRIERHVS NAQRVAEFLA DREDVVTVNY AGLPGSPWHE RAKKLSPKGT GAVLSFELAG GVEAGKAFVN ALKLHSHVAN IGDVRSLVIH PASTTHAQLS PAEQLSTGVS PGLVRLAVGI EGIEDILADL ELGFAAARKF SGDSQAVAAI
Full NT Sequence (Expression Vector + Insert)
ttcttgaaga cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca cccgtggcca ggacccaacg ctgcccgaga tgcgccgcgt gcggctgctg gagatggcgg acgcgatgga tatgttctgc caagggttgg tttgcgcatt cacagttctc cgcaagaatt gattggctcc aattcttgga gtggtgaatc cgttagcgag gtgccgccgg cttccattca ggtcgaggtg gcccggctcc atgcaccgcg acgcaacgcg gggaggcaga caaggtatag ggcggcgcct acaatccatg ccaacccgtt ccatgtgctc gccgaggcgg cataaatcgc cgtgacgatc agcggtccag tgatcgaagt taggctggta agagccgcga gcgatccttg aagctgtccc tgatggtcgt catctacctg cctggacagc atggcctgca acgcgggcat cccgatgccg ccggaagcga gaagaatcat aatggggaag gccatccagc ctcgcgtcgt gaacgccagc aagacgtagc ccagcgcgtc ggccgtaaca acaccattta aatggagtgg ttacaaatgg agtggttaat taacaacacc atttgtcgac gctctccctt atgcgactcc tgcattagga agcagcccag tagtaggttg aggccgttga gcaccgccgc cgcaaggaat ggtgcatgca aggagatggc gcccaacagt cccccggcca cggggcctgc caccataccc acgccgaaac aagcgctcat gagcccgaag tggcgagccc gatcttcccc atcggtgatg tcggcgatat aggcgccagc aaccgcacct gtggcgccgg tgatgccggc cacgatgcgt ccggcgtaga ggatcgagat ctcgatcccg cgaaattaat acgactcact atagggagac cacaacggtt tccctctaga aataattttg tttaacttta agaaggagat ataccatggc tcatcaccat caccatcata tgggtaccct ggaagctcag acccagggtc ctggttcgat gagcgccgaa aacaccagca ctgacgcaga tccgaccgcg cattggtcat ttgaaaccaa gcagatccac gctggccagc agcccgattc cgccaccaac gcgcgggcgc tgccgatcta ccaaaccacc tcgtacacct tcgaaaacac ggcgcacgct gccgctttgt tcggcctgga ggttcccggc aacatctaca cgcggctggg caaccccact accgatgtgg tcgagcagcg catcgccgcg ctcgaaggtg gggtcgccgc gctgttcctg tcctccggtc aggccgcgga aaccttcgcc atcctcaacc tggccggcgc gggcgatcac atcgtgtcca gcccccgcct ctacggcggc acctacaacc tgttccatta ctcgctagca aagctgggga tcgaggtcag cttcgtcgac gaccccgaca acctggactc gtggcaggcg gcggtgcggc ccaacaccaa ggcgtttttt ggtgagacca tctccaaccc gcagatcgac ctgctcgaca cccccggggt tgctgaagtc gcccaccgca acgggatacc gctgatcgtc gacaacacca tcgccacgcc gtacctgatt cggccgttca cgcagggcgc cgacatcgtc gtgcactcgg ccaccaagta cctgggcggg cacggcgccg cgatcgcggg tgtgatcgtc gatggcggca cattcgactg gacgcagggc cgttttcccg aattcaccac accggacccc agctaccacg gcgtggtgtt tgccgagttg ggcgcgccgg cctatgccct caaggcacgc gtccagctgc tgcgcgactt ggggtcggcg gcctcgccgt tcaacgcctt cctggtggcc cagggcctgg aaaccctgag cctgcggatc gaacggcacg tttccaacgc gcagcgcgtc gcagagttcc tggccgaccg cgaggacgtc gtcacggtca actacgccgg actacccggt tcgccgtggc atgagcgggc aaagaagctg tcccccaagg gaaccggggc ggtgctgtcg ttcgagttgg ccggcggcgt cgaggccggt aaagcattcg tgaatgcgct caaactgcac agccacgtcg ccaacatcgg cgatgtgcgc tcgctggtga tccatccggc atcgaccacc cacgcccagc tgagcccggc cgagcagctg tccaccggcg tcagtccggg actggtgcgt ttagcagtgg gcatcgaggg tatagaagac atcctggccg acctggagct cgggttcgcc gcggcacgca agttcagcgg cgattcacag gcagtagcag ccatcaaaca gcacgaacaa gttctgcagc caagcttctc gaggatccgg ctgctaacaa agcccgaaag gaagctgagt tggctgctgc caccgctgag caataactag cataacccct tggggcctct aaacgggtct tgaggggttt tttgctgaaa ggaggaacta tatccggata tccacaggac gggtgtggtc gccatgatcg cgtagtcgat agtggctcca agtagcgaag cgagcaggac tgggcggcgg ccaaagcggt cggacagtgc tccgagaacg ggtgcgcata gaaattgcat caacgcatat agcgctagca gcacgccata gtgactggcg atgctgtcgg aatggacgat atcccgcaag aggcccggca gtaccggcat aaccaagcct atgcctacag catccagggt gacggtgccg aggatgacga tgagcgcatt gttagatttc atacacggtg cctgactgcg ttagcaattt aactgtgata aactaccgca ttaaagctta tcgatgataa gctgtcaaac atgagaa
Details for MymaA.01156.a.A1.PS00936
PURIFICATION DATe: 1/12/2011
CONCENTRATION: 54.6mg/ml
OBSERVED MW: data unavailable
EXPRESSION LEVEL: Moderate Expression
PROTEIN PURIFICATION BUFFER: 20 mM HEPES, pH 7.0, 300 mM NaCl, 5% glycerol and 1 mM TCEP
EXPRESSION HOST: data unavailable
VIAL COUNT (approx.): 6
VIAL VOLUME: 200µl
PERCENT IDENTITY: 100
PERCENT COVERAGE: 99
Protocol Notes
notes unavailable
Validated AA Sequence
MAHHHHHHMG TLEAQTQGPG SMSAENTSTD ADPTAHWSFE TKQIHAGQQP DSATNARALP IYQTTSYTFE NTAHAAALFG LEVPGNIYTR LGNPTTDVVE QRIAALEGGV AALFLSSGQA AETFAILNLA GAGDHIVSSP RLYGGTYNLF HYSLAKLGIE VSFVDDPDNL DSWQAAVRPN TKAFFGETIS NPQIDLLDTP GVAEVAHRNG IPLIVDNTIA TPYLIRPFTQ GADIVVHSAT KYLGGHGAAI AGVIVDGGTF DWTQGRFPEF TTPDPSYHGV VFAELGAPAY ALKARVQLLR DLGSAASPFN AFLVAQGLET LSLRIERHVS NAQRVAEFLA DREDVVTVNY AGLPGSPWHE RAKKLSPKGT GAVLSFELAG GVEAGKAFVN ALKLHSHVAN IGDVRSLVIH PASTTHAQLS PAEQLSTGVS PGLVRLAVGI EGIEDILADL ELGFAAARKF SGDSQAVAA
Validated NT Sequence
ttaagaagga gatataccat ggctcatcac catcaccatc atatgggtac cctggaagct cagacccagg gtcctggttc gatgagcgcc gaaaacacca gcactgacgc agatccgacc gcgcattggt catttgaaac caagcagatc cacgctggcc agcagcccga ttccgccacc aacgcgcggg cgctgccgat ctaccaaacc acctcgtaca ccttcgaaaa cacggcgcac gctgccgctt tgttcggcct ggaggttccc ggcaacatct acacgcggct gggcaacccc actaccgatg tggtcgagca gcgcatcgcc gcgctcgaag gtggggtcgc cgcgctgttc ctgtcctccg gtcaggccgc ggaaaccttc gccatcctca acctggccgg cgcgggcgat cacatcgtgt ccagcccccg cctctacggc ggcacctaca acctgttcca ttactcgcta gcaaagctgg ggatcgaggt cagcttcgtc gacgaccccg acaacctgga ctcgtggcag gcggcggtgc ggcccaacac caaggcgttt tttggtgaga ccatctccaa cccgcagatc gacctgctcg acacccccgg ggttgctgaa gtcgcccacc gcaacgggat accgctgatc gtcgacaaca ccatcgccac gccgtacctg attcggccgt tcacgcaggg cgccgacatc gtcgtgcact cggccaccaa gtacctgggc gggcacggcg ccgcgatcgc gggtgtgatc gtcgatggcg gcacattcga ctggacgcag ggccgttttc ccgaattcac cacaccggac cccagctacc acggcgtggt gtttgccgag ttgggcgcgc cggcctatgc cctcaaggca cgcgtccagc tgctgcgcga cttggggtcg gcggcctcgc cgttcaacgc cttcctggtg gcccagggcc tggaaaccct gagcctgcgg atcgaacggc acgtttccaa cgcgcagcgc gtcgcagagt tcctggccga ccgcgaggac gtcgtcacgg tcaactacgc cggactaccc ggttcgccgt ggcatgagcg ggcaaagaag ctgtccccca agggaaccgg ggcggtgctg tcgttcgagt tggccggcgg cgtcgaggcc ggtaaagcat tcgtgaatgc gctcaaactg cacagccacg tcgccaacat cggcgatgtg cgctcgctgg tgatccatcc ggcatcgacc acccacgccc agctgagccc ggccgagcag ctgtccaccg gcgtcagtcc gggactggtg cgtttagcag tgggcatcga gggtatagaa gacatcctgg ccgacctgga gctcgggttc gccgcggcac gcaagttcag cggcgattca caggcagtag cagccat
Expressed Protein Sequence
MAHHHHHHMG TLEAQTQGPG SMSAENTSTD ADPTAHWSFE TKQIHAGQQP DSATNARALP IYQTTSYTFE NTAHAAALFG LEVPGNIYTR LGNPTTDVVE QRIAALEGGV AALFLSSGQA AETFAILNLA GAGDHIVSSP RLYGGTYNLF HYSLAKLGIE VSFVDDPDNL DSWQAAVRPN TKAFFGETIS NPQIDLLDTP GVAEVAHRNG IPLIVDNTIA TPYLIRPFTQ GADIVVHSAT KYLGGHGAAI AGVIVDGGTF DWTQGRFPEF TTPDPSYHGV VFAELGAPAY ALKARVQLLR DLGSAASPFN AFLVAQGLET LSLRIERHVS NAQRVAEFLA DREDVVTVNY AGLPGSPWHE RAKKLSPKGT GAVLSFELAG GVEAGKAFVN ALKLHSHVAN IGDVRSLVIH PASTTHAQLS PAEQLSTGVS PGLVRLAVGI EGIEDILADL ELGFAAARKF SGDSQAVAAI
Full NT Sequence (Expression Vector + Insert)
TTCTTGAAGA CGAAAGGGCC TCGTGATACG CCTATTTTTA TAGGTTAATG TCATGATAAT AATGGTTTCT TAGACGTCAG GTGGCACTTT TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTTATTTTTC TAAATACATT CAAATATGTA TCCGCTCATG AGACAATAAC CCTGATAAAT GCTTCAATAA TATTGAAAAA GGAAGAGTAT GAGTATTCAA CATTTCCGTG TCGCCCTTAT TCCCTTTTTT GCGGCATTTT GCCTTCCTGT TTTTGCTCAC CCAGAAACGC TGGTGAAAGT AAAAGATGCT GAAGATCAGT TGGGTGCACG AGTGGGTTAC ATCGAACTGG ATCTCAACAG CGGTAAGATC CTTGAGAGTT TTCGCCCCGA AGAACGTTTT CCAATGATGA GCACTTTTAA AGTTCTGCTA TGTGGCGCGG TATTATCCCG TGTTGACGCC GGGCAAGAGC AACTCGGTCG CCGCATACAC TATTCTCAGA ATGACTTGGT TGAGTACTCA CCAGTCACAG AAAAGCATCT TACGGATGGC ATGACAGTAA GAGAATTATG CAGTGCTGCC ATAACCATGA GTGATAACAC TGCGGCCAAC TTACTTCTGA CAACGATCGG AGGACCGAAG GAGCTAACCG CTTTTTTGCA CAACATGGGG GATCATGTAA CTCGCCTTGA TCGTTGGGAA CCGGAGCTGA ATGAAGCCAT ACCAAACGAC GAGCGTGACA CCACGATGCC TGCAGCAATG GCAACAACGT TGCGCAAACT ATTAACTGGC GAACTACTTA CTCTAGCTTC CCGGCAACAA TTAATAGACT GGATGGAGGC GGATAAAGTT GCAGGACCAC TTCTGCGCTC GGCCCTTCCG GCTGGCTGGT TTATTGCTGA TAAATCTGGA GCCGGTGAGC GTGGGTCTCG CGGTATCATT GCAGCACTGG GGCCAGATGG TAAGCCCTCC CGTATCGTAG TTATCTACAC GACGGGGAGT CAGGCAACTA TGGATGAACG AAATAGACAG ATCGCTGAGA TAGGTGCCTC ACTGATTAAG CATTGGTAAC TGTCAGACCA AGTTTACTCA TATATACTTT AGATTGATTT AAAACTTCAT TTTTAATTTA AAAGGATCTA GGTGAAGATC CTTTTTGATA ATCTCATGAC CAAAATCCCT TAACGTGAGT TTTCGTTCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT TTTTTCTGCG CGTAATCTGC TGCTTGCAAA CAAAAAAACC ACCGCTACCA GCGGTGGTTT GTTTGCCGGA TCAAGAGCTA CCAACTCTTT TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AGATACCAAA TACTGTCCTT CTAGTGTAGC CGTAGTTAGG CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA GACGATAGTT ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCACACAGC CCAGCTTGGA GCGAACGACC TACACCGAAC TGAGATACCT ACAGCGTGAG CTATGAGAAA GCGCCACGCT TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGCCTG GTATCTTTAT AGTCCTGTCG GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG GGGCGGAGCC TATGGAAAAA CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG CTCACATGTT CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG AGTGAGCTGA TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA GCGCCTGATG CGGTATTTTC TCCTTACGCA TCTGTGCGGT ATTTCACACC GCATATATGG TGCACTCTCA GTACAATCTG CTCTGATGCC GCATAGTTAA GCCAGTATAC ACTCCGCTAT CGCTACGTGA CTGGGTCATG GCTGCGCCCC GACACCCGCC AACACCCGCT GACGCGCCCT GACGGGCTTG TCTGCTCCCG GCATCCGCTT ACAGACAAGC TGTGACCGTC TCCGGGAGCT GCATGTGTCA GAGGTTTTCA CCGTCATCAC CGAAACGCGC GAGGCAGCTG CGGTAAAGCT CATCAGCGTG GTCGTGAAGC GATTCACAGA TGTCTGCCTG TTCATCCGCG TCCAGCTCGT TGAGTTTCTC CAGAAGCGTT AATGTCTGGC TTCTGATAAA GCGGGCCATG TTAAGGGCGG TTTTTTCCTG TTTGGTCACT GATGCCTCCG TGTAAGGGGG ATTTCTGTTC ATGGGGGTAA TGATACCGAT GAAACGAGAG AGGATGCTCA CGATACGGGT TACTGATGAT GAACATGCCC GGTTACTGGA ACGTTGTGAG GGTAAACAAC TGGCGGTATG GATGCGGCGG GACCAGAGAA AAATCACTCA GGGTCAATGC CAGCGCTTCG TTAATACAGA TGTAGGTGTT CCACAGGGTA GCCAGCAGCA TCCTGCGATG CAGATCCGGA ACATAATGGT GCAGGGCGCT GACTTCCGCG TTTCCAGACT TTACGAAACA CGGAAACCGA AGACCATTCA TGTTGTTGCT CAGGTCGCAG ACGTTTTGCA GCAGCAGTCG CTTCACGTTC GCTCGCGTAT CGGTGATTCA TTCTGCTAAC CAGTAAGGCA ACCCCGCCAG CCTAGCCGGG TCCTCAACGA CAGGAGCACG ATCATGCGCA CCCGTGGCCA GGACCCAACG CTGCCCGAGA TGCGCCGCGT GCGGCTGCTG GAGATGGCGG ACGCGATGGA TATGTTCTGC CAAGGGTTGG TTTGCGCATT CACAGTTCTC CGCAAGAATT GATTGGCTCC AATTCTTGGA GTGGTGAATC CGTTAGCGAG GTGCCGCCGG CTTCCATTCA GGTCGAGGTG GCCCGGCTCC ATGCACCGCG ACGCAACGCG GGGAGGCAGA CAAGGTATAG GGCGGCGCCT ACAATCCATG CCAACCCGTT CCATGTGCTC GCCGAGGCGG CATAAATCGC CGTGACGATC AGCGGTCCAG TGATCGAAGT TAGGCTGGTA AGAGCCGCGA GCGATCCTTG AAGCTGTCCC TGATGGTCGT CATCTACCTG CCTGGACAGC ATGGCCTGCA ACGCGGGCAT CCCGATGCCG CCGGAAGCGA GAAGAATCAT AATGGGGAAG GCCATCCAGC CTCGCGTCGT GAACGCCAGC AAGACGTAGC CCAGCGCGTC GGCCGTAACA ACACCATTTA AATGGAGTGG TTACAAATGG AGTGGTTAAT TAACAACACC ATTTGTCGAC GCTCTCCCTT ATGCGACTCC TGCATTAGGA AGCAGCCCAG TAGTAGGTTG AGGCCGTTGA GCACCGCCGC CGCAAGGAAT GGTGCATGCA AGGAGATGGC GCCCAACAGT CCCCCGGCCA CGGGGCCTGC CACCATACCC ACGCCGAAAC AAGCGCTCAT GAGCCCGAAG TGGCGAGCCC GATCTTCCCC ATCGGTGATG TCGGCGATAT AGGCGCCAGC AACCGCACCT GTGGCGCCGG TGATGCCGGC CACGATGCGT CCGGCGTAGA GGATCGAGAT CTCGATCCCG CGAAATTAAT ACGACTCACT ATAGGGAGAC CACAACGGTT TCCCTCTAGA AATAATTTTG TTTAACTTTA AGAAGGAGAT ATACCATGGC TCATCACCAT CACCATCATA TGGGTACCCT GGAAGCTCAG ACCCAGGGTC CTGGTTCGAT GAGCGCCGAA AACACCAGCA CTGACGCAGA TCCGACCGCG CATTGGTCAT TTGAAACCAA GCAGATCCAC GCTGGCCAGC AGCCCGATTC CGCCACCAAC GCGCGGGCGC TGCCGATCTA CCAAACCACC TCGTACACCT TCGAAAACAC GGCGCACGCT GCCGCTTTGT TCGGCCTGGA GGTTCCCGGC AACATCTACA CGCGGCTGGG CAACCCCACT ACCGATGTGG TCGAGCAGCG CATCGCCGCG CTCGAAGGTG GGGTCGCCGC GCTGTTCCTG TCCTCCGGTC AGGCCGCGGA AACCTTCGCC ATCCTCAACC TGGCCGGCGC GGGCGATCAC ATCGTGTCCA GCCCCCGCCT CTACGGCGGC ACCTACAACC TGTTCCATTA CTCGCTAGCA AAGCTGGGGA TCGAGGTCAG CTTCGTCGAC GACCCCGACA ACCTGGACTC GTGGCAGGCG GCGGTGCGGC CCAACACCAA GGCGTTTTTT GGTGAGACCA TCTCCAACCC GCAGATCGAC CTGCTCGACA CCCCCGGGGT TGCTGAAGTC GCCCACCGCA ACGGGATACC GCTGATCGTC GACAACACCA TCGCCACGCC GTACCTGATT CGGCCGTTCA CGCAGGGCGC CGACATCGTC GTGCACTCGG CCACCAAGTA CCTGGGCGGG CACGGCGCCG CGATCGCGGG TGTGATCGTC GATGGCGGCA CATTCGACTG GACGCAGGGC CGTTTTCCCG AATTCACCAC ACCGGACCCC AGCTACCACG GCGTGGTGTT TGCCGAGTTG GGCGCGCCGG CCTATGCCCT CAAGGCACGC GTCCAGCTGC TGCGCGACTT GGGGTCGGCG GCCTCGCCGT TCAACGCCTT CCTGGTGGCC CAGGGCCTGG AAACCCTGAG CCTGCGGATC GAACGGCACG TTTCCAACGC GCAGCGCGTC GCAGAGTTCC TGGCCGACCG CGAGGACGTC GTCACGGTCA ACTACGCCGG ACTACCCGGT TCGCCGTGGC ATGAGCGGGC AAAGAAGCTG TCCCCCAAGG GAACCGGGGC GGTGCTGTCG TTCGAGTTGG CCGGCGGCGT CGAGGCCGGT AAAGCATTCG TGAATGCGCT CAAACTGCAC AGCCACGTCG CCAACATCGG CGATGTGCGC TCGCTGGTGA TCCATCCGGC ATCGACCACC CACGCCCAGC TGAGCCCGGC CGAGCAGCTG TCCACCGGCG TCAGTCCGGG ACTGGTGCGT TTAGCAGTGG GCATCGAGGG TATAGAAGAC ATCCTGGCCG ACCTGGAGCT CGGGTTCGCC GCGGCACGCA AGTTCAGCGG CGATTCACAG GCAGTAGCAG CCATCAAACA GCACGAACAA GTTCTGCAGC CAAGCTTCTC GAGGATCCGG CTGCTAACAA AGCCCGAAAG GAAGCTGAGT TGGCTGCTGC CACCGCTGAG CAATAACTAG CATAACCCCT TGGGGCCTCT AAACGGGTCT TGAGGGGTTT TTTGCTGAAA GGAGGAACTA TATCCGGATA TCCACAGGAC GGGTGTGGTC GCCATGATCG CGTAGTCGAT AGTGGCTCCA AGTAGCGAAG CGAGCAGGAC TGGGCGGCGG CCAAAGCGGT CGGACAGTGC TCCGAGAACG GGTGCGCATA GAAATTGCAT CAACGCATAT AGCGCTAGCA GCACGCCATA GTGACTGGCG ATGCTGTCGG AATGGACGAT ATCCCGCAAG AGGCCCGGCA GTACCGGCAT AACCAAGCCT ATGCCTACAG CATCCAGGGT GACGGTGCCG AGGATGACGA TGAGCGCATT GTTAGATTTC ATACACGGTG CCTGACTGCG TTAGCAATTT AACTGTGATA AACTACCGCA TTAAAGCTTA TCGATGATAA GCTGTCAAAC ATGAGAA