MytuD.01507.a

fumarate hydratase

CENTER ID: MytuD.01507.a
ORGANISM: Mycobacterium tuberculosis H37Rv
ASSOCIATED DISEASE: Tuberculosis
CURRENT STATUS: crystallized
COMMUNITY REQUEST: False
NIH RISK GROUP: 2
SELECT AGENT: False
NIH PRIORITY
pathogens category:
IIIC

Ordering Clones & Proteins

If there are materials available for this target, they will be listed below. Materials can be ordered from SSGCID using the button in the "order material" column. Clicking the button will add the material to a virtual cart. You may order multiple materials at a time at no cost to you, as this contract is funded by NIAID. When you are ready to place your order, click the "Place Order" link which will appear in the top right corner of the page after you place your first item in your cart.

Clones*

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
MytuD.01507.a.A1.GE25971 Full length( MytuD.01507.a ) 1 474
* SSGCID clones represent un-induced expression constructs which have been verified by sequencing from vector primers. Clones may contain a tag, such as N-term 6xHis. Get sequence information using the button in the "info" column.

Proteins

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
MytuD.01507.a.A1.PS00180 Full length( MytuD.01507.a ) 1 474

External Resources

RESOURCE REFERENCE ID
BV-BRC: fig|83332.12.peg.1229
RefSeq: NP_215614.1
TubercuList: Rv1098c
MTB Network Portal: Rv1098c
UniProt: P9WN93

Orthologues for Rv1098c

Species % Identity % Coverage Clones Proteins Structures
marinum 92 99 1 1 1
paratuberculosis 91 99 1 0 0
ulcerans 91 99 1 0 1
leprae 90 99 0 0 0
smegmatis 88 97 0 3 1
abscessus 86 97 1 1 1

Sequences

These sequences are the native gene sequence; sequences of constructs derived from these sequences may differ due to codon optimization or other protocols. To find the specific sequence of any material you may have ordered, click on the "info" button next to the name of that material.
AA Sequence
MAVDADSANY RIEHDTMGEV RVPAKALWRA QTQRAVENFP ISGRGLERTQ IRALGLLKGA CAQVNSDLGL LAPEKADAII AAAAEIADGQ HDDQFPIDVF QTGSGTSSNM NTNEVIASIA AKGGVTLHPN DDVNMSQSSN DTFPTATHIA ATEAAVAHLI PALQQLHDAL AAKALDWHTV VKSGRTHLMD AVPVTLGQEF SGYARQIEAG IERVRACLPR LGELAIGGTA VGTGLNAPDD FGVRVVAVLV AQTGLSELRT AANSFEAQAA RDGLVEASGA LRTIAVSLTK IANDIRWMGS GPLTGLAEIQ LPDLQPGSSI MPGKVNPVLP EAVTQVAAQV IGNDAAIAWG GANGAFELNV YIPMMARNIL ESFKLLTNVS RLFAQRCIAG LTANVEHLRR LAESSPSIVT PLNSAIGYEE AAAVAKQALK ERKTIRQTVI DRGLIGDRLS IEDLDRRLDV LAMAKAEQLD SDRL
NT Sequence
ATGGCCGTTG ACGCCGACAG CGCCAATTAC CGCATCGAGC ACGACACCAT GGGCGAAGTC CGGGTGCCGG CAAAAGCGTT GTGGCGCGCG CAAACCCAGC GCGCGGTGGA GAACTTCCCG ATATCCGGCC GCGGGTTGGA GCGCACCCAG ATCCGCGCGC TAGGCCTGCT GAAAGGCGCC TGCGCGCAGG TGAACTCCGA CCTCGGGTTG CTGGCGCCGG AGAAAGCCGA CGCCATCATC GCCGCGGCCG CCGAGATCGC CGACGGTCAA CACGACGACC AGTTTCCCAT CGACGTCTTC CAGACCGGCT CGGGCACCAG CTCCAACATG AACACCAACG AGGTGATTGC GTCCATCGCG GCCAAGGGCG GGGTCACGTT GCATCCCAAC GACGACGTGA ACATGTCGCA GTCGTCCAAC GACACCTTCC CGACGGCCAC CCACATCGCG GCCACCGAGG CCGCGGTCGC TCATCTCATC CCAGCGCTGC AGCAGCTGCA CGACGCATTG GCCGCCAAGG CTCTTGATTG GCACACGGTG GTGAAGTCGG GCCGAACGCA TCTGATGGAC GCCGTTCCGG TGACACTCGG CCAGGAGTTC AGCGGATATG CCCGCCAGAT CGAGGCCGGC ATCGAGCGGG TGCGCGCGTG TCTGCCCAGG CTGGGCGAGC TGGCGATCGG CGGCACCGCG GTGGGTACCG GCCTCAACGC TCCCGACGAC TTCGGCGTCA GAGTGGTCGC GGTGCTGGTC GCGCAGACCG GTCTGTCGGA ATTGCGTACG GCGGCTAATT CTTTCGAAGC TCAGGCTGCC CGCGACGGGC TGGTGGAGGC GTCCGGGGCG CTGCGCACGA TCGCGGTATC GCTGACCAAG ATCGCCAACG ACATCCGCTG GATGGGATCG GGCCCATTGA CCGGCCTGGC CGAGATCCAA CTGCCAGATC TGCAGCCGGG CAGCTCGATC ATGCCGGGAA AGGTGAATCC GGTTCTGCCG GAGGCGGTTA CGCAGGTCGC CGCGCAGGTG ATCGGAAACG ACGCCGCCAT CGCCTGGGGT GGGGCCAACG GCGCATTCGA ACTCAACGTC TACATCCCGA TGATGGCCCG CAACATCCTC GAGTCCTTCA AGCTGCTGAC CAATGTGTCA CGGCTGTTCG CCCAGCGCTG CATAGCAGGG CTGACCGCCA ACGTCGAGCA CCTGCGGCGG CTGGCCGAGT CCTCACCGTC GATCGTGACA CCGTTGAATT CGGCCATCGG CTACGAGGAG GCGGCCGCCG TCGCCAAGCA AGCACTCAAG GAACGCAAAA CGATTCGCCA AACCGTGATC GACCGTGGCC TGATCGGCGA CAGGCTGTCG ATCGAGGATC TGGACCGCCG TCTGGACGTG CTGGCAATGG CCAAGGCCGA GCAGCTCGAC AGCGATCGGC TA
Details for MytuD.01507.a.A1.GE25971
HARVESTED ON: 1/9/2009
SEQUENCED ON: 7/28/2010
EXPECTED MW: 52kDa
OBSERVED MW: 52kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Good (10-50)
TOTAL EXPRESSION LEVEL: High Expression
SOLUBLE EXPRESSION LEVEL High Expression
EXPRESSION HOST: data unavailable
SEQUENCING RESULT: pass with incomplete coverage
PERCENT IDENTITY: 99
PERCENT COVERAGE: 100
Validated AA Sequence
MAHHHHHHMG TLEAQTQGPG SMAVDADSAX YRIEHDTMGE VRVPAKALWR AQTQRAVENF PISGRGLERT QIRALGLLKG ACAQVNSDLG LLAPEKADAI IAAAAEIADG QHDDQFPIDV FQTGSGTSSN MNTNEVIASI AAKGGVTLHP NDDVNMSQSS NDTFPTATHI AATEAAVAHL IPALQQLHDA LAAKALDWHT VVKSGRTHLM DAVPVTLGQE FSGYARQIEA GIERVRACLP RLGELAIGGT AVGTGLNAPD DFGVRVVAVL VAQTGLSELR TAANSXXAQA ARDGLVEASG ALRTIAVSLT KIANDIRWMG SGPLTGLAEI QLPDLQPGSS IMPGKVNPVL PEAVTQVAAQ VIGNDAAIAW GGANGAFELN VYIPMMARNI LESFKLLTNV SRLFAQRCIA GLTANVEHLR RLAESSPSIV TPLNSAIGYE EAAAVAKQAL KERKTIRQTV IDRGLIGDRL SIEDLDRRLD VLAMAKAEQL DSDRL
Validated NT Sequence
atggctcatc accatcacca tcatatgggt accctggaag ctcagaccca gggtcctggt tcgatggccg ttgacgccga cagcgccnat taccgcatcg agcacgacac catgggcgaa gtccgggtgc cggcaaaagc gttgtggcgc gcgcaaaccc agcgcgcggt ggagaacttc ccgatatccg gccgcgggtt ggagcgcacc cagatccgcg cgctaggcct gctgaaaggc gcctgcgcgc aggtgaactc cgacctcggg ttgctggcgc cggagaaagc cgacgccatc atcgccgcgg ccgccgagat cgccgacggt caacacgacg accagtttcc catcgacgtc ttccagaccg gctcgggcac cagctccaac atgaacacca acgaggtgat tgcgtccatc gcggccaagg gcggggtcac gttgcatccc aacgacgacg tgaacatgtc gcagtcgtcc aacgacacct tcccgacggc cacccacatc gcggccaccg aggccgcggt cgctcatctc atcccagcgc tgcagcagct gcacgacgca ttggccgcca aggctcttga ttggcacacg gtggtgaagt cgggccgaac gcatctgatg gacgccgttc cggtgacact cggccaggag ttcagcggat atgcccgcca gatcgaggcc ggcatcgagc gggtgcgcgc gtgtctgccc aggctgggcg agctggcgat cggcggcacc gcggtgggta ccggcctcaa cgctcccgac gacttcggcg tcagagtggt cgcggtgctg gtcgcgcaga ccggtctgtc ggaattgcgt acggcggcta attctttnna agctcaggct gcccgcgacg ggctggtgga ggcgtccggg gcgctgcgca cgatcgcggt atcgctgacc aagatcgcca acgacatccg ctggatggga tcgggcccat tgaccggcct ggccgagatc caactgccag atctgcagcc gggcagctcg atcatgccgg gaaaggtgaa tccggttctg ccggaggcgg ttacgcaggt cgccgcgcag gtgatcggaa acgacgccgc catcgcctgg ggtggggcca acggcgcatt cgaactcaac gtctacatcc cgatgatggc ccgcaacatc ctcgagtcct tcaagctgct gaccaatgtg tcacggctgt tcgcccagcg ctgcatagca gggctgaccg ccaacgtcga gcacctgcgg cggctggccg agtcctcacc gtcgatcgtg acaccgttga attcggccat cggctacgag gaggcggccg ccgtcgccaa gcaagcactc aaggaacgca aaacgattcg ccaaaccgtg atcgaccgtg gcctgatcgg cgacaggctg tcgatcgagg atctggaccg ccgtctggac gtgctggcaa tggccaaggc cgagcagctc gacagcgatc ggctataata a
Expected Protein Sequence
MAHHHHHHMG TLEAQTQGPG SMAVDADSAN YRIEHDTMGE VRVPAKALWR AQTQRAVENF PISGRGLERT QIRALGLLKG ACAQVNSDLG LLAPEKADAI IAAAAEIADG QHDDQFPIDV FQTGSGTSSN MNTNEVIASI AAKGGVTLHP NDDVNMSQSS NDTFPTATHI AATEAAVAHL IPALQQLHDA LAAKALDWHT VVKSGRTHLM DAVPVTLGQE FSGYARQIEA GIERVRACLP RLGELAIGGT AVGTGLNAPD DFGVRVVAVL VAQTGLSELR TAANSFEAQA ARDGLVEASG ALRTIAVSLT KIANDIRWMG SGPLTGLAEI QLPDLQPGSS IMPGKVNPVL PEAVTQVAAQ VIGNDAAIAW GGANGAFELN VYIPMMARNI LESFKLLTNV SRLFAQRCIA GLTANVEHLR RLAESSPSIV TPLNSAIGYE EAAAVAKQAL KERKTIRQTV IDRGLIGDRL SIEDLDRRLD VLAMAKAEQL DSDRL
Full NT Sequence (Expression Vector + Insert)
ttcttgaaga cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca cccgtggcca ggacccaacg ctgcccgaga tgcgccgcgt gcggctgctg gagatggcgg acgcgatgga tatgttctgc caagggttgg tttgcgcatt cacagttctc cgcaagaatt gattggctcc aattcttgga gtggtgaatc cgttagcgag gtgccgccgg cttccattca ggtcgaggtg gcccggctcc atgcaccgcg acgcaacgcg gggaggcaga caaggtatag ggcggcgcct acaatccatg ccaacccgtt ccatgtgctc gccgaggcgg cataaatcgc cgtgacgatc agcggtccag tgatcgaagt taggctggta agagccgcga gcgatccttg aagctgtccc tgatggtcgt catctacctg cctggacagc atggcctgca acgcgggcat cccgatgccg ccggaagcga gaagaatcat aatggggaag gccatccagc ctcgcgtcgt gaacgccagc aagacgtagc ccagcgcgtc ggccgtaaca acaccattta aatggagtgg ttacaaatgg agtggttaat taacaacacc atttgtcgac gctctccctt atgcgactcc tgcattagga agcagcccag tagtaggttg aggccgttga gcaccgccgc cgcaaggaat ggtgcatgca aggagatggc gcccaacagt cccccggcca cggggcctgc caccataccc acgccgaaac aagcgctcat gagcccgaag tggcgagccc gatcttcccc atcggtgatg tcggcgatat aggcgccagc aaccgcacct gtggcgccgg tgatgccggc cacgatgcgt ccggcgtaga ggatcgagat ctcgatcccg cgaaattaat acgactcact atagggagac cacaacggtt tccctctaga aataattttg tttaacttta agaaggagat ataccatggc tcatcaccat caccatcata tgggtaccct ggaagctcag acccagggtc ctggttcgat ggccgttgac gccgacagcg ccaattaccg catcgagcac gacaccatgg gcgaagtccg ggtgccggca aaagcgttgt ggcgcgcgca aacccagcgc gcggtggaga acttcccgat atccggccgc gggttggagc gcacccagat ccgcgcgcta ggcctgctga aaggcgcctg cgcgcaggtg aactccgacc tcgggttgct ggcgccggag aaagccgacg ccatcatcgc cgcggccgcc gagatcgccg acggtcaaca cgacgaccag tttcccatcg acgtcttcca gaccggctcg ggcaccagct ccaacatgaa caccaacgag gtgattgcgt ccatcgcggc caagggcggg gtcacgttgc atcccaacga cgacgtgaac atgtcgcagt cgtccaacga caccttcccg acggccaccc acatcgcggc caccgaggcc gcggtcgctc atctcatccc agcgctgcag cagctgcacg acgcattggc cgccaaggct cttgattggc acacggtggt gaagtcgggc cgaacgcatc tgatggacgc cgttccggtg acactcggcc aggagttcag cggatatgcc cgccagatcg aggccggcat cgagcgggtg cgcgcgtgtc tgcccaggct gggcgagctg gcgatcggcg gcaccgcggt gggtaccggc ctcaacgctc ccgacgactt cggcgtcaga gtggtcgcgg tgctggtcgc gcagaccggt ctgtcggaat tgcgtacggc ggctaattct ttcgaagctc aggctgcccg cgacgggctg gtggaggcgt ccggggcgct gcgcacgatc gcggtatcgc tgaccaagat cgccaacgac atccgctgga tgggatcggg cccattgacc ggcctggccg agatccaact gccagatctg cagccgggca gctcgatcat gccgggaaag gtgaatccgg ttctgccgga ggcggttacg caggtcgccg cgcaggtgat cggaaacgac gccgccatcg cctggggtgg ggccaacggc gcattcgaac tcaacgtcta catcccgatg atggcccgca acatcctcga gtccttcaag ctgctgacca atgtgtcacg gctgttcgcc cagcgctgca tagcagggct gaccgccaac gtcgagcacc tgcggcggct ggccgagtcc tcaccgtcga tcgtgacacc gttgaattcg gccatcggct acgaggaggc ggccgccgtc gccaagcaag cactcaagga acgcaaaacg attcgccaaa ccgtgatcga ccgtggcctg atcggcgaca ggctgtcgat cgaggatctg gaccgccgtc tggacgtgct ggcaatggcc aaggccgagc agctcgacag cgatcggcta aaacagcacg aacaagttct gcagccaagc ttctcgagga tccggctgct aacaaagccc gaaaggaagc tgagttggct gctgccaccg ctgagcaata actagcataa ccccttgggg cctctaaacg ggtcttgagg ggttttttgc tgaaaggagg aactatatcc ggatatccac aggacgggtg tggtcgccat gatcgcgtag tcgatagtgg ctccaagtag cgaagcgagc aggactgggc ggcggccaaa gcggtcggac agtgctccga gaacgggtgc gcatagaaat tgcatcaacg catatagcgc tagcagcacg ccatagtgac tggcgatgct gtcggaatgg acgatatccc gcaagaggcc cggcagtacc ggcataacca agcctatgcc tacagcatcc agggtgacgg tgccgaggat gacgatgagc gcattgttag atttcataca cggtgcctga ctgcgttagc aatttaactg tgataaacta ccgcattaaa gcttatcgat gataagctgt caaacatgag aa
Details for MytuD.01507.a.A1.PS00180
PURIFICATION DATe: 2/13/2009
CONCENTRATION: 32.2mg/ml
OBSERVED MW: data unavailable
EXPRESSION LEVEL: Moderate Expression
PROTEIN PURIFICATION BUFFER: 20 mM HEPES, pH 7.0, 300 mM NaCl, 5% glycerol and 1 mM TCEP
EXPRESSION HOST: data unavailable
VIAL COUNT (approx.): 3
VIAL VOLUME: 200µl
PERCENT IDENTITY: 99
PERCENT COVERAGE: 100
Protocol Notes
notes unavailable
Validated AA Sequence
MAHHHHHHMG TLEAQTQGPG SMAVDADSAX YRIEHDTMGE VRVPAKALWR AQTQRAVENF PISGRGLERT QIRALGLLKG ACAQVNSDLG LLAPEKADAI IAAAAEIADG QHDDQFPIDV FQTGSGTSSN MNTNEVIASI AAKGGVTLHP NDDVNMSQSS NDTFPTATHI AATEAAVAHL IPALQQLHDA LAAKALDWHT VVKSGRTHLM DAVPVTLGQE FSGYARQIEA GIERVRACLP RLGELAIGGT AVGTGLNAPD DFGVRVVAVL VAQTGLSELR TAANSXXAQA ARDGLVEASG ALRTIAVSLT KIANDIRWMG SGPLTGLAEI QLPDLQPGSS IMPGKVNPVL PEAVTQVAAQ VIGNDAAIAW GGANGAFELN VYIPMMARNI LESFKLLTNV SRLFAQRCIA GLTANVEHLR RLAESSPSIV TPLNSAIGYE EAAAVAKQAL KERKTIRQTV IDRGLIGDRL SIEDLDRRLD VLAMAKAEQL DSDRL
Validated NT Sequence
atggctcatc accatcacca tcatatgggt accctggaag ctcagaccca gggtcctggt tcgatggccg ttgacgccga cagcgccnat taccgcatcg agcacgacac catgggcgaa gtccgggtgc cggcaaaagc gttgtggcgc gcgcaaaccc agcgcgcggt ggagaacttc ccgatatccg gccgcgggtt ggagcgcacc cagatccgcg cgctaggcct gctgaaaggc gcctgcgcgc aggtgaactc cgacctcggg ttgctggcgc cggagaaagc cgacgccatc atcgccgcgg ccgccgagat cgccgacggt caacacgacg accagtttcc catcgacgtc ttccagaccg gctcgggcac cagctccaac atgaacacca acgaggtgat tgcgtccatc gcggccaagg gcggggtcac gttgcatccc aacgacgacg tgaacatgtc gcagtcgtcc aacgacacct tcccgacggc cacccacatc gcggccaccg aggccgcggt cgctcatctc atcccagcgc tgcagcagct gcacgacgca ttggccgcca aggctcttga ttggcacacg gtggtgaagt cgggccgaac gcatctgatg gacgccgttc cggtgacact cggccaggag ttcagcggat atgcccgcca gatcgaggcc ggcatcgagc gggtgcgcgc gtgtctgccc aggctgggcg agctggcgat cggcggcacc gcggtgggta ccggcctcaa cgctcccgac gacttcggcg tcagagtggt cgcggtgctg gtcgcgcaga ccggtctgtc ggaattgcgt acggcggcta attctttnna agctcaggct gcccgcgacg ggctggtgga ggcgtccggg gcgctgcgca cgatcgcggt atcgctgacc aagatcgcca acgacatccg ctggatggga tcgggcccat tgaccggcct ggccgagatc caactgccag atctgcagcc gggcagctcg atcatgccgg gaaaggtgaa tccggttctg ccggaggcgg ttacgcaggt cgccgcgcag gtgatcggaa acgacgccgc catcgcctgg ggtggggcca acggcgcatt cgaactcaac gtctacatcc cgatgatggc ccgcaacatc ctcgagtcct tcaagctgct gaccaatgtg tcacggctgt tcgcccagcg ctgcatagca gggctgaccg ccaacgtcga gcacctgcgg cggctggccg agtcctcacc gtcgatcgtg acaccgttga attcggccat cggctacgag gaggcggccg ccgtcgccaa gcaagcactc aaggaacgca aaacgattcg ccaaaccgtg atcgaccgtg gcctgatcgg cgacaggctg tcgatcgagg atctggaccg ccgtctggac gtgctggcaa tggccaaggc cgagcagctc gacagcgatc ggctataata a
Expressed Protein Sequence
MAHHHHHHMG TLEAQTQGPG SMAVDADSAN YRIEHDTMGE VRVPAKALWR AQTQRAVENF PISGRGLERT QIRALGLLKG ACAQVNSDLG LLAPEKADAI IAAAAEIADG QHDDQFPIDV FQTGSGTSSN MNTNEVIASI AAKGGVTLHP NDDVNMSQSS NDTFPTATHI AATEAAVAHL IPALQQLHDA LAAKALDWHT VVKSGRTHLM DAVPVTLGQE FSGYARQIEA GIERVRACLP RLGELAIGGT AVGTGLNAPD DFGVRVVAVL VAQTGLSELR TAANSFEAQA ARDGLVEASG ALRTIAVSLT KIANDIRWMG SGPLTGLAEI QLPDLQPGSS IMPGKVNPVL PEAVTQVAAQ VIGNDAAIAW GGANGAFELN VYIPMMARNI LESFKLLTNV SRLFAQRCIA GLTANVEHLR RLAESSPSIV TPLNSAIGYE EAAAVAKQAL KERKTIRQTV IDRGLIGDRL SIEDLDRRLD VLAMAKAEQL DSDRL
Full NT Sequence (Expression Vector + Insert)
TTCTTGAAGA CGAAAGGGCC TCGTGATACG CCTATTTTTA TAGGTTAATG TCATGATAAT AATGGTTTCT TAGACGTCAG GTGGCACTTT TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTTATTTTTC TAAATACATT CAAATATGTA TCCGCTCATG AGACAATAAC CCTGATAAAT GCTTCAATAA TATTGAAAAA GGAAGAGTAT GAGTATTCAA CATTTCCGTG TCGCCCTTAT TCCCTTTTTT GCGGCATTTT GCCTTCCTGT TTTTGCTCAC CCAGAAACGC TGGTGAAAGT AAAAGATGCT GAAGATCAGT TGGGTGCACG AGTGGGTTAC ATCGAACTGG ATCTCAACAG CGGTAAGATC CTTGAGAGTT TTCGCCCCGA AGAACGTTTT CCAATGATGA GCACTTTTAA AGTTCTGCTA TGTGGCGCGG TATTATCCCG TGTTGACGCC GGGCAAGAGC AACTCGGTCG CCGCATACAC TATTCTCAGA ATGACTTGGT TGAGTACTCA CCAGTCACAG AAAAGCATCT TACGGATGGC ATGACAGTAA GAGAATTATG CAGTGCTGCC ATAACCATGA GTGATAACAC TGCGGCCAAC TTACTTCTGA CAACGATCGG AGGACCGAAG GAGCTAACCG CTTTTTTGCA CAACATGGGG GATCATGTAA CTCGCCTTGA TCGTTGGGAA CCGGAGCTGA ATGAAGCCAT ACCAAACGAC GAGCGTGACA CCACGATGCC TGCAGCAATG GCAACAACGT TGCGCAAACT ATTAACTGGC GAACTACTTA CTCTAGCTTC CCGGCAACAA TTAATAGACT GGATGGAGGC GGATAAAGTT GCAGGACCAC TTCTGCGCTC GGCCCTTCCG GCTGGCTGGT TTATTGCTGA TAAATCTGGA GCCGGTGAGC GTGGGTCTCG CGGTATCATT GCAGCACTGG GGCCAGATGG TAAGCCCTCC CGTATCGTAG TTATCTACAC GACGGGGAGT CAGGCAACTA TGGATGAACG AAATAGACAG ATCGCTGAGA TAGGTGCCTC ACTGATTAAG CATTGGTAAC TGTCAGACCA AGTTTACTCA TATATACTTT AGATTGATTT AAAACTTCAT TTTTAATTTA AAAGGATCTA GGTGAAGATC CTTTTTGATA ATCTCATGAC CAAAATCCCT TAACGTGAGT TTTCGTTCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT TTTTTCTGCG CGTAATCTGC TGCTTGCAAA CAAAAAAACC ACCGCTACCA GCGGTGGTTT GTTTGCCGGA TCAAGAGCTA CCAACTCTTT TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AGATACCAAA TACTGTCCTT CTAGTGTAGC CGTAGTTAGG CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA GACGATAGTT ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCACACAGC CCAGCTTGGA GCGAACGACC TACACCGAAC TGAGATACCT ACAGCGTGAG CTATGAGAAA GCGCCACGCT TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGCCTG GTATCTTTAT AGTCCTGTCG GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG GGGCGGAGCC TATGGAAAAA CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG CTCACATGTT CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG AGTGAGCTGA TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA GCGCCTGATG CGGTATTTTC TCCTTACGCA TCTGTGCGGT ATTTCACACC GCATATATGG TGCACTCTCA GTACAATCTG CTCTGATGCC GCATAGTTAA GCCAGTATAC ACTCCGCTAT CGCTACGTGA CTGGGTCATG GCTGCGCCCC GACACCCGCC AACACCCGCT GACGCGCCCT GACGGGCTTG TCTGCTCCCG GCATCCGCTT ACAGACAAGC TGTGACCGTC TCCGGGAGCT GCATGTGTCA GAGGTTTTCA CCGTCATCAC CGAAACGCGC GAGGCAGCTG CGGTAAAGCT CATCAGCGTG GTCGTGAAGC GATTCACAGA TGTCTGCCTG TTCATCCGCG TCCAGCTCGT TGAGTTTCTC CAGAAGCGTT AATGTCTGGC TTCTGATAAA GCGGGCCATG TTAAGGGCGG TTTTTTCCTG TTTGGTCACT GATGCCTCCG TGTAAGGGGG ATTTCTGTTC ATGGGGGTAA TGATACCGAT GAAACGAGAG AGGATGCTCA CGATACGGGT TACTGATGAT GAACATGCCC GGTTACTGGA ACGTTGTGAG GGTAAACAAC TGGCGGTATG GATGCGGCGG GACCAGAGAA AAATCACTCA GGGTCAATGC CAGCGCTTCG TTAATACAGA TGTAGGTGTT CCACAGGGTA GCCAGCAGCA TCCTGCGATG CAGATCCGGA ACATAATGGT GCAGGGCGCT GACTTCCGCG TTTCCAGACT TTACGAAACA CGGAAACCGA AGACCATTCA TGTTGTTGCT CAGGTCGCAG ACGTTTTGCA GCAGCAGTCG CTTCACGTTC GCTCGCGTAT CGGTGATTCA TTCTGCTAAC CAGTAAGGCA ACCCCGCCAG CCTAGCCGGG TCCTCAACGA CAGGAGCACG ATCATGCGCA CCCGTGGCCA GGACCCAACG CTGCCCGAGA TGCGCCGCGT GCGGCTGCTG GAGATGGCGG ACGCGATGGA TATGTTCTGC CAAGGGTTGG TTTGCGCATT CACAGTTCTC CGCAAGAATT GATTGGCTCC AATTCTTGGA GTGGTGAATC CGTTAGCGAG GTGCCGCCGG CTTCCATTCA GGTCGAGGTG GCCCGGCTCC ATGCACCGCG ACGCAACGCG GGGAGGCAGA CAAGGTATAG GGCGGCGCCT ACAATCCATG CCAACCCGTT CCATGTGCTC GCCGAGGCGG CATAAATCGC CGTGACGATC AGCGGTCCAG TGATCGAAGT TAGGCTGGTA AGAGCCGCGA GCGATCCTTG AAGCTGTCCC TGATGGTCGT CATCTACCTG CCTGGACAGC ATGGCCTGCA ACGCGGGCAT CCCGATGCCG CCGGAAGCGA GAAGAATCAT AATGGGGAAG GCCATCCAGC CTCGCGTCGT GAACGCCAGC AAGACGTAGC CCAGCGCGTC GGCCGTAACA ACACCATTTA AATGGAGTGG TTACAAATGG AGTGGTTAAT TAACAACACC ATTTGTCGAC GCTCTCCCTT ATGCGACTCC TGCATTAGGA AGCAGCCCAG TAGTAGGTTG AGGCCGTTGA GCACCGCCGC CGCAAGGAAT GGTGCATGCA AGGAGATGGC GCCCAACAGT CCCCCGGCCA CGGGGCCTGC CACCATACCC ACGCCGAAAC AAGCGCTCAT GAGCCCGAAG TGGCGAGCCC GATCTTCCCC ATCGGTGATG TCGGCGATAT AGGCGCCAGC AACCGCACCT GTGGCGCCGG TGATGCCGGC CACGATGCGT CCGGCGTAGA GGATCGAGAT CTCGATCCCG CGAAATTAAT ACGACTCACT ATAGGGAGAC CACAACGGTT TCCCTCTAGA AATAATTTTG TTTAACTTTA AGAAGGAGAT ATACCATGGC TCATCACCAT CACCATCATA TGGGTACCCT GGAAGCTCAG ACCCAGGGTC CTGGTTCGAT GGCCGTTGAC GCCGACAGCG CCAATTACCG CATCGAGCAC GACACCATGG GCGAAGTCCG GGTGCCGGCA AAAGCGTTGT GGCGCGCGCA AACCCAGCGC GCGGTGGAGA ACTTCCCGAT ATCCGGCCGC GGGTTGGAGC GCACCCAGAT CCGCGCGCTA GGCCTGCTGA AAGGCGCCTG CGCGCAGGTG AACTCCGACC TCGGGTTGCT GGCGCCGGAG AAAGCCGACG CCATCATCGC CGCGGCCGCC GAGATCGCCG ACGGTCAACA CGACGACCAG TTTCCCATCG ACGTCTTCCA GACCGGCTCG GGCACCAGCT CCAACATGAA CACCAACGAG GTGATTGCGT CCATCGCGGC CAAGGGCGGG GTCACGTTGC ATCCCAACGA CGACGTGAAC ATGTCGCAGT CGTCCAACGA CACCTTCCCG ACGGCCACCC ACATCGCGGC CACCGAGGCC GCGGTCGCTC ATCTCATCCC AGCGCTGCAG CAGCTGCACG ACGCATTGGC CGCCAAGGCT CTTGATTGGC ACACGGTGGT GAAGTCGGGC CGAACGCATC TGATGGACGC CGTTCCGGTG ACACTCGGCC AGGAGTTCAG CGGATATGCC CGCCAGATCG AGGCCGGCAT CGAGCGGGTG CGCGCGTGTC TGCCCAGGCT GGGCGAGCTG GCGATCGGCG GCACCGCGGT GGGTACCGGC CTCAACGCTC CCGACGACTT CGGCGTCAGA GTGGTCGCGG TGCTGGTCGC GCAGACCGGT CTGTCGGAAT TGCGTACGGC GGCTAATTCT TTCGAAGCTC AGGCTGCCCG CGACGGGCTG GTGGAGGCGT CCGGGGCGCT GCGCACGATC GCGGTATCGC TGACCAAGAT CGCCAACGAC ATCCGCTGGA TGGGATCGGG CCCATTGACC GGCCTGGCCG AGATCCAACT GCCAGATCTG CAGCCGGGCA GCTCGATCAT GCCGGGAAAG GTGAATCCGG TTCTGCCGGA GGCGGTTACG CAGGTCGCCG CGCAGGTGAT CGGAAACGAC GCCGCCATCG CCTGGGGTGG GGCCAACGGC GCATTCGAAC TCAACGTCTA CATCCCGATG ATGGCCCGCA ACATCCTCGA GTCCTTCAAG CTGCTGACCA ATGTGTCACG GCTGTTCGCC CAGCGCTGCA TAGCAGGGCT GACCGCCAAC GTCGAGCACC TGCGGCGGCT GGCCGAGTCC TCACCGTCGA TCGTGACACC GTTGAATTCG GCCATCGGCT ACGAGGAGGC GGCCGCCGTC GCCAAGCAAG CACTCAAGGA ACGCAAAACG ATTCGCCAAA CCGTGATCGA CCGTGGCCTG ATCGGCGACA GGCTGTCGAT CGAGGATCTG GACCGCCGTC TGGACGTGCT GGCAATGGCC AAGGCCGAGC AGCTCGACAG CGATCGGCTA AAACAGCACG AACAAGTTCT GCAGCCAAGC TTCTCGAGGA TCCGGCTGCT AACAAAGCCC GAAAGGAAGC TGAGTTGGCT GCTGCCACCG CTGAGCAATA ACTAGCATAA CCCCTTGGGG CCTCTAAACG GGTCTTGAGG GGTTTTTTGC TGAAAGGAGG AACTATATCC GGATATCCAC AGGACGGGTG TGGTCGCCAT GATCGCGTAG TCGATAGTGG CTCCAAGTAG CGAAGCGAGC AGGACTGGGC GGCGGCCAAA GCGGTCGGAC AGTGCTCCGA GAACGGGTGC GCATAGAAAT TGCATCAACG CATATAGCGC TAGCAGCACG CCATAGTGAC TGGCGATGCT GTCGGAATGG ACGATATCCC GCAAGAGGCC CGGCAGTACC GGCATAACCA AGCCTATGCC TACAGCATCC AGGGTGACGG TGCCGAGGAT GACGATGAGC GCATTGTTAG ATTTCATACA CGGTGCCTGA CTGCGTTAGC AATTTAACTG TGATAAACTA CCGCATTAAA GCTTATCGAT GATAAGCTGT CAAACATGAG AA