ButhA.00545.a

Threonine synthase (EC 4.2.3.1) (BTH_I2199) (thrC)

CENTER ID: ButhA.00545.a
ORGANISM: Burkholderia thailandensis E264
ASSOCIATED DISEASE:
CURRENT STATUS: in PDB
COMMUNITY REQUEST: True
NIH RISK GROUP: 3
SELECT AGENT: False
NIH PRIORITY
pathogens category:
IIIB

Ordering Clones & Proteins

If there are materials available for this target, they will be listed below. Materials can be ordered from SSGCID using the button in the "order material" column. Clicking the button will add the material to a virtual cart. You may order multiple materials at a time at no cost to you, as this contract is funded by NIAID. When you are ready to place your order, click the "Place Order" link which will appear in the top right corner of the page after you place your first item in your cart.

Clones*

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
ButhA.00545.a.A1.GE32796 full length 1 483
* SSGCID clones represent un-induced expression constructs which have been verified by sequencing from vector primers. Clones may contain a tag, such as N-term 6xHis. Get sequence information using the button in the "info" column.

Proteins

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
ButhA.00545.a.A1.PW33400 full length 1 483
Structures
3V7N
DEPOSITED: 12/21/2011
DETERMINATION: XRay
CLONE: ButhA.00545.a.A1.GE32796
PROTEIN: ButhA.00545.a.A1.PW33400
Publications by SSGCID
Combining functional and structural genomics to sample the essential Burkholderia structome.
Abendroth J, Armour B, Barrett L, Baugh L, Begley DW, Buchko GW, Choi R, Clifton MC, Dieterich SH, Dranow DM, Edwards TE, Fairman JW, Fox D, Gallagher LA, Gardberg AS, Gillespie A, Manoil C, Myler PJ, Nakazawa-Hewitt S, Napuli A, Nguyen MT, Patrapuvich R, Phan I, Stacy R, Staker BL, Stewart LJ, Van Voorhis WC
PLoS ONE - 2012
volume 8, issue 1, pages e53851
PMID: 23382856; PMCID: PMC3561365
External Resources
RESOURCE REFERENCE ID
OrthoMCL: OG5_128662
PATRIC ID: fig|271848.6.peg.5018
RefSeq: YP_442720.1
UniProt: Q2SWH9
Sequences
These sequences are the native gene sequence; sequences of constructs derived from these sequences may differ due to codon optimization or other protocols. To find the specific sequence of any material you may have ordered, click on the "info" button next to the name of that material.
AA Sequence
MNYISTRGAG IGERHTFSDI LLGGLAKDGG LYLPSEYPQV SADELARWRT LPYADLAFEI LSKFCDDIAA ADLRAITRRT YTADVYRHAR RGGNAADITP LTTLGTENGA PVSLLELSNG PTLAFKDMAM QLLGNLFEYT LAKHGETLNI LGATSGDTGS AAEYAMRGKE GVRVFMLSPH KKMSAFQTAQ MYSLQDPNIF NLAVNGVFDD CQDIVKAVSN DHAFKAQQKI GTVNSINWAR VVAQVVYYFK GYFAATRSND ERVSFTVPSG NFGNVCAGHI ARMMGLPIEK LVVATNENDV LDEFFRTGAY RVRSAQDTYH TSSPSMDISK ASNFERFVFD LLGRDPARVV QLFRDVEQKG GFDLAASGDF ARVAEFGFVS GRSTHADRIA TIRDVFERYR TMIDTHTADG LKVAREHLRP GVPMVVLETA QPIKFGESIR EALGQEPSRP AAFDGLEALP QRFEVVDANA QQVKDFIAAH TGA
NT Sequence
atgaattaca tctccacgcg cggcgccggc atcggcgagc gccatacgtt ctccgacatc ctgctcggcg gcctcgcgaa ggacggcggg ctctatctgc cgagcgagta tccgcaggtg agcgcggacg agctcgcgcg ctggcgcacg ctgccgtacg cggatctcgc gttcgagatc ctgtcgaagt tctgcgacga catcgccgcc gccgacctgc gcgcgatcac gcgccgcacg tacacggccg acgtgtaccg ccacgcgcgc cgcggcggga acgcggccga catcacgccg ctcacgacgc tcggcaccga gaacggcgcg cccgtctcgc tgctcgagct gtcgaacggc ccgacgctcg cgttcaagga catggcgatg cagttgctcg gcaatctgtt cgagtacacg ctcgccaagc acggcgaaac gctgaacatc ctcggcgcga cgtcgggcga cacgggcagc gcggccgaat acgcgatgcg cggcaaagag ggcgtgcgcg tgttcatgct gtcgccgcac aagaagatga gcgcgttcca gaccgcgcag atgtacagcc tgcaggaccc gaacatcttc aacctcgcgg tgaacggcgt gttcgacgac tgccaggaca tcgtgaaggc cgtgtcgaac gatcacgcgt tcaaggcgca gcagaagatc ggcaccgtca attcgatcaa ctgggcgcgc gtcgtcgcgc aggtcgtcta ctacttcaag ggctacttcg cggcgacgcg gtcgaatgac gagcgcgtgt cgttcacggt gccgtcgggc aatttcggca acgtctgcgc gggccacatc gcgcgcatga tggggctgcc gatcgagaag ctcgtcgtcg cgacgaacga gaacgacgtg ctcgacgagt ttttccgcac gggcgcgtac cgcgtgcgca gcgcccagga cacgtatcac acgagcagcc cgagcatgga catctcgaag gcgtcgaact tcgagcgctt cgtgttcgat ctgctcggcc gcgatccggc gcgtgtcgtc cagctgtttc gcgatgtcga gcaaaagggc ggcttcgatc tcgcggcgag cggcgatttc gcgcgcgtcg ccgagttcgg cttcgtgtcg ggccgcagca cgcacgcgga ccggatcgcg acgatccgcg acgtgttcga gcgctaccgc acgatgatcg acacgcatac ggccgacggc ctgaaggtcg cgcgcgagca tctgcggccg ggcgtgccga tggtcgtgct cgagaccgcg cagccgatca agttcggcga atcgattcgc gaggcgctcg ggcaggagcc gtcgcggcct gccgcgttcg acgggctcga ggcgctgccg cagcgcttcg aggtcgtcga cgcgaacgcg cagcaagtga aggacttcat cgccgcgcat acgggcgcgt ga
Details for ButhA.00545.a.A1.GE32796
HARVESTED ON: 6/28/2011
SEQUENCED ON: 7/25/2011
EXPECTED MW: 55kDa
OBSERVED MW: 55kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Too many to count (100+)
TOTAL EXPRESSION LEVEL: Moderate Expression
SOLUBLE EXPRESSION LEVEL High Expression
EXPRESSION HOST: data unavailable
SEQUENCING RESULT: pass with incomplete coverage
PERCENT IDENTITY: 100
PERCENT COVERAGE: 94
Validated AA Sequence
MNYISTRGAG IGERHTFSDI LLGGLAKDGG LYLPSEYPQV SADELARWRT LPYADLAFEI LSKFCDDIAA ADLRAITRRT YTADVYRHAR RGGNAADITP LTTLGTENGA PVSLLELSNG PTLAFKDMAM QLLGNLFEYT LAKHGETLNI LGATSGDTGS AAEYAMRGKE GVRVFMLSPH KKMSAFQTAQ MYSLQDPNIF NLAVNGVFDD CQDIVKAVSN DHAFKAQQKI GTVNSINWAR VVAQVVYYFK GYFAATRSND ERVSFTVPSG NFGNVCAGHI ARMMGLPIEK LVVATNENDV LDEFFRTGAY RVRSAQDTYH TSSPSMDISK ASNFERFVFD LLGRDPARVV QLFRDVEQKG GFDLAASGDF ARVAEFGFVS GRSTHADRIA TIRDVFERYR TMIDTHTADG LKVAREHLRP GVPMVVLETA QPIKFGESIR EALGQEPSRP AAFDGLEALP QRFEVVDANA QQVKDFI
Validated NT Sequence
ggcgatgaag tccttcactt gctgcgcgtt cgcgtcgacg acctcgaagc gctgcggcag cgcctcgagc ccgtcgaacg cggcaggccg cgacggctcc tgcccgagcg cctcgcgaat cgattcgccg aacttgatcg gctgcgcggt ctcgagcacg accatcggca cgcccggccg cagatgctcg cgcgcgacct tcaggccgtc ggccgtatgc gtgtcgatca tcgtgcggta gcgctcgaac acgtcgcgga tcgtcgcgat ccggtccgcg tgcgtgctgc ggcccgacac gaagccgaac tcggcgacgc gcgcgaaatc gccgctcgcc gcgagatcga agccgccctt ttgctcgaca tcgcgaaaca gctggacgac acgcgccgga tcgcggccga gcagatcgaa cacgaagcgc tcgaagttcg acgccttcga gatgtccatg ctcgggctgc tcgtgtgata cgtgtcctgg gcgctgcgca cgcggtacgc gcccgtgcgg aaaaactcgt cgagcacgtc gttctcgttc gtcgcgacga cgagcttctc gatcggcagc cccatcatgc gcgcgatgtg gcccgcgcag acgttgccga aattgcccga cggcaccgtg aacgacacgc gctcgtcatt cgaccgcgtc gccgcgaagt agcccttgaa gtagtagacg acctgcgcga cgacgcgcgc ccagttgatc gaattgacgg tgccgatctt ctgctgcgcc ttgaacgcgt gatcgttcga cacggccttc acgatgtcct ggcagtcgtc gaacacgccg ttcaccgcga ggttgaagat gttcgggtcc tgcaggctgt acatctgcgc ggtctggaac gcgctcatct tcttgtgcgg cgacagcatg aacacgcgca cgccctcttt gccgcgcatc gcgtattcgg ccgcgctgcc cgtgtcgccc gacgtcgcgc cgaggatgtt cagcgtttcg ccgtgcttgg cgagcgtgta ctcgaacaga ttgccgagca actgcatcgc catgtccttg aacgcgagcg tcgggccgtt cgacagctcg agcagcgaga cgggcgcgcc gttctcggtg ccgagcgtcg tgagcggcgt gatgtcggcc gcgttcccgc cgcggcgcgc gtggcggtac acgtcggccg tgtacgtgcg gcgcgtgatc gcgcgcaggt cggcggcggc gatgtcgtcg cagaacttcg acaggatctc gaacgcgaga tccgcgtacg gcagcgtgcg ccagcgcgcg agctcgtccg cgctcacctg cggatactcg ctcggcagat agagcccgcc gtccttcgcg aggccgccga gcaggatgtc ggagaacgta tggcgctcgc cgatgccggc gccgcgcgtg gagatgtaat tcatcgaacc aggaccctgg gtctgagctt
Expected Protein Sequence
MAHHHHHHMG TLEAQTQGPG SMNYISTRGA GIGERHTFSD ILLGGLAKDG GLYLPSEYPQ VSADELARWR TLPYADLAFE ILSKFCDDIA AADLRAITRR TYTADVYRHA RRGGNAADIT PLTTLGTENG APVSLLELSN GPTLAFKDMA MQLLGNLFEY TLAKHGETLN ILGATSGDTG SAAEYAMRGK EGVRVFMLSP HKKMSAFQTA QMYSLQDPNI FNLAVNGVFD DCQDIVKAVS NDHAFKAQQK IGTVNSINWA RVVAQVVYYF KGYFAATRSN DERVSFTVPS GNFGNVCAGH IARMMGLPIE KLVVATNEND VLDEFFRTGA YRVRSAQDTY HTSSPSMDIS KASNFERFVF DLLGRDPARV VQLFRDVEQK GGFDLAASGD FARVAEFGFV SGRSTHADRI ATIRDVFERY RTMIDTHTAD GLKVAREHLR PGVPMVVLET AQPIKFGESI REALGQEPSR PAAFDGLEAL PQRFEVVDAN AQQVKDFIAA HTGA
Full NT Sequence (Expression Vector + Insert)
ttcttgaaga cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca cccgtggcca ggacccaacg ctgcccgaga tgcgccgcgt gcggctgctg gagatggcgg acgcgatgga tatgttctgc caagggttgg tttgcgcatt cacagttctc cgcaagaatt gattggctcc aattcttgga gtggtgaatc cgttagcgag gtgccgccgg cttccattca ggtcgaggtg gcccggctcc atgcaccgcg acgcaacgcg gggaggcaga caaggtatag ggcggcgcct acaatccatg ccaacccgtt ccatgtgctc gccgaggcgg cataaatcgc cgtgacgatc agcggtccag tgatcgaagt taggctggta agagccgcga gcgatccttg aagctgtccc tgatggtcgt catctacctg cctggacagc atggcctgca acgcgggcat cccgatgccg ccggaagcga gaagaatcat aatggggaag gccatccagc ctcgcgtcgt gaacgccagc aagacgtagc ccagcgcgtc ggccgtaaca acaccattta aatggagtgg ttacaaatgg agtggttaat taacaacacc atttgtcgac gctctccctt atgcgactcc tgcattagga agcagcccag tagtaggttg aggccgttga gcaccgccgc cgcaaggaat ggtgcatgca aggagatggc gcccaacagt cccccggcca cggggcctgc caccataccc acgccgaaac aagcgctcat gagcccgaag tggcgagccc gatcttcccc atcggtgatg tcggcgatat aggcgccagc aaccgcacct gtggcgccgg tgatgccggc cacgatgcgt ccggcgtaga ggatcgagat ctcgatcccg cgaaattaat acgactcact atagggagac cacaacggtt tccctctaga aataattttg tttaacttta agaaggagat ataccatggc tcatcaccat caccatcata tgggtaccct ggaagctcag acccagggtc ctggttcgat gaattacatc tccacgcgcg gcgccggcat cggcgagcgc catacgttct ccgacatcct gctcggcggc ctcgcgaagg acggcgggct ctatctgccg agcgagtatc cgcaggtgag cgcggacgag ctcgcgcgct ggcgcacgct gccgtacgcg gatctcgcgt tcgagatcct gtcgaagttc tgcgacgaca tcgccgccgc cgacctgcgc gcgatcacgc gccgcacgta cacggccgac gtgtaccgcc acgcgcgccg cggcgggaac gcggccgaca tcacgccgct cacgacgctc ggcaccgaga acggcgcgcc cgtctcgctg ctcgagctgt cgaacggccc gacgctcgcg ttcaaggaca tggcgatgca gttgctcggc aatctgttcg agtacacgct cgccaagcac ggcgaaacgc tgaacatcct cggcgcgacg tcgggcgaca cgggcagcgc ggccgaatac gcgatgcgcg gcaaagaggg cgtgcgcgtg ttcatgctgt cgccgcacaa gaagatgagc gcgttccaga ccgcgcagat gtacagcctg caggacccga acatcttcaa cctcgcggtg aacggcgtgt tcgacgactg ccaggacatc gtgaaggccg tgtcgaacga tcacgcgttc aaggcgcagc agaagatcgg caccgtcaat tcgatcaact gggcgcgcgt cgtcgcgcag gtcgtctact acttcaaggg ctacttcgcg gcgacgcggt cgaatgacga gcgcgtgtcg ttcacggtgc cgtcgggcaa tttcggcaac gtctgcgcgg gccacatcgc gcgcatgatg gggctgccga tcgagaagct cgtcgtcgcg acgaacgaga acgacgtgct cgacgagttt ttccgcacgg gcgcgtaccg cgtgcgcagc gcccaggaca cgtatcacac gagcagcccg agcatggaca tctcgaaggc gtcgaacttc gagcgcttcg tgttcgatct gctcggccgc gatccggcgc gtgtcgtcca gctgtttcgc gatgtcgagc aaaagggcgg cttcgatctc gcggcgagcg gcgatttcgc gcgcgtcgcc gagttcggct tcgtgtcggg ccgcagcacg cacgcggacc ggatcgcgac gatccgcgac gtgttcgagc gctaccgcac gatgatcgac acgcatacgg ccgacggcct gaaggtcgcg cgcgagcatc tgcggccggg cgtgccgatg gtcgtgctcg agaccgcgca gccgatcaag ttcggcgaat cgattcgcga ggcgctcggg caggagccgt cgcggcctgc cgcgttcgac gggctcgagg cgctgccgca gcgcttcgag gtcgtcgacg cgaacgcgca gcaagtgaag gacttcatcg ccgcgcatac gggcgcgtaa acagcacgaa caagttctgc agccaagctt ctcgaggatc cggctgctaa caaagcccga aaggaagctg agttggctgc tgccaccgct gagcaataac tagcataacc ccttggggcc tctaaacggg tcttgagggg ttttttgctg aaaggaggaa ctatatccgg atatccacag gacgggtgtg gtcgccatga tcgcgtagtc gatagtggct ccaagtagcg aagcgagcag gactgggcgg cggccaaagc ggtcggacag tgctccgaga acgggtgcgc atagaaattg catcaacgca tatagcgcta gcagcacgcc atagtgactg gcgatgctgt cggaatggac gatatcccgc aagaggcccg gcagtaccgg cataaccaag cctatgccta cagcatccag ggtgacggtg ccgaggatga cgatgagcgc attgttagat ttcatacacg gtgcctgact gcgttagcaa tttaactgtg ataaactacc gcattaaagc ttatcgatga taagctgtca aacatgagaa
Details for ButhA.00545.a.A1.PW33400
PURIFICATION DATe: 8/19/2011
CONCENTRATION: 44.82mg/ml
OBSERVED MW: data unavailable
EXPRESSION LEVEL: High Expression
PROTEIN PURIFICATION BUFFER: 25 mM HEPES pH 7.0, 500 mM NaCl, 5% Glycerol , 2 mM DTT, and 0.025% Azide
EXPRESSION HOST: data unavailable
VIAL COUNT (approx.): 10
VIAL VOLUME: 200µl
PERCENT IDENTITY: 100
PERCENT COVERAGE: 94
Protocol Notes
notes unavailable
Validated AA Sequence
MNYISTRGAG IGERHTFSDI LLGGLAKDGG LYLPSEYPQV SADELARWRT LPYADLAFEI LSKFCDDIAA ADLRAITRRT YTADVYRHAR RGGNAADITP LTTLGTENGA PVSLLELSNG PTLAFKDMAM QLLGNLFEYT LAKHGETLNI LGATSGDTGS AAEYAMRGKE GVRVFMLSPH KKMSAFQTAQ MYSLQDPNIF NLAVNGVFDD CQDIVKAVSN DHAFKAQQKI GTVNSINWAR VVAQVVYYFK GYFAATRSND ERVSFTVPSG NFGNVCAGHI ARMMGLPIEK LVVATNENDV LDEFFRTGAY RVRSAQDTYH TSSPSMDISK ASNFERFVFD LLGRDPARVV QLFRDVEQKG GFDLAASGDF ARVAEFGFVS GRSTHADRIA TIRDVFERYR TMIDTHTADG LKVAREHLRP GVPMVVLETA QPIKFGESIR EALGQEPSRP AAFDGLEALP QRFEVVDANA QQVKDFI
Validated NT Sequence
ggcgatgaag tccttcactt gctgcgcgtt cgcgtcgacg acctcgaagc gctgcggcag cgcctcgagc ccgtcgaacg cggcaggccg cgacggctcc tgcccgagcg cctcgcgaat cgattcgccg aacttgatcg gctgcgcggt ctcgagcacg accatcggca cgcccggccg cagatgctcg cgcgcgacct tcaggccgtc ggccgtatgc gtgtcgatca tcgtgcggta gcgctcgaac acgtcgcgga tcgtcgcgat ccggtccgcg tgcgtgctgc ggcccgacac gaagccgaac tcggcgacgc gcgcgaaatc gccgctcgcc gcgagatcga agccgccctt ttgctcgaca tcgcgaaaca gctggacgac acgcgccgga tcgcggccga gcagatcgaa cacgaagcgc tcgaagttcg acgccttcga gatgtccatg ctcgggctgc tcgtgtgata cgtgtcctgg gcgctgcgca cgcggtacgc gcccgtgcgg aaaaactcgt cgagcacgtc gttctcgttc gtcgcgacga cgagcttctc gatcggcagc cccatcatgc gcgcgatgtg gcccgcgcag acgttgccga aattgcccga cggcaccgtg aacgacacgc gctcgtcatt cgaccgcgtc gccgcgaagt agcccttgaa gtagtagacg acctgcgcga cgacgcgcgc ccagttgatc gaattgacgg tgccgatctt ctgctgcgcc ttgaacgcgt gatcgttcga cacggccttc acgatgtcct ggcagtcgtc gaacacgccg ttcaccgcga ggttgaagat gttcgggtcc tgcaggctgt acatctgcgc ggtctggaac gcgctcatct tcttgtgcgg cgacagcatg aacacgcgca cgccctcttt gccgcgcatc gcgtattcgg ccgcgctgcc cgtgtcgccc gacgtcgcgc cgaggatgtt cagcgtttcg ccgtgcttgg cgagcgtgta ctcgaacaga ttgccgagca actgcatcgc catgtccttg aacgcgagcg tcgggccgtt cgacagctcg agcagcgaga cgggcgcgcc gttctcggtg ccgagcgtcg tgagcggcgt gatgtcggcc gcgttcccgc cgcggcgcgc gtggcggtac acgtcggccg tgtacgtgcg gcgcgtgatc gcgcgcaggt cggcggcggc gatgtcgtcg cagaacttcg acaggatctc gaacgcgaga tccgcgtacg gcagcgtgcg ccagcgcgcg agctcgtccg cgctcacctg cggatactcg ctcggcagat agagcccgcc gtccttcgcg aggccgccga gcaggatgtc ggagaacgta tggcgctcgc cgatgccggc gccgcgcgtg gagatgtaat tcatcgaacc aggaccctgg gtctgagctt
Expressed Protein Sequence
MAHHHHHHMG TLEAQTQGPG SMNYISTRGA GIGERHTFSD ILLGGLAKDG GLYLPSEYPQ VSADELARWR TLPYADLAFE ILSKFCDDIA AADLRAITRR TYTADVYRHA RRGGNAADIT PLTTLGTENG APVSLLELSN GPTLAFKDMA MQLLGNLFEY TLAKHGETLN ILGATSGDTG SAAEYAMRGK EGVRVFMLSP HKKMSAFQTA QMYSLQDPNI FNLAVNGVFD DCQDIVKAVS NDHAFKAQQK IGTVNSINWA RVVAQVVYYF KGYFAATRSN DERVSFTVPS GNFGNVCAGH IARMMGLPIE KLVVATNEND VLDEFFRTGA YRVRSAQDTY HTSSPSMDIS KASNFERFVF DLLGRDPARV VQLFRDVEQK GGFDLAASGD FARVAEFGFV SGRSTHADRI ATIRDVFERY RTMIDTHTAD GLKVAREHLR PGVPMVVLET AQPIKFGESI REALGQEPSR PAAFDGLEAL PQRFEVVDAN AQQVKDFIAA HTGA
Full NT Sequence (Expression Vector + Insert)
TTCTTGAAGA CGAAAGGGCC TCGTGATACG CCTATTTTTA TAGGTTAATG TCATGATAAT AATGGTTTCT TAGACGTCAG GTGGCACTTT TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTTATTTTTC TAAATACATT CAAATATGTA TCCGCTCATG AGACAATAAC CCTGATAAAT GCTTCAATAA TATTGAAAAA GGAAGAGTAT GAGTATTCAA CATTTCCGTG TCGCCCTTAT TCCCTTTTTT GCGGCATTTT GCCTTCCTGT TTTTGCTCAC CCAGAAACGC TGGTGAAAGT AAAAGATGCT GAAGATCAGT TGGGTGCACG AGTGGGTTAC ATCGAACTGG ATCTCAACAG CGGTAAGATC CTTGAGAGTT TTCGCCCCGA AGAACGTTTT CCAATGATGA GCACTTTTAA AGTTCTGCTA TGTGGCGCGG TATTATCCCG TGTTGACGCC GGGCAAGAGC AACTCGGTCG CCGCATACAC TATTCTCAGA ATGACTTGGT TGAGTACTCA CCAGTCACAG AAAAGCATCT TACGGATGGC ATGACAGTAA GAGAATTATG CAGTGCTGCC ATAACCATGA GTGATAACAC TGCGGCCAAC TTACTTCTGA CAACGATCGG AGGACCGAAG GAGCTAACCG CTTTTTTGCA CAACATGGGG GATCATGTAA CTCGCCTTGA TCGTTGGGAA CCGGAGCTGA ATGAAGCCAT ACCAAACGAC GAGCGTGACA CCACGATGCC TGCAGCAATG GCAACAACGT TGCGCAAACT ATTAACTGGC GAACTACTTA CTCTAGCTTC CCGGCAACAA TTAATAGACT GGATGGAGGC GGATAAAGTT GCAGGACCAC TTCTGCGCTC GGCCCTTCCG GCTGGCTGGT TTATTGCTGA TAAATCTGGA GCCGGTGAGC GTGGGTCTCG CGGTATCATT GCAGCACTGG GGCCAGATGG TAAGCCCTCC CGTATCGTAG TTATCTACAC GACGGGGAGT CAGGCAACTA TGGATGAACG AAATAGACAG ATCGCTGAGA TAGGTGCCTC ACTGATTAAG CATTGGTAAC TGTCAGACCA AGTTTACTCA TATATACTTT AGATTGATTT AAAACTTCAT TTTTAATTTA AAAGGATCTA GGTGAAGATC CTTTTTGATA ATCTCATGAC CAAAATCCCT TAACGTGAGT TTTCGTTCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT TTTTTCTGCG CGTAATCTGC TGCTTGCAAA CAAAAAAACC ACCGCTACCA GCGGTGGTTT GTTTGCCGGA TCAAGAGCTA CCAACTCTTT TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AGATACCAAA TACTGTCCTT CTAGTGTAGC CGTAGTTAGG CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA GACGATAGTT ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCACACAGC CCAGCTTGGA GCGAACGACC TACACCGAAC TGAGATACCT ACAGCGTGAG CTATGAGAAA GCGCCACGCT TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGCCTG GTATCTTTAT AGTCCTGTCG GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG GGGCGGAGCC TATGGAAAAA CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG CTCACATGTT CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG AGTGAGCTGA TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA GCGCCTGATG CGGTATTTTC TCCTTACGCA TCTGTGCGGT ATTTCACACC GCATATATGG TGCACTCTCA GTACAATCTG CTCTGATGCC GCATAGTTAA GCCAGTATAC ACTCCGCTAT CGCTACGTGA CTGGGTCATG GCTGCGCCCC GACACCCGCC AACACCCGCT GACGCGCCCT GACGGGCTTG TCTGCTCCCG GCATCCGCTT ACAGACAAGC TGTGACCGTC TCCGGGAGCT GCATGTGTCA GAGGTTTTCA CCGTCATCAC CGAAACGCGC GAGGCAGCTG CGGTAAAGCT CATCAGCGTG GTCGTGAAGC GATTCACAGA TGTCTGCCTG TTCATCCGCG TCCAGCTCGT TGAGTTTCTC CAGAAGCGTT AATGTCTGGC TTCTGATAAA GCGGGCCATG TTAAGGGCGG TTTTTTCCTG TTTGGTCACT GATGCCTCCG TGTAAGGGGG ATTTCTGTTC ATGGGGGTAA TGATACCGAT GAAACGAGAG AGGATGCTCA CGATACGGGT TACTGATGAT GAACATGCCC GGTTACTGGA ACGTTGTGAG GGTAAACAAC TGGCGGTATG GATGCGGCGG GACCAGAGAA AAATCACTCA GGGTCAATGC CAGCGCTTCG TTAATACAGA TGTAGGTGTT CCACAGGGTA GCCAGCAGCA TCCTGCGATG CAGATCCGGA ACATAATGGT GCAGGGCGCT GACTTCCGCG TTTCCAGACT TTACGAAACA CGGAAACCGA AGACCATTCA TGTTGTTGCT CAGGTCGCAG ACGTTTTGCA GCAGCAGTCG CTTCACGTTC GCTCGCGTAT CGGTGATTCA TTCTGCTAAC CAGTAAGGCA ACCCCGCCAG CCTAGCCGGG TCCTCAACGA CAGGAGCACG ATCATGCGCA CCCGTGGCCA GGACCCAACG CTGCCCGAGA TGCGCCGCGT GCGGCTGCTG GAGATGGCGG ACGCGATGGA TATGTTCTGC CAAGGGTTGG TTTGCGCATT CACAGTTCTC CGCAAGAATT GATTGGCTCC AATTCTTGGA GTGGTGAATC CGTTAGCGAG GTGCCGCCGG CTTCCATTCA GGTCGAGGTG GCCCGGCTCC ATGCACCGCG ACGCAACGCG GGGAGGCAGA CAAGGTATAG GGCGGCGCCT ACAATCCATG CCAACCCGTT CCATGTGCTC GCCGAGGCGG CATAAATCGC CGTGACGATC AGCGGTCCAG TGATCGAAGT TAGGCTGGTA AGAGCCGCGA GCGATCCTTG AAGCTGTCCC TGATGGTCGT CATCTACCTG CCTGGACAGC ATGGCCTGCA ACGCGGGCAT CCCGATGCCG CCGGAAGCGA GAAGAATCAT AATGGGGAAG GCCATCCAGC CTCGCGTCGT GAACGCCAGC AAGACGTAGC CCAGCGCGTC GGCCGTAACA ACACCATTTA AATGGAGTGG TTACAAATGG AGTGGTTAAT TAACAACACC ATTTGTCGAC GCTCTCCCTT ATGCGACTCC TGCATTAGGA AGCAGCCCAG TAGTAGGTTG AGGCCGTTGA GCACCGCCGC CGCAAGGAAT GGTGCATGCA AGGAGATGGC GCCCAACAGT CCCCCGGCCA CGGGGCCTGC CACCATACCC ACGCCGAAAC AAGCGCTCAT GAGCCCGAAG TGGCGAGCCC GATCTTCCCC ATCGGTGATG TCGGCGATAT AGGCGCCAGC AACCGCACCT GTGGCGCCGG TGATGCCGGC CACGATGCGT CCGGCGTAGA GGATCGAGAT CTCGATCCCG CGAAATTAAT ACGACTCACT ATAGGGAGAC CACAACGGTT TCCCTCTAGA AATAATTTTG TTTAACTTTA AGAAGGAGAT ATACCATGGC TCATCACCAT CACCATCATA TGGGTACCCT GGAAGCTCAG ACCCAGGGTC CTGGTTCGAT GAATTACATC TCCACGCGCG GCGCCGGCAT CGGCGAGCGC CATACGTTCT CCGACATCCT GCTCGGCGGC CTCGCGAAGG ACGGCGGGCT CTATCTGCCG AGCGAGTATC CGCAGGTGAG CGCGGACGAG CTCGCGCGCT GGCGCACGCT GCCGTACGCG GATCTCGCGT TCGAGATCCT GTCGAAGTTC TGCGACGACA TCGCCGCCGC CGACCTGCGC GCGATCACGC GCCGCACGTA CACGGCCGAC GTGTACCGCC ACGCGCGCCG CGGCGGGAAC GCGGCCGACA TCACGCCGCT CACGACGCTC GGCACCGAGA ACGGCGCGCC CGTCTCGCTG CTCGAGCTGT CGAACGGCCC GACGCTCGCG TTCAAGGACA TGGCGATGCA GTTGCTCGGC AATCTGTTCG AGTACACGCT CGCCAAGCAC GGCGAAACGC TGAACATCCT CGGCGCGACG TCGGGCGACA CGGGCAGCGC GGCCGAATAC GCGATGCGCG GCAAAGAGGG CGTGCGCGTG TTCATGCTGT CGCCGCACAA GAAGATGAGC GCGTTCCAGA CCGCGCAGAT GTACAGCCTG CAGGACCCGA ACATCTTCAA CCTCGCGGTG AACGGCGTGT TCGACGACTG CCAGGACATC GTGAAGGCCG TGTCGAACGA TCACGCGTTC AAGGCGCAGC AGAAGATCGG CACCGTCAAT TCGATCAACT GGGCGCGCGT CGTCGCGCAG GTCGTCTACT ACTTCAAGGG CTACTTCGCG GCGACGCGGT CGAATGACGA GCGCGTGTCG TTCACGGTGC CGTCGGGCAA TTTCGGCAAC GTCTGCGCGG GCCACATCGC GCGCATGATG GGGCTGCCGA TCGAGAAGCT CGTCGTCGCG ACGAACGAGA ACGACGTGCT CGACGAGTTT TTCCGCACGG GCGCGTACCG CGTGCGCAGC GCCCAGGACA CGTATCACAC GAGCAGCCCG AGCATGGACA TCTCGAAGGC GTCGAACTTC GAGCGCTTCG TGTTCGATCT GCTCGGCCGC GATCCGGCGC GTGTCGTCCA GCTGTTTCGC GATGTCGAGC AAAAGGGCGG CTTCGATCTC GCGGCGAGCG GCGATTTCGC GCGCGTCGCC GAGTTCGGCT TCGTGTCGGG CCGCAGCACG CACGCGGACC GGATCGCGAC GATCCGCGAC GTGTTCGAGC GCTACCGCAC GATGATCGAC ACGCATACGG CCGACGGCCT GAAGGTCGCG CGCGAGCATC TGCGGCCGGG CGTGCCGATG GTCGTGCTCG AGACCGCGCA GCCGATCAAG TTCGGCGAAT CGATTCGCGA GGCGCTCGGG CAGGAGCCGT CGCGGCCTGC CGCGTTCGAC GGGCTCGAGG CGCTGCCGCA GCGCTTCGAG GTCGTCGACG CGAACGCGCA GCAAGTGAAG GACTTCATCG CCGCGCATAC GGGCGCGTAA ACAGCACGAA CAAGTTCTGC AGCCAAGCTT CTCGAGGATC CGGCTGCTAA CAAAGCCCGA AAGGAAGCTG AGTTGGCTGC TGCCACCGCT GAGCAATAAC TAGCATAACC CCTTGGGGCC TCTAAACGGG TCTTGAGGGG TTTTTTGCTG AAAGGAGGAA CTATATCCGG ATATCCACAG GACGGGTGTG GTCGCCATGA TCGCGTAGTC GATAGTGGCT CCAAGTAGCG AAGCGAGCAG GACTGGGCGG CGGCCAAAGC GGTCGGACAG TGCTCCGAGA ACGGGTGCGC ATAGAAATTG CATCAACGCA TATAGCGCTA GCAGCACGCC ATAGTGACTG GCGATGCTGT CGGAATGGAC GATATCCCGC AAGAGGCCCG GCAGTACCGG CATAACCAAG CCTATGCCTA CAGCATCCAG GGTGACGGTG CCGAGGATGA CGATGAGCGC ATTGTTAGAT TTCATACACG GTGCCTGACT GCGTTAGCAA TTTAACTGTG ATAAACTACC GCATTAAAGC TTATCGATGA TAAGCTGTCA AACATGAGAA