ButhA.17876.a

GMP synthase (BTH_I2058)

CENTER ID: ButhA.17876.a
ORGANISM: Burkholderia thailandensis E264
ASSOCIATED DISEASE:
CURRENT STATUS: purified
COMMUNITY REQUEST: True
NIH RISK GROUP: 3
SELECT AGENT: False
NIH PRIORITY
pathogens category:
IIIB

Ordering Clones & Proteins

If there are materials available for this target, they will be listed below. Materials can be ordered from SSGCID using the button in the "order material" column. Clicking the button will add the material to a virtual cart. You may order multiple materials at a time at no cost to you, as this contract is funded by NIAID. When you are ready to place your order, click the "Place Order" link which will appear in the top right corner of the page after you place your first item in your cart.

Clones*

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
ButhA.17876.a.A1.GE32897 full length 1 599
* SSGCID clones represent un-induced expression constructs which have been verified by sequencing from vector primers. Clones may contain a tag, such as N-term 6xHis. Get sequence information using the button in the "info" column.

Proteins

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
ButhA.17876.a.A1.PW33501 full length 1 599

External Resources

RESOURCE REFERENCE ID
BV-BRC: fig|271848.6.peg.4870
RefSeq: YP_442582.1
UniProt: Q2SWW7

Sequences

These sequences are the native gene sequence; sequences of constructs derived from these sequences may differ due to codon optimization or other protocols. To find the specific sequence of any material you may have ordered, click on the "info" button next to the name of that material.
AA Sequence
MLRRASSATR RGRRRNRDRH GSPDCQAAAC RGGAIARLRI RCSASCAAFV IPLFFSPSAA MHDKILILDF GSQVTQLIAR RVREAHVYCE IHPNDVSDDF VREFAPKGVI LSGSHASTYE DHQLRAPQAV WDLGVPVLGI CYGMQTMAVQ LGGKVEWSDH REFGYAEVRA HGHTRLLDGI QDFATPEGHG MLKVWMSHGD KVAAMPPGFA LMASTPSCPI AGMADEARGY YAVQFHPEVT HTVQGRKLLE RFVLDIAGAK PDWIMRDHIE EAVARIREQV GDEEVILGLS GGVDSSVAAA LIHRAIGDQL TCVFVDHGLL RLNEGKMVLD MFEGRLHAKV VHVDASEQFL GHLAGVADPE HKRKIIGREF VEVFQAEAKK LTNAKWLAQG TIYPDVIESG GAKTKKATTI KSHHNVGGLP ETLGLKLLEP LRDLFKDEVR ELGVALGLPE EMVYRHPFPG PGLGVRILGE VKRDYADLLR RADAIFIEEL RGTLATEQDA AAGLCEPSQV GKSWYDLTSQ AFAVFLPVKS VGVMGDGRTY DYVTALRAVQ TTDFMTAHWA HLPYALLGRA SNRIINEVRG INRVVYDVSG KPPATIEWE
NT Sequence
atgctgcgac gagcaagttc ggcgacgcgg cgcggccgcc gtcgcaaccg tgatcggcac ggctctcccg actgtcaggc cgccgcgtgt cgcggcggcg cgatcgcccg gctacgtatt cgatgcagcg cctcgtgcgc tgcttttgtt attccattgt tcttttctcc gtccgctgcc atgcacgaca aaatcctgat tctcgacttc ggttcgcaag tcactcaact gattgcgcgg cgcgtccgcg aagcgcacgt ctactgcgag atccacccga acgacgtctc cgacgacttc gtccgcgaat tcgcaccgaa gggcgtgatt ctgtccggca gccacgcgag cacctatgag gaccaccaac tgcgcgcgcc gcaggcggtg tgggatctcg gcgtgcccgt gctcggcatt tgttacggga tgcagacgat ggccgtgcag ttgggcggca aggtcgagtg gagcgatcat cgcgagttcg gctatgcgga agtgcgcgcg cacggccaca cgcgcctgct cgacggcatc caggatttcg cgacgccgga aggccacgga atgctgaagg tgtggatgag tcatggcgac aaggtcgccg cgatgccgcc gggcttcgcg ctgatggcgt cgacgccgag ctgcccgatc gccggcatgg ccgacgaggc gcgcggctac tacgcggtgc agttccatcc ggaagtcacg cacacggtgc agggccgcaa gctgctcgag cgcttcgtgc tcgacatcgc cggcgcgaaa cccgactgga tcatgcgtga ccacatcgaa gaggcggtcg cgcggattcg cgagcaagtc ggcgacgagg aggtgattct cggcctgtcg ggcggcgtgg attcgagcgt cgcggcggcg ctgatccacc gcgcgatcgg cgatcaactg acctgcgtgt tcgtcgatca cggcctgctg cgcctgaacg aaggcaagat ggtgctcgac atgttcgaag gccggctcca tgcgaaggtg gtccatgtcg acgcgtccga gcagttcctg ggccatctcg cgggcgtcgc cgatccggag cacaagcgca agatcattgg ccgcgagttc gtcgaggtgt tccaggccga ggcgaagaag ctgacgaacg cgaagtggct cgcgcagggc acgatctatc cggacgtaat cgaatcgggc ggcgcaaaga cgaagaaggc gacgacgatc aagagccatc acaatgtcgg cggcctgccg gagacgctcg gcctgaaact gctcgagccg ttgcgcgacc tgttcaagga tgaagtgcgc gaactcggcg tcgcgctcgg cttgcccgag gagatggtgt atcggcatcc gttcccgggc ccgggcctcg gcgtgcggat tctgggcgaa gtgaagcgag actacgcgga tctgctgcgc cgcgcggatg cgatcttcat cgaagagctt cgcggcacgt tggcgaccga gcaggacgcg gcggcgggcc tttgcgagcc gtcgcaggtc ggcaagagct ggtatgacct gacgagccag gcgttcgcgg tgttcctgcc ggtgaagtcg gtcggcgtga tgggcgatgg ccgcacgtac gactacgtga ccgcgctgcg cgccgtgcag acgaccgact tcatgaccgc gcactgggcg catctgccgt atgcgctgct cggccgcgcg tcgaaccgaa tcatcaacga agtgcgcggc atcaaccggg tcgtgtacga cgtgtcgggc aagccgccgg cgacgatcga gtgggaatga
Details for ButhA.17876.a.A1.GE32897
HARVESTED ON: 7/12/2011
SEQUENCED ON: 7/28/2011
EXPECTED MW: 68kDa
OBSERVED MW: 68kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Too many to count (100+)
TOTAL EXPRESSION LEVEL: Moderate Expression
SOLUBLE EXPRESSION LEVEL Low Expression
EXPRESSION HOST: data unavailable
SEQUENCING RESULT: pass with incomplete coverage
PERCENT IDENTITY: 69
PERCENT COVERAGE: 80
Validated AA Sequence
MAHHHHHHMG TLEAQTQGPG SMLRRASSAT RRGRRRNRDR HGSPDCQAAA CRGGAIARLR IRCSASCAAF VIPLFFSPSA AMHDKILILD FGSQVTQLIA RRVREAHVYC EIHPNDVSDD FVREFAPKGV ILSGSHASTY EDHQLRAPQA VWDLGVPVLG ICYGMQTMAV QLGGKVEWSD HREFGYAEVR AHGHTRLLDG IQDFATPEGH GMLKVWMSHG DKVAAMPPGF ALMASTPSCP IAGMADEARG YYAVQXHPEV THTXQGRKLL ERFVLDIAXA KPDXXMRDHX EXXVARXREQ VGXXEVILGL SGGVDXSVAA AXXXXAIXDQ LXACSSITXA AXEXXQDGAR HVRRPAPCEG GPCRRVRAVX GPSRXRRRSG AQAQDHWPRV RRGVPGRGEE ADEREVARAG HDLSGRNRIG RRKDEEGDDD QEPSQCRRPA GDARPETAXL XXXXIGRPQR FXXXXXXXXX XXXXXXWLIT ITIIWVPWKL RPRVLVR
Validated NT Sequence
cgagaacgac gagccgcanc cgcaggtcgt ggtcgcgttc gggttcttga tgacgaattg cgcgccgttc anatcgtcct tgtagtcgat ctcggcgccg acgagatatt gatagctcat cnngncgacn agnancacga cgccnttctt gttgagcacg gtgtcgtcct cgttgacttc ctcgtcgaac gtgaagccat actggaaacc ggagcagccg ccgccttgca cgannncncg cagctngagg tcggggttgc cctcttcnnc gatcanttgc ttgaccttgt cggccgccgc nncngtgaag acgaacggag ccggcntctc ggtggttgcc gcggattcgg taacngcgtt catcgaacca ggaccctggg tctgagcttc cagggtaccc atatgatggt gatggtgatg agccatnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn aaaccgttgt ggtctcccta tannnnnnnn nattaannga gcagtttcag gccgagcgtc tccggcaggc cgccgacatt gtgatggctc ttgatcgtcg tcgccttctt cgtctttgcg ccgcccgatt cgattacgtc cggatagatc gtgccctgcg cgagccactt cgcgttcgtc agcttcttcg cctcggcctg gaacacctcg acgaactcgc ggccaatgat cttgcgcttg tgctccggat cggcgacgcn cgcgagatgg cccangaact gctcggacgc gtcgacatgg accaccttcg catggagccg gccttcgaac atgtcgagca ccatcttgnc ttcnttcang cgcagcnnnc gtgatcgacg aacacgcnnn cagttgatcg ncgatcgcgc ngnggnncan cgccgccgcg acgctcgnat ccacgccgcc cgacaggccg agaatcacct cctngtngcc gacttgctcg cgannccgcg cgaccnnntn ttcgangtgg tcacgcatgn nccngtcggg tttcgcgcng gcgatgtcga gcacgaagcg ctcgagcagc ttgcggccct gcnccgtgtg cgtgacttcc ggatgnaact gcaccgcgta gtagccgcgc gcctcgtcgg ccatgccggc gatcgggcag ctcggcgtcg acgccatcag cgcgaagccc ggcggcatcg cggcgacctt gtcgccatga ctcatccaca ccttcagcat tccgtggcct tccggcgtcg cgaaatcctg gatgccgtcg agcaggcgcg tgtggccgtg cgcgcgcact tccgcatagc cgaactcgcg atgatcgctc cactcgacct tgccgcccaa ctgcacggcc atcgtctgca tcccgtaaca aatgccgagc acgggcacgc cgagatccca caccgcctgc ggcgcgcgca gttggtggtc ctcataggtg ctcgcgtggc tgccggacag aatcacgccc ttcggtgcga attcgcggac gaagtcgtcg gagacgtcgt tcgggtggat ctcgcagtag acgtgcgctt cgcggacgcg ccgcgcaatc agttgagtga cttgcgaacc gaagtcgaga atcaggattt tgtcgtgcat ggcagcggac ggagaaaaga acaatggaat aacaaaagca gcgcacgagg cgctgcatcg aatacgtagc cgggcgatcg cgccgccgcg acacgcggcg gcctgacagt cgggagagcc gtgccgatca cggttgcgac ggcggccgcg ccgcgtcgcc gaacttgctc gtcgcagcat cgaaccagga ccctgggtct gagcttccag ggtacccata tgatggtgat ggtgatgagc cat
Expected Protein Sequence
MAHHHHHHMG TLEAQTQGPG SMLRRASSAT RRGRRRNRDR HGSPDCQAAA CRGGAIARLR IRCSASCAAF VIPLFFSPSA AMHDKILILD FGSQVTQLIA RRVREAHVYC EIHPNDVSDD FVREFAPKGV ILSGSHASTY EDHQLRAPQA VWDLGVPVLG ICYGMQTMAV QLGGKVEWSD HREFGYAEVR AHGHTRLLDG IQDFATPEGH GMLKVWMSHG DKVAAMPPGF ALMASTPSCP IAGMADEARG YYAVQFHPEV THTVQGRKLL ERFVLDIAGA KPDWIMRDHI EEAVARIREQ VGDEEVILGL SGGVDSSVAA ALIHRAIGDQ LTCVFVDHGL LRLNEGKMVL DMFEGRLHAK VVHVDASEQF LGHLAGVADP EHKRKIIGRE FVEVFQAEAK KLTNAKWLAQ GTIYPDVIES GGAKTKKATT IKSHHNVGGL PETLGLKLLE PLRDLFKDEV RELGVALGLP EEMVYRHPFP GPGLGVRILG EVKRDYADLL RRADAIFIEE LRGTLATEQD AAAGLCEPSQ VGKSWYDLTS QAFAVFLPVK SVGVMGDGRT YDYVTALRAV QTTDFMTAHW AHLPYALLGR ASNRIINEVR GINRVVYDVS GKPPATIEWE
Full NT Sequence (Expression Vector + Insert)
ttcttgaaga cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca cccgtggcca ggacccaacg ctgcccgaga tgcgccgcgt gcggctgctg gagatggcgg acgcgatgga tatgttctgc caagggttgg tttgcgcatt cacagttctc cgcaagaatt gattggctcc aattcttgga gtggtgaatc cgttagcgag gtgccgccgg cttccattca ggtcgaggtg gcccggctcc atgcaccgcg acgcaacgcg gggaggcaga caaggtatag ggcggcgcct acaatccatg ccaacccgtt ccatgtgctc gccgaggcgg cataaatcgc cgtgacgatc agcggtccag tgatcgaagt taggctggta agagccgcga gcgatccttg aagctgtccc tgatggtcgt catctacctg cctggacagc atggcctgca acgcgggcat cccgatgccg ccggaagcga gaagaatcat aatggggaag gccatccagc ctcgcgtcgt gaacgccagc aagacgtagc ccagcgcgtc ggccgtaaca acaccattta aatggagtgg ttacaaatgg agtggttaat taacaacacc atttgtcgac gctctccctt atgcgactcc tgcattagga agcagcccag tagtaggttg aggccgttga gcaccgccgc cgcaaggaat ggtgcatgca aggagatggc gcccaacagt cccccggcca cggggcctgc caccataccc acgccgaaac aagcgctcat gagcccgaag tggcgagccc gatcttcccc atcggtgatg tcggcgatat aggcgccagc aaccgcacct gtggcgccgg tgatgccggc cacgatgcgt ccggcgtaga ggatcgagat ctcgatcccg cgaaattaat acgactcact atagggagac cacaacggtt tccctctaga aataattttg tttaacttta agaaggagat ataccatggc tcatcaccat caccatcata tgggtaccct ggaagctcag acccagggtc ctggttcgat gctgcgacga gcaagttcgg cgacgcggcg cggccgccgt cgcaaccgtg atcggcacgg ctctcccgac tgtcaggccg ccgcgtgtcg cggcggcgcg atcgcccggc tacgtattcg atgcagcgcc tcgtgcgctg cttttgttat tccattgttc ttttctccgt ccgctgccat gcacgacaaa atcctgattc tcgacttcgg ttcgcaagtc actcaactga ttgcgcggcg cgtccgcgaa gcgcacgtct actgcgagat ccacccgaac gacgtctccg acgacttcgt ccgcgaattc gcaccgaagg gcgtgattct gtccggcagc cacgcgagca cctatgagga ccaccaactg cgcgcgccgc aggcggtgtg ggatctcggc gtgcccgtgc tcggcatttg ttacgggatg cagacgatgg ccgtgcagtt gggcggcaag gtcgagtgga gcgatcatcg cgagttcggc tatgcggaag tgcgcgcgca cggccacacg cgcctgctcg acggcatcca ggatttcgcg acgccggaag gccacggaat gctgaaggtg tggatgagtc atggcgacaa ggtcgccgcg atgccgccgg gcttcgcgct gatggcgtcg acgccgagct gcccgatcgc cggcatggcc gacgaggcgc gcggctacta cgcggtgcag ttccatccgg aagtcacgca cacggtgcag ggccgcaagc tgctcgagcg cttcgtgctc gacatcgccg gcgcgaaacc cgactggatc atgcgtgacc acatcgaaga ggcggtcgcg cggattcgcg agcaagtcgg cgacgaggag gtgattctcg gcctgtcggg cggcgtggat tcgagcgtcg cggcggcgct gatccaccgc gcgatcggcg atcaactgac ctgcgtgttc gtcgatcacg gcctgctgcg cctgaacgaa ggcaagatgg tgctcgacat gttcgaaggc cggctccatg cgaaggtggt ccatgtcgac gcgtccgagc agttcctggg ccatctcgcg ggcgtcgccg atccggagca caagcgcaag atcattggcc gcgagttcgt cgaggtgttc caggccgagg cgaagaagct gacgaacgcg aagtggctcg cgcagggcac gatctatccg gacgtaatcg aatcgggcgg cgcaaagacg aagaaggcga cgacgatcaa gagccatcac aatgtcggcg gcctgccgga gacgctcggc ctgaaactgc tcgagccgtt gcgcgacctg ttcaaggatg aagtgcgcga actcggcgtc gcgctcggct tgcccgagga gatggtgtat cggcatccgt tcccgggccc gggcctcggc gtgcggattc tgggcgaagt gaagcgagac tacgcggatc tgctgcgccg cgcggatgcg atcttcatcg aagagcttcg cggcacgttg gcgaccgagc aggacgcggc ggcgggcctt tgcgagccgt cgcaggtcgg caagagctgg tatgacctga cgagccaggc gttcgcggtg ttcctgccgg tgaagtcggt cggcgtgatg ggcgatggcc gcacgtacga ctacgtgacc gcgctgcgcg ccgtgcagac gaccgacttc atgaccgcgc actgggcgca tctgccgtat gcgctgctcg gccgcgcgtc gaaccgaatc atcaacgaag tgcgcggcat caaccgggtc gtgtacgacg tgtcgggcaa gccgccggcg acgatcgagt gggaataaac agcacgaaca agttctgcag ccaagcttct cgaggatccg gctgctaaca aagcccgaaa ggaagctgag ttggctgctg ccaccgctga gcaataacta gcataacccc ttggggcctc taaacgggtc ttgaggggtt ttttgctgaa aggaggaact atatccggat atccacagga cgggtgtggt cgccatgatc gcgtagtcga tagtggctcc aagtagcgaa gcgagcagga ctgggcggcg gccaaagcgg tcggacagtg ctccgagaac gggtgcgcat agaaattgca tcaacgcata tagcgctagc agcacgccat agtgactggc gatgctgtcg gaatggacga tatcccgcaa gaggcccggc agtaccggca taaccaagcc tatgcctaca gcatccaggg tgacggtgcc gaggatgacg atgagcgcat tgttagattt catacacggt gcctgactgc gttagcaatt taactgtgat aaactaccgc attaaagctt atcgatgata agctgtcaaa catgagaa
Details for ButhA.17876.a.A1.PW33501
PURIFICATION DATe: 10/28/2011
CONCENTRATION: 10.72mg/ml
OBSERVED MW: data unavailable
EXPRESSION LEVEL: Low Expression
PROTEIN PURIFICATION BUFFER: 25 mM HEPES pH 7.0, 500 mM NaCl, 5% Glycerol , 2 mM DTT, and 0.025% Azide
EXPRESSION HOST: data unavailable
VIAL COUNT (approx.): 3
VIAL VOLUME: 50µl
PERCENT IDENTITY: 69
PERCENT COVERAGE: 80
Protocol Notes
notes unavailable
Validated AA Sequence
MAHHHHHHMG TLEAQTQGPG SMLRRASSAT RRGRRRNRDR HGSPDCQAAA CRGGAIARLR IRCSASCAAF VIPLFFSPSA AMHDKILILD FGSQVTQLIA RRVREAHVYC EIHPNDVSDD FVREFAPKGV ILSGSHASTY EDHQLRAPQA VWDLGVPVLG ICYGMQTMAV QLGGKVEWSD HREFGYAEVR AHGHTRLLDG IQDFATPEGH GMLKVWMSHG DKVAAMPPGF ALMASTPSCP IAGMADEARG YYAVQXHPEV THTXQGRKLL ERFVLDIAXA KPDXXMRDHX EXXVARXREQ VGXXEVILGL SGGVDXSVAA AXXXXAIXDQ LXACSSITXA AXEXXQDGAR HVRRPAPCEG GPCRRVRAVX GPSRXRRRSG AQAQDHWPRV RRGVPGRGEE ADEREVARAG HDLSGRNRIG RRKDEEGDDD QEPSQCRRPA GDARPETAXL XXXXIGRPQR FXXXXXXXXX XXXXXXWLIT ITIIWVPWKL RPRVLVR
Validated NT Sequence
cgagaacgac gagccgcanc cgcaggtcgt ggtcgcgttc gggttcttga tgacgaattg cgcgccgttc anatcgtcct tgtagtcgat ctcggcgccg acgagatatt gatagctcat cnngncgacn agnancacga cgccnttctt gttgagcacg gtgtcgtcct cgttgacttc ctcgtcgaac gtgaagccat actggaaacc ggagcagccg ccgccttgca cgannncncg cagctngagg tcggggttgc cctcttcnnc gatcanttgc ttgaccttgt cggccgccgc nncngtgaag acgaacggag ccggcntctc ggtggttgcc gcggattcgg taacngcgtt catcgaacca ggaccctggg tctgagcttc cagggtaccc atatgatggt gatggtgatg agccatnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn aaaccgttgt ggtctcccta tannnnnnnn nattaannga gcagtttcag gccgagcgtc tccggcaggc cgccgacatt gtgatggctc ttgatcgtcg tcgccttctt cgtctttgcg ccgcccgatt cgattacgtc cggatagatc gtgccctgcg cgagccactt cgcgttcgtc agcttcttcg cctcggcctg gaacacctcg acgaactcgc ggccaatgat cttgcgcttg tgctccggat cggcgacgcn cgcgagatgg cccangaact gctcggacgc gtcgacatgg accaccttcg catggagccg gccttcgaac atgtcgagca ccatcttgnc ttcnttcang cgcagcnnnc gtgatcgacg aacacgcnnn cagttgatcg ncgatcgcgc ngnggnncan cgccgccgcg acgctcgnat ccacgccgcc cgacaggccg agaatcacct cctngtngcc gacttgctcg cgannccgcg cgaccnnntn ttcgangtgg tcacgcatgn nccngtcggg tttcgcgcng gcgatgtcga gcacgaagcg ctcgagcagc ttgcggccct gcnccgtgtg cgtgacttcc ggatgnaact gcaccgcgta gtagccgcgc gcctcgtcgg ccatgccggc gatcgggcag ctcggcgtcg acgccatcag cgcgaagccc ggcggcatcg cggcgacctt gtcgccatga ctcatccaca ccttcagcat tccgtggcct tccggcgtcg cgaaatcctg gatgccgtcg agcaggcgcg tgtggccgtg cgcgcgcact tccgcatagc cgaactcgcg atgatcgctc cactcgacct tgccgcccaa ctgcacggcc atcgtctgca tcccgtaaca aatgccgagc acgggcacgc cgagatccca caccgcctgc ggcgcgcgca gttggtggtc ctcataggtg ctcgcgtggc tgccggacag aatcacgccc ttcggtgcga attcgcggac gaagtcgtcg gagacgtcgt tcgggtggat ctcgcagtag acgtgcgctt cgcggacgcg ccgcgcaatc agttgagtga cttgcgaacc gaagtcgaga atcaggattt tgtcgtgcat ggcagcggac ggagaaaaga acaatggaat aacaaaagca gcgcacgagg cgctgcatcg aatacgtagc cgggcgatcg cgccgccgcg acacgcggcg gcctgacagt cgggagagcc gtgccgatca cggttgcgac ggcggccgcg ccgcgtcgcc gaacttgctc gtcgcagcat cgaaccagga ccctgggtct gagcttccag ggtacccata tgatggtgat ggtgatgagc cat
Expressed Protein Sequence
MAHHHHHHMG TLEAQTQGPG SMLRRASSAT RRGRRRNRDR HGSPDCQAAA CRGGAIARLR IRCSASCAAF VIPLFFSPSA AMHDKILILD FGSQVTQLIA RRVREAHVYC EIHPNDVSDD FVREFAPKGV ILSGSHASTY EDHQLRAPQA VWDLGVPVLG ICYGMQTMAV QLGGKVEWSD HREFGYAEVR AHGHTRLLDG IQDFATPEGH GMLKVWMSHG DKVAAMPPGF ALMASTPSCP IAGMADEARG YYAVQFHPEV THTVQGRKLL ERFVLDIAGA KPDWIMRDHI EEAVARIREQ VGDEEVILGL SGGVDSSVAA ALIHRAIGDQ LTCVFVDHGL LRLNEGKMVL DMFEGRLHAK VVHVDASEQF LGHLAGVADP EHKRKIIGRE FVEVFQAEAK KLTNAKWLAQ GTIYPDVIES GGAKTKKATT IKSHHNVGGL PETLGLKLLE PLRDLFKDEV RELGVALGLP EEMVYRHPFP GPGLGVRILG EVKRDYADLL RRADAIFIEE LRGTLATEQD AAAGLCEPSQ VGKSWYDLTS QAFAVFLPVK SVGVMGDGRT YDYVTALRAV QTTDFMTAHW AHLPYALLGR ASNRIINEVR GINRVVYDVS GKPPATIEWE
Full NT Sequence (Expression Vector + Insert)
TTCTTGAAGA CGAAAGGGCC TCGTGATACG CCTATTTTTA TAGGTTAATG TCATGATAAT AATGGTTTCT TAGACGTCAG GTGGCACTTT TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTTATTTTTC TAAATACATT CAAATATGTA TCCGCTCATG AGACAATAAC CCTGATAAAT GCTTCAATAA TATTGAAAAA GGAAGAGTAT GAGTATTCAA CATTTCCGTG TCGCCCTTAT TCCCTTTTTT GCGGCATTTT GCCTTCCTGT TTTTGCTCAC CCAGAAACGC TGGTGAAAGT AAAAGATGCT GAAGATCAGT TGGGTGCACG AGTGGGTTAC ATCGAACTGG ATCTCAACAG CGGTAAGATC CTTGAGAGTT TTCGCCCCGA AGAACGTTTT CCAATGATGA GCACTTTTAA AGTTCTGCTA TGTGGCGCGG TATTATCCCG TGTTGACGCC GGGCAAGAGC AACTCGGTCG CCGCATACAC TATTCTCAGA ATGACTTGGT TGAGTACTCA CCAGTCACAG AAAAGCATCT TACGGATGGC ATGACAGTAA GAGAATTATG CAGTGCTGCC ATAACCATGA GTGATAACAC TGCGGCCAAC TTACTTCTGA CAACGATCGG AGGACCGAAG GAGCTAACCG CTTTTTTGCA CAACATGGGG GATCATGTAA CTCGCCTTGA TCGTTGGGAA CCGGAGCTGA ATGAAGCCAT ACCAAACGAC GAGCGTGACA CCACGATGCC TGCAGCAATG GCAACAACGT TGCGCAAACT ATTAACTGGC GAACTACTTA CTCTAGCTTC CCGGCAACAA TTAATAGACT GGATGGAGGC GGATAAAGTT GCAGGACCAC TTCTGCGCTC GGCCCTTCCG GCTGGCTGGT TTATTGCTGA TAAATCTGGA GCCGGTGAGC GTGGGTCTCG CGGTATCATT GCAGCACTGG GGCCAGATGG TAAGCCCTCC CGTATCGTAG TTATCTACAC GACGGGGAGT CAGGCAACTA TGGATGAACG AAATAGACAG ATCGCTGAGA TAGGTGCCTC ACTGATTAAG CATTGGTAAC TGTCAGACCA AGTTTACTCA TATATACTTT AGATTGATTT AAAACTTCAT TTTTAATTTA AAAGGATCTA GGTGAAGATC CTTTTTGATA ATCTCATGAC CAAAATCCCT TAACGTGAGT TTTCGTTCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT TTTTTCTGCG CGTAATCTGC TGCTTGCAAA CAAAAAAACC ACCGCTACCA GCGGTGGTTT GTTTGCCGGA TCAAGAGCTA CCAACTCTTT TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AGATACCAAA TACTGTCCTT CTAGTGTAGC CGTAGTTAGG CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA GACGATAGTT ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCACACAGC CCAGCTTGGA GCGAACGACC TACACCGAAC TGAGATACCT ACAGCGTGAG CTATGAGAAA GCGCCACGCT TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGCCTG GTATCTTTAT AGTCCTGTCG GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG GGGCGGAGCC TATGGAAAAA CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG CTCACATGTT CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG AGTGAGCTGA TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA GCGCCTGATG CGGTATTTTC TCCTTACGCA TCTGTGCGGT ATTTCACACC GCATATATGG TGCACTCTCA GTACAATCTG CTCTGATGCC GCATAGTTAA GCCAGTATAC ACTCCGCTAT CGCTACGTGA CTGGGTCATG GCTGCGCCCC GACACCCGCC AACACCCGCT GACGCGCCCT GACGGGCTTG TCTGCTCCCG GCATCCGCTT ACAGACAAGC TGTGACCGTC TCCGGGAGCT GCATGTGTCA GAGGTTTTCA CCGTCATCAC CGAAACGCGC GAGGCAGCTG CGGTAAAGCT CATCAGCGTG GTCGTGAAGC GATTCACAGA TGTCTGCCTG TTCATCCGCG TCCAGCTCGT TGAGTTTCTC CAGAAGCGTT AATGTCTGGC TTCTGATAAA GCGGGCCATG TTAAGGGCGG TTTTTTCCTG TTTGGTCACT GATGCCTCCG TGTAAGGGGG ATTTCTGTTC ATGGGGGTAA TGATACCGAT GAAACGAGAG AGGATGCTCA CGATACGGGT TACTGATGAT GAACATGCCC GGTTACTGGA ACGTTGTGAG GGTAAACAAC TGGCGGTATG GATGCGGCGG GACCAGAGAA AAATCACTCA GGGTCAATGC CAGCGCTTCG TTAATACAGA TGTAGGTGTT CCACAGGGTA GCCAGCAGCA TCCTGCGATG CAGATCCGGA ACATAATGGT GCAGGGCGCT GACTTCCGCG TTTCCAGACT TTACGAAACA CGGAAACCGA AGACCATTCA TGTTGTTGCT CAGGTCGCAG ACGTTTTGCA GCAGCAGTCG CTTCACGTTC GCTCGCGTAT CGGTGATTCA TTCTGCTAAC CAGTAAGGCA ACCCCGCCAG CCTAGCCGGG TCCTCAACGA CAGGAGCACG ATCATGCGCA CCCGTGGCCA GGACCCAACG CTGCCCGAGA TGCGCCGCGT GCGGCTGCTG GAGATGGCGG ACGCGATGGA TATGTTCTGC CAAGGGTTGG TTTGCGCATT CACAGTTCTC CGCAAGAATT GATTGGCTCC AATTCTTGGA GTGGTGAATC CGTTAGCGAG GTGCCGCCGG CTTCCATTCA GGTCGAGGTG GCCCGGCTCC ATGCACCGCG ACGCAACGCG GGGAGGCAGA CAAGGTATAG GGCGGCGCCT ACAATCCATG CCAACCCGTT CCATGTGCTC GCCGAGGCGG CATAAATCGC CGTGACGATC AGCGGTCCAG TGATCGAAGT TAGGCTGGTA AGAGCCGCGA GCGATCCTTG AAGCTGTCCC TGATGGTCGT CATCTACCTG CCTGGACAGC ATGGCCTGCA ACGCGGGCAT CCCGATGCCG CCGGAAGCGA GAAGAATCAT AATGGGGAAG GCCATCCAGC CTCGCGTCGT GAACGCCAGC AAGACGTAGC CCAGCGCGTC GGCCGTAACA ACACCATTTA AATGGAGTGG TTACAAATGG AGTGGTTAAT TAACAACACC ATTTGTCGAC GCTCTCCCTT ATGCGACTCC TGCATTAGGA AGCAGCCCAG TAGTAGGTTG AGGCCGTTGA GCACCGCCGC CGCAAGGAAT GGTGCATGCA AGGAGATGGC GCCCAACAGT CCCCCGGCCA CGGGGCCTGC CACCATACCC ACGCCGAAAC AAGCGCTCAT GAGCCCGAAG TGGCGAGCCC GATCTTCCCC ATCGGTGATG TCGGCGATAT AGGCGCCAGC AACCGCACCT GTGGCGCCGG TGATGCCGGC CACGATGCGT CCGGCGTAGA GGATCGAGAT CTCGATCCCG CGAAATTAAT ACGACTCACT ATAGGGAGAC CACAACGGTT TCCCTCTAGA AATAATTTTG TTTAACTTTA AGAAGGAGAT ATACCATGGC TCATCACCAT CACCATCATA TGGGTACCCT GGAAGCTCAG ACCCAGGGTC CTGGTTCGAT GCTGCGACGA GCAAGTTCGG CGACGCGGCG CGGCCGCCGT CGCAACCGTG ATCGGCACGG CTCTCCCGAC TGTCAGGCCG CCGCGTGTCG CGGCGGCGCG ATCGCCCGGC TACGTATTCG ATGCAGCGCC TCGTGCGCTG CTTTTGTTAT TCCATTGTTC TTTTCTCCGT CCGCTGCCAT GCACGACAAA ATCCTGATTC TCGACTTCGG TTCGCAAGTC ACTCAACTGA TTGCGCGGCG CGTCCGCGAA GCGCACGTCT ACTGCGAGAT CCACCCGAAC GACGTCTCCG ACGACTTCGT CCGCGAATTC GCACCGAAGG GCGTGATTCT GTCCGGCAGC CACGCGAGCA CCTATGAGGA CCACCAACTG CGCGCGCCGC AGGCGGTGTG GGATCTCGGC GTGCCCGTGC TCGGCATTTG TTACGGGATG CAGACGATGG CCGTGCAGTT GGGCGGCAAG GTCGAGTGGA GCGATCATCG CGAGTTCGGC TATGCGGAAG TGCGCGCGCA CGGCCACACG CGCCTGCTCG ACGGCATCCA GGATTTCGCG ACGCCGGAAG GCCACGGAAT GCTGAAGGTG TGGATGAGTC ATGGCGACAA GGTCGCCGCG ATGCCGCCGG GCTTCGCGCT GATGGCGTCG ACGCCGAGCT GCCCGATCGC CGGCATGGCC GACGAGGCGC GCGGCTACTA CGCGGTGCAG TTCCATCCGG AAGTCACGCA CACGGTGCAG GGCCGCAAGC TGCTCGAGCG CTTCGTGCTC GACATCGCCG GCGCGAAACC CGACTGGATC ATGCGTGACC ACATCGAAGA GGCGGTCGCG CGGATTCGCG AGCAAGTCGG CGACGAGGAG GTGATTCTCG GCCTGTCGGG CGGCGTGGAT TCGAGCGTCG CGGCGGCGCT GATCCACCGC GCGATCGGCG ATCAACTGAC CTGCGTGTTC GTCGATCACG GCCTGCTGCG CCTGAACGAA GGCAAGATGG TGCTCGACAT GTTCGAAGGC CGGCTCCATG CGAAGGTGGT CCATGTCGAC GCGTCCGAGC AGTTCCTGGG CCATCTCGCG GGCGTCGCCG ATCCGGAGCA CAAGCGCAAG ATCATTGGCC GCGAGTTCGT CGAGGTGTTC CAGGCCGAGG CGAAGAAGCT GACGAACGCG AAGTGGCTCG CGCAGGGCAC GATCTATCCG GACGTAATCG AATCGGGCGG CGCAAAGACG AAGAAGGCGA CGACGATCAA GAGCCATCAC AATGTCGGCG GCCTGCCGGA GACGCTCGGC CTGAAACTGC TCGAGCCGTT GCGCGACCTG TTCAAGGATG AAGTGCGCGA ACTCGGCGTC GCGCTCGGCT TGCCCGAGGA GATGGTGTAT CGGCATCCGT TCCCGGGCCC GGGCCTCGGC GTGCGGATTC TGGGCGAAGT GAAGCGAGAC TACGCGGATC TGCTGCGCCG CGCGGATGCG ATCTTCATCG AAGAGCTTCG CGGCACGTTG GCGACCGAGC AGGACGCGGC GGCGGGCCTT TGCGAGCCGT CGCAGGTCGG CAAGAGCTGG TATGACCTGA CGAGCCAGGC GTTCGCGGTG TTCCTGCCGG TGAAGTCGGT CGGCGTGATG GGCGATGGCC GCACGTACGA CTACGTGACC GCGCTGCGCG CCGTGCAGAC GACCGACTTC ATGACCGCGC ACTGGGCGCA TCTGCCGTAT GCGCTGCTCG GCCGCGCGTC GAACCGAATC ATCAACGAAG TGCGCGGCAT CAACCGGGTC GTGTACGACG TGTCGGGCAA GCCGCCGGCG ACGATCGAGT GGGAATAAAC AGCACGAACA AGTTCTGCAG CCAAGCTTCT CGAGGATCCG GCTGCTAACA AAGCCCGAAA GGAAGCTGAG TTGGCTGCTG CCACCGCTGA GCAATAACTA GCATAACCCC TTGGGGCCTC TAAACGGGTC TTGAGGGGTT TTTTGCTGAA AGGAGGAACT ATATCCGGAT ATCCACAGGA CGGGTGTGGT CGCCATGATC GCGTAGTCGA TAGTGGCTCC AAGTAGCGAA GCGAGCAGGA CTGGGCGGCG GCCAAAGCGG TCGGACAGTG CTCCGAGAAC GGGTGCGCAT AGAAATTGCA TCAACGCATA TAGCGCTAGC AGCACGCCAT AGTGACTGGC GATGCTGTCG GAATGGACGA TATCCCGCAA GAGGCCCGGC AGTACCGGCA TAACCAAGCC TATGCCTACA GCATCCAGGG TGACGGTGCC GAGGATGACG ATGAGCGCAT TGTTAGATTT CATACACGGT GCCTGACTGC GTTAGCAATT TAACTGTGAT AAACTACCGC ATTAAAGCTT ATCGATGATA AGCTGTCAAA CATGAGAA