ButhA.18002.a

Glutamine dependent NAD+ synthetase (BTH_I0882)

CENTER ID: ButhA.18002.a
ORGANISM: Burkholderia thailandensis E264
ASSOCIATED DISEASE:
CURRENT STATUS: in PDB
COMMUNITY REQUEST: True
NIH RISK GROUP: 3
SELECT AGENT: False
NIH PRIORITY
pathogens category:
IIIB

Ordering Clones & Proteins

If there are materials available for this target, they will be listed below. Materials can be ordered from SSGCID using the button in the "order material" column. Clicking the button will add the material to a virtual cart. You may order multiple materials at a time at no cost to you, as this contract is funded by NIAID. When you are ready to place your order, click the "Place Order" link which will appear in the top right corner of the page after you place your first item in your cart.

Clones*

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
ButhA.18002.a.A1.GE32972 full length 1 561
* SSGCID clones represent un-induced expression constructs which have been verified by sequencing from vector primers. Clones may contain a tag, such as N-term 6xHis. Get sequence information using the button in the "info" column.

Proteins

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
ButhA.18002.a.A1.PS01197 full length 1 561

Structures

4F4H
DEPOSITED: 5/10/2012
DETERMINATION: XRay
CLONE: ButhA.18002.a.A1.GE32972
PROTEIN: ButhA.18002.a.A1.PS01197

Publications by SSGCID

Combining functional and structural genomics to sample the essential Burkholderia structome.
Abendroth J, Armour B, Barrett L, Baugh L, Begley DW, Buchko GW, Choi R, Clifton MC, Dieterich SH, Dranow DM, Edwards TE, Fairman JW, Fox D, Gallagher LA, Gardberg AS, Gillespie A, Manoil C, Myler PJ, Nakazawa-Hewitt S, Napuli A, Nguyen MT, Patrapuvich R, Phan I, Stacy R, Staker BL, Stewart LJ, Van Voorhis WC
PLoS ONE - 2012
volume 8, issue 1, pages e53851
PMID: 23382856; PMCID: PMC3561365

External Resources

RESOURCE REFERENCE ID
OrthoMCL: OG5_127076
PATRIC ID: fig|271848.6.peg.3606
RefSeq: YP_441438.1
UniProt: Q2T061

Sequences

These sequences are the native gene sequence; sequences of constructs derived from these sequences may differ due to codon optimization or other protocols. To find the specific sequence of any material you may have ordered, click on the "info" button next to the name of that material.
AA Sequence
MKTRIALAQL NVTVGDFAGN VAKIVAAAQA AHDAGAHFLI APELALSGYP PEDLLLRPAF YAASDAALAE LAAQLKPFAG LAVLVGHPLR APSADGNANR AIERGVPPVD TYNAASLIVG GEVAGTYRKQ DLPNTEVFDE KRYFATDAAP YVFELNGVKF GVVICEDVWH ASAAQLAKAA GAQVLIVPNG SPYHMNKDAV RIDILRARIR ETGLPMVYVN LVGGQDELVF DGGSFVLDGA GELVAKMPQF EEGNAIVEFD GARALPAAIA PALSVEAQVY RALVLGVRDY IGKNGFPGAI IGLSGGVDSA LVLAVAVDAL GAERVRAVMM PSRYTAGIST TDAADMARRV GVRYDEIAIA PMFDAFRASL AAEFAGLAED ATEENIQARI RGTLLMALSN KFGSIVLTTG NKSEMAVGYC TLYGDMAGGF AVIKDIAKTL VYRLCRYRNA AAEYGQPDIV PERILTRAPS AELRENQTDQ DSLPPYDVLD AIMRMYMEED RPLAEIVAAG YSEADVKRVT RLIKINEYKR RQAPVGIRVT HRAFGRDWRY PITSRFVESI D
NT Sequence
atgaagaccc gtatcgcact tgcccaactc aacgtcaccg tcggcgattt cgccggcaac gtcgccaaga tcgtcgccgc cgcgcaagcc gcgcacgatg ccggcgcgca cttcctgatc gcgcccgagc tcgcgctgtc cggctacccg cccgaggatc tgctgctgcg ccccgcgttc tacgcggcgt ccgacgcggc gctcgccgag ctcgccgcgc aactcaagcc gttcgccggg ctcgcggtgc tcgtcggcca cccgctgcgc gcaccaagcg ccgatggtaa tgcaaaccgt gcgatcgagc gcggcgtccc gccggtcgac acgtacaacg cggcatcgct gatcgtcggc ggcgaggtcg ccggcacgta ccgcaagcag gacttgccga acaccgaggt gttcgacgag aagcgctatt tcgcgaccga cgccgcgccg tacgtattcg agctgaacgg cgtgaagttc ggcgtcgtga tctgcgagga cgtgtggcat gcgtcggccg cgcagctcgc gaaggcggcg ggcgcgcagg tgctgatcgt gccgaacggc tcgccgtacc acatgaacaa ggacgcggtg cgcatcgaca tcctgcgcgc gcggattcgc gaaacgggcc tgccgatggt ctacgtgaat ctcgtcggcg gccaggacga gctcgtgttc gacggcggct ctttcgtgct cgacggcgcg ggcgagctgg tcgcgaagat gccgcagttc gaggagggca atgcgatcgt cgagttcgac ggcgcgcgag cgctgcccgc cgctatcgcg ccggcgctca gcgtcgaggc gcaggtgtat cgcgcgctcg tgctcggcgt gcgcgactac atcggcaaga acggtttccc cggcgcgatc atcgggctgt cgggcggcgt cgattcggcg ctcgtgctcg cggtggccgt cgacgcgctc ggtgccgagc gcgtgcgcgc ggtgatgatg ccgtcgcgct acacggccgg catctcgacg accgacgcgg ccgacatggc gcggcgcgtc ggcgtgcgct acgacgagat cgcgatcgcg ccgatgttcg atgcgttccg cgcgtcgctc gcggccgagt tcgcgggcct cgccgaagac gcgacggagg agaacatcca ggcgcgcatt cgcggcacgc tgctgatggc gctgtcgaac aagttcggct cgatcgtgct gacgacgggc aacaagagcg agatggcggt cggctactgc acgctttacg gcgacatggc gggcggcttc gcggtgatca aggacatcgc gaagacgctc gtctaccggc tctgccgtta ccgcaacgcg gcggccgaat acggccagcc cgacatcgtt cccgagcgga ttctcacgcg cgcgccgtcg gccgagctgc gcgagaacca gaccgaccag gacagcctgc cgccgtacga tgtgctcgac gcgatcatgc gcatgtacat ggaggaggac cggccgctcg cggagatcgt cgcggcgggc tattcggagg cggacgtgaa gcgcgtcacg cggctcatca agatcaacga atacaagcgc cggcaggcgc ccgtcggcat tcgcgtcacg caccgcgcgt tcgggcgcga ctggcgctat ccgatcacgt cgcgcttcgt cgagagcatc gactga
Details for ButhA.18002.a.A1.GE32972
HARVESTED ON: 7/12/2011
SEQUENCED ON: 7/28/2011
EXPECTED MW: 62kDa
OBSERVED MW: 64kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Too many to count (100+)
TOTAL EXPRESSION LEVEL: High Expression
SOLUBLE EXPRESSION LEVEL High Expression
EXPRESSION HOST: data unavailable
SEQUENCING RESULT: pass with incomplete coverage
PERCENT IDENTITY: 69
PERCENT COVERAGE: 95
Validated AA Sequence
MKTRIALAQL NVTVGDFAGN VAKIVAAAQA AHDAGAHFLI APELALSGYP PEDLLLRPAF YAASDAALAE LAAQLKPFAG LAVLVGHPLR APSADGNANR AIERGVPPVD TYNAASLIVG GEVAGTYRKQ DLPNTEVFDE KRYFATDAAP YVFELNGVKF GVVICEDVWH ASAAQLAKAA GAQVLIVPNG SPYHMNKDAV RIDILRARIR ETGLPMVYVN LVGGQDELVF DGGSFVLDGA GELVAKMPQF EEGNAIVEFD GARALPAAIA PALSVEAQVY RALVLGVRDY IXKNGFPGAI IGLSGGVDXX XXARGXXXRA RCRARARGDD AVALHGRHLD DRRGRHGAAR RRALRRDRDR ADVRCVPRVA RGRVRGPRRR RDGGEHPGAH SRHAADGAVE QVRLDRADDG QQERDGGRLL HALRRHGGRL RGDQGHREDA RLPALPLPQR GGRIRPARHR SRADSHARAV GRAAREPDRP GQPAAVRCAR RDHAHVHGGG PAARGDRRGG LFGGGREARH AAHQDQRIQA PAGARRHSRH APRVRARLAL SDHV
Validated NT Sequence
gcgacgtgat cggatagcgc cagtcgcgcc cgaacgcgcg gtgcgtgacg cgaatgccga cgggcgcctg ccggcgcttg tattcgttga tcttgatgag ccgcgtgacg cgcttcacgt ccgcctccga atagcccgcc gcgacgatct ccgcgagcgg ccggtcctcc tccatgtaca tgcgcatgat cgcgtcgagc acatcgtacg gcggcaggct gtcctggtcg gtctggttct cgcgcagctc ggccgacggc gcgcgcgtga gaatccgctc gggaacgatg tcgggctggc cgtattcggc cgccgcgttg cggtaacggc agagccggta gacgagcgtc ttcgcgatgt ccttgatcac cgcgaagccg cccgccatgt cgccgtaaag cgtgcagtag ccgaccgcca tctcgctctt gttgcccgtc gtcagcacga tcgagccgaa cttgttcgac agcgccatca gcagcgtgcc gcgaatgcgc gcctggatgt tctcctccgt cgcgtcttcg gcgaggcccg cgaactcggc cgcgagcgac gcgcggaacg catcgaacat cggcgcgatc gcgatctcgt cgtagcgcac gccgacgcgc cgcgccatgt cggccgcgtc ggtcgtcgag atgccggccg tgtagcgcga cggcatcatc accgcgcgca cgcgctcggc accgagcgcg tcnacngnca ccgcgagcac naannnncna atcgacgccg cccgacagcc cgatgatcgc gccggggaaa ccgttcttgn cgatgtagtc gcgcacgccg agcacgagcg cgcgatacac ctgcgcctcg acgctgagcg cnggcgcgat agcggcgggc agcgctcgcg cgccgtcgaa ctcgacgatc gcattgccct cctcgaactg cggcatcttc gcgaccagct cgcccgcgcc gtcgagcacg aaagagccgc cgtcgaacac gagctcgtcc tggccgccga cgagattcac gtagaccatc ggcaggcccg tttcgcgaat ccgcgcgcgc aggatgtcga tgcgcaccgc gtccttgttc atgtggtacg gcgagccgtt cggcacgatc agcacctgcg cgcccgccgc cttcgcgagc tgcgcggccg acgcatgcca cacgtcctcg cagatcacga cgccgaactt cacgccgttc agctcgaata cgtacggcgc ggcgtcggtc gcgaaatagc gcttctcgtc gaacacctcg gtgttcggca agtcctgctt gcggtacgtg ccggcgacct cgccgccgac gatcagcgat gccgcgttgt acgtgtcgac cggcgggacg ccgcgctcga tcgcacggtt tgcattacca tcggcgcttg gtgcgcgcag cgggtggccg acgagcaccg cgagcccggc gaacggcttg agttgcgcgg cgagctcggc gagcgccgcg tcggacgccg cgtagaacgc ggggcgcagc agcagatcct cgggcgggta gccggacagc gcgagctcgg gcgcgatcag gaagtgcgcg ccggcatcgt gcgcggcttg cgcggcggcg acgatcttgg cgacgttgcc ggcgaaatcg ccgacggtga cgttgagttg ggcaagtgcg atacgggtct tcatcgaacc aggaccctgg gtctgagctt ccagggtac
Expected Protein Sequence
MAHHHHHHMG TLEAQTQGPG SMKTRIALAQ LNVTVGDFAG NVAKIVAAAQ AAHDAGAHFL IAPELALSGY PPEDLLLRPA FYAASDAALA ELAAQLKPFA GLAVLVGHPL RAPSADGNAN RAIERGVPPV DTYNAASLIV GGEVAGTYRK QDLPNTEVFD EKRYFATDAA PYVFELNGVK FGVVICEDVW HASAAQLAKA AGAQVLIVPN GSPYHMNKDA VRIDILRARI RETGLPMVYV NLVGGQDELV FDGGSFVLDG AGELVAKMPQ FEEGNAIVEF DGARALPAAI APALSVEAQV YRALVLGVRD YIGKNGFPGA IIGLSGGVDS ALVLAVAVDA LGAERVRAVM MPSRYTAGIS TTDAADMARR VGVRYDEIAI APMFDAFRAS LAAEFAGLAE DATEENIQAR IRGTLLMALS NKFGSIVLTT GNKSEMAVGY CTLYGDMAGG FAVIKDIAKT LVYRLCRYRN AAAEYGQPDI VPERILTRAP SAELRENQTD QDSLPPYDVL DAIMRMYMEE DRPLAEIVAA GYSEADVKRV TRLIKINEYK RRQAPVGIRV THRAFGRDWR YPITSRFVES ID
Full NT Sequence (Expression Vector + Insert)
ttcttgaaga cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca cccgtggcca ggacccaacg ctgcccgaga tgcgccgcgt gcggctgctg gagatggcgg acgcgatgga tatgttctgc caagggttgg tttgcgcatt cacagttctc cgcaagaatt gattggctcc aattcttgga gtggtgaatc cgttagcgag gtgccgccgg cttccattca ggtcgaggtg gcccggctcc atgcaccgcg acgcaacgcg gggaggcaga caaggtatag ggcggcgcct acaatccatg ccaacccgtt ccatgtgctc gccgaggcgg cataaatcgc cgtgacgatc agcggtccag tgatcgaagt taggctggta agagccgcga gcgatccttg aagctgtccc tgatggtcgt catctacctg cctggacagc atggcctgca acgcgggcat cccgatgccg ccggaagcga gaagaatcat aatggggaag gccatccagc ctcgcgtcgt gaacgccagc aagacgtagc ccagcgcgtc ggccgtaaca acaccattta aatggagtgg ttacaaatgg agtggttaat taacaacacc atttgtcgac gctctccctt atgcgactcc tgcattagga agcagcccag tagtaggttg aggccgttga gcaccgccgc cgcaaggaat ggtgcatgca aggagatggc gcccaacagt cccccggcca cggggcctgc caccataccc acgccgaaac aagcgctcat gagcccgaag tggcgagccc gatcttcccc atcggtgatg tcggcgatat aggcgccagc aaccgcacct gtggcgccgg tgatgccggc cacgatgcgt ccggcgtaga ggatcgagat ctcgatcccg cgaaattaat acgactcact atagggagac cacaacggtt tccctctaga aataattttg tttaacttta agaaggagat ataccatggc tcatcaccat caccatcata tgggtaccct ggaagctcag acccagggtc ctggttcgat gaagacccgt atcgcacttg cccaactcaa cgtcaccgtc ggcgatttcg ccggcaacgt cgccaagatc gtcgccgccg cgcaagccgc gcacgatgcc ggcgcgcact tcctgatcgc gcccgagctc gcgctgtccg gctacccgcc cgaggatctg ctgctgcgcc ccgcgttcta cgcggcgtcc gacgcggcgc tcgccgagct cgccgcgcaa ctcaagccgt tcgccgggct cgcggtgctc gtcggccacc cgctgcgcgc accaagcgcc gatggtaatg caaaccgtgc gatcgagcgc ggcgtcccgc cggtcgacac gtacaacgcg gcatcgctga tcgtcggcgg cgaggtcgcc ggcacgtacc gcaagcagga cttgccgaac accgaggtgt tcgacgagaa gcgctatttc gcgaccgacg ccgcgccgta cgtattcgag ctgaacggcg tgaagttcgg cgtcgtgatc tgcgaggacg tgtggcatgc gtcggccgcg cagctcgcga aggcggcggg cgcgcaggtg ctgatcgtgc cgaacggctc gccgtaccac atgaacaagg acgcggtgcg catcgacatc ctgcgcgcgc ggattcgcga aacgggcctg ccgatggtct acgtgaatct cgtcggcggc caggacgagc tcgtgttcga cggcggctct ttcgtgctcg acggcgcggg cgagctggtc gcgaagatgc cgcagttcga ggagggcaat gcgatcgtcg agttcgacgg cgcgcgagcg ctgcccgccg ctatcgcgcc ggcgctcagc gtcgaggcgc aggtgtatcg cgcgctcgtg ctcggcgtgc gcgactacat cggcaagaac ggtttccccg gcgcgatcat cgggctgtcg ggcggcgtcg attcggcgct cgtgctcgcg gtggccgtcg acgcgctcgg tgccgagcgc gtgcgcgcgg tgatgatgcc gtcgcgctac acggccggca tctcgacgac cgacgcggcc gacatggcgc ggcgcgtcgg cgtgcgctac gacgagatcg cgatcgcgcc gatgttcgat gcgttccgcg cgtcgctcgc ggccgagttc gcgggcctcg ccgaagacgc gacggaggag aacatccagg cgcgcattcg cggcacgctg ctgatggcgc tgtcgaacaa gttcggctcg atcgtgctga cgacgggcaa caagagcgag atggcggtcg gctactgcac gctttacggc gacatggcgg gcggcttcgc ggtgatcaag gacatcgcga agacgctcgt ctaccggctc tgccgttacc gcaacgcggc ggccgaatac ggccagcccg acatcgttcc cgagcggatt ctcacgcgcg cgccgtcggc cgagctgcgc gagaaccaga ccgaccagga cagcctgccg ccgtacgatg tgctcgacgc gatcatgcgc atgtacatgg aggaggaccg gccgctcgcg gagatcgtcg cggcgggcta ttcggaggcg gacgtgaagc gcgtcacgcg gctcatcaag atcaacgaat acaagcgccg gcaggcgccc gtcggcattc gcgtcacgca ccgcgcgttc gggcgcgact ggcgctatcc gatcacgtcg cgcttcgtcg agagcatcga ctaaacagca cgaacaagtt ctgcagccaa gcttctcgag gatccggctg ctaacaaagc ccgaaaggaa gctgagttgg ctgctgccac cgctgagcaa taactagcat aaccccttgg ggcctctaaa cgggtcttga ggggtttttt gctgaaagga ggaactatat ccggatatcc acaggacggg tgtggtcgcc atgatcgcgt agtcgatagt ggctccaagt agcgaagcga gcaggactgg gcggcggcca aagcggtcgg acagtgctcc gagaacgggt gcgcatagaa attgcatcaa cgcatatagc gctagcagca cgccatagtg actggcgatg ctgtcggaat ggacgatatc ccgcaagagg cccggcagta ccggcataac caagcctatg cctacagcat ccagggtgac ggtgccgagg atgacgatga gcgcattgtt agatttcata cacggtgcct gactgcgtta gcaatttaac tgtgataaac taccgcatta aagcttatcg atgataagct gtcaaacatg agaa
Details for ButhA.18002.a.A1.PS01197
PURIFICATION DATe: 8/31/2011
CONCENTRATION: 57.1mg/ml
OBSERVED MW: data unavailable
EXPRESSION LEVEL: Moderate Expression
PROTEIN PURIFICATION BUFFER: 20 mM HEPES, pH 7.0, 300 mM NaCl, 5% glycerol and 1 mM TCEP
EXPRESSION HOST: data unavailable
VIAL COUNT (approx.): 6
VIAL VOLUME: 200µl
PERCENT IDENTITY: 69
PERCENT COVERAGE: 95
Protocol Notes
notes unavailable
Validated AA Sequence
MKTRIALAQL NVTVGDFAGN VAKIVAAAQA AHDAGAHFLI APELALSGYP PEDLLLRPAF YAASDAALAE LAAQLKPFAG LAVLVGHPLR APSADGNANR AIERGVPPVD TYNAASLIVG GEVAGTYRKQ DLPNTEVFDE KRYFATDAAP YVFELNGVKF GVVICEDVWH ASAAQLAKAA GAQVLIVPNG SPYHMNKDAV RIDILRARIR ETGLPMVYVN LVGGQDELVF DGGSFVLDGA GELVAKMPQF EEGNAIVEFD GARALPAAIA PALSVEAQVY RALVLGVRDY IXKNGFPGAI IGLSGGVDXX XXARGXXXRA RCRARARGDD AVALHGRHLD DRRGRHGAAR RRALRRDRDR ADVRCVPRVA RGRVRGPRRR RDGGEHPGAH SRHAADGAVE QVRLDRADDG QQERDGGRLL HALRRHGGRL RGDQGHREDA RLPALPLPQR GGRIRPARHR SRADSHARAV GRAAREPDRP GQPAAVRCAR RDHAHVHGGG PAARGDRRGG LFGGGREARH AAHQDQRIQA PAGARRHSRH APRVRARLAL SDHV
Validated NT Sequence
gcgacgtgat cggatagcgc cagtcgcgcc cgaacgcgcg gtgcgtgacg cgaatgccga cgggcgcctg ccggcgcttg tattcgttga tcttgatgag ccgcgtgacg cgcttcacgt ccgcctccga atagcccgcc gcgacgatct ccgcgagcgg ccggtcctcc tccatgtaca tgcgcatgat cgcgtcgagc acatcgtacg gcggcaggct gtcctggtcg gtctggttct cgcgcagctc ggccgacggc gcgcgcgtga gaatccgctc gggaacgatg tcgggctggc cgtattcggc cgccgcgttg cggtaacggc agagccggta gacgagcgtc ttcgcgatgt ccttgatcac cgcgaagccg cccgccatgt cgccgtaaag cgtgcagtag ccgaccgcca tctcgctctt gttgcccgtc gtcagcacga tcgagccgaa cttgttcgac agcgccatca gcagcgtgcc gcgaatgcgc gcctggatgt tctcctccgt cgcgtcttcg gcgaggcccg cgaactcggc cgcgagcgac gcgcggaacg catcgaacat cggcgcgatc gcgatctcgt cgtagcgcac gccgacgcgc cgcgccatgt cggccgcgtc ggtcgtcgag atgccggccg tgtagcgcga cggcatcatc accgcgcgca cgcgctcggc accgagcgcg tcnacngnca ccgcgagcac naannnncna atcgacgccg cccgacagcc cgatgatcgc gccggggaaa ccgttcttgn cgatgtagtc gcgcacgccg agcacgagcg cgcgatacac ctgcgcctcg acgctgagcg cnggcgcgat agcggcgggc agcgctcgcg cgccgtcgaa ctcgacgatc gcattgccct cctcgaactg cggcatcttc gcgaccagct cgcccgcgcc gtcgagcacg aaagagccgc cgtcgaacac gagctcgtcc tggccgccga cgagattcac gtagaccatc ggcaggcccg tttcgcgaat ccgcgcgcgc aggatgtcga tgcgcaccgc gtccttgttc atgtggtacg gcgagccgtt cggcacgatc agcacctgcg cgcccgccgc cttcgcgagc tgcgcggccg acgcatgcca cacgtcctcg cagatcacga cgccgaactt cacgccgttc agctcgaata cgtacggcgc ggcgtcggtc gcgaaatagc gcttctcgtc gaacacctcg gtgttcggca agtcctgctt gcggtacgtg ccggcgacct cgccgccgac gatcagcgat gccgcgttgt acgtgtcgac cggcgggacg ccgcgctcga tcgcacggtt tgcattacca tcggcgcttg gtgcgcgcag cgggtggccg acgagcaccg cgagcccggc gaacggcttg agttgcgcgg cgagctcggc gagcgccgcg tcggacgccg cgtagaacgc ggggcgcagc agcagatcct cgggcgggta gccggacagc gcgagctcgg gcgcgatcag gaagtgcgcg ccggcatcgt gcgcggcttg cgcggcggcg acgatcttgg cgacgttgcc ggcgaaatcg ccgacggtga cgttgagttg ggcaagtgcg atacgggtct tcatcgaacc aggaccctgg gtctgagctt ccagggtac
Expressed Protein Sequence
MAHHHHHHMG TLEAQTQGPG SMKTRIALAQ LNVTVGDFAG NVAKIVAAAQ AAHDAGAHFL IAPELALSGY PPEDLLLRPA FYAASDAALA ELAAQLKPFA GLAVLVGHPL RAPSADGNAN RAIERGVPPV DTYNAASLIV GGEVAGTYRK QDLPNTEVFD EKRYFATDAA PYVFELNGVK FGVVICEDVW HASAAQLAKA AGAQVLIVPN GSPYHMNKDA VRIDILRARI RETGLPMVYV NLVGGQDELV FDGGSFVLDG AGELVAKMPQ FEEGNAIVEF DGARALPAAI APALSVEAQV YRALVLGVRD YIGKNGFPGA IIGLSGGVDS ALVLAVAVDA LGAERVRAVM MPSRYTAGIS TTDAADMARR VGVRYDEIAI APMFDAFRAS LAAEFAGLAE DATEENIQAR IRGTLLMALS NKFGSIVLTT GNKSEMAVGY CTLYGDMAGG FAVIKDIAKT LVYRLCRYRN AAAEYGQPDI VPERILTRAP SAELRENQTD QDSLPPYDVL DAIMRMYMEE DRPLAEIVAA GYSEADVKRV TRLIKINEYK RRQAPVGIRV THRAFGRDWR YPITSRFVES ID
Full NT Sequence (Expression Vector + Insert)
TTCTTGAAGA CGAAAGGGCC TCGTGATACG CCTATTTTTA TAGGTTAATG TCATGATAAT AATGGTTTCT TAGACGTCAG GTGGCACTTT TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTTATTTTTC TAAATACATT CAAATATGTA TCCGCTCATG AGACAATAAC CCTGATAAAT GCTTCAATAA TATTGAAAAA GGAAGAGTAT GAGTATTCAA CATTTCCGTG TCGCCCTTAT TCCCTTTTTT GCGGCATTTT GCCTTCCTGT TTTTGCTCAC CCAGAAACGC TGGTGAAAGT AAAAGATGCT GAAGATCAGT TGGGTGCACG AGTGGGTTAC ATCGAACTGG ATCTCAACAG CGGTAAGATC CTTGAGAGTT TTCGCCCCGA AGAACGTTTT CCAATGATGA GCACTTTTAA AGTTCTGCTA TGTGGCGCGG TATTATCCCG TGTTGACGCC GGGCAAGAGC AACTCGGTCG CCGCATACAC TATTCTCAGA ATGACTTGGT TGAGTACTCA CCAGTCACAG AAAAGCATCT TACGGATGGC ATGACAGTAA GAGAATTATG CAGTGCTGCC ATAACCATGA GTGATAACAC TGCGGCCAAC TTACTTCTGA CAACGATCGG AGGACCGAAG GAGCTAACCG CTTTTTTGCA CAACATGGGG GATCATGTAA CTCGCCTTGA TCGTTGGGAA CCGGAGCTGA ATGAAGCCAT ACCAAACGAC GAGCGTGACA CCACGATGCC TGCAGCAATG GCAACAACGT TGCGCAAACT ATTAACTGGC GAACTACTTA CTCTAGCTTC CCGGCAACAA TTAATAGACT GGATGGAGGC GGATAAAGTT GCAGGACCAC TTCTGCGCTC GGCCCTTCCG GCTGGCTGGT TTATTGCTGA TAAATCTGGA GCCGGTGAGC GTGGGTCTCG CGGTATCATT GCAGCACTGG GGCCAGATGG TAAGCCCTCC CGTATCGTAG TTATCTACAC GACGGGGAGT CAGGCAACTA TGGATGAACG AAATAGACAG ATCGCTGAGA TAGGTGCCTC ACTGATTAAG CATTGGTAAC TGTCAGACCA AGTTTACTCA TATATACTTT AGATTGATTT AAAACTTCAT TTTTAATTTA AAAGGATCTA GGTGAAGATC CTTTTTGATA ATCTCATGAC CAAAATCCCT TAACGTGAGT TTTCGTTCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT TTTTTCTGCG CGTAATCTGC TGCTTGCAAA CAAAAAAACC ACCGCTACCA GCGGTGGTTT GTTTGCCGGA TCAAGAGCTA CCAACTCTTT TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AGATACCAAA TACTGTCCTT CTAGTGTAGC CGTAGTTAGG CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA GACGATAGTT ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCACACAGC CCAGCTTGGA GCGAACGACC TACACCGAAC TGAGATACCT ACAGCGTGAG CTATGAGAAA GCGCCACGCT TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGCCTG GTATCTTTAT AGTCCTGTCG GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG GGGCGGAGCC TATGGAAAAA CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG CTCACATGTT CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG AGTGAGCTGA TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA GCGCCTGATG CGGTATTTTC TCCTTACGCA TCTGTGCGGT ATTTCACACC GCATATATGG TGCACTCTCA GTACAATCTG CTCTGATGCC GCATAGTTAA GCCAGTATAC ACTCCGCTAT CGCTACGTGA CTGGGTCATG GCTGCGCCCC GACACCCGCC AACACCCGCT GACGCGCCCT GACGGGCTTG TCTGCTCCCG GCATCCGCTT ACAGACAAGC TGTGACCGTC TCCGGGAGCT GCATGTGTCA GAGGTTTTCA CCGTCATCAC CGAAACGCGC GAGGCAGCTG CGGTAAAGCT CATCAGCGTG GTCGTGAAGC GATTCACAGA TGTCTGCCTG TTCATCCGCG TCCAGCTCGT TGAGTTTCTC CAGAAGCGTT AATGTCTGGC TTCTGATAAA GCGGGCCATG TTAAGGGCGG TTTTTTCCTG TTTGGTCACT GATGCCTCCG TGTAAGGGGG ATTTCTGTTC ATGGGGGTAA TGATACCGAT GAAACGAGAG AGGATGCTCA CGATACGGGT TACTGATGAT GAACATGCCC GGTTACTGGA ACGTTGTGAG GGTAAACAAC TGGCGGTATG GATGCGGCGG GACCAGAGAA AAATCACTCA GGGTCAATGC CAGCGCTTCG TTAATACAGA TGTAGGTGTT CCACAGGGTA GCCAGCAGCA TCCTGCGATG CAGATCCGGA ACATAATGGT GCAGGGCGCT GACTTCCGCG TTTCCAGACT TTACGAAACA CGGAAACCGA AGACCATTCA TGTTGTTGCT CAGGTCGCAG ACGTTTTGCA GCAGCAGTCG CTTCACGTTC GCTCGCGTAT CGGTGATTCA TTCTGCTAAC CAGTAAGGCA ACCCCGCCAG CCTAGCCGGG TCCTCAACGA CAGGAGCACG ATCATGCGCA CCCGTGGCCA GGACCCAACG CTGCCCGAGA TGCGCCGCGT GCGGCTGCTG GAGATGGCGG ACGCGATGGA TATGTTCTGC CAAGGGTTGG TTTGCGCATT CACAGTTCTC CGCAAGAATT GATTGGCTCC AATTCTTGGA GTGGTGAATC CGTTAGCGAG GTGCCGCCGG CTTCCATTCA GGTCGAGGTG GCCCGGCTCC ATGCACCGCG ACGCAACGCG GGGAGGCAGA CAAGGTATAG GGCGGCGCCT ACAATCCATG CCAACCCGTT CCATGTGCTC GCCGAGGCGG CATAAATCGC CGTGACGATC AGCGGTCCAG TGATCGAAGT TAGGCTGGTA AGAGCCGCGA GCGATCCTTG AAGCTGTCCC TGATGGTCGT CATCTACCTG CCTGGACAGC ATGGCCTGCA ACGCGGGCAT CCCGATGCCG CCGGAAGCGA GAAGAATCAT AATGGGGAAG GCCATCCAGC CTCGCGTCGT GAACGCCAGC AAGACGTAGC CCAGCGCGTC GGCCGTAACA ACACCATTTA AATGGAGTGG TTACAAATGG AGTGGTTAAT TAACAACACC ATTTGTCGAC GCTCTCCCTT ATGCGACTCC TGCATTAGGA AGCAGCCCAG TAGTAGGTTG AGGCCGTTGA GCACCGCCGC CGCAAGGAAT GGTGCATGCA AGGAGATGGC GCCCAACAGT CCCCCGGCCA CGGGGCCTGC CACCATACCC ACGCCGAAAC AAGCGCTCAT GAGCCCGAAG TGGCGAGCCC GATCTTCCCC ATCGGTGATG TCGGCGATAT AGGCGCCAGC AACCGCACCT GTGGCGCCGG TGATGCCGGC CACGATGCGT CCGGCGTAGA GGATCGAGAT CTCGATCCCG CGAAATTAAT ACGACTCACT ATAGGGAGAC CACAACGGTT TCCCTCTAGA AATAATTTTG TTTAACTTTA AGAAGGAGAT ATACCATGGC TCATCACCAT CACCATCATA TGGGTACCCT GGAAGCTCAG ACCCAGGGTC CTGGTTCGAT GAAGACCCGT ATCGCACTTG CCCAACTCAA CGTCACCGTC GGCGATTTCG CCGGCAACGT CGCCAAGATC GTCGCCGCCG CGCAAGCCGC GCACGATGCC GGCGCGCACT TCCTGATCGC GCCCGAGCTC GCGCTGTCCG GCTACCCGCC CGAGGATCTG CTGCTGCGCC CCGCGTTCTA CGCGGCGTCC GACGCGGCGC TCGCCGAGCT CGCCGCGCAA CTCAAGCCGT TCGCCGGGCT CGCGGTGCTC GTCGGCCACC CGCTGCGCGC ACCAAGCGCC GATGGTAATG CAAACCGTGC GATCGAGCGC GGCGTCCCGC CGGTCGACAC GTACAACGCG GCATCGCTGA TCGTCGGCGG CGAGGTCGCC GGCACGTACC GCAAGCAGGA CTTGCCGAAC ACCGAGGTGT TCGACGAGAA GCGCTATTTC GCGACCGACG CCGCGCCGTA CGTATTCGAG CTGAACGGCG TGAAGTTCGG CGTCGTGATC TGCGAGGACG TGTGGCATGC GTCGGCCGCG CAGCTCGCGA AGGCGGCGGG CGCGCAGGTG CTGATCGTGC CGAACGGCTC GCCGTACCAC ATGAACAAGG ACGCGGTGCG CATCGACATC CTGCGCGCGC GGATTCGCGA AACGGGCCTG CCGATGGTCT ACGTGAATCT CGTCGGCGGC CAGGACGAGC TCGTGTTCGA CGGCGGCTCT TTCGTGCTCG ACGGCGCGGG CGAGCTGGTC GCGAAGATGC CGCAGTTCGA GGAGGGCAAT GCGATCGTCG AGTTCGACGG CGCGCGAGCG CTGCCCGCCG CTATCGCGCC GGCGCTCAGC GTCGAGGCGC AGGTGTATCG CGCGCTCGTG CTCGGCGTGC GCGACTACAT CGGCAAGAAC GGTTTCCCCG GCGCGATCAT CGGGCTGTCG GGCGGCGTCG ATTCGGCGCT CGTGCTCGCG GTGGCCGTCG ACGCGCTCGG TGCCGAGCGC GTGCGCGCGG TGATGATGCC GTCGCGCTAC ACGGCCGGCA TCTCGACGAC CGACGCGGCC GACATGGCGC GGCGCGTCGG CGTGCGCTAC GACGAGATCG CGATCGCGCC GATGTTCGAT GCGTTCCGCG CGTCGCTCGC GGCCGAGTTC GCGGGCCTCG CCGAAGACGC GACGGAGGAG AACATCCAGG CGCGCATTCG CGGCACGCTG CTGATGGCGC TGTCGAACAA GTTCGGCTCG ATCGTGCTGA CGACGGGCAA CAAGAGCGAG ATGGCGGTCG GCTACTGCAC GCTTTACGGC GACATGGCGG GCGGCTTCGC GGTGATCAAG GACATCGCGA AGACGCTCGT CTACCGGCTC TGCCGTTACC GCAACGCGGC GGCCGAATAC GGCCAGCCCG ACATCGTTCC CGAGCGGATT CTCACGCGCG CGCCGTCGGC CGAGCTGCGC GAGAACCAGA CCGACCAGGA CAGCCTGCCG CCGTACGATG TGCTCGACGC GATCATGCGC ATGTACATGG AGGAGGACCG GCCGCTCGCG GAGATCGTCG CGGCGGGCTA TTCGGAGGCG GACGTGAAGC GCGTCACGCG GCTCATCAAG ATCAACGAAT ACAAGCGCCG GCAGGCGCCC GTCGGCATTC GCGTCACGCA CCGCGCGTTC GGGCGCGACT GGCGCTATCC GATCACGTCG CGCTTCGTCG AGAGCATCGA CTAAACAGCA CGAACAAGTT CTGCAGCCAA GCTTCTCGAG GATCCGGCTG CTAACAAAGC CCGAAAGGAA GCTGAGTTGG CTGCTGCCAC CGCTGAGCAA TAACTAGCAT AACCCCTTGG GGCCTCTAAA CGGGTCTTGA GGGGTTTTTT GCTGAAAGGA GGAACTATAT CCGGATATCC ACAGGACGGG TGTGGTCGCC ATGATCGCGT AGTCGATAGT GGCTCCAAGT AGCGAAGCGA GCAGGACTGG GCGGCGGCCA AAGCGGTCGG ACAGTGCTCC GAGAACGGGT GCGCATAGAA ATTGCATCAA CGCATATAGC GCTAGCAGCA CGCCATAGTG ACTGGCGATG CTGTCGGAAT GGACGATATC CCGCAAGAGG CCCGGCAGTA CCGGCATAAC CAAGCCTATG CCTACAGCAT CCAGGGTGAC GGTGCCGAGG ATGACGATGA GCGCATTGTT AGATTTCATA CACGGTGCCT GACTGCGTTA GCAATTTAAC TGTGATAAAC TACCGCATTA AAGCTTATCG ATGATAAGCT GTCAAACATG AGAA