BupsE.19908.a

Putative exported protein (BPSS2145)

CENTER ID: BupsE.19908.a
ORGANISM: Burkholderia pseudomallei K96243
ASSOCIATED DISEASE: Melioidosis
CURRENT STATUS: in PDB
COMMUNITY REQUEST: True
NIH RISK GROUP: 3
SELECT AGENT: True
NIH PRIORITY
pathogens category:
IIIB

Ordering Clones & Proteins

If there are materials available for this target, they will be listed below. Materials can be ordered from SSGCID using the button in the "order material" column. Clicking the button will add the material to a virtual cart. You may order multiple materials at a time at no cost to you, as this contract is funded by NIAID. When you are ready to place your order, click the "Place Order" link which will appear in the top right corner of the page after you place your first item in your cart.

Clones*

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
BupsE.19908.a.B2.GE42226 mature protein 34 563
* SSGCID clones represent un-induced expression constructs which have been verified by sequencing from vector primers. Clones may contain a tag, such as N-term 6xHis. Get sequence information using the button in the "info" column.

Proteins

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
BupsE.19908.a.B2.PW38478 mature protein 34 563
Structures
6OZD
DEPOSITED: 5/15/2019
DETERMINATION: XRay
CLONE: BupsE.19908.a.B2.GE42226
PROTEIN: BupsE.19908.a.B2.PW38478
External Resources
RESOURCE REFERENCE ID
PATRIC ID: fig|272560.51.peg.5714
UniProt: Q63IC6
Sequences
These sequences are the native gene sequence; sequences of constructs derived from these sequences may differ due to codon optimization or other protocols. To find the specific sequence of any material you may have ordered, click on the "info" button next to the name of that material.
AA Sequence
MRTTKSPGIS VIRQRLAVAL PLALTLTLAS CGGDDLTPAA QRWAMPGTEL PLGPQGLAQS VSTQTLAAGV AYYQIKRGAA SAADFWTVNL GFYATQAAAQ ADAANLAAAG FATRVDASAG TDLQGKVLGY WLSAGRYATQ AEATAAAARI AQATQNRYKP GTRHTSLAGA PTTGPWIVNV LAIDPSRAGA ALSLALPGGN DLGAGGETVS AARARVNALA GVNGGFFTNI NPFGAPLPPR SPVGATVVDG RLVAAAIGRR PGLLLARDAN GRQRATVVRN LATAITLTDA QGRAIAVQTL NRPILGTVVN CGAQARTPTS EPAQDTVCTN YDDLVMYDSL YLRGGASNTL VDAGYQGARY ELVVDANGAV VAGHATLGAP PPPNGYVLQG LGASAAWLQA HATPGTRLAV SRRLSADGAD LALASGTSLV EAGPTLSVPN LAQSAAQEGF APTVGGVDAG EGAAANGNWY NGWYVARNGR TAAGVAADGT ILLVEIDGRQ PTLSVGTSIP ETAAVMAWLG ATSAVNLDGG GSSNMVVGGK MVGHPSDAVG ERGVGDTLML LPG
NT Sequence
ttgcggacga cgaaatctcc cggcatctcc gtgattcggc agcggctcgc cgttgcgctg ccgcttgcgt tgacgctcac cctcgcgagc tgcggcggtg acgatctgac gcccgccgcg cagcgctggg cgatgcccgg caccgaactg ccgctcgggc cgcagggcct cgcgcagagc gtgtcgacgc agacgctcgc cgcaggcgtc gcctattacc agatcaagcg cggcgcggcg agcgcggccg atttctggac cgtcaacctc ggcttctacg cgacgcaggc cgcggcgcag gccgatgcgg cgaatctcgc ggcggccggc ttcgcgacgc gcgtcgacgc gtcggcgggc accgacctgc agggcaaggt gctcggctac tggctgtcgg ccggccgcta cgcgacgcag gccgaggcga cggcggccgc cgcgcgcatc gcgcaggcca cgcagaaccg ctacaagccg ggcacgcggc atacgtcgct cgccggcgcg ccgacgacgg ggccgtggat cgtcaacgtg ctcgcgatcg acccgtcgcg cgccggcgcg gcgctgtcgc tcgcgctgcc gggcggcaac gatctcggtg cgggcggcga gacggtttcg gccgcgcggg cgcgtgtgaa cgcgctcgcc ggcgtcaacg gcggcttttt cacgaacatc aatccgttcg gcgcgccgct gccgccgcgc tcgcccgtcg gcgcgacggt agtcgacggg cggctcgtcg cggccgcgat cggcaggcgc cccggcctgc tgctcgcgcg cgacgcgaac ggccgccaac gcgcgacggt cgtgcgcaat ctcgcgacgg cgatcacgct gaccgacgcg caaggccgtg cgatcgcggt ccagacgctg aaccggccga tcctcggtac ggtcgtcaat tgcggcgcgc aggcgcgcac gccgacgagc gagccggcgc aggacacggt gtgcacgaac tacgatgacc tcgtgatgta cgactcgcta tatctgcgcg gcggtgcgtc gaacacgctc gtcgacgccg gctaccaggg cgcgcgatac gaactcgtgg tcgacgcgaa cggcgccgtc gtcgccggcc atgcgacgct cggcgcgccg ccgccgccga acggctacgt gctgcagggg ctcggcgcga gcgccgcgtg gctgcaggcg catgcgacgc cgggcacgcg cctcgcggta tcgcgccggc tgtcggccga cggcgcggat ctcgcgctcg cgtcgggcac gtcgctcgtc gaggcggggc cgacgctgtc cgtgccgaat ctcgcgcaaa gcgccgcgca agagggcttc gcgccgacgg tgggcggcgt cgacgcgggc gaaggcgccg cggcgaacgg caactggtac aacggctggt atgtcgcgcg caatgggcgc accgcggcgg gcgtcgcggc ggacggcacg atcctgctcg tcgagatcga cggccggcag cccacgttga gcgtcggcac gagcattccg gagacggcgg cggtgatggc atggctcggt gcgacgtcgg ccgtcaatct cgacggcggc ggctcgagca acatggtggt cggcggcaag atggtcggac atccgtccga cgccgtgggc gagcggggcg tcggcgatac gctgatgctg ctgccgggct ga
Details for BupsE.19908.a.B2.GE42226
HARVESTED ON: 2/23/2018
SEQUENCED ON: 3/1/2018
EXPECTED MW: 54kDa
OBSERVED MW: 52kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Good (10-50)
TOTAL EXPRESSION LEVEL: Moderate Expression
SOLUBLE EXPRESSION LEVEL Low Expression
EXPRESSION HOST: BL 21 (DE3) Rosetta
SEQUENCING RESULT:
PERCENT IDENTITY: 98
PERCENT COVERAGE: 98
Validated AA Sequence
MAHHHHHHMD DLTPAAQRWA MPGTELPLGP QGLAQSVSTQ TLAAGVAYYQ IKRGAASAAD FWTVNLGFYA TQAAAQADAA NLAAAGFATR VDASAGTDLQ GKVLGYWLSA GRYATQAEAT AAAARIAQAT QNRYKPGTRH TSLAGAPTTG PWIVNVLAID PSRAGAALSL ALPGGNDLGA GGETVSAARA RVNALAGVNG GFFTNINPFG APLPPRSPVG ATVVDGRLVA AAIGRRPGLL LARXANGRQR ATXVRNLATA ITXTXAQGRA IAVQTXXXPX XXTVVNCGAQ ARTPTSEPAQ DTVCTNYDDL VMYDSLYLRG GASNTLVDAG YQGARYELVV DANGAVVAGH ATLGAPPPPN GYVLQGLGAS AAWLQAHATP GTRLAVSRRL SADGADLALA SGTSLVEAGP TLSVPNLAQS AAQEGFAPTV GGVDAGEGAA ANGNWYNGWY VARNGRTAAG VAADGTILLV EIDGRQPTLS VGTSIPETAA VMAWLGATSA VNLDGGGSSN MVVGGKMVGH PSDAVGERGV G
Validated NT Sequence
tcgccgacgc cccgctcgcc cacggcgtcg gacggatgtc cgaccatctt gccgccgacc accatgttgc tcgagccgcc gccgtcgaga ttgacggccg acgtcgcacc gagccatgcc atcaccgccg ccgtctccgg aatgctcgtg ccgacgctca acgtgggctg ccggccgtcg atctcgacga gcaggatcgt gccgtccgcc gcgacgcccg ccgcggtgcg cccattgcgc gcgacatacc agccgttgta ccagttgccg ttcgccgcgg cgccttcgcc cgcgtcgacg ccgcccaccg tcggcgcgaa gccctcttgc gcggcgcttt gcgcgagatt cggcacggac agcgtcggcc ccgcctcgac gagcgacgtg cccgacgcga gcgcgagatc cgcgccgtcg gccgacagcc ggcgcgatac cgcgaggcgc gtgcccggcg tcgcatgcgc ctgcagccac gcggcgctcg cgccgagccc ctgcagcacg tagccgttcg gcggcggcgg cgcgccgagc gtcgcatggc cggcgacgac ggcgccgttc gcgtcgacca cgagttcgta tcgcgcgccc tggtagccgg cgtcgacgag cgtgttcgac gcaccgccgc gcagatatag cgagtcgtac atcacgaggt catcgtagtt cgtgcacacc gtgtcctgcg ccggctcgct cgtcggcgtg cgcgcctgcg cgccgcaatt gacgaccgtn nnnnnnntcg gcnnnntcnn ngtctggacc gcgatcgcac ggccttgcgc gtnngtnngc gtgatcgccg tcgcgagatt gcgcacganc gtcgcgcgtt ggcggccgtt cgcgncgcgc gcgagcagca ggccggggcg cctgccgatc gcggccgcga cgagccgccc gtcgactacc gtcgcgccga cgggcgagcg cggcggcagc ggcgcgccga acggattgat gttcgtgaaa aagccgccgt tgacgccggc gagcgcgttc acacgcgccc gcgcggccga aaccgtctcg ccgcccgcac cgagatcgtt gccgcccggc agcgcgagcg acagcgccgc gccggcgcgc gacgggtcga tcgcgagcac gttgacgatc cacggccccg tcgtcggcgc gccggcgagc gacgtatgcc gcgtgcccgg cttgtagcgg ttctgcgtgg cctgcgcgat gcgcgcggcg gccgccgtcg cctcggcctg cgtcgcgtag cggccggccg acagccagta gccgagcacc ttgccctgca ggtcggtgcc cgccgacgcg tcgacgcgcg tcgcgaagcc ggccgccgcg agattcgccg catcggcctg cgccgcggcc tgcgtcgcgt agaagccgag gttgacggtc cagaaatcgg ccgcgctcgc cgcgccgcgc ttgatctggt aataggcgac gcctgcggcg agcgtctgcg tcgacacgct ctgcgcgagg ccctgcggcc cgagcggcag ttcggtgccg ggcatcgccc agcgctgcgc ggcgggcgtc agatcgtcca tatggtggtg gtggtggtga gccat
Expected Protein Sequence
MAHHHHHHDD LTPAAQRWAM PGTELPLGPQ GLAQSVSTQT LAAGVAYYQI KRGAASAADF WTVNLGFYAT QAAAQADAAN LAAAGFATRV DASAGTDLQG KVLGYWLSAG RYATQAEATA AAARIAQATQ NRYKPGTRHT SLAGAPTTGP WIVNVLAIDP SRAGAALSLA LPGGNDLGAG GETVSAARAR VNALAGVNGG FFTNINPFGA PLPPRSPVGA TVVDGRLVAA AIGRRPGLLL ARDANGRQRA TVVRNLATAI TLTDAQGRAI AVQTLNRPIL GTVVNCGAQA RTPTSEPAQD TVCTNYDDLV MYDSLYLRGG ASNTLVDAGY QGARYELVVD ANGAVVAGHA TLGAPPPPNG YVLQGLGASA AWLQAHATPG TRLAVSRRLS ADGADLALAS GTSLVEAGPT LSVPNLAQSA AQEGFAPTVG GVDAGEGAAA NGNWYNGWYV ARNGRTAAGV AADGTILLVE IDGRQPTLSV GTSIPETAAV MAWLGATSAV NLDGGGSSNM VVGGKMVGHP SDAVGERGVG DTLMLLPG
Full NT Sequence (Expression Vector + Insert)
taatacgact cactataggg agaccacaac ggtttccctc tagaaataat tttgtttaac tttaagaagg agatatacca tggctcacca ccaccaccac catgacgatc tgacgcccgc cgcgcagcgc tgggcgatgc ccggcaccga actgccgctc gggccgcagg gcctcgcgca gagcgtgtcg acgcagacgc tcgccgcagg cgtcgcctat taccagatca agcgcggcgc ggcgagcgcg gccgatttct ggaccgtcaa cctcggcttc tacgcgacgc aggccgcggc gcaggccgat gcggcgaatc tcgcggcggc cggcttcgcg acgcgcgtcg acgcgtcggc gggcaccgac ctgcagggca aggtgctcgg ctactggctg tcggccggcc gctacgcgac gcaggccgag gcgacggcgg ccgccgcgcg catcgcgcag gccacgcaga accgctacaa gccgggcacg cggcatacgt cgctcgccgg cgcgccgacg acggggccgt ggatcgtcaa cgtgctcgcg atcgacccgt cgcgcgccgg cgcggcgctg tcgctcgcgc tgccgggcgg caacgatctc ggtgcgggcg gcgagacggt ttcggccgcg cgggcgcgtg tgaacgcgct cgccggcgtc aacggcggct ttttcacgaa catcaatccg ttcggcgcgc cgctgccgcc gcgctcgccc gtcggcgcga cggtagtcga cgggcggctc gtcgcggccg cgatcggcag gcgccccggc ctgctgctcg cgcgcgacgc gaacggccgc caacgcgcga cggtcgtgcg caatctcgcg acggcgatca cgctgaccga cgcgcaaggc cgtgcgatcg cggtccagac gctgaaccgg ccgatcctcg gtacggtcgt caattgcggc gcgcaggcgc gcacgccgac gagcgagccg gcgcaggaca cggtgtgcac gaactacgat gacctcgtga tgtacgactc gctatatctg cgcggcggtg cgtcgaacac gctcgtcgac gccggctacc agggcgcgcg atacgaactc gtggtcgacg cgaacggcgc cgtcgtcgcc ggccatgcga cgctcggcgc gccgccgccg ccgaacggct acgtgctgca ggggctcggc gcgagcgccg cgtggctgca ggcgcatgcg acgccgggca cgcgcctcgc ggtatcgcgc cggctgtcgg ccgacggcgc ggatctcgcg ctcgcgtcgg gcacgtcgct cgtcgaggcg gggccgacgc tgtccgtgcc gaatctcgcg caaagcgccg cgcaagaggg cttcgcgccg acggtgggcg gcgtcgacgc gggcgaaggc gccgcggcga acggcaactg gtacaacggc tggtatgtcg cgcgcaatgg gcgcaccgcg gcgggcgtcg cggcggacgg cacgatcctg ctcgtcgaga tcgacggccg gcagcccacg ttgagcgtcg gcacgagcat tccggagacg gcggcggtga tggcatggct cggtgcgacg tcggccgtca atctcgacgg cggcggctcg agcaacatgg tggtcggcgg caagatggtc ggacatccgt ccgacgccgt gggcgagcgg ggcgtcggcg atacgctgat gctgctgccg ggctgagtaa gataggatcc ggctgctaac aaagcccgaa aggaagctga gttggctgct gccaccgctg agcaataact agcataaccc cttggggcct ctaaacgggt cttgaggggt tttttgctga aaggaggaac tatatccgga tatccacagg acgggtgtgg tcgccatgat cgcgtagtcg atagtggctc caagtagcga agcgagcagg actgggcggc ggccaaagcg gtcggacagt gctccgagaa cgggtgcgca tagaaattgc atcaacgcat atagcgctag cagcacgcca tagtgactgg cgatgctgtc ggaatggacg atatcccgca agaggcccgg cagtaccggc ataaccaagc ctatgcctac agcatccagg gtgacggtgc cgaggatgac gatgagcgca ttgttagatt tcatacacgg tgcctgactg cgttagcaat ttaactgtga taaactaccg cattaaagct tatcgatgat aagctgtcaa acatgagaat tcttgaagac gaaagggcct cgtgatacgc ctatttttat aggttaatgt catgataata atggtttctt agacgtcagg tggcactttt cggggaaatg tgcgcggaac ccctatttgt ttatttttct aaatacattc aaatatgtat ccgctcatga gacaataacc ctgataaatg cttcaataat attgaaaaag gaagagtatg agtattcaac atttccgtgt cgcccttatt cccttttttg cggcattttg ccttcctgtt tttgctcacc cagaaacgct ggtgaaagta aaagatgctg aagatcagtt gggtgcacga gtgggttaca tcgaactgga tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc caatgatgag cacttttaaa gttctgctat gtggcgcggt attatcccgt gttgacgccg ggcaagagca actcggtcgc cgcatacact attctcagaa tgacttggtt gagtactcac cagtcacaga aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca taaccatgag tgataacact gcggccaact tacttctgac aacgatcgga ggaccgaagg agctaaccgc ttttttgcac aacatggggg atcatgtaac tcgccttgat cgttgggaac cggagctgaa tgaagccata ccaaacgacg agcgtgacac cacgatgcct gcagcaatgg caacaacgtt gcgcaaacta ttaactggcg aactacttac tctagcttcc cggcaacaat taatagactg gatggaggcg gataaagttg caggaccact tctgcgctcg gcccttccgg ctggctggtt tattgctgat aaatctggag ccggtgagcg tgggtctcgc ggtatcattg cagcactggg gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc aggcaactat ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc attggtaact gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt tttaatttaa aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt aacgtgagtt ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt gagatccttt ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg tttgccggat caagagctac caactctttt tccgaaggta actggcttca gcagagcgca gataccaaat actgtccttc tagtgtagcc gtagttaggc caccacttca agaactctgt agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg ccagtggcga taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg cgcagcggtc gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct acaccgaact gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga gaaaggcgga caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc ttccaggggg aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg cggccttttt acggttcctg gccttttgct ggccttttgc tcacatgttc tttcctgcgt tatcccctga ttctgtggat aaccgtatta ccgcctttga gtgagctgat accgctcgcc gcagccgaac gaccgagcgc agcgagtcag tgagcgagga agcggaagag cgcctgatgc ggtattttct ccttacgcat ctgtgcggta tttcacaccg catatatggt gcactctcag tacaatctgc tctgatgccg catagttaag ccagtataca ctccgctatc gctacgtgac tgggtcatgg ctgcgccccg acacccgcca acacccgctg acgcgccctg acgggcttgt ctgctcccgg catccgctta cagacaagct gtgaccgtct ccgggagctg catgtgtcag aggttttcac cgtcatcacc gaaacgcgcg aggcagctgc ggtaaagctc atcagcgtgg tcgtgaagcg attcacagat gtctgcctgt tcatccgcgt ccagctcgtt gagtttctcc agaagcgtta atgtctggct tctgataaag cgggccatgt taagggcggt tttttcctgt ttggtcactg atgcctccgt gtaaggggga tttctgttca tgggggtaat gataccgatg aaacgagaga ggatgctcac gatacgggtt actgatgatg aacatgcccg gttactggaa cgttgtgagg gtaaacaact ggcggtatgg atgcggcggg accagagaaa aatcactcag ggtcaatgcc agcgcttcgt taatacagat gtaggtgttc cacagggtag ccagcagcat cctgcgatgc agatccggaa cataatggtg cagggcgctg acttccgcgt ttccagactt tacgaaacac ggaaaccgaa gaccattcat gttgttgctc aggtcgcaga cgttttgcag cagcagtcgc ttcacgttcg ctcgcgtatc ggtgattcat tctgctaacc agtaaggcaa ccccgccagc ctagccgggt cctcaacgac aggagcacga tcatgcgcac ccgtggccag gacccaacgc tgcccgagat gcgccgcgtg cggctgctgg agatggcgga cgcgatggat atgttctgcc aagggttggt ttgcgcattc acagttctcc gcaagaattg attggctcca attcttggag tggtgaatcc gttagcgagg tgccgccggc ttccattcag gtcgaggtgg cccggctcca tgcaccgcga cgcaacgcgg ggaggcagac aaggtatagg gcggcgccta caatccatgc caacccgttc catgtgctcg ccgaggcggc ataaatcgcc gtgacgatca gcggtccagt gatcgaagtt aggctggtaa gagccgcgag cgatccttga agctgtccct gatggtcgtc atctacctgc ctggacagca tggcctgcaa cgcgggcatc ccgatgccgc cggaagcgag aagaatcata atggggaagg ccatccagcc tcgcgtcgcg aacgccagca agacgtagcc cagcgcgtcg gccgccatgc cggcgataat ggcctgcttc tcgccgaaac gtttggtggc gggaccagtg acgaaggctt gagcgagggc gtgcaagatt ccgaataccg caagcgacag gccgatcatc gtcgcgctcc agcgaaagcg gtcctcgccg aaaatgaccc agagcgctgc cggcacctgt cctacgagtt gcatgataaa gaagacagtc ataagtgcgg cgacgatagt catgccccgc gcccaccgga aggagctgac tgggttgaag gctctcaagg gcatcggtcg acgctctccc ttatgcgact cctgcattag gaagcagccc agtagtaggt tgaggccgtt gagcaccgcc gccgcaagga atggtgcatg caaggagatg gcgcccaaca gtcccccggc cacggggcct gccaccatac ccacgccgaa acaagcgctc atgagcccga agtggcgagc ccgatcttcc ccatcggtga tgtcggcgat ataggcgcca gcaaccgcac ctgtggcgcc ggtgatgccg gccacgatgc gtccggcgta gaggatcgag atctcgatcc cgcgaaat
Details for BupsE.19908.a.B2.PW38478
PURIFICATION DATe: 5/21/2018
CONCENTRATION: 15.79mg/ml
OBSERVED MW: 54kDa
EXPRESSION LEVEL: Low Expression
PROTEIN PURIFICATION BUFFER: 25 mM HEPES pH 7, 500 mM NaCl, 5% Glycerol , 2 mM DTT, 0.025% Azide
EXPRESSION HOST: BL 21 (DE3) Rosetta
VIAL COUNT (approx.): 2
VIAL VOLUME: 100µl
PERCENT IDENTITY: 98
PERCENT COVERAGE: 98
Protocol Notes
notes unavailable
Validated AA Sequence
MAHHHHHHMD DLTPAAQRWA MPGTELPLGP QGLAQSVSTQ TLAAGVAYYQ IKRGAASAAD FWTVNLGFYA TQAAAQADAA NLAAAGFATR VDASAGTDLQ GKVLGYWLSA GRYATQAEAT AAAARIAQAT QNRYKPGTRH TSLAGAPTTG PWIVNVLAID PSRAGAALSL ALPGGNDLGA GGETVSAARA RVNALAGVNG GFFTNINPFG APLPPRSPVG ATVVDGRLVA AAIGRRPGLL LARXANGRQR ATXVRNLATA ITXTXAQGRA IAVQTXXXPX XXTVVNCGAQ ARTPTSEPAQ DTVCTNYDDL VMYDSLYLRG GASNTLVDAG YQGARYELVV DANGAVVAGH ATLGAPPPPN GYVLQGLGAS AAWLQAHATP GTRLAVSRRL SADGADLALA SGTSLVEAGP TLSVPNLAQS AAQEGFAPTV GGVDAGEGAA ANGNWYNGWY VARNGRTAAG VAADGTILLV EIDGRQPTLS VGTSIPETAA VMAWLGATSA VNLDGGGSSN MVVGGKMVGH PSDAVGERGV G
Validated NT Sequence
tcgccgacgc cccgctcgcc cacggcgtcg gacggatgtc cgaccatctt gccgccgacc accatgttgc tcgagccgcc gccgtcgaga ttgacggccg acgtcgcacc gagccatgcc atcaccgccg ccgtctccgg aatgctcgtg ccgacgctca acgtgggctg ccggccgtcg atctcgacga gcaggatcgt gccgtccgcc gcgacgcccg ccgcggtgcg cccattgcgc gcgacatacc agccgttgta ccagttgccg ttcgccgcgg cgccttcgcc cgcgtcgacg ccgcccaccg tcggcgcgaa gccctcttgc gcggcgcttt gcgcgagatt cggcacggac agcgtcggcc ccgcctcgac gagcgacgtg cccgacgcga gcgcgagatc cgcgccgtcg gccgacagcc ggcgcgatac cgcgaggcgc gtgcccggcg tcgcatgcgc ctgcagccac gcggcgctcg cgccgagccc ctgcagcacg tagccgttcg gcggcggcgg cgcgccgagc gtcgcatggc cggcgacgac ggcgccgttc gcgtcgacca cgagttcgta tcgcgcgccc tggtagccgg cgtcgacgag cgtgttcgac gcaccgccgc gcagatatag cgagtcgtac atcacgaggt catcgtagtt cgtgcacacc gtgtcctgcg ccggctcgct cgtcggcgtg cgcgcctgcg cgccgcaatt gacgaccgtn nnnnnnntcg gcnnnntcnn ngtctggacc gcgatcgcac ggccttgcgc gtnngtnngc gtgatcgccg tcgcgagatt gcgcacganc gtcgcgcgtt ggcggccgtt cgcgncgcgc gcgagcagca ggccggggcg cctgccgatc gcggccgcga cgagccgccc gtcgactacc gtcgcgccga cgggcgagcg cggcggcagc ggcgcgccga acggattgat gttcgtgaaa aagccgccgt tgacgccggc gagcgcgttc acacgcgccc gcgcggccga aaccgtctcg ccgcccgcac cgagatcgtt gccgcccggc agcgcgagcg acagcgccgc gccggcgcgc gacgggtcga tcgcgagcac gttgacgatc cacggccccg tcgtcggcgc gccggcgagc gacgtatgcc gcgtgcccgg cttgtagcgg ttctgcgtgg cctgcgcgat gcgcgcggcg gccgccgtcg cctcggcctg cgtcgcgtag cggccggccg acagccagta gccgagcacc ttgccctgca ggtcggtgcc cgccgacgcg tcgacgcgcg tcgcgaagcc ggccgccgcg agattcgccg catcggcctg cgccgcggcc tgcgtcgcgt agaagccgag gttgacggtc cagaaatcgg ccgcgctcgc cgcgccgcgc ttgatctggt aataggcgac gcctgcggcg agcgtctgcg tcgacacgct ctgcgcgagg ccctgcggcc cgagcggcag ttcggtgccg ggcatcgccc agcgctgcgc ggcgggcgtc agatcgtcca tatggtggtg gtggtggtga gccat
Expressed Protein Sequence
MAHHHHHHDD LTPAAQRWAM PGTELPLGPQ GLAQSVSTQT LAAGVAYYQI KRGAASAADF WTVNLGFYAT QAAAQADAAN LAAAGFATRV DASAGTDLQG KVLGYWLSAG RYATQAEATA AAARIAQATQ NRYKPGTRHT SLAGAPTTGP WIVNVLAIDP SRAGAALSLA LPGGNDLGAG GETVSAARAR VNALAGVNGG FFTNINPFGA PLPPRSPVGA TVVDGRLVAA AIGRRPGLLL ARDANGRQRA TVVRNLATAI TLTDAQGRAI AVQTLNRPIL GTVVNCGAQA RTPTSEPAQD TVCTNYDDLV MYDSLYLRGG ASNTLVDAGY QGARYELVVD ANGAVVAGHA TLGAPPPPNG YVLQGLGASA AWLQAHATPG TRLAVSRRLS ADGADLALAS GTSLVEAGPT LSVPNLAQSA AQEGFAPTVG GVDAGEGAAA NGNWYNGWYV ARNGRTAAGV AADGTILLVE IDGRQPTLSV GTSIPETAAV MAWLGATSAV NLDGGGSSNM VVGGKMVGHP SDAVGERGVG DTLMLLPG
Full NT Sequence (Expression Vector + Insert)
TAATACGACT CACTATAGGG AGACCACAAC GGTTTCCCTC TAGAAATAAT TTTGTTTAAC TTTAAGAAGG AGATATACCA TGGCTCACCA CCACCACCAC CATGACGATC TGACGCCCGC CGCGCAGCGC TGGGCGATGC CCGGCACCGA ACTGCCGCTC GGGCCGCAGG GCCTCGCGCA GAGCGTGTCG ACGCAGACGC TCGCCGCAGG CGTCGCCTAT TACCAGATCA AGCGCGGCGC GGCGAGCGCG GCCGATTTCT GGACCGTCAA CCTCGGCTTC TACGCGACGC AGGCCGCGGC GCAGGCCGAT GCGGCGAATC TCGCGGCGGC CGGCTTCGCG ACGCGCGTCG ACGCGTCGGC GGGCACCGAC CTGCAGGGCA AGGTGCTCGG CTACTGGCTG TCGGCCGGCC GCTACGCGAC GCAGGCCGAG GCGACGGCGG CCGCCGCGCG CATCGCGCAG GCCACGCAGA ACCGCTACAA GCCGGGCACG CGGCATACGT CGCTCGCCGG CGCGCCGACG ACGGGGCCGT GGATCGTCAA CGTGCTCGCG ATCGACCCGT CGCGCGCCGG CGCGGCGCTG TCGCTCGCGC TGCCGGGCGG CAACGATCTC GGTGCGGGCG GCGAGACGGT TTCGGCCGCG CGGGCGCGTG TGAACGCGCT CGCCGGCGTC AACGGCGGCT TTTTCACGAA CATCAATCCG TTCGGCGCGC CGCTGCCGCC GCGCTCGCCC GTCGGCGCGA CGGTAGTCGA CGGGCGGCTC GTCGCGGCCG CGATCGGCAG GCGCCCCGGC CTGCTGCTCG CGCGCGACGC GAACGGCCGC CAACGCGCGA CGGTCGTGCG CAATCTCGCG ACGGCGATCA CGCTGACCGA CGCGCAAGGC CGTGCGATCG CGGTCCAGAC GCTGAACCGG CCGATCCTCG GTACGGTCGT CAATTGCGGC GCGCAGGCGC GCACGCCGAC GAGCGAGCCG GCGCAGGACA CGGTGTGCAC GAACTACGAT GACCTCGTGA TGTACGACTC GCTATATCTG CGCGGCGGTG CGTCGAACAC GCTCGTCGAC GCCGGCTACC AGGGCGCGCG ATACGAACTC GTGGTCGACG CGAACGGCGC CGTCGTCGCC GGCCATGCGA CGCTCGGCGC GCCGCCGCCG CCGAACGGCT ACGTGCTGCA GGGGCTCGGC GCGAGCGCCG CGTGGCTGCA GGCGCATGCG ACGCCGGGCA CGCGCCTCGC GGTATCGCGC CGGCTGTCGG CCGACGGCGC GGATCTCGCG CTCGCGTCGG GCACGTCGCT CGTCGAGGCG GGGCCGACGC TGTCCGTGCC GAATCTCGCG CAAAGCGCCG CGCAAGAGGG CTTCGCGCCG ACGGTGGGCG GCGTCGACGC GGGCGAAGGC GCCGCGGCGA ACGGCAACTG GTACAACGGC TGGTATGTCG CGCGCAATGG GCGCACCGCG GCGGGCGTCG CGGCGGACGG CACGATCCTG CTCGTCGAGA TCGACGGCCG GCAGCCCACG TTGAGCGTCG GCACGAGCAT TCCGGAGACG GCGGCGGTGA TGGCATGGCT CGGTGCGACG TCGGCCGTCA ATCTCGACGG CGGCGGCTCG AGCAACATGG TGGTCGGCGG CAAGATGGTC GGACATCCGT CCGACGCCGT GGGCGAGCGG GGCGTCGGCG ATACGCTGAT GCTGCTGCCG GGCTGAGTAA GATAGGATCC GGCTGCTAAC AAAGCCCGAA AGGAAGCTGA GTTGGCTGCT GCCACCGCTG AGCAATAACT AGCATAACCC CTTGGGGCCT CTAAACGGGT CTTGAGGGGT TTTTTGCTGA AAGGAGGAAC TATATCCGGA TATCCACAGG ACGGGTGTGG TCGCCATGAT CGCGTAGTCG ATAGTGGCTC CAAGTAGCGA AGCGAGCAGG ACTGGGCGGC GGCCAAAGCG GTCGGACAGT GCTCCGAGAA CGGGTGCGCA TAGAAATTGC ATCAACGCAT ATAGCGCTAG CAGCACGCCA TAGTGACTGG CGATGCTGTC GGAATGGACG ATATCCCGCA AGAGGCCCGG CAGTACCGGC ATAACCAAGC CTATGCCTAC AGCATCCAGG GTGACGGTGC CGAGGATGAC GATGAGCGCA TTGTTAGATT TCATACACGG TGCCTGACTG CGTTAGCAAT TTAACTGTGA TAAACTACCG CATTAAAGCT TATCGATGAT AAGCTGTCAA ACATGAGAAT TCTTGAAGAC GAAAGGGCCT CGTGATACGC CTATTTTTAT AGGTTAATGT CATGATAATA ATGGTTTCTT AGACGTCAGG TGGCACTTTT CGGGGAAATG TGCGCGGAAC CCCTATTTGT TTATTTTTCT AAATACATTC AAATATGTAT CCGCTCATGA GACAATAACC CTGATAAATG CTTCAATAAT ATTGAAAAAG GAAGAGTATG AGTATTCAAC ATTTCCGTGT CGCCCTTATT CCCTTTTTTG CGGCATTTTG CCTTCCTGTT TTTGCTCACC CAGAAACGCT GGTGAAAGTA AAAGATGCTG AAGATCAGTT GGGTGCACGA GTGGGTTACA TCGAACTGGA TCTCAACAGC GGTAAGATCC TTGAGAGTTT TCGCCCCGAA GAACGTTTTC CAATGATGAG CACTTTTAAA GTTCTGCTAT GTGGCGCGGT ATTATCCCGT GTTGACGCCG GGCAAGAGCA ACTCGGTCGC CGCATACACT ATTCTCAGAA TGACTTGGTT GAGTACTCAC CAGTCACAGA AAAGCATCTT ACGGATGGCA TGACAGTAAG AGAATTATGC AGTGCTGCCA TAACCATGAG TGATAACACT GCGGCCAACT TACTTCTGAC AACGATCGGA GGACCGAAGG AGCTAACCGC TTTTTTGCAC AACATGGGGG ATCATGTAAC TCGCCTTGAT CGTTGGGAAC CGGAGCTGAA TGAAGCCATA CCAAACGACG AGCGTGACAC CACGATGCCT GCAGCAATGG CAACAACGTT GCGCAAACTA TTAACTGGCG AACTACTTAC TCTAGCTTCC CGGCAACAAT TAATAGACTG GATGGAGGCG GATAAAGTTG CAGGACCACT TCTGCGCTCG GCCCTTCCGG CTGGCTGGTT TATTGCTGAT AAATCTGGAG CCGGTGAGCG TGGGTCTCGC GGTATCATTG CAGCACTGGG GCCAGATGGT AAGCCCTCCC GTATCGTAGT TATCTACACG ACGGGGAGTC AGGCAACTAT GGATGAACGA AATAGACAGA TCGCTGAGAT AGGTGCCTCA CTGATTAAGC ATTGGTAACT GTCAGACCAA GTTTACTCAT ATATACTTTA GATTGATTTA AAACTTCATT TTTAATTTAA AAGGATCTAG GTGAAGATCC TTTTTGATAA TCTCATGACC AAAATCCCTT AACGTGAGTT TTCGTTCCAC TGAGCGTCAG ACCCCGTAGA AAAGATCAAA GGATCTTCTT GAGATCCTTT TTTTCTGCGC GTAATCTGCT GCTTGCAAAC AAAAAAACCA CCGCTACCAG CGGTGGTTTG TTTGCCGGAT CAAGAGCTAC CAACTCTTTT TCCGAAGGTA ACTGGCTTCA GCAGAGCGCA GATACCAAAT ACTGTCCTTC TAGTGTAGCC GTAGTTAGGC CACCACTTCA AGAACTCTGT AGCACCGCCT ACATACCTCG CTCTGCTAAT CCTGTTACCA GTGGCTGCTG CCAGTGGCGA TAAGTCGTGT CTTACCGGGT TGGACTCAAG ACGATAGTTA CCGGATAAGG CGCAGCGGTC GGGCTGAACG GGGGGTTCGT GCACACAGCC CAGCTTGGAG CGAACGACCT ACACCGAACT GAGATACCTA CAGCGTGAGC TATGAGAAAG CGCCACGCTT CCCGAAGGGA GAAAGGCGGA CAGGTATCCG GTAAGCGGCA GGGTCGGAAC AGGAGAGCGC ACGAGGGAGC TTCCAGGGGG AAACGCCTGG TATCTTTATA GTCCTGTCGG GTTTCGCCAC CTCTGACTTG AGCGTCGATT TTTGTGATGC TCGTCAGGGG GGCGGAGCCT ATGGAAAAAC GCCAGCAACG CGGCCTTTTT ACGGTTCCTG GCCTTTTGCT GGCCTTTTGC TCACATGTTC TTTCCTGCGT TATCCCCTGA TTCTGTGGAT AACCGTATTA CCGCCTTTGA GTGAGCTGAT ACCGCTCGCC GCAGCCGAAC GACCGAGCGC AGCGAGTCAG TGAGCGAGGA AGCGGAAGAG CGCCTGATGC GGTATTTTCT CCTTACGCAT CTGTGCGGTA TTTCACACCG CATATATGGT GCACTCTCAG TACAATCTGC TCTGATGCCG CATAGTTAAG CCAGTATACA CTCCGCTATC GCTACGTGAC TGGGTCATGG CTGCGCCCCG ACACCCGCCA ACACCCGCTG ACGCGCCCTG ACGGGCTTGT CTGCTCCCGG CATCCGCTTA CAGACAAGCT GTGACCGTCT CCGGGAGCTG CATGTGTCAG AGGTTTTCAC CGTCATCACC GAAACGCGCG AGGCAGCTGC GGTAAAGCTC ATCAGCGTGG TCGTGAAGCG ATTCACAGAT GTCTGCCTGT TCATCCGCGT CCAGCTCGTT GAGTTTCTCC AGAAGCGTTA ATGTCTGGCT TCTGATAAAG CGGGCCATGT TAAGGGCGGT TTTTTCCTGT TTGGTCACTG ATGCCTCCGT GTAAGGGGGA TTTCTGTTCA TGGGGGTAAT GATACCGATG AAACGAGAGA GGATGCTCAC GATACGGGTT ACTGATGATG AACATGCCCG GTTACTGGAA CGTTGTGAGG GTAAACAACT GGCGGTATGG ATGCGGCGGG ACCAGAGAAA AATCACTCAG GGTCAATGCC AGCGCTTCGT TAATACAGAT GTAGGTGTTC CACAGGGTAG CCAGCAGCAT CCTGCGATGC AGATCCGGAA CATAATGGTG CAGGGCGCTG ACTTCCGCGT TTCCAGACTT TACGAAACAC GGAAACCGAA GACCATTCAT GTTGTTGCTC AGGTCGCAGA CGTTTTGCAG CAGCAGTCGC TTCACGTTCG CTCGCGTATC GGTGATTCAT TCTGCTAACC AGTAAGGCAA CCCCGCCAGC CTAGCCGGGT CCTCAACGAC AGGAGCACGA TCATGCGCAC CCGTGGCCAG GACCCAACGC TGCCCGAGAT GCGCCGCGTG CGGCTGCTGG AGATGGCGGA CGCGATGGAT ATGTTCTGCC AAGGGTTGGT TTGCGCATTC ACAGTTCTCC GCAAGAATTG ATTGGCTCCA ATTCTTGGAG TGGTGAATCC GTTAGCGAGG TGCCGCCGGC TTCCATTCAG GTCGAGGTGG CCCGGCTCCA TGCACCGCGA CGCAACGCGG GGAGGCAGAC AAGGTATAGG GCGGCGCCTA CAATCCATGC CAACCCGTTC CATGTGCTCG CCGAGGCGGC ATAAATCGCC GTGACGATCA GCGGTCCAGT GATCGAAGTT AGGCTGGTAA GAGCCGCGAG CGATCCTTGA AGCTGTCCCT GATGGTCGTC ATCTACCTGC CTGGACAGCA TGGCCTGCAA CGCGGGCATC CCGATGCCGC CGGAAGCGAG AAGAATCATA ATGGGGAAGG CCATCCAGCC TCGCGTCGCG AACGCCAGCA AGACGTAGCC CAGCGCGTCG GCCGCCATGC CGGCGATAAT GGCCTGCTTC TCGCCGAAAC GTTTGGTGGC GGGACCAGTG ACGAAGGCTT GAGCGAGGGC GTGCAAGATT CCGAATACCG CAAGCGACAG GCCGATCATC GTCGCGCTCC AGCGAAAGCG GTCCTCGCCG AAAATGACCC AGAGCGCTGC CGGCACCTGT CCTACGAGTT GCATGATAAA GAAGACAGTC ATAAGTGCGG CGACGATAGT CATGCCCCGC GCCCACCGGA AGGAGCTGAC TGGGTTGAAG GCTCTCAAGG GCATCGGTCG ACGCTCTCCC TTATGCGACT CCTGCATTAG GAAGCAGCCC AGTAGTAGGT TGAGGCCGTT GAGCACCGCC GCCGCAAGGA ATGGTGCATG CAAGGAGATG GCGCCCAACA GTCCCCCGGC CACGGGGCCT GCCACCATAC CCACGCCGAA ACAAGCGCTC ATGAGCCCGA AGTGGCGAGC CCGATCTTCC CCATCGGTGA TGTCGGCGAT ATAGGCGCCA GCAACCGCAC CTGTGGCGCC GGTGATGCCG GCCACGATGC GTCCGGCGTA GAGGATCGAG ATCTCGATCC CGCGAAAT