ButhA.00020.t

Aldehyde dehydrogenase

CENTER ID: ButhA.00020.t
ORGANISM: Burkholderia thailandensis E264
ASSOCIATED DISEASE:
CURRENT STATUS: in PDB
COMMUNITY REQUEST: True
NIH RISK GROUP: 3
SELECT AGENT: False
NIH PRIORITY
pathogens category:
IIIB

Ordering Clones & Proteins

If there are materials available for this target, they will be listed below. Materials can be ordered from SSGCID using the button in the "order material" column. Clicking the button will add the material to a virtual cart. You may order multiple materials at a time at no cost to you, as this contract is funded by NIAID. When you are ready to place your order, click the "Place Order" link which will appear in the top right corner of the page after you place your first item in your cart.

Clones*

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
ButhA.00020.t.B1.GE39398 Full length( ButhA.00020.t ) 1 477
* SSGCID clones represent un-induced expression constructs which have been verified by sequencing from vector primers. Clones may contain a tag, such as N-term 6xHis. Get sequence information using the button in the "info" column.

Proteins

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
ButhA.00020.t.B1.PW37792 Full length( ButhA.00020.t ) 1 477
Structures
5J6B
DEPOSITED: 4/4/2016
DETERMINATION: XRay
CLONE: ButhA.00020.t.B1.GE39398
PROTEIN: ButhA.00020.t.B1.PW37792
External Resources
RESOURCE REFERENCE ID
OrthoMCL: OG5_126638
PATRIC ID: fig|271848.6.peg.569
RefSeq: YP_438698.1
UniProt: Q2T801
Sequences
These sequences are the native gene sequence; sequences of constructs derived from these sequences may differ due to codon optimization or other protocols. To find the specific sequence of any material you may have ordered, click on the "info" button next to the name of that material.
AA Sequence
MLKETYPYYL ANEAVYANAE LEVTDKYTGK VATRVALADA SAIDAAIAAA VGAQKPLRAL PAFRRQAILE HCVARFRERF DELAQALCIE AGKPINDSKG EVTRLIDTFR VAAEESVRIE GGLVNLEISP RAQGYSGYYK RVPIGPCSFI SPFNFPLNLA AHKVAPALAA GCPFVLKPAS RTPIGALIIG EVLAETDLPK GAFSILPAHR DGADLFTTDE RFKLLSFTGS PTVGWELKKK AGKKKVVLEL GGNAAAIVDA DQREVLDYVV ERLAFGAYYQ SGQSCIGVQR IIAHADVYDA LREKLIAKTR SLKMGDPKDP ATFVGPMISE SEARRLAGWM EAAVAAGAKI VAGGKVDGAM FEATLLEGVG RDQDLYRKEA FGPVALLERF SDFDDALARV NDSDFGLQAG VFTDSLSHAQ RAWDELEVGG VVINDVPSFR VDNMPYGGVK DSGLGREGIR YAIEDMTELR LMVVRRR
NT Sequence
ATGCTGAAGG AAACCTATCC GTATTACCTC GCGAACGAAG CGGTGTACGC GAACGCCGAG CTCGAAGTCA CCGACAAATA CACCGGCAAG GTCGCGACGC GCGTCGCGCT GGCCGACGCG AGCGCGATCG ACGCGGCGAT CGCGGCGGCC GTCGGCGCGC AGAAGCCGTT GCGCGCATTG CCCGCGTTCA GGCGGCAGGC GATCCTCGAA CACTGCGTCG CGCGCTTTCG CGAGCGCTTC GACGAGCTCG CGCAGGCGCT GTGCATCGAG GCGGGCAAGC CGATCAACGA TTCGAAGGGC GAGGTGACGC GCCTCATCGA TACGTTTCGC GTCGCGGCCG AGGAATCGGT GCGCATCGAA GGCGGTCTCG TCAATCTCGA AATCTCGCCG CGCGCGCAGG GCTACAGCGG CTACTACAAG CGCGTGCCGA TCGGCCCGTG CTCGTTCATC TCGCCGTTCA ATTTTCCGCT GAACCTCGCC GCGCACAAGG TCGCGCCCGC GCTCGCCGCC GGCTGCCCGT TCGTGCTGAA GCCCGCGAGC CGCACGCCGA TCGGCGCGCT GATCATCGGC GAGGTGCTCG CGGAAACCGA CTTGCCGAAG GGCGCGTTCT CGATCCTGCC CGCGCATCGC GACGGCGCGG ATCTGTTCAC GACCGACGAG CGCTTCAAGC TGCTGTCGTT CACGGGCTCG CCCACCGTCG GCTGGGAACT GAAGAAGAAG GCGGGCAAGA AGAAGGTCGT GCTCGAGCTG GGCGGCAATG CGGCGGCGAT CGTCGATGCC GATCAGCGCG AGGTGCTCGA CTACGTCGTC GAGCGGCTCG CGTTCGGCGC GTACTACCAG TCGGGGCAGA GCTGCATCGG CGTGCAGCGG ATCATCGCGC ATGCGGACGT CTATGACGCG CTGCGCGAGA AGCTGATCGC GAAGACGCGC TCGCTGAAGA TGGGCGATCC GAAGGACCCG GCGACGTTCG TCGGCCCGAT GATCTCCGAA TCCGAAGCGC GGCGGCTCGC CGGCTGGATG GAGGCGGCGG TGGCGGCGGG CGCGAAGATC GTCGCGGGCG GCAAAGTCGA CGGTGCGATG TTCGAGGCGA CGCTGCTCGA AGGCGTGGGC CGCGATCAGG ATCTGTATCG CAAGGAGGCG TTCGGCCCGG TCGCGCTGCT CGAGCGGTTC TCCGATTTCG ACGACGCGCT CGCGCGCGTG AACGACAGCG ATTTCGGCCT GCAGGCGGGC GTGTTCACCG ATTCGCTGTC GCATGCGCAG CGCGCATGGG ACGAGCTCGA AGTGGGCGGC GTCGTGATCA ACGATGTGCC GTCGTTTCGC GTCGACAACA TGCCGTACGG CGGCGTGAAG GACTCGGGGC TCGGCCGCGA AGGGATTCGC TATGCGATCG AGGATATGAC CGAACTGCGC CTGATGGTCG TGCGCCGGCG C
Details for ButhA.00020.t.B1.GE39398
HARVESTED ON: 8/5/2015
SEQUENCED ON: 8/13/2015
EXPECTED MW: 53kDa
OBSERVED MW: 53kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Too many to count (100+)
TOTAL EXPRESSION LEVEL: High Expression
SOLUBLE EXPRESSION LEVEL High Expression
EXPRESSION HOST: data unavailable
SEQUENCING RESULT:
PERCENT IDENTITY: 100
PERCENT COVERAGE: 98
Validated AA Sequence
MAHHHHHHML KETYPYYLAN EAVYANAELE VTDKYTGKVA TRVALADASA IDAAIAAAVG AQKPLRALPA FRRQAILEHC VARFRERFDE LAQALCIEAG KPINDSKGEV TRLIDTFRVA AEESVRIEGG LVNLEISPRA QGYSGYYKRV PIGPCSFISP FNFPLNLAAH KVAPALAAGC PFVLKPASRT PIGALIIGEV LAETDLPKGA FSILPAHRDG ADLFTTDERF KLLSFTGSPT VGWELKKKAG KKKVVLELGG NAAAIVDADQ REVLDYVVER LAFGAYYQSG QSCIGVQRII AHADVYDALR EKLIAKTRSL KMGDPKDPAT FVGPMISESE ARRLAGWMEA AVAAGAKIVA GGKVDGAMFE ATLLEGVGRD QDLYRKEAFG PVALLERFSD FDDALARVND SDFGLQAGVF TDSLSHAQRA WDELEVGGVV INDVPSFRVD NMPYGGVKDS GLGREGIRYA IEDMTE
Validated NT Sequence
agttcggtca tatcctcgat cgcatagcga atcccttcgc ggccgagccc cgagtccttc acgccgccgt acggcatgtt gtcgacgcga aacgacggca catcgttgat cacgacgccg cccacttcga gctcgtccca tgcgcgctgc gcatgcgaca gcgaatcggt gaacacgccc gcctgcaggc cgaaatcgct gtcgttcacg cgcgcgagcg cgtcgtcgaa atcggagaac cgctcgagca gcgcgaccgg gccgaacgcc tccttgcgat acagatcctg atcgcggccc acgccttcga gcagcgtcgc ctcgaacatc gcaccgtcga ctttgccgcc cgcgacgatc ttcgcgcccg ccgccaccgc cgcctccatc cagccggcga gccgccgcgc ttcggattcg gagatcatcg ggccgacgaa cgtcgccggg tccttcggat cgcccatctt cagcgagcgc gtcttcgcga tcagcttctc gcgcagcgcg tcatagacgt ccgcatgcgc gatgatccgc tgcacgccga tgcagctctg ccccgactgg tagtacgcgc cgaacgcgag ccgctcgacg acgtagtcga gcacctcgcg ctgatcggca tcgacgatcg ccgccgcatt gccgcccagc tcgagcacga ccttcttctt gcccgccttc ttcttcagtt cccagccgac ggtgggcgag cccgtgaacg acagcagctt gaagcgctcg tcggtcgtga acagatccgc gccgtcgcga tgcgcgggca ggatcgagaa cgcgcccttc ggcaagtcgg tttccgcgag cacctcgccg atgatcagcg cgccgatcgg cgtgcggctc gcgggcttca gcacgaacgg gcagccggcg gcgagcgcgg gcgcgacctt gtgcgcggcg aggttcagcg gaaaattgaa cggcgagatg aacgagcacg ggccgatcgg cacgcgcttg tagtagccgc tgtagccctg cgcgcgcggc gagatttcga gattgacgag accgccttcg atgcgcaccg attcctcggc cgcgacgcga aacgtatcga tgaggcgcgt cacctcgccc ttcgaatcgt tgatcggctt gcccgcctcg atgcacagcg cctgcgcgag ctcgtcgaag cgctcgcgaa agcgcgcgac gcagtgttcg aggatcgcct gccgcctgaa cgcgggcaat gcgcgcaacg gcttctgcgc gccgacggcc gccgcgatcg ccgcgtcgat cgcgctcgcg tcggccagcg cgacgcgcgt cgcgaccttg ccggtgtatt tgtcggtgac ttcgagctcg gcgttcgcgt acaccgcttc gttcgcgagg taatacggat aggtttcctt cagcatatgg tggtggtggt ggtgagccat
Expected Protein Sequence
MAHHHHHHML KETYPYYLAN EAVYANAELE VTDKYTGKVA TRVALADASA IDAAIAAAVG AQKPLRALPA FRRQAILEHC VARFRERFDE LAQALCIEAG KPINDSKGEV TRLIDTFRVA AEESVRIEGG LVNLEISPRA QGYSGYYKRV PIGPCSFISP FNFPLNLAAH KVAPALAAGC PFVLKPASRT PIGALIIGEV LAETDLPKGA FSILPAHRDG ADLFTTDERF KLLSFTGSPT VGWELKKKAG KKKVVLELGG NAAAIVDADQ REVLDYVVER LAFGAYYQSG QSCIGVQRII AHADVYDALR EKLIAKTRSL KMGDPKDPAT FVGPMISESE ARRLAGWMEA AVAAGAKIVA GGKVDGAMFE ATLLEGVGRD QDLYRKEAFG PVALLERFSD FDDALARVND SDFGLQAGVF TDSLSHAQRA WDELEVGGVV INDVPSFRVD NMPYGGVKDS GLGREGIRYA IEDMTELRLM VVRRR
Full NT Sequence (Expression Vector + Insert)
taatacgact cactataggg agaccacaac ggtttccctc tagaaataat tttgtttaac tttaagaagg agatatacca tggctcacca ccaccaccac catatgctga aggaaaccta tccgtattac ctcgcgaacg aagcggtgta cgcgaacgcc gagctcgaag tcaccgacaa atacaccggc aaggtcgcga cgcgcgtcgc gctggccgac gcgagcgcga tcgacgcggc gatcgcggcg gccgtcggcg cgcagaagcc gttgcgcgca ttgcccgcgt tcaggcggca ggcgatcctc gaacactgcg tcgcgcgctt tcgcgagcgc ttcgacgagc tcgcgcaggc gctgtgcatc gaggcgggca agccgatcaa cgattcgaag ggcgaggtga cgcgcctcat cgatacgttt cgcgtcgcgg ccgaggaatc ggtgcgcatc gaaggcggtc tcgtcaatct cgaaatctcg ccgcgcgcgc agggctacag cggctactac aagcgcgtgc cgatcggccc gtgctcgttc atctcgccgt tcaattttcc gctgaacctc gccgcgcaca aggtcgcgcc cgcgctcgcc gccggctgcc cgttcgtgct gaagcccgcg agccgcacgc cgatcggcgc gctgatcatc ggcgaggtgc tcgcggaaac cgacttgccg aagggcgcgt tctcgatcct gcccgcgcat cgcgacggcg cggatctgtt cacgaccgac gagcgcttca agctgctgtc gttcacgggc tcgcccaccg tcggctggga actgaagaag aaggcgggca agaagaaggt cgtgctcgag ctgggcggca atgcggcggc gatcgtcgat gccgatcagc gcgaggtgct cgactacgtc gtcgagcggc tcgcgttcgg cgcgtactac cagtcggggc agagctgcat cggcgtgcag cggatcatcg cgcatgcgga cgtctatgac gcgctgcgcg agaagctgat cgcgaagacg cgctcgctga agatgggcga tccgaaggac ccggcgacgt tcgtcggccc gatgatctcc gaatccgaag cgcggcggct cgccggctgg atggaggcgg cggtggcggc gggcgcgaag atcgtcgcgg gcggcaaagt cgacggtgcg atgttcgagg cgacgctgct cgaaggcgtg ggccgcgatc aggatctgta tcgcaaggag gcgttcggcc cggtcgcgct gctcgagcgg ttctccgatt tcgacgacgc gctcgcgcgc gtgaacgaca gcgatttcgg cctgcaggcg ggcgtgttca ccgattcgct gtcgcatgcg cagcgcgcat gggacgagct cgaagtgggc ggcgtcgtga tcaacgatgt gccgtcgttt cgcgtcgaca acatgccgta cggcggcgtg aaggactcgg ggctcggccg cgaagggatt cgctatgcga tcgaggatat gaccgaactg cgcctgatgg tcgtgcgccg gcgctgagta agataggatc cggctgctaa caaagcccga aaggaagctg agttggctgc tgccaccgct gagcaataac tagcataacc ccttggggcc tctaaacggg tcttgagggg ttttttgctg aaaggaggaa ctatatccgg atatccacag gacgggtgtg gtcgccatga tcgcgtagtc gatagtggct ccaagtagcg aagcgagcag gactgggcgg cggccaaagc ggtcggacag tgctccgaga acgggtgcgc atagaaattg catcaacgca tatagcgcta gcagcacgcc atagtgactg gcgatgctgt cggaatggac gatatcccgc aagaggcccg gcagtaccgg cataaccaag cctatgccta cagcatccag ggtgacggtg ccgaggatga cgatgagcgc attgttagat ttcatacacg gtgcctgact gcgttagcaa tttaactgtg ataaactacc gcattaaagc ttatcgatga taagctgtca aacatgagaa ttcttgaaga cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca cccgtggcca ggacccaacg ctgcccgaga tgcgccgcgt gcggctgctg gagatggcgg acgcgatgga tatgttctgc caagggttgg tttgcgcatt cacagttctc cgcaagaatt gattggctcc aattcttgga gtggtgaatc cgttagcgag gtgccgccgg cttccattca ggtcgaggtg gcccggctcc atgcaccgcg acgcaacgcg gggaggcaga caaggtatag ggcggcgcct acaatccatg ccaacccgtt ccatgtgctc gccgaggcgg cataaatcgc cgtgacgatc agcggtccag tgatcgaagt taggctggta agagccgcga gcgatccttg aagctgtccc tgatggtcgt catctacctg cctggacagc atggcctgca acgcgggcat cccgatgccg ccggaagcga gaagaatcat aatggggaag gccatccagc ctcgcgtcgc gaacgccagc aagacgtagc ccagcgcgtc ggccgccatg ccggcgataa tggcctgctt ctcgccgaaa cgtttggtgg cgggaccagt gacgaaggct tgagcgaggg cgtgcaagat tccgaatacc gcaagcgaca ggccgatcat cgtcgcgctc cagcgaaagc ggtcctcgcc gaaaatgacc cagagcgctg ccggcacctg tcctacgagt tgcatgataa agaagacagt cataagtgcg gcgacgatag tcatgccccg cgcccaccgg aaggagctga ctgggttgaa ggctctcaag ggcatcggtc gacgctctcc cttatgcgac tcctgcatta ggaagcagcc cagtagtagg ttgaggccgt tgagcaccgc cgccgcaagg aatggtgcat gcaaggagat ggcgcccaac agtcccccgg ccacggggcc tgccaccata cccacgccga aacaagcgct catgagcccg aagtggcgag cccgatcttc cccatcggtg atgtcggcga tataggcgcc agcaaccgca cctgtggcgc cggtgatgcc ggccacgatg cgtccggcgt agaggatcga gatctcgatc ccgcgaaat
Details for ButhA.00020.t.B1.PW37792
PURIFICATION DATe: 9/23/2015
CONCENTRATION: 34mg/ml
OBSERVED MW: 54kDa
EXPRESSION LEVEL: Moderate Expression
PROTEIN PURIFICATION BUFFER: 25 mM HEPES pH 7.0, 500 mM NaCl, 5% Glycerol , 2 mM DTT, and 0.025% Azide
EXPRESSION HOST: data unavailable
VIAL COUNT (approx.): 13
VIAL VOLUME: 200µl
PERCENT IDENTITY: 100
PERCENT COVERAGE: 98
Protocol Notes
notes unavailable
Validated AA Sequence
MAHHHHHHML KETYPYYLAN EAVYANAELE VTDKYTGKVA TRVALADASA IDAAIAAAVG AQKPLRALPA FRRQAILEHC VARFRERFDE LAQALCIEAG KPINDSKGEV TRLIDTFRVA AEESVRIEGG LVNLEISPRA QGYSGYYKRV PIGPCSFISP FNFPLNLAAH KVAPALAAGC PFVLKPASRT PIGALIIGEV LAETDLPKGA FSILPAHRDG ADLFTTDERF KLLSFTGSPT VGWELKKKAG KKKVVLELGG NAAAIVDADQ REVLDYVVER LAFGAYYQSG QSCIGVQRII AHADVYDALR EKLIAKTRSL KMGDPKDPAT FVGPMISESE ARRLAGWMEA AVAAGAKIVA GGKVDGAMFE ATLLEGVGRD QDLYRKEAFG PVALLERFSD FDDALARVND SDFGLQAGVF TDSLSHAQRA WDELEVGGVV INDVPSFRVD NMPYGGVKDS GLGREGIRYA IEDMTE
Validated NT Sequence
agttcggtca tatcctcgat cgcatagcga atcccttcgc ggccgagccc cgagtccttc acgccgccgt acggcatgtt gtcgacgcga aacgacggca catcgttgat cacgacgccg cccacttcga gctcgtccca tgcgcgctgc gcatgcgaca gcgaatcggt gaacacgccc gcctgcaggc cgaaatcgct gtcgttcacg cgcgcgagcg cgtcgtcgaa atcggagaac cgctcgagca gcgcgaccgg gccgaacgcc tccttgcgat acagatcctg atcgcggccc acgccttcga gcagcgtcgc ctcgaacatc gcaccgtcga ctttgccgcc cgcgacgatc ttcgcgcccg ccgccaccgc cgcctccatc cagccggcga gccgccgcgc ttcggattcg gagatcatcg ggccgacgaa cgtcgccggg tccttcggat cgcccatctt cagcgagcgc gtcttcgcga tcagcttctc gcgcagcgcg tcatagacgt ccgcatgcgc gatgatccgc tgcacgccga tgcagctctg ccccgactgg tagtacgcgc cgaacgcgag ccgctcgacg acgtagtcga gcacctcgcg ctgatcggca tcgacgatcg ccgccgcatt gccgcccagc tcgagcacga ccttcttctt gcccgccttc ttcttcagtt cccagccgac ggtgggcgag cccgtgaacg acagcagctt gaagcgctcg tcggtcgtga acagatccgc gccgtcgcga tgcgcgggca ggatcgagaa cgcgcccttc ggcaagtcgg tttccgcgag cacctcgccg atgatcagcg cgccgatcgg cgtgcggctc gcgggcttca gcacgaacgg gcagccggcg gcgagcgcgg gcgcgacctt gtgcgcggcg aggttcagcg gaaaattgaa cggcgagatg aacgagcacg ggccgatcgg cacgcgcttg tagtagccgc tgtagccctg cgcgcgcggc gagatttcga gattgacgag accgccttcg atgcgcaccg attcctcggc cgcgacgcga aacgtatcga tgaggcgcgt cacctcgccc ttcgaatcgt tgatcggctt gcccgcctcg atgcacagcg cctgcgcgag ctcgtcgaag cgctcgcgaa agcgcgcgac gcagtgttcg aggatcgcct gccgcctgaa cgcgggcaat gcgcgcaacg gcttctgcgc gccgacggcc gccgcgatcg ccgcgtcgat cgcgctcgcg tcggccagcg cgacgcgcgt cgcgaccttg ccggtgtatt tgtcggtgac ttcgagctcg gcgttcgcgt acaccgcttc gttcgcgagg taatacggat aggtttcctt cagcatatgg tggtggtggt ggtgagccat
Expressed Protein Sequence
MAHHHHHHML KETYPYYLAN EAVYANAELE VTDKYTGKVA TRVALADASA IDAAIAAAVG AQKPLRALPA FRRQAILEHC VARFRERFDE LAQALCIEAG KPINDSKGEV TRLIDTFRVA AEESVRIEGG LVNLEISPRA QGYSGYYKRV PIGPCSFISP FNFPLNLAAH KVAPALAAGC PFVLKPASRT PIGALIIGEV LAETDLPKGA FSILPAHRDG ADLFTTDERF KLLSFTGSPT VGWELKKKAG KKKVVLELGG NAAAIVDADQ REVLDYVVER LAFGAYYQSG QSCIGVQRII AHADVYDALR EKLIAKTRSL KMGDPKDPAT FVGPMISESE ARRLAGWMEA AVAAGAKIVA GGKVDGAMFE ATLLEGVGRD QDLYRKEAFG PVALLERFSD FDDALARVND SDFGLQAGVF TDSLSHAQRA WDELEVGGVV INDVPSFRVD NMPYGGVKDS GLGREGIRYA IEDMTELRLM VVRRR
Full NT Sequence (Expression Vector + Insert)
TAATACGACT CACTATAGGG AGACCACAAC GGTTTCCCTC TAGAAATAAT TTTGTTTAAC TTTAAGAAGG AGATATACCA TGGCTCACCA CCACCACCAC CATATGCTGA AGGAAACCTA TCCGTATTAC CTCGCGAACG AAGCGGTGTA CGCGAACGCC GAGCTCGAAG TCACCGACAA ATACACCGGC AAGGTCGCGA CGCGCGTCGC GCTGGCCGAC GCGAGCGCGA TCGACGCGGC GATCGCGGCG GCCGTCGGCG CGCAGAAGCC GTTGCGCGCA TTGCCCGCGT TCAGGCGGCA GGCGATCCTC GAACACTGCG TCGCGCGCTT TCGCGAGCGC TTCGACGAGC TCGCGCAGGC GCTGTGCATC GAGGCGGGCA AGCCGATCAA CGATTCGAAG GGCGAGGTGA CGCGCCTCAT CGATACGTTT CGCGTCGCGG CCGAGGAATC GGTGCGCATC GAAGGCGGTC TCGTCAATCT CGAAATCTCG CCGCGCGCGC AGGGCTACAG CGGCTACTAC AAGCGCGTGC CGATCGGCCC GTGCTCGTTC ATCTCGCCGT TCAATTTTCC GCTGAACCTC GCCGCGCACA AGGTCGCGCC CGCGCTCGCC GCCGGCTGCC CGTTCGTGCT GAAGCCCGCG AGCCGCACGC CGATCGGCGC GCTGATCATC GGCGAGGTGC TCGCGGAAAC CGACTTGCCG AAGGGCGCGT TCTCGATCCT GCCCGCGCAT CGCGACGGCG CGGATCTGTT CACGACCGAC GAGCGCTTCA AGCTGCTGTC GTTCACGGGC TCGCCCACCG TCGGCTGGGA ACTGAAGAAG AAGGCGGGCA AGAAGAAGGT CGTGCTCGAG CTGGGCGGCA ATGCGGCGGC GATCGTCGAT GCCGATCAGC GCGAGGTGCT CGACTACGTC GTCGAGCGGC TCGCGTTCGG CGCGTACTAC CAGTCGGGGC AGAGCTGCAT CGGCGTGCAG CGGATCATCG CGCATGCGGA CGTCTATGAC GCGCTGCGCG AGAAGCTGAT CGCGAAGACG CGCTCGCTGA AGATGGGCGA TCCGAAGGAC CCGGCGACGT TCGTCGGCCC GATGATCTCC GAATCCGAAG CGCGGCGGCT CGCCGGCTGG ATGGAGGCGG CGGTGGCGGC GGGCGCGAAG ATCGTCGCGG GCGGCAAAGT CGACGGTGCG ATGTTCGAGG CGACGCTGCT CGAAGGCGTG GGCCGCGATC AGGATCTGTA TCGCAAGGAG GCGTTCGGCC CGGTCGCGCT GCTCGAGCGG TTCTCCGATT TCGACGACGC GCTCGCGCGC GTGAACGACA GCGATTTCGG CCTGCAGGCG GGCGTGTTCA CCGATTCGCT GTCGCATGCG CAGCGCGCAT GGGACGAGCT CGAAGTGGGC GGCGTCGTGA TCAACGATGT GCCGTCGTTT CGCGTCGACA ACATGCCGTA CGGCGGCGTG AAGGACTCGG GGCTCGGCCG CGAAGGGATT CGCTATGCGA TCGAGGATAT GACCGAACTG CGCCTGATGG TCGTGCGCCG GCGCTGAGTA AGATAGGATC CGGCTGCTAA CAAAGCCCGA AAGGAAGCTG AGTTGGCTGC TGCCACCGCT GAGCAATAAC TAGCATAACC CCTTGGGGCC TCTAAACGGG TCTTGAGGGG TTTTTTGCTG AAAGGAGGAA CTATATCCGG ATATCCACAG GACGGGTGTG GTCGCCATGA TCGCGTAGTC GATAGTGGCT CCAAGTAGCG AAGCGAGCAG GACTGGGCGG CGGCCAAAGC GGTCGGACAG TGCTCCGAGA ACGGGTGCGC ATAGAAATTG CATCAACGCA TATAGCGCTA GCAGCACGCC ATAGTGACTG GCGATGCTGT CGGAATGGAC GATATCCCGC AAGAGGCCCG GCAGTACCGG CATAACCAAG CCTATGCCTA CAGCATCCAG GGTGACGGTG CCGAGGATGA CGATGAGCGC ATTGTTAGAT TTCATACACG GTGCCTGACT GCGTTAGCAA TTTAACTGTG ATAAACTACC GCATTAAAGC TTATCGATGA TAAGCTGTCA AACATGAGAA TTCTTGAAGA CGAAAGGGCC TCGTGATACG CCTATTTTTA TAGGTTAATG TCATGATAAT AATGGTTTCT TAGACGTCAG GTGGCACTTT TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTTATTTTTC TAAATACATT CAAATATGTA TCCGCTCATG AGACAATAAC CCTGATAAAT GCTTCAATAA TATTGAAAAA GGAAGAGTAT GAGTATTCAA CATTTCCGTG TCGCCCTTAT TCCCTTTTTT GCGGCATTTT GCCTTCCTGT TTTTGCTCAC CCAGAAACGC TGGTGAAAGT AAAAGATGCT GAAGATCAGT TGGGTGCACG AGTGGGTTAC ATCGAACTGG ATCTCAACAG CGGTAAGATC CTTGAGAGTT TTCGCCCCGA AGAACGTTTT CCAATGATGA GCACTTTTAA AGTTCTGCTA TGTGGCGCGG TATTATCCCG TGTTGACGCC GGGCAAGAGC AACTCGGTCG CCGCATACAC TATTCTCAGA ATGACTTGGT TGAGTACTCA CCAGTCACAG AAAAGCATCT TACGGATGGC ATGACAGTAA GAGAATTATG CAGTGCTGCC ATAACCATGA GTGATAACAC TGCGGCCAAC TTACTTCTGA CAACGATCGG AGGACCGAAG GAGCTAACCG CTTTTTTGCA CAACATGGGG GATCATGTAA CTCGCCTTGA TCGTTGGGAA CCGGAGCTGA ATGAAGCCAT ACCAAACGAC GAGCGTGACA CCACGATGCC TGCAGCAATG GCAACAACGT TGCGCAAACT ATTAACTGGC GAACTACTTA CTCTAGCTTC CCGGCAACAA TTAATAGACT GGATGGAGGC GGATAAAGTT GCAGGACCAC TTCTGCGCTC GGCCCTTCCG GCTGGCTGGT TTATTGCTGA TAAATCTGGA GCCGGTGAGC GTGGGTCTCG CGGTATCATT GCAGCACTGG GGCCAGATGG TAAGCCCTCC CGTATCGTAG TTATCTACAC GACGGGGAGT CAGGCAACTA TGGATGAACG AAATAGACAG ATCGCTGAGA TAGGTGCCTC ACTGATTAAG CATTGGTAAC TGTCAGACCA AGTTTACTCA TATATACTTT AGATTGATTT AAAACTTCAT TTTTAATTTA AAAGGATCTA GGTGAAGATC CTTTTTGATA ATCTCATGAC CAAAATCCCT TAACGTGAGT TTTCGTTCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT TTTTTCTGCG CGTAATCTGC TGCTTGCAAA CAAAAAAACC ACCGCTACCA GCGGTGGTTT GTTTGCCGGA TCAAGAGCTA CCAACTCTTT TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AGATACCAAA TACTGTCCTT CTAGTGTAGC CGTAGTTAGG CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA GACGATAGTT ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCACACAGC CCAGCTTGGA GCGAACGACC TACACCGAAC TGAGATACCT ACAGCGTGAG CTATGAGAAA GCGCCACGCT TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGCCTG GTATCTTTAT AGTCCTGTCG GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG GGGCGGAGCC TATGGAAAAA CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG CTCACATGTT CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG AGTGAGCTGA TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA GCGCCTGATG CGGTATTTTC TCCTTACGCA TCTGTGCGGT ATTTCACACC GCATATATGG TGCACTCTCA GTACAATCTG CTCTGATGCC GCATAGTTAA GCCAGTATAC ACTCCGCTAT CGCTACGTGA CTGGGTCATG GCTGCGCCCC GACACCCGCC AACACCCGCT GACGCGCCCT GACGGGCTTG TCTGCTCCCG GCATCCGCTT ACAGACAAGC TGTGACCGTC TCCGGGAGCT GCATGTGTCA GAGGTTTTCA CCGTCATCAC CGAAACGCGC GAGGCAGCTG CGGTAAAGCT CATCAGCGTG GTCGTGAAGC GATTCACAGA TGTCTGCCTG TTCATCCGCG TCCAGCTCGT TGAGTTTCTC CAGAAGCGTT AATGTCTGGC TTCTGATAAA GCGGGCCATG TTAAGGGCGG TTTTTTCCTG TTTGGTCACT GATGCCTCCG TGTAAGGGGG ATTTCTGTTC ATGGGGGTAA TGATACCGAT GAAACGAGAG AGGATGCTCA CGATACGGGT TACTGATGAT GAACATGCCC GGTTACTGGA ACGTTGTGAG GGTAAACAAC TGGCGGTATG GATGCGGCGG GACCAGAGAA AAATCACTCA GGGTCAATGC CAGCGCTTCG TTAATACAGA TGTAGGTGTT CCACAGGGTA GCCAGCAGCA TCCTGCGATG CAGATCCGGA ACATAATGGT GCAGGGCGCT GACTTCCGCG TTTCCAGACT TTACGAAACA CGGAAACCGA AGACCATTCA TGTTGTTGCT CAGGTCGCAG ACGTTTTGCA GCAGCAGTCG CTTCACGTTC GCTCGCGTAT CGGTGATTCA TTCTGCTAAC CAGTAAGGCA ACCCCGCCAG CCTAGCCGGG TCCTCAACGA CAGGAGCACG ATCATGCGCA CCCGTGGCCA GGACCCAACG CTGCCCGAGA TGCGCCGCGT GCGGCTGCTG GAGATGGCGG ACGCGATGGA TATGTTCTGC CAAGGGTTGG TTTGCGCATT CACAGTTCTC CGCAAGAATT GATTGGCTCC AATTCTTGGA GTGGTGAATC CGTTAGCGAG GTGCCGCCGG CTTCCATTCA GGTCGAGGTG GCCCGGCTCC ATGCACCGCG ACGCAACGCG GGGAGGCAGA CAAGGTATAG GGCGGCGCCT ACAATCCATG CCAACCCGTT CCATGTGCTC GCCGAGGCGG CATAAATCGC CGTGACGATC AGCGGTCCAG TGATCGAAGT TAGGCTGGTA AGAGCCGCGA GCGATCCTTG AAGCTGTCCC TGATGGTCGT CATCTACCTG CCTGGACAGC ATGGCCTGCA ACGCGGGCAT CCCGATGCCG CCGGAAGCGA GAAGAATCAT AATGGGGAAG GCCATCCAGC CTCGCGTCGC GAACGCCAGC AAGACGTAGC CCAGCGCGTC GGCCGCCATG CCGGCGATAA TGGCCTGCTT CTCGCCGAAA CGTTTGGTGG CGGGACCAGT GACGAAGGCT TGAGCGAGGG CGTGCAAGAT TCCGAATACC GCAAGCGACA GGCCGATCAT CGTCGCGCTC CAGCGAAAGC GGTCCTCGCC GAAAATGACC CAGAGCGCTG CCGGCACCTG TCCTACGAGT TGCATGATAA AGAAGACAGT CATAAGTGCG GCGACGATAG TCATGCCCCG CGCCCACCGG AAGGAGCTGA CTGGGTTGAA GGCTCTCAAG GGCATCGGTC GACGCTCTCC CTTATGCGAC TCCTGCATTA GGAAGCAGCC CAGTAGTAGG TTGAGGCCGT TGAGCACCGC CGCCGCAAGG AATGGTGCAT GCAAGGAGAT GGCGCCCAAC AGTCCCCCGG CCACGGGGCC TGCCACCATA CCCACGCCGA AACAAGCGCT CATGAGCCCG AAGTGGCGAG CCCGATCTTC CCCATCGGTG ATGTCGGCGA TATAGGCGCC AGCAACCGCA CCTGTGGCGC CGGTGATGCC GGCCACGATG CGTCCGGCGT AGAGGATCGA GATCTCGATC CCGCGAAAT