TrbrA.00795.a

Hexokinase TbHK1

CENTER ID: TrbrA.00795.a
ORGANISM: Trypanosoma brucei TREU927
ASSOCIATED DISEASE: African trypanosomiasis (African Sleeping Sickness)
CURRENT STATUS: purified
COMMUNITY REQUEST: True
NIH RISK GROUP: 2
SELECT AGENT: False
NIH PRIORITY
pathogens category:
unclassified

Ordering Clones & Proteins

If there are materials available for this target, they will be listed below. Materials can be ordered from SSGCID using the button in the "order material" column. Clicking the button will add the material to a virtual cart. You may order multiple materials at a time at no cost to you, as this contract is funded by NIAID. When you are ready to place your order, click the "Place Order" link which will appear in the top right corner of the page after you place your first item in your cart.

Proteins

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
TrbrA.00795.a.E1.PS00966 Hexokinase TbHK1 (N-truncated) 2 471
TrbrA.00795.a.E1.PS01750 Hexokinase TbHK1 (N-truncated) 2 471
External Resources
RESOURCE REFERENCE ID
EuPathDB: TritrypDB:Tb927.10.2010
OrthoMCL: OG5_126743
RefSeq: XP_822456.1
UniProt: Q38C42
Sequences
These sequences are the native gene sequence; sequences of constructs derived from these sequences may differ due to codon optimization or other protocols. To find the specific sequence of any material you may have ordered, click on the "info" button next to the name of that material.
AA Sequence
MSRRLNNILE HISIQGNDGE TVRAVKRDVA MAALTNQFTM SVESMRQIMT YLLYEMVEGL EGRESTVRML PSYVYKADPK RATGVFYALD LGGTNFRVLR VACKEGAVVD SSTSAFKIPK YALEGNATDL FGFIASNVKK TMETRAPEDL NRTVPLGFTF SFPVEQTKVN RGVLIRWTKG FSTKGVQGND VIALLQAAFG RVSLKVNVVA LCNDTVGTLI SHYFKDPEVQ VGVIIGTGSN ACYFETASAV TKDPAVAARG SALTPINMES GNFDSKYRFV LPTTKFDLDI DDASLNKGQQ ALEKMISGMY LGEIARRVIV HLSSINCLPA ALQTALGNRG SFESRFAGMI SADRMPGLQF TRSTIQKVCG VDVQSIEDLR IIRDVCRLVR GRAAQLSASF CCAPLVKTQT QGRATIAIDG SVFEKIPSFR RVLQDNINRI LGPECDVRAV LAKDGSGIGA AFISAMVVND K
NT Sequence
atgtctagac gcctaaacaa tatcctcgaa cacatctcga tccagggaaa tgatggtgag actgtgcgtg ccgttaagcg tgatgttgca atggcagcgc tgaccaacca attcacaatg agtgtcgagt ctatgcgaca gatcatgaca tacctcctgt acgagatggt ggagggtctt gagggtcgtg aaagcaccgt ccgcatgtta ccatcttatg tatacaaggc ggaccctaag cgtgctactg gcgtcttcta cgcacttgac ctcggtggta ccaacttccg tgtgctgcgc gttgcatgca aggagggtgc cgtggtggat tcctctactt ctgcattcaa gattcccaaa tatgcccttg agggtaacgc caccgatctg tttggcttca ttgcatccaa tgtgaagaaa accatggaaa ctcgtgcacc tgaggacctc aatcgcacag ttcctcttgg gtttaccttc agtttccccg tggagcagac gaaggttaac cgtggtgtgc ttatccggtg gacgaagggc ttcagcacga aaggcgttca aggaaatgat gtgattgccc ttcttcaggc tgcttttggg cgagtgagct tgaaggtgaa tgttgtggcg ttgtgcaacg acactgttgg aacattaatt tcgcattact ttaaggaccc tgaggtacag gttggtgtga ttatcggcac tggttccaat gcgtgctact ttgagacggc gtctgctgtg acgaaggacc ctgccgttgc tgctcgtggg tcagcactta ctcccatcaa tatggaaagc ggcaactttg actccaagta ccggtttgtc ctccctacga cgaagttcga cttggatatt gacgatgcgt cgttgaacaa aggtcaacag gcgctggaga agatgatatc cggcatgtac ctcggcgaaa tcgcccgccg cgttattgtg cacctgtcgt ctattaactg ccttcctgcg gcactgcaga ctgctttggg caaccggggg tcgtttgagt cccgatttgc cgggatgatc agtgctgacc gtatgcccgg acttcagttc actcgcagca cgatccagaa ggtgtgtggt gttgacgtgc agtcaattga agaccttcgc atcattcgcg atgtgtgccg ccttgtccgt gggagggctg cgcaactctc tgcttccttc tgctgcgctc cactggttaa gactcaaaca cagggccgtg caactattgc aattgacggc tccgtgtttg agaagattcc gtcattccgc cgcgtcttgc aggacaacat caaccgtatc cttggccctg agtgcgatgt cagggccgtt ctcgcaaagg atggcagtgg aattggtgct gcatttattt ccgcaatggt ggtgaacgac aagtaa
Details for TrbrA.00795.a.E1.PS00966
PURIFICATION DATe: 12/14/2010
CONCENTRATION: 6.88mg/ml
OBSERVED MW: data unavailable
EXPRESSION LEVEL: Moderate Expression
PROTEIN PURIFICATION BUFFER: 20 mM HEPES, pH 7.0, 300 mM NaCl, 5% glycerol and 1 mM TCEP
EXPRESSION HOST: data unavailable
VIAL COUNT (approx.): 2
VIAL VOLUME: 100µl
PERCENT IDENTITY: 95
PERCENT COVERAGE: 95
Protocol Notes
notes unavailable
Validated AA Sequence
MSRRLNNILE HISIQGNDGE TVRAVKRDVA MAALTNQFTM SVESMRQIMT YLLYEMVEGL EGRESTVRML PSYVYKADPK RATGVFYALD LGGTNFRVLR VACKEGAVVD SSTSAFKIPK YALEGNATDL FDFIASNVKK TMETRAPEDL NRTVPLGFTF SFPVEQTXVN RGVLIRWTXG FSTKGVQGND VIALLXAAFG RVSLXVNVVA LCNDTVGTLI SHYFXDPEVX VGVIIGTGSN ACYFETASAV TKXPAVAARG SALTPINMES GNFDSKYRFX XXTTKXDLDI DDASLNKGQQ ALEKMISGMY XGEXARRVIV HLSSINCXPX ALQTALGNRG SFESRFAGMI SADRMPGXQF TRSTIQKVCG VDVQSIEDLR IIRDVCRLVR GRXAQLXASX CCAPLVKTQT QGRATIAIDG SVFEKIPSFR RVLQDNINRI LGPECDVRAV LAKDGSGIGX AFISAMVV
Validated NT Sequence
ttcaccacca ttgcggaaat aaatgcanca ccaattccac tgccatcctt tgcgagaacg gccctgacat cgcactcagg gccaaggata cggttgatgt tgtcctgcaa gacgcggcgg aatgacggaa tcttctcaaa cacggagccg tcaattgcaa tagttgcacg gccctgtgtt tgagtcttaa ccagtggagc gcagcanaag gaagcanana gttgcgcanc cctcccacgg acaaggcggc acacatcgcg aatgatgcga aggtcttcaa ttgactgcac gtcaacacca cacaccttct ggatcgtgct gcgagtgaac tgaantccgg gcatacggtc agcactgatc atcccggcaa atcgggactc aaacgacccc cggttgccca aagcagtctg cagtgccnca gganngcagt taatagacga caggtgcaca ataacgcggc gggcganttc accgangtac atgccggata tcatcttctc cagcgcctgt tgacctttgt tcaacgacgc atcgtcaata tccaagtcna acttcgtcgt anggangana aaccggtact tggagtcaaa gttgccgctt tccatattga tgggagtaag tgctgaccca cgagcagcaa cggcagggnc cttcgtcaca gcagacgccg tctcaaagta gcacgcattn gaaccagtgc cgataatcac gccaacntgt acctcagggt cnttaaagta atgcgaaatt aatgttccaa cggtgtcgtt gcacaacgcc acaacattca cnttcaggct cactcgccca aaagcagcnt gaagaagggc aatcacatcg tttccttgaa cgcctttcgt gctgaagccn ttcgtccacc ggataagcac accacggtta acnttcgtct gctccacggg gaaactgaag gtaaacccaa gaggaactgt gcgattgagg tcctcaggtg cacgagtttc catggttttc ttcacattgg atgcaatgaa gtcaaacaga tcggtggcgt taccctcaag ggcatatttg ggaatcttga atgcagaagt agaggaatcc accacggcac cctccttgca tgcaacacgc aacacacgga agttggtacc accgaggtca agtgcgtaga agacgccagt agcacgctta gggtccgcct tgtacacata agatggtaac atgcggacgg tgctttcacg accctcaaga ccctccacca tctcgtacag gaggtatgtc atgatctgtc gcatagactc cacactcatt gtgaattggt tggtcagcgc tgccattgca acatcacgct taacggcacg cacagtctca ccatcatttc cctggatcga gatgtgttcg aggatattgt ttaggcgtct agacatatgg tggtggtg
Expressed Protein Sequence
MAHHHHHHMG TLEAQTQGPG SSRRLNNILE HISIQGNDGE TVRAVKRDVA MAALTNQFTM SVESMRQIMT YLLYEMVEGL EGRESTVRML PSYVYKADPK RATGVFYALD LGGTNFRVLR VACKEGAVVD SSTSAFKIPK YALEGNATDL FGFIASNVKK TMETRAPEDL NRTVPLGFTF SFPVEQTKVN RGVLIRWTKG FSTKGVQGND VIALLQAAFG RVSLKVNVVA LCNDTVGTLI SHYFKDPEVQ VGVIIGTGSN ACYFETASAV TKDPAVAARG SALTPINMES GNFDSKYRFV LPTTKFDLDI DDASLNKGQQ ALEKMISGMY LGEIARRVIV HLSSINCLPA ALQTALGNRG SFESRFAGMI SADRMPGLQF TRSTIQKVCG VDVQSIEDLR IIRDVCRLVR GRAAQLSASF CCAPLVKTQT QGRATIAIDG SVFEKIPSFR RVLQDNINRI LGPECDVRAV LAKDGSGIGA AFISAMVVND K
Full NT Sequence (Expression Vector + Insert)
GAACTCACCT ATCTCCCCAA CACCTAATAA CATTCAATCA CTCTTTCCAC TAACCACCTA TCTACATCAC CAAGATATCA CTAGTTCTCG AGATGGCTCA CCACCACCAC CACCATATGG GTACCCTGGA AGCTCAGACC CAGGGTCCTG GTTCGTCTAG ACGCCTAAAC AATATCCTCG AACACATCTC GATCCAGGGA AATGATGGTG AGACTGTGCG TGCCGTTAAG CGTGATGTTG CAATGGCAGC GCTGACCAAC CAATTCACAA TGAGTGTCGA GTCTATGCGA CAGATCATGA CATACCTCCT GTACGAGATG GTGGAGGGTC TTGAGGGTCG TGAAAGCACC GTCCGCATGT TACCATCTTA TGTATACAAG GCGGACCCTA AGCGTGCTAC TGGCGTCTTC TACGCACTTG ACCTCGGTGG TACCAACTTC CGTGTGCTGC GCGTTGCATG CAAGGAGGGT GCCGTGGTGG ATTCCTCTAC TTCTGCATTC AAGATTCCCA AATATGCCCT TGAGGGTAAC GCCACCGATC TGTTTGGCTT CATTGCATCC AATGTGAAGA AAACCATGGA AACTCGTGCA CCTGAGGACC TCAATCGCAC AGTTCCTCTT GGGTTTACCT TCAGTTTCCC CGTGGAGCAG ACGAAGGTTA ACCGTGGTGT GCTTATCCGG TGGACGAAGG GCTTCAGCAC GAAAGGCGTT CAAGGAAATG ATGTGATTGC CCTTCTTCAG GCTGCTTTTG GGCGAGTGAG CTTGAAGGTG AATGTTGTGG CGTTGTGCAA CGACACTGTT GGAACATTAA TTTCGCATTA CTTTAAGGAC CCTGAGGTAC AGGTTGGTGT GATTATCGGC ACTGGTTCCA ATGCGTGCTA CTTTGAGACG GCGTCTGCTG TGACGAAGGA CCCTGCCGTT GCTGCTCGTG GGTCAGCACT TACTCCCATC AATATGGAAA GCGGCAACTT TGACTCCAAG TACCGGTTTG TCCTCCCTAC GACGAAGTTC GACTTGGATA TTGACGATGC GTCGTTGAAC AAAGGTCAAC AGGCGCTGGA GAAGATGATA TCCGGCATGT ACCTCGGCGA AATCGCCCGC CGCGTTATTG TGCACCTGTC GTCTATTAAC TGCCTTCCTG CGGCACTGCA GACTGCTTTG GGCAACCGGG GGTCGTTTGA GTCCCGATTT GCCGGGATGA TCAGTGCTGA CCGTATGCCC GGACTTCAGT TCACTCGCAG CACGATCCAG AAGGTGTGTG GTGTTGACGT GCAGTCAATT GAAGACCTTC GCATCATTCG CGATGTGTGC CGCCTTGTCC GTGGGAGGGC TGCGCAACTC TCTGCTTCCT TCTGCTGCGC TCCACTGGTT AAGACTCAAA CACAGGGCCG TGCAACTATT GCAATTGACG GCTCCGTGTT TGAGAAGATT CCGTCATTCC GCCGCGTCTT GCAGGACAAC ATCAACCGTA TCCTTGGCCC TGAGTGCGAT GTCAGGGCCG TTCTCGCAAA GGATGGCAGT GGAATTGGTG CTGCATTTAT TTCCGCAATG GTGGTGAACG ACAAGACGCG TTAACCACGT GAGTAAGATA GGGATCCATA TATAGGGCCC GGGTTATAAT TACCTCAGGT CGACGTCCCA TGGTTTTGTA TAGAATTTAC GGCTAGCGCC GGATGCGACG CCGGTCGCGT CTTATCCGGC CTTCCTATAT CAGGCGGTGT TTAAGACGCC GCCGCTTCGC CCAAATCCTT ATGCCGGTTC GACGACTGGA CAAAATACTG TTTATCTTCC CAGCGCAGGC AGGTTAATGT ACCACCCCAG CAGCAGCCGG TATCCAGCGC GTATATACCT TCCGGCGTAC CTTTGCCCTC CAGCGATGCC CAGTGACCAA AGGCGATGCT GTATTCTTCA GCGACAGGGC CAGGAATCGC AAACCACGGT TTCAGTGGGG CAGGGGCCTC TTCCGGCGAT TCTTACTAGC TAGTATGCAT AGGTGCTGAA ATATAAAGTT TGTGTTTCTA AAACACACTT GGTACGTACG ATAACGTACA GTGTTTTTCC CTCCACTTAA ATCGAAGGGT AGTGTCTTGG AGCGCGCGGA GTAAACATAT ATGGTTCATA TATGTCCGTA GGCACGTAAA AAAAGCGAGG GATTCGAATT CCCCCGGAAC CCCCGGTTGG GGCCCACGCC TCGATCGAGC AAAAAAAAAA AAAAAGAAAA AAAAAAAAAA AAAAAGCTTT CCCGCGGCCA GCTTGGCGTA ATCATGGTCA TAGCTGTTTC CTGTGTGAAA TTGTTATCCG CTCACAATTC CACACAACAT ACGAGCCGGA AGCATAAAGT GTAAAGCCTG GGGTGCCTAA TGAGTGAGCT AACTCACATT AATTGCGTTG CGCTCACTGC CCGCTTTCCA GTCGGGAAAC CTGTCGTGCC AGCTGCATTA ATGAATCGGC CAACGCGCGG GGAGAGGCGG TTTGCGTATT GGGCGCTCTT CCGCTTCCTC ACTCACTGAC TCGCTGCGCT CGGTCGCTCG GCTGCGGCGA GCGGTATCAG CTCACTCAAA GGCGGTAATA CGGTTATCCA CAGAATCAGG GGATAACGCA GGAAAGAACA TGTGAGCAAA AGGCCAGCAA AAGGCCAGGA ACCGTAAAAA GGCCGCGTTG CTGGCGTTTT TCCATAGGCT CCGCCCCCCT GACGAGCATC ACAAAAATCG ACGCTCAAGT CAGAGGTGGC GAAACCCGAC AGGACTATAA AGATACCAGG CGTTTCCCCC TGGAAGCTCC CTCGTGCGCT CTCCTGTTCC GACCCTGCCG CTTACCGGAT ACCTGTCCGC CTTTCTCCCT TCGGGAAGCG TGGCGCTTTC TCATAGCTCA CGCTGTAGGT ATCTCAGTTC GGTGTAGGTC GTTCGCTCCA AGCTGGGCTG TGTGCACGAA CCCCCCGTTC AGCCCGACCG CTGCGCCTTA TCCGGTAACT ATCGTCTTGA GTCCAACCCG GTAAGACACG ACTTATCGCC ACTGGCAGCA GCCACTGGTA ACAGGATTAG CAGAGCGAGG TATGTAGGCG GTGCTACAGA GTTCTTGAAG TGGTGGCCTA ACTACGGCTA CACTAGAAGA ACAGTATTTG GTATCTGCGC TCTGCTGAAG CCAGTTACCT TCGGAAAAAG AGTTGGTAGC TCTTGATCCG GCAAACAAAC CACCGCTGGT AGCGGTGGTT TTTTTGTTTG CAAGCAGCAG ATTACGCGCA GAAAAAAAGG ATCTCAAGAA GATCCTTTGA TCTTTTCTAC GGGGTCTGAC GCTCAGTGGA ACGAAAACTC ACGTTAAGGG ATTTTGGTCA TGAGATTATC AAAAAGGATC TTCACCTAGA TCCTTTTAAA TTAAAAATGA AGTTTTAAAT CAATCTAAAG TATATATGAG TAAACTTGGT CTGACAGTTA CCAATGCTTA ATCAGTGAGG CACCTATCTC AGCGATCTGT CTATTTCGTT CATCCATAGT TGCCTGACTC CCCGTCGTGT AGATAACTAC GATACGGGAG GGCTTACCAT CTGGCCCCAG TGCTGCAATG ATACCGCGAG ACCCACGCTC ACCGGCTCCA GATTTATCAG CAATAAACCA GCCAGCCGGA AGGGCCGAGC GCAGAAGTGG TCCTGCAACT TTATCCGCCT CCATCCAGTC TATTAATTGT TGCCGGGAAG CTAGAGTAAG TAGTTCGCCA GTTAATAGTT TGCGCAACGT TGTTGCCATT GCTACAGGCA TCGTGGTGTC ACGCTCGTCG TTTGGTATGG CTTCATTCAG CTCCGGTTCC CAACGATCAA GGCGAGTTAC ATGATCCCCC ATGTTGTGCA AAAAAGCGGT TAGCTCCTTC GGTCCTCCGA TCGTTGTCAG AAGTAAGTTG GCCGCAGTGT TATCACTCAT GGTTATGGCA GCACTGCATA ATTCTCTTAC TGTCATGCCA TCCGTAAGAT GCTTTTCTGT GACTGGTGAG TACTCAACCA AGTCATTCTG AGAATAGCGT ATGCGGCGAC CGAGTTGCTC TTGCCCGGCG TCAATACGGG ATAATACCGC GCCACATAGC AGAACTTTAA AAGTGCTCAT CATTGGAAAA CGTTCTTCGG GGCGAAAACT CTCAAGGATC TTACCGCTGT TGAGATCCAG TTCGATGTAA CCCACTCGTG CACCCAACTG ATCTTCAGCA TCTTTTACTT TCACCAGCGT TTCTGGGTGA GCAAAAACAG GAAGGCAAAA TGCCGCAAAA AAGGGAATAA GGGCGACACG GAAATGTTGA ATACTCATAC TCTTCCTTTT TCAATATTAT TGAAGCATTT ATCAGGGTTA TTGTCTCATG AGCGGATACA TATTTGAATG TATTTAGAAA AATAAACAAA TAGGGGTTCC GCGCACATTT CCCCGAAAAG TGCCACCTGA CGTCTAAGAA ACCATTATTA TCATGACATT AACCTATAAA AATAGGCGTA TCACGAGGCC CTTTCGTCTC GCGCGTTTCG GTGATGACGG TGAAAACCTC TGACACATGC AGCTCCCGGA GACGGTCACA GCTTGTCTGT AAGCGGATGC CGGGAGCAGA CAAGCCCGTC AGGGCGCGTC AGCGGGTGTT GGCGGGTGTC GGGGCTGGCT TAACTATGCG GCATCAGAGC AGATTGTACT GAGAGTGCAC CATTCGACGC TCTCCCTTAT GCGACTCCTG CATTAGGAAG CAGCCCAGTA GTAGGTTGAG GCCGTTGAGC ACCGCCGCCG CAAGGAATGG TGCATGCAAG GAGATGGCGC CCAACAGTCC CCCGGCCACG GGGCCTGCCA CCATACCCAC GCCGAAACAA GCGCTCATGA GCCCGAAGTG GCGAGCCCGA TCTTCCCCAT CGGTGATGTC GGCGATATAG GCGCCAGCAA CCGCACCTGT GGCGCCGGTG ATGCCGGCCA CGATGCGTCC GGCGTAGAGG ATCTGGCTAG CGATGACCCT GCTGATTGGT TCGCTGACCA TTTCCGGGTG CGGGACGGCG TTACCAGAAA CTCAGAAGGT TCGTCCAACC AAACCGACTC TGGCGGCAGT TTACGAGAGA GATGATAGGG TCTGCTTCAG TAAGCCAGAT GCTACACAAT TAGGCTTGTA CATACTGTCG TTAGAACGCG GCTACAATTA ATACATAACC TTATGTATCA TACACATACG ATTTAGGTGA CACTATA
Details for TrbrA.00795.a.E1.PS01750
PURIFICATION DATe: 1/12/2011
CONCENTRATION: 9.33mg/ml
OBSERVED MW: 55kDa
EXPRESSION LEVEL: Low Expression
PROTEIN PURIFICATION BUFFER: 20 mM HEPES, pH 7.0, 300 mM NaCl, 5% glycerol and 1 mM TCEP
EXPRESSION HOST: data unavailable
VIAL COUNT (approx.): 5
VIAL VOLUME: 50µl
PERCENT IDENTITY: 95
PERCENT COVERAGE: 95
Protocol Notes
notes unavailable
Validated AA Sequence
MSRRLNNILE HISIQGNDGE TVRAVKRDVA MAALTNQFTM SVESMRQIMT YLLYEMVEGL EGRESTVRML PSYVYKADPK RATGVFYALD LGGTNFRVLR VACKEGAVVD SSTSAFKIPK YALEGNATDL FDFIASNVKK TMETRAPEDL NRTVPLGFTF SFPVEQTXVN RGVLIRWTXG FSTKGVQGND VIALLXAAFG RVSLXVNVVA LCNDTVGTLI SHYFXDPEVX VGVIIGTGSN ACYFETASAV TKXPAVAARG SALTPINMES GNFDSKYRFX XXTTKXDLDI DDASLNKGQQ ALEKMISGMY XGEXARRVIV HLSSINCXPX ALQTALGNRG SFESRFAGMI SADRMPGXQF TRSTIQKVCG VDVQSIEDLR IIRDVCRLVR GRXAQLXASX CCAPLVKTQT QGRATIAIDG SVFEKIPSFR RVLQDNINRI LGPECDVRAV LAKDGSGIGX AFISAMVV
Validated NT Sequence
ttcaccacca ttgcggaaat aaatgcanca ccaattccac tgccatcctt tgcgagaacg gccctgacat cgcactcagg gccaaggata cggttgatgt tgtcctgcaa gacgcggcgg aatgacggaa tcttctcaaa cacggagccg tcaattgcaa tagttgcacg gccctgtgtt tgagtcttaa ccagtggagc gcagcanaag gaagcanana gttgcgcanc cctcccacgg acaaggcggc acacatcgcg aatgatgcga aggtcttcaa ttgactgcac gtcaacacca cacaccttct ggatcgtgct gcgagtgaac tgaantccgg gcatacggtc agcactgatc atcccggcaa atcgggactc aaacgacccc cggttgccca aagcagtctg cagtgccnca gganngcagt taatagacga caggtgcaca ataacgcggc gggcganttc accgangtac atgccggata tcatcttctc cagcgcctgt tgacctttgt tcaacgacgc atcgtcaata tccaagtcna acttcgtcgt anggangana aaccggtact tggagtcaaa gttgccgctt tccatattga tgggagtaag tgctgaccca cgagcagcaa cggcagggnc cttcgtcaca gcagacgccg tctcaaagta gcacgcattn gaaccagtgc cgataatcac gccaacntgt acctcagggt cnttaaagta atgcgaaatt aatgttccaa cggtgtcgtt gcacaacgcc acaacattca cnttcaggct cactcgccca aaagcagcnt gaagaagggc aatcacatcg tttccttgaa cgcctttcgt gctgaagccn ttcgtccacc ggataagcac accacggtta acnttcgtct gctccacggg gaaactgaag gtaaacccaa gaggaactgt gcgattgagg tcctcaggtg cacgagtttc catggttttc ttcacattgg atgcaatgaa gtcaaacaga tcggtggcgt taccctcaag ggcatatttg ggaatcttga atgcagaagt agaggaatcc accacggcac cctccttgca tgcaacacgc aacacacgga agttggtacc accgaggtca agtgcgtaga agacgccagt agcacgctta gggtccgcct tgtacacata agatggtaac atgcggacgg tgctttcacg accctcaaga ccctccacca tctcgtacag gaggtatgtc atgatctgtc gcatagactc cacactcatt gtgaattggt tggtcagcgc tgccattgca acatcacgct taacggcacg cacagtctca ccatcatttc cctggatcga gatgtgttcg aggatattgt ttaggcgtct agacatatgg tggtggtg
Expressed Protein Sequence
MAHHHHHHMG TLEAQTQGPG SSRRLNNILE HISIQGNDGE TVRAVKRDVA MAALTNQFTM SVESMRQIMT YLLYEMVEGL EGRESTVRML PSYVYKADPK RATGVFYALD LGGTNFRVLR VACKEGAVVD SSTSAFKIPK YALEGNATDL FGFIASNVKK TMETRAPEDL NRTVPLGFTF SFPVEQTKVN RGVLIRWTKG FSTKGVQGND VIALLQAAFG RVSLKVNVVA LCNDTVGTLI SHYFKDPEVQ VGVIIGTGSN ACYFETASAV TKDPAVAARG SALTPINMES GNFDSKYRFV LPTTKFDLDI DDASLNKGQQ ALEKMISGMY LGEIARRVIV HLSSINCLPA ALQTALGNRG SFESRFAGMI SADRMPGLQF TRSTIQKVCG VDVQSIEDLR IIRDVCRLVR GRAAQLSASF CCAPLVKTQT QGRATIAIDG SVFEKIPSFR RVLQDNINRI LGPECDVRAV LAKDGSGIGA AFISAMVVND K
Full NT Sequence (Expression Vector + Insert)
GAACTCACCT ATCTCCCCAA CACCTAATAA CATTCAATCA CTCTTTCCAC TAACCACCTA TCTACATCAC CAAGATATCA CTAGTTCTCG AGATGGCTCA CCACCACCAC CACCATATGG GTACCCTGGA AGCTCAGACC CAGGGTCCTG GTTCGTCTAG ACGCCTAAAC AATATCCTCG AACACATCTC GATCCAGGGA AATGATGGTG AGACTGTGCG TGCCGTTAAG CGTGATGTTG CAATGGCAGC GCTGACCAAC CAATTCACAA TGAGTGTCGA GTCTATGCGA CAGATCATGA CATACCTCCT GTACGAGATG GTGGAGGGTC TTGAGGGTCG TGAAAGCACC GTCCGCATGT TACCATCTTA TGTATACAAG GCGGACCCTA AGCGTGCTAC TGGCGTCTTC TACGCACTTG ACCTCGGTGG TACCAACTTC CGTGTGCTGC GCGTTGCATG CAAGGAGGGT GCCGTGGTGG ATTCCTCTAC TTCTGCATTC AAGATTCCCA AATATGCCCT TGAGGGTAAC GCCACCGATC TGTTTGGCTT CATTGCATCC AATGTGAAGA AAACCATGGA AACTCGTGCA CCTGAGGACC TCAATCGCAC AGTTCCTCTT GGGTTTACCT TCAGTTTCCC CGTGGAGCAG ACGAAGGTTA ACCGTGGTGT GCTTATCCGG TGGACGAAGG GCTTCAGCAC GAAAGGCGTT CAAGGAAATG ATGTGATTGC CCTTCTTCAG GCTGCTTTTG GGCGAGTGAG CTTGAAGGTG AATGTTGTGG CGTTGTGCAA CGACACTGTT GGAACATTAA TTTCGCATTA CTTTAAGGAC CCTGAGGTAC AGGTTGGTGT GATTATCGGC ACTGGTTCCA ATGCGTGCTA CTTTGAGACG GCGTCTGCTG TGACGAAGGA CCCTGCCGTT GCTGCTCGTG GGTCAGCACT TACTCCCATC AATATGGAAA GCGGCAACTT TGACTCCAAG TACCGGTTTG TCCTCCCTAC GACGAAGTTC GACTTGGATA TTGACGATGC GTCGTTGAAC AAAGGTCAAC AGGCGCTGGA GAAGATGATA TCCGGCATGT ACCTCGGCGA AATCGCCCGC CGCGTTATTG TGCACCTGTC GTCTATTAAC TGCCTTCCTG CGGCACTGCA GACTGCTTTG GGCAACCGGG GGTCGTTTGA GTCCCGATTT GCCGGGATGA TCAGTGCTGA CCGTATGCCC GGACTTCAGT TCACTCGCAG CACGATCCAG AAGGTGTGTG GTGTTGACGT GCAGTCAATT GAAGACCTTC GCATCATTCG CGATGTGTGC CGCCTTGTCC GTGGGAGGGC TGCGCAACTC TCTGCTTCCT TCTGCTGCGC TCCACTGGTT AAGACTCAAA CACAGGGCCG TGCAACTATT GCAATTGACG GCTCCGTGTT TGAGAAGATT CCGTCATTCC GCCGCGTCTT GCAGGACAAC ATCAACCGTA TCCTTGGCCC TGAGTGCGAT GTCAGGGCCG TTCTCGCAAA GGATGGCAGT GGAATTGGTG CTGCATTTAT TTCCGCAATG GTGGTGAACG ACAAGACGCG TTAACCACGT GAGTAAGATA GGGATCCATA TATAGGGCCC GGGTTATAAT TACCTCAGGT CGACGTCCCA TGGTTTTGTA TAGAATTTAC GGCTAGCGCC GGATGCGACG CCGGTCGCGT CTTATCCGGC CTTCCTATAT CAGGCGGTGT TTAAGACGCC GCCGCTTCGC CCAAATCCTT ATGCCGGTTC GACGACTGGA CAAAATACTG TTTATCTTCC CAGCGCAGGC AGGTTAATGT ACCACCCCAG CAGCAGCCGG TATCCAGCGC GTATATACCT TCCGGCGTAC CTTTGCCCTC CAGCGATGCC CAGTGACCAA AGGCGATGCT GTATTCTTCA GCGACAGGGC CAGGAATCGC AAACCACGGT TTCAGTGGGG CAGGGGCCTC TTCCGGCGAT TCTTACTAGC TAGTATGCAT AGGTGCTGAA ATATAAAGTT TGTGTTTCTA AAACACACTT GGTACGTACG ATAACGTACA GTGTTTTTCC CTCCACTTAA ATCGAAGGGT AGTGTCTTGG AGCGCGCGGA GTAAACATAT ATGGTTCATA TATGTCCGTA GGCACGTAAA AAAAGCGAGG GATTCGAATT CCCCCGGAAC CCCCGGTTGG GGCCCACGCC TCGATCGAGC AAAAAAAAAA AAAAAGAAAA AAAAAAAAAA AAAAAGCTTT CCCGCGGCCA GCTTGGCGTA ATCATGGTCA TAGCTGTTTC CTGTGTGAAA TTGTTATCCG CTCACAATTC CACACAACAT ACGAGCCGGA AGCATAAAGT GTAAAGCCTG GGGTGCCTAA TGAGTGAGCT AACTCACATT AATTGCGTTG CGCTCACTGC CCGCTTTCCA GTCGGGAAAC CTGTCGTGCC AGCTGCATTA ATGAATCGGC CAACGCGCGG GGAGAGGCGG TTTGCGTATT GGGCGCTCTT CCGCTTCCTC ACTCACTGAC TCGCTGCGCT CGGTCGCTCG GCTGCGGCGA GCGGTATCAG CTCACTCAAA GGCGGTAATA CGGTTATCCA CAGAATCAGG GGATAACGCA GGAAAGAACA TGTGAGCAAA AGGCCAGCAA AAGGCCAGGA ACCGTAAAAA GGCCGCGTTG CTGGCGTTTT TCCATAGGCT CCGCCCCCCT GACGAGCATC ACAAAAATCG ACGCTCAAGT CAGAGGTGGC GAAACCCGAC AGGACTATAA AGATACCAGG CGTTTCCCCC TGGAAGCTCC CTCGTGCGCT CTCCTGTTCC GACCCTGCCG CTTACCGGAT ACCTGTCCGC CTTTCTCCCT TCGGGAAGCG TGGCGCTTTC TCATAGCTCA CGCTGTAGGT ATCTCAGTTC GGTGTAGGTC GTTCGCTCCA AGCTGGGCTG TGTGCACGAA CCCCCCGTTC AGCCCGACCG CTGCGCCTTA TCCGGTAACT ATCGTCTTGA GTCCAACCCG GTAAGACACG ACTTATCGCC ACTGGCAGCA GCCACTGGTA ACAGGATTAG CAGAGCGAGG TATGTAGGCG GTGCTACAGA GTTCTTGAAG TGGTGGCCTA ACTACGGCTA CACTAGAAGA ACAGTATTTG GTATCTGCGC TCTGCTGAAG CCAGTTACCT TCGGAAAAAG AGTTGGTAGC TCTTGATCCG GCAAACAAAC CACCGCTGGT AGCGGTGGTT TTTTTGTTTG CAAGCAGCAG ATTACGCGCA GAAAAAAAGG ATCTCAAGAA GATCCTTTGA TCTTTTCTAC GGGGTCTGAC GCTCAGTGGA ACGAAAACTC ACGTTAAGGG ATTTTGGTCA TGAGATTATC AAAAAGGATC TTCACCTAGA TCCTTTTAAA TTAAAAATGA AGTTTTAAAT CAATCTAAAG TATATATGAG TAAACTTGGT CTGACAGTTA CCAATGCTTA ATCAGTGAGG CACCTATCTC AGCGATCTGT CTATTTCGTT CATCCATAGT TGCCTGACTC CCCGTCGTGT AGATAACTAC GATACGGGAG GGCTTACCAT CTGGCCCCAG TGCTGCAATG ATACCGCGAG ACCCACGCTC ACCGGCTCCA GATTTATCAG CAATAAACCA GCCAGCCGGA AGGGCCGAGC GCAGAAGTGG TCCTGCAACT TTATCCGCCT CCATCCAGTC TATTAATTGT TGCCGGGAAG CTAGAGTAAG TAGTTCGCCA GTTAATAGTT TGCGCAACGT TGTTGCCATT GCTACAGGCA TCGTGGTGTC ACGCTCGTCG TTTGGTATGG CTTCATTCAG CTCCGGTTCC CAACGATCAA GGCGAGTTAC ATGATCCCCC ATGTTGTGCA AAAAAGCGGT TAGCTCCTTC GGTCCTCCGA TCGTTGTCAG AAGTAAGTTG GCCGCAGTGT TATCACTCAT GGTTATGGCA GCACTGCATA ATTCTCTTAC TGTCATGCCA TCCGTAAGAT GCTTTTCTGT GACTGGTGAG TACTCAACCA AGTCATTCTG AGAATAGCGT ATGCGGCGAC CGAGTTGCTC TTGCCCGGCG TCAATACGGG ATAATACCGC GCCACATAGC AGAACTTTAA AAGTGCTCAT CATTGGAAAA CGTTCTTCGG GGCGAAAACT CTCAAGGATC TTACCGCTGT TGAGATCCAG TTCGATGTAA CCCACTCGTG CACCCAACTG ATCTTCAGCA TCTTTTACTT TCACCAGCGT TTCTGGGTGA GCAAAAACAG GAAGGCAAAA TGCCGCAAAA AAGGGAATAA GGGCGACACG GAAATGTTGA ATACTCATAC TCTTCCTTTT TCAATATTAT TGAAGCATTT ATCAGGGTTA TTGTCTCATG AGCGGATACA TATTTGAATG TATTTAGAAA AATAAACAAA TAGGGGTTCC GCGCACATTT CCCCGAAAAG TGCCACCTGA CGTCTAAGAA ACCATTATTA TCATGACATT AACCTATAAA AATAGGCGTA TCACGAGGCC CTTTCGTCTC GCGCGTTTCG GTGATGACGG TGAAAACCTC TGACACATGC AGCTCCCGGA GACGGTCACA GCTTGTCTGT AAGCGGATGC CGGGAGCAGA CAAGCCCGTC AGGGCGCGTC AGCGGGTGTT GGCGGGTGTC GGGGCTGGCT TAACTATGCG GCATCAGAGC AGATTGTACT GAGAGTGCAC CATTCGACGC TCTCCCTTAT GCGACTCCTG CATTAGGAAG CAGCCCAGTA GTAGGTTGAG GCCGTTGAGC ACCGCCGCCG CAAGGAATGG TGCATGCAAG GAGATGGCGC CCAACAGTCC CCCGGCCACG GGGCCTGCCA CCATACCCAC GCCGAAACAA GCGCTCATGA GCCCGAAGTG GCGAGCCCGA TCTTCCCCAT CGGTGATGTC GGCGATATAG GCGCCAGCAA CCGCACCTGT GGCGCCGGTG ATGCCGGCCA CGATGCGTCC GGCGTAGAGG ATCTGGCTAG CGATGACCCT GCTGATTGGT TCGCTGACCA TTTCCGGGTG CGGGACGGCG TTACCAGAAA CTCAGAAGGT TCGTCCAACC AAACCGACTC TGGCGGCAGT TTACGAGAGA GATGATAGGG TCTGCTTCAG TAAGCCAGAT GCTACACAAT TAGGCTTGTA CATACTGTCG TTAGAACGCG GCTACAATTA ATACATAACC TTATGTATCA TACACATACG ATTTAGGTGA CACTATA