EnhiA.01624.b

Cysteine proteinase 1 (EC 3.4.22.-)

CENTER ID: EnhiA.01624.b
ORGANISM: Entamoeba histolytica HM-1:IMSS
ASSOCIATED DISEASE: Amoebic dysentery
CURRENT STATUS: purified
COMMUNITY REQUEST: True
NIH RISK GROUP: 2
SELECT AGENT: False
NIH PRIORITY
pathogens category:
IIIB

Ordering Clones & Proteins

If there are materials available for this target, they will be listed below. Materials can be ordered from SSGCID using the button in the "order material" column. Clicking the button will add the material to a virtual cart. You may order multiple materials at a time at no cost to you, as this contract is funded by NIAID. When you are ready to place your order, click the "Place Order" link which will appear in the top right corner of the page after you place your first item in your cart.

Clones*

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
EnhiA.01624.b.A12.GE33878 full length 1 311
EnhiA.01624.b.A13.GE33879 full length 9 315
EnhiA.01624.b.A14.GE33880 full length 9 311
EnhiA.01624.b.A15.GE33881 full length 29 315
EnhiA.01624.b.A16.GE33882 full length 29 311
* SSGCID clones represent un-induced expression constructs which have been verified by sequencing from vector primers. Clones may contain a tag, such as N-term 6xHis. Get sequence information using the button in the "info" column.
External Resources
RESOURCE REFERENCE ID
EuPathDB: AmoebaDB:EHI_074180
OrthoMCL: OG5_132389
UniProt: Q01957
Sequences
These sequences are the native gene sequence; sequences of constructs derived from these sequences may differ due to codon optimization or other protocols. To find the specific sequence of any material you may have ordered, click on the "info" button next to the name of that material.
AA Sequence
MFTFILMFYI GYGIDFNTWV ANNNKHFTAV ESLRRRAIFN MNARIVAENN RKETFKLSVD GPFAAMTNEE YNSLLKLKRS GEEKGEVRYL NIQAPKAVDW RKKGKVTPIR DQGNCGSCYT FGSIAALEGR LLIEKGGDSE TLDLSEEHMV QCTREDGNNG CNGGLGSNVY NYIMENGIAK ESDYPYTGSD STCRSDVKAF AKIKSYNRVA RNNEVELKAA ISQGLVDVSI DASSVQFQLY KSGAYTDTQC KNNYFALNHE VCAVGYGVVD GKECWIVRNS WGTGWGEKGY INMVIEGNTC GVATDPLYPT GVEYL
NT Sequence
atgttcactt tcattttgat gttttatatt ggatatggga ttgatttcaa tacatgggtt gccaataaca ataaacactt cacagcagtt gagtcactcc gaagaagagc aatcttcaat atgaatgcaa gaattgttgc agaaaacaat agaaaagaaa cattcaaatt atcagtagat ggaccatttg ctgctatgac aaatgaagaa tataatagtc ttctgaaatt aaaacgaagt ggtgaagaaa aaggagaagt tagatatttg aatatccaag cacccaaagc agtagattgg agaaaaaaag ggaaagtaac accaattcga gatcaaggga attgtgggtc atgttataca tttggatcga ttgcagcact tgaaggaaga ttattaattg agaaaggtgg tgatagtgag acacttgatc tttcagaaga acatatggtt caatgtacta gggaagatgg aaataatgga tgtaatggag gacttggatc aaatgtttat aattatatta tggaaaatgg aattgctaaa gaaagtgatt atccatacac aggaagtgat tcaacatgta gaagtgatgt gaaagcattt gctaaaatca agagttataa tcgagttgca agaaataatg aagttgaact taaagcagca atttcacaag gtcttgttga tgtttcaatt gatgcatcat ctgttcaatt ccagttatac aagagtggag catatacaga cacacaatgc aagaataact attttgcatt gaatcatgaa gtttgtgctg ttggatatgg tgttgttgat gggaaagaat gttggatagt tagaaactca tggggaacag gatggggaga gaaaggatat atcaacatgg ttattgaagg aaatacatgt ggtgttgcta ctgatccact ttatccaact ggtgttgaat atctctga
Details for EnhiA.01624.b.A12.GE33878
HARVESTED ON: 10/15/2011
SEQUENCED ON: 10/21/2011
EXPECTED MW: 36kDa
OBSERVED MW: 36kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Too many to count (100+)
TOTAL EXPRESSION LEVEL: Low Expression
SOLUBLE EXPRESSION LEVEL No Expression
EXPRESSION HOST: data unavailable
SEQUENCING RESULT: pass
PERCENT IDENTITY: 98
PERCENT COVERAGE: 110
Validated AA Sequence
MAHHHHHHMG TLEAQTQGPG SMFTFILMFY IGYGIDFNTW VANNNKHFTA VESLRRRAIF NMNARIVAEN NRKETFKLSV DGPFAAMTNE EYNSLLKLKR SGEEKGEVRY LNIQAPKAVD WRKKGKVTPI RDQGNCGSCY TFGSIAALEG RLLIEKGGDS ETLDLSEEHM VQCTREDGNN GCNGGLGSNV YNYIMENGIA KESDYPYTGS DSTCRSDVKA FAKIKSYNRV ARNNEVELKA AISQGLVDVS IDASSVQFQL YKSGAYTDTQ CKNNYFALNH EVCAVXYGVV DGKECWIVRN SWGTXWGEKG YINXXIEGXT CGVATDPLYP TXLINSTNXX LXXAXXXXXC XXXXXXXXXX XXXXXXX
Validated NT Sequence
attttgttnn ntttaagaag gagatatacc atggctcatc accatcacca tcatatgggt accctggaag ctcagaccca gggtcctggt tcgatgttca ccttcatcct gatgttctac ataggctatg gcattgactt caacacgtgg gttgctaaca ataacaaaca cttcactgct gttgaatccc tgagacgtcg cgctatcttc aacatgaacg ctcgtatagt tgcagagaac aaccgtaaag aaaccttcaa actgagcgta gacggcccgt tcgccgctat gaccaatgaa gaatacaact ctctgctgaa actgaaacgt agcggtgaag agaagggtga agtgcgctac ctgaacatcc aggctccgaa agctgttgac tggcgtaaga agggtaaagt tactcctatt agagaccagg gtaactgcgg ctcttgctac actttcggct ctatcgcagc gctggaaggt cgtctgctga tcgagaaggg cggcgatagc gagacccttg acctgagcga agaacatatg gttcagtgca ctcgtgagga cggtaacaac ggttgtaacg gtggtctcgg ctctaacgtg tataactaca taatggagaa tggtatcgct aaggagtctg actacccata caccggttca gactctactt gccgttctga cgttaaggct ttcgctaaga ttaaatctta taaccgtgta gctcgtaaca acgaagttga acttaaagct gctatctccc agggtttagt tgacgtttcc atcgacgcta gctcagttca gttccagctg tataaatctg gcgcttatac ggatactcag tgtaagaaca actacttcgc cctgaaccac gaagtttgcg ctgtnngcta cggcgttgta gacggtaaag aatgctggat cgttcgtaat tcttggggca cnngctgggg tgagaaaggc tacatcaacn tggnnatcga aggnancacc tgtggcgttg cgactgaccc gctgtaccct accnggctaa taaacagcac gaacnanntt ctgcannnng ctnnnnnnnn nnccnnctgc tnnnnannnn nnnncnnnnn ngnnnnnann nnnnnnnnnn nnnnnnnnnn anna
Expected Protein Sequence
MAHHHHHHMG TLEAQTQGPG SMFTFILMFY IGYGIDFNTW VANNNKHFTA VESLRRRAIF NMNARIVAEN NRKETFKLSV DGPFAAMTNE EYNSLLKLKR SGEEKGEVRY LNIQAPKAVD WRKKGKVTPI RDQGNCGSCY TFGSIAALEG RLLIEKGGDS ETLDLSEEHM VQCTREDGNN GCNGGLGSNV YNYIMENGIA KESDYPYTGS DSTCRSDVKA FAKIKSYNRV ARNNEVELKA AISQGLVDVS IDASSVQFQL YKSGAYTDTQ CKNNYFALNH EVCAVGYGVV DGKECWIVRN SWGTGWGEKG YINMVIEGNT CGVATDPLYP TG
Full NT Sequence (Expression Vector + Insert)
ttcttgaaga cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca cccgtggcca ggacccaacg ctgcccgaga tgcgccgcgt gcggctgctg gagatggcgg acgcgatgga tatgttctgc caagggttgg tttgcgcatt cacagttctc cgcaagaatt gattggctcc aattcttgga gtggtgaatc cgttagcgag gtgccgccgg cttccattca ggtcgaggtg gcccggctcc atgcaccgcg acgcaacgcg gggaggcaga caaggtatag ggcggcgcct acaatccatg ccaacccgtt ccatgtgctc gccgaggcgg cataaatcgc cgtgacgatc agcggtccag tgatcgaagt taggctggta agagccgcga gcgatccttg aagctgtccc tgatggtcgt catctacctg cctggacagc atggcctgca acgcgggcat cccgatgccg ccggaagcga gaagaatcat aatggggaag gccatccagc ctcgcgtcgt gaacgccagc aagacgtagc ccagcgcgtc ggccgtaaca acaccattta aatggagtgg ttacaaatgg agtggttaat taacaacacc atttgtcgac gctctccctt atgcgactcc tgcattagga agcagcccag tagtaggttg aggccgttga gcaccgccgc cgcaaggaat ggtgcatgca aggagatggc gcccaacagt cccccggcca cggggcctgc caccataccc acgccgaaac aagcgctcat gagcccgaag tggcgagccc gatcttcccc atcggtgatg tcggcgatat aggcgccagc aaccgcacct gtggcgccgg tgatgccggc cacgatgcgt ccggcgtaga ggatcgagat ctcgatcccg cgaaattaat acgactcact atagggagac cacaacggtt tccctctaga aataattttg tttaacttta agaaggagat ataccatggc tcatcaccat caccatcata tgggtaccct ggaagctcag acccagggtc ctggttcgat gttcaccttc atcctgatgt tctacatagg ctatggcatt gacttcaaca cgtgggttgc taacaataac aaacacttca ctgctgttga atccctgaga cgtcgcgcta tcttcaacat gaacgctcgt atagttgcag agaacaaccg taaagaaacc ttcaaactga gcgtagacgg cccgttcgcc gctatgacca atgaagaata caactctctg ctgaaactga aacgtagcgg tgaagagaag ggtgaagtgc gctacctgaa catccaggct ccgaaagctg ttgactggcg taagaagggt aaagttactc ctattagaga ccagggtaac tgcggctctt gctacacttt cggctctatc gcagcgctgg aaggtcgtct gctgatcgag aagggcggcg atagcgagac ccttgacctg agcgaagaac atatggttca gtgcactcgt gaggacggta acaacggttg taacggtggt ctcggctcta acgtgtataa ctacataatg gagaatggta tcgctaagga gtctgactac ccatacaccg gttcagactc tacttgccgt tctgacgtta aggctttcgc taagattaaa tcttataacc gtgtagctcg taacaacgaa gttgaactta aagctgctat ctcccagggt ttagttgacg tttccatcga cgctagctca gttcagttcc agctgtataa atctggcgct tatacggata ctcagtgtaa gaacaactac ttcgccctga accacgaagt ttgcgctgta ggctacggcg ttgtagacgg taaagaatgc tggatcgttc gtaattcttg gggcacaggc tggggtgaga aaggctacat caacatggta atcgaaggta acacctgtgg cgttgcgact gacccgctgt accctaccgg ctaaacagca cgaacaagtt ctgcagccaa gcttctcgag gatccggctg ctaacaaagc ccgaaaggaa gctgagttgg ctgctgccac cgctgagcaa taactagcat aaccccttgg ggcctctaaa cgggtcttga ggggtttttt gctgaaagga ggaactatat ccggatatcc acaggacggg tgtggtcgcc atgatcgcgt agtcgatagt ggctccaagt agcgaagcga gcaggactgg gcggcggcca aagcggtcgg acagtgctcc gagaacgggt gcgcatagaa attgcatcaa cgcatatagc gctagcagca cgccatagtg actggcgatg ctgtcggaat ggacgatatc ccgcaagagg cccggcagta ccggcataac caagcctatg cctacagcat ccagggtgac ggtgccgagg atgacgatga gcgcattgtt agatttcata cacggtgcct gactgcgtta gcaatttaac tgtgataaac taccgcatta aagcttatcg atgataagct gtcaaacatg agaa
Details for EnhiA.01624.b.A13.GE33879
HARVESTED ON: 10/15/2011
SEQUENCED ON: 10/21/2011
EXPECTED MW: 36kDa
OBSERVED MW: 36kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Too many to count (100+)
TOTAL EXPRESSION LEVEL: Moderate Expression
SOLUBLE EXPRESSION LEVEL No Expression
EXPRESSION HOST: data unavailable
SEQUENCING RESULT: pass
PERCENT IDENTITY: 98
PERCENT COVERAGE: 100
Validated AA Sequence
MAHHHHHHMG TLEAQTQGPG SMYIGYGIDF NTWVANNNKH FTAVESLRRR AIFNMNARIV AENNRKETFK LSVDGPFAAM TNEEYNSLLK LKRSGEEKGE VRYLNIQAPK AVDWRKKGKV TPIRDQGNCG SCYTFGSIAA LEGRLLIEKG GDSETLDLSE EHMVQCTRED GNNGCNGGLG SNVYNYIMEN GIAKESDYPY TGSDSTCRSD VKAFAKIKSY NRVARNNEVE LKAAISQGLV DVSIDASSVQ FQLYKSGAYT DTQCKNNYFA LNHEVCAVGY GVVDGKECWI VRNSWGTGWG EKGYINMXIX XNTCGVATDP LYPTXVEYLX
Validated NT Sequence
attttgtnnn ntttaagaag gagatatacc atggctcatc accatcacca tcatatgggt accctggaag ctcagaccca gggtcctggt tcgatgtaca taggctatgg cattgacttc aacacgtggg ttgctaacaa taacaaacac ttcactgctg ttgaatccct gagacgtcgc gctatcttca acatgaacgc tcgtatagtt gcagagaaca accgtaaaga aaccttcaaa ctgagcgtag acggcccgtt cgccgctatg accaatgaag aatacaactc tctgctgaaa ctgaaacgta gcggtgaaga gaagggtgaa gtgcgctacc tgaacatcca ggctccgaaa gctgttgact ggcgtaagaa gggtaaagtt actcctatta gagaccaggg taactgcggc tcttgctaca ctttcggctc tatcgcagcg ctggaaggtc gtctgctgat cgagaagggc ggcgatagcg agacccttga cctgagcgaa gaacatatgg ttcagtgcac tcgtgaggac ggtaacaacg gttgtaacgg tggtctcggc tctaacgtgt ataactacat aatggagaat ggtatcgcta aggagtctga ctacccatac accggttcag actctacttg ccgttctgac gttaaggctt tcgctaagat taaatcttat aaccgtgtag ctcgtaacaa cgaagttgaa cttaaagctg ctatctccca gggtttagtt gacgtttcca tcgacgctag ctcagttcag ttccagctgt ataaatctgg cgcttatacg gatactcagt gtaagaacaa ctacttcgcc ctgaaccacg aagtttgcgc tgtaggctac ggcgttgtag acggtaaaga atgctggatc gttcgtaatt cttggggcac aggctggggt gagaaaggnt acatcaacat gnnaatcgan ngnaacacct gtggcgttgc gactgacccg ctgtacccta ccngngttga atacctgnaa taaacagcac gaacaagtnc tgcancnnag ctnnnnnnng nnccnnctgc tnncnaancc cnaannnnct nnntgnngct nnnnnnnnnn nnng
Expected Protein Sequence
MAHHHHHHMG TLEAQTQGPG SMYIGYGIDF NTWVANNNKH FTAVESLRRR AIFNMNARIV AENNRKETFK LSVDGPFAAM TNEEYNSLLK LKRSGEEKGE VRYLNIQAPK AVDWRKKGKV TPIRDQGNCG SCYTFGSIAA LEGRLLIEKG GDSETLDLSE EHMVQCTRED GNNGCNGGLG SNVYNYIMEN GIAKESDYPY TGSDSTCRSD VKAFAKIKSY NRVARNNEVE LKAAISQGLV DVSIDASSVQ FQLYKSGAYT DTQCKNNYFA LNHEVCAVGY GVVDGKECWI VRNSWGTGWG EKGYINMVIE GNTCGVATDP LYPTGVEYL
Full NT Sequence (Expression Vector + Insert)
ttcttgaaga cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca cccgtggcca ggacccaacg ctgcccgaga tgcgccgcgt gcggctgctg gagatggcgg acgcgatgga tatgttctgc caagggttgg tttgcgcatt cacagttctc cgcaagaatt gattggctcc aattcttgga gtggtgaatc cgttagcgag gtgccgccgg cttccattca ggtcgaggtg gcccggctcc atgcaccgcg acgcaacgcg gggaggcaga caaggtatag ggcggcgcct acaatccatg ccaacccgtt ccatgtgctc gccgaggcgg cataaatcgc cgtgacgatc agcggtccag tgatcgaagt taggctggta agagccgcga gcgatccttg aagctgtccc tgatggtcgt catctacctg cctggacagc atggcctgca acgcgggcat cccgatgccg ccggaagcga gaagaatcat aatggggaag gccatccagc ctcgcgtcgt gaacgccagc aagacgtagc ccagcgcgtc ggccgtaaca acaccattta aatggagtgg ttacaaatgg agtggttaat taacaacacc atttgtcgac gctctccctt atgcgactcc tgcattagga agcagcccag tagtaggttg aggccgttga gcaccgccgc cgcaaggaat ggtgcatgca aggagatggc gcccaacagt cccccggcca cggggcctgc caccataccc acgccgaaac aagcgctcat gagcccgaag tggcgagccc gatcttcccc atcggtgatg tcggcgatat aggcgccagc aaccgcacct gtggcgccgg tgatgccggc cacgatgcgt ccggcgtaga ggatcgagat ctcgatcccg cgaaattaat acgactcact atagggagac cacaacggtt tccctctaga aataattttg tttaacttta agaaggagat ataccatggc tcatcaccat caccatcata tgggtaccct ggaagctcag acccagggtc ctggttcgat gtacataggc tatggcattg acttcaacac gtgggttgct aacaataaca aacacttcac tgctgttgaa tccctgagac gtcgcgctat cttcaacatg aacgctcgta tagttgcaga gaacaaccgt aaagaaacct tcaaactgag cgtagacggc ccgttcgccg ctatgaccaa tgaagaatac aactctctgc tgaaactgaa acgtagcggt gaagagaagg gtgaagtgcg ctacctgaac atccaggctc cgaaagctgt tgactggcgt aagaagggta aagttactcc tattagagac cagggtaact gcggctcttg ctacactttc ggctctatcg cagcgctgga aggtcgtctg ctgatcgaga agggcggcga tagcgagacc cttgacctga gcgaagaaca tatggttcag tgcactcgtg aggacggtaa caacggttgt aacggtggtc tcggctctaa cgtgtataac tacataatgg agaatggtat cgctaaggag tctgactacc catacaccgg ttcagactct acttgccgtt ctgacgttaa ggctttcgct aagattaaat cttataaccg tgtagctcgt aacaacgaag ttgaacttaa agctgctatc tcccagggtt tagttgacgt ttccatcgac gctagctcag ttcagttcca gctgtataaa tctggcgctt atacggatac tcagtgtaag aacaactact tcgccctgaa ccacgaagtt tgcgctgtag gctacggcgt tgtagacggt aaagaatgct ggatcgttcg taattcttgg ggcacaggct ggggtgagaa aggctacatc aacatggtaa tcgaaggtaa cacctgtggc gttgcgactg acccgctgta ccctaccggc gttgaatacc tgtaaacagc acgaacaagt tctgcagcca agcttctcga ggatccggct gctaacaaag cccgaaagga agctgagttg gctgctgcca ccgctgagca ataactagca taaccccttg gggcctctaa acgggtcttg aggggttttt tgctgaaagg aggaactata tccggatatc cacaggacgg gtgtggtcgc catgatcgcg tagtcgatag tggctccaag tagcgaagcg agcaggactg ggcggcggcc aaagcggtcg gacagtgctc cgagaacggg tgcgcataga aattgcatca acgcatatag cgctagcagc acgccatagt gactggcgat gctgtcggaa tggacgatat cccgcaagag gcccggcagt accggcataa ccaagcctat gcctacagca tccagggtga cggtgccgag gatgacgatg agcgcattgt tagatttcat acacggtgcc tgactgcgtt agcaatttaa ctgtgataaa ctaccgcatt aaagcttatc gatgataagc tgtcaaacat gagaa
Details for EnhiA.01624.b.A14.GE33880
HARVESTED ON: 10/15/2011
SEQUENCED ON: 10/21/2011
EXPECTED MW: 35kDa
OBSERVED MW: 36kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Too many to count (100+)
TOTAL EXPRESSION LEVEL: Moderate Expression
SOLUBLE EXPRESSION LEVEL No Expression
EXPRESSION HOST: data unavailable
SEQUENCING RESULT: pass
PERCENT IDENTITY: 98
PERCENT COVERAGE: 86
Validated AA Sequence
MAHHHHHHMG TLEAQTQGPG SMYIGYGIDF NTWVANNNKH FTAVESLRRR AIFNMNARIV AENNRKETFK LSVDGPFAAM TNEEYNSLLK LKRSGEEKGE VRYLNIQAPK AVDWRKKGKV TPIRDQGNCG SCYTFGSIAA LEGRLLIEKG GDSETLDLSE EHMVQCTRED GNNGCNGGLG SNVYNYIMEN GIAKESDYPY TGSDSTCRSD VKAFAKIKSY NRVARNNEVE LKAAISQGLV DVSIDASSVQ FQLYKSGAYT DTQCKNNYFA LNHEVCAVXT AL
Validated NT Sequence
tttgtttnnc tttaagaagg agatatacca tggctcatca ccatcaccat catatgggta ccctggaagc tcagacccag ggtcctggtt cgatgtacat aggctatggc attgacttca acacgtgggt tgctaacaat aacaaacact tcactgctgt tgaatccctg agacgtcgcg ctatcttcaa catgaacgct cgtatagttg cagagaacaa ccgtaaagaa accttcaaac tgagcgtaga cggcccgttc gccgctatga ccaatgaaga atacaactct ctgctgaaac tgaaacgtag cggtgaagag aagggtgaag tgcgctacct gaacatccag gctccgaaag ctgttgactg gcgtaagaag ggtaaagtta ctcctattag agaccagggt aactgcggct cttgctacac tttcggctct atcgcagcgc tggaaggtcg tctgctgatc gagaagggcg gcgatagcga gacccttgac ctgagcgaag aacatatggt tcagtgcact cgtgaggacg gtaacaacgg ttgtaacggt ggtctcggct ctaacgtgta taactacata atggagaatg gtatcgctaa ggagtctgac tacccataca ccggttcaga ctctacttgc cgttctgacg ttaaggcttt cgctaagatt aaatcttata accgtgtagc tcgtaacaac gaagttgaac ttaaagctgc tatctcccag ggtttagttg acgtttccat cgacgctagc tcagttcagt tccagctgta taaatctggc gcttatacgg atactcagtg taagaacaac tacttcgccc tgaaccacga agtttgcgct gtnnctacgg cgttgtagan ggtaaagaat gctggatcgt tcgtaattct tggggcacan gctggggtga gaanggntac atcaacatgg taatcgnang gtaacacctg tggcgttgcg actgacccgc tgnaccctac cngctaataa acagcacgaa caagtnctgc anccnagctn nnncnnngnn ccnnctgcta ncaaancccn nannnnnnct gantnnntgn tgcnnnnnnn nna
Expected Protein Sequence
MAHHHHHHMG TLEAQTQGPG SMYIGYGIDF NTWVANNNKH FTAVESLRRR AIFNMNARIV AENNRKETFK LSVDGPFAAM TNEEYNSLLK LKRSGEEKGE VRYLNIQAPK AVDWRKKGKV TPIRDQGNCG SCYTFGSIAA LEGRLLIEKG GDSETLDLSE EHMVQCTRED GNNGCNGGLG SNVYNYIMEN GIAKESDYPY TGSDSTCRSD VKAFAKIKSY NRVARNNEVE LKAAISQGLV DVSIDASSVQ FQLYKSGAYT DTQCKNNYFA LNHEVCAVGY GVVDGKECWI VRNSWGTGWG EKGYINMVIE GNTCGVATDP LYPTG
Full NT Sequence (Expression Vector + Insert)
ttcttgaaga cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca cccgtggcca ggacccaacg ctgcccgaga tgcgccgcgt gcggctgctg gagatggcgg acgcgatgga tatgttctgc caagggttgg tttgcgcatt cacagttctc cgcaagaatt gattggctcc aattcttgga gtggtgaatc cgttagcgag gtgccgccgg cttccattca ggtcgaggtg gcccggctcc atgcaccgcg acgcaacgcg gggaggcaga caaggtatag ggcggcgcct acaatccatg ccaacccgtt ccatgtgctc gccgaggcgg cataaatcgc cgtgacgatc agcggtccag tgatcgaagt taggctggta agagccgcga gcgatccttg aagctgtccc tgatggtcgt catctacctg cctggacagc atggcctgca acgcgggcat cccgatgccg ccggaagcga gaagaatcat aatggggaag gccatccagc ctcgcgtcgt gaacgccagc aagacgtagc ccagcgcgtc ggccgtaaca acaccattta aatggagtgg ttacaaatgg agtggttaat taacaacacc atttgtcgac gctctccctt atgcgactcc tgcattagga agcagcccag tagtaggttg aggccgttga gcaccgccgc cgcaaggaat ggtgcatgca aggagatggc gcccaacagt cccccggcca cggggcctgc caccataccc acgccgaaac aagcgctcat gagcccgaag tggcgagccc gatcttcccc atcggtgatg tcggcgatat aggcgccagc aaccgcacct gtggcgccgg tgatgccggc cacgatgcgt ccggcgtaga ggatcgagat ctcgatcccg cgaaattaat acgactcact atagggagac cacaacggtt tccctctaga aataattttg tttaacttta agaaggagat ataccatggc tcatcaccat caccatcata tgggtaccct ggaagctcag acccagggtc ctggttcgat gtacataggc tatggcattg acttcaacac gtgggttgct aacaataaca aacacttcac tgctgttgaa tccctgagac gtcgcgctat cttcaacatg aacgctcgta tagttgcaga gaacaaccgt aaagaaacct tcaaactgag cgtagacggc ccgttcgccg ctatgaccaa tgaagaatac aactctctgc tgaaactgaa acgtagcggt gaagagaagg gtgaagtgcg ctacctgaac atccaggctc cgaaagctgt tgactggcgt aagaagggta aagttactcc tattagagac cagggtaact gcggctcttg ctacactttc ggctctatcg cagcgctgga aggtcgtctg ctgatcgaga agggcggcga tagcgagacc cttgacctga gcgaagaaca tatggttcag tgcactcgtg aggacggtaa caacggttgt aacggtggtc tcggctctaa cgtgtataac tacataatgg agaatggtat cgctaaggag tctgactacc catacaccgg ttcagactct acttgccgtt ctgacgttaa ggctttcgct aagattaaat cttataaccg tgtagctcgt aacaacgaag ttgaacttaa agctgctatc tcccagggtt tagttgacgt ttccatcgac gctagctcag ttcagttcca gctgtataaa tctggcgctt atacggatac tcagtgtaag aacaactact tcgccctgaa ccacgaagtt tgcgctgtag gctacggcgt tgtagacggt aaagaatgct ggatcgttcg taattcttgg ggcacaggct ggggtgagaa aggctacatc aacatggtaa tcgaaggtaa cacctgtggc gttgcgactg acccgctgta ccctaccggc taaacagcac gaacaagttc tgcagccaag cttctcgagg atccggctgc taacaaagcc cgaaaggaag ctgagttggc tgctgccacc gctgagcaat aactagcata accccttggg gcctctaaac gggtcttgag gggttttttg ctgaaaggag gaactatatc cggatatcca caggacgggt gtggtcgcca tgatcgcgta gtcgatagtg gctccaagta gcgaagcgag caggactggg cggcggccaa agcggtcgga cagtgctccg agaacgggtg cgcatagaaa ttgcatcaac gcatatagcg ctagcagcac gccatagtga ctggcgatgc tgtcggaatg gacgatatcc cgcaagaggc ccggcagtac cggcataacc aagcctatgc ctacagcatc cagggtgacg gtgccgagga tgacgatgag cgcattgtta gatttcatac acggtgcctg actgcgttag caatttaact gtgataaact accgcattaa agcttatcga tgataagctg tcaaacatga gaa
Details for EnhiA.01624.b.A15.GE33881
HARVESTED ON: 10/15/2011
SEQUENCED ON: 10/21/2011
EXPECTED MW: 34kDa
OBSERVED MW: 36kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Too many to count (100+)
TOTAL EXPRESSION LEVEL: Moderate Expression
SOLUBLE EXPRESSION LEVEL No Expression
EXPRESSION HOST: data unavailable
SEQUENCING RESULT: pass
PERCENT IDENTITY: 96
PERCENT COVERAGE: 96
Validated AA Sequence
MAHHHHHHMG TLEAQTQGPG SMAVESLRRR AIFNMNARIV AENNRKETFK LSVDGPFAAM TNEEYNSLLK LKRSGEEKGE VRYLNIQAPK AVDWRKKGKV TPIRDQGNCG SCYTFGSIAA LEGRLLIEKG GDSETLDLSE EHMVQCTRED GNNGCNGGLG SNVYNYIMEN GIAKESDYPY TGSDSTCRSD VKAFAKIKSY NRVARNNEVE LKAAISQGLV DVSIDASSVQ FQLYKSGAYT DTQCKNNYFA LNHEVCAVXY GVVDGKECWI VRNSWGTGWG EKGYINMVIX XXHLWRCD
Validated NT Sequence
ttttntttac tttaagaagg agatatacca tggctcatca ccatcaccat catatgggta ccctggaagc tcagacccag ggtcctggtt cgatggctgt tgaatccctg agacgtcgcg ctatcttcaa catgaacgct cgtatagttg cagagaacaa ccgtaaagaa accttcaaac tgagcgtaga cggcccgttc gccgctatga ccaatgaaga atacaactct ctgctgaaac tgaaacgtag cggtgaagag aagggtgaag tgcgctacct gaacatccag gctccgaaag ctgttgactg gcgtaagaag ggtaaagtta ctcctattag agaccagggt aactgcggct cttgctacac tttcggctct atcgcagcgc tggaaggtcg tctgctgatc gagaagggcg gcgatagcga gacccttgac ctgagcgaag aacatatggt tcagtgcact cgtgaggacg gtaacaacgg ttgtaacggt ggtctcggct ctaacgtgta taactacata atggagaatg gtatcgctaa ggagtctgac tacccataca ccggttcaga ctctacttgc cgttctgacg ttaaggcttt cgctaagatt aaatcttata accgtgtagc tcgtaacaac gaagttgaac ttaaagctgc tatctcccag ggtttagttg acgtttccat cgacgctagc tcagttcagt tccagctgta taaatctggc gcttatacgg atactcagtg taagaacaac tacttcgccc tgaaccacga agtttgcgct gtangctacg gcgttgtaga cggtaaagaa tgctggatcg ttcgtaattc ttggggcaca ggctggggtg agaaaggcta catcaacatg gtaatcgnan ggnaacacct gtggcgttgc gactgacccg ctgtacccta ccggcgttga atacctgtaa taaacagcac gaacaagttc tgcagccaag ctnctcgagg atcnnnctgc tancaaagcc cgaanngnag ctganttngc tgctgcnnna naaaa
Expected Protein Sequence
MAHHHHHHMG TLEAQTQGPG SMAVESLRRR AIFNMNARIV AENNRKETFK LSVDGPFAAM TNEEYNSLLK LKRSGEEKGE VRYLNIQAPK AVDWRKKGKV TPIRDQGNCG SCYTFGSIAA LEGRLLIEKG GDSETLDLSE EHMVQCTRED GNNGCNGGLG SNVYNYIMEN GIAKESDYPY TGSDSTCRSD VKAFAKIKSY NRVARNNEVE LKAAISQGLV DVSIDASSVQ FQLYKSGAYT DTQCKNNYFA LNHEVCAVGY GVVDGKECWI VRNSWGTGWG EKGYINMVIE GNTCGVATDP LYPTGVEYL
Full NT Sequence (Expression Vector + Insert)
ttcttgaaga cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca cccgtggcca ggacccaacg ctgcccgaga tgcgccgcgt gcggctgctg gagatggcgg acgcgatgga tatgttctgc caagggttgg tttgcgcatt cacagttctc cgcaagaatt gattggctcc aattcttgga gtggtgaatc cgttagcgag gtgccgccgg cttccattca ggtcgaggtg gcccggctcc atgcaccgcg acgcaacgcg gggaggcaga caaggtatag ggcggcgcct acaatccatg ccaacccgtt ccatgtgctc gccgaggcgg cataaatcgc cgtgacgatc agcggtccag tgatcgaagt taggctggta agagccgcga gcgatccttg aagctgtccc tgatggtcgt catctacctg cctggacagc atggcctgca acgcgggcat cccgatgccg ccggaagcga gaagaatcat aatggggaag gccatccagc ctcgcgtcgt gaacgccagc aagacgtagc ccagcgcgtc ggccgtaaca acaccattta aatggagtgg ttacaaatgg agtggttaat taacaacacc atttgtcgac gctctccctt atgcgactcc tgcattagga agcagcccag tagtaggttg aggccgttga gcaccgccgc cgcaaggaat ggtgcatgca aggagatggc gcccaacagt cccccggcca cggggcctgc caccataccc acgccgaaac aagcgctcat gagcccgaag tggcgagccc gatcttcccc atcggtgatg tcggcgatat aggcgccagc aaccgcacct gtggcgccgg tgatgccggc cacgatgcgt ccggcgtaga ggatcgagat ctcgatcccg cgaaattaat acgactcact atagggagac cacaacggtt tccctctaga aataattttg tttaacttta agaaggagat ataccatggc tcatcaccat caccatcata tgggtaccct ggaagctcag acccagggtc ctggttcgat ggctgttgaa tccctgagac gtcgcgctat cttcaacatg aacgctcgta tagttgcaga gaacaaccgt aaagaaacct tcaaactgag cgtagacggc ccgttcgccg ctatgaccaa tgaagaatac aactctctgc tgaaactgaa acgtagcggt gaagagaagg gtgaagtgcg ctacctgaac atccaggctc cgaaagctgt tgactggcgt aagaagggta aagttactcc tattagagac cagggtaact gcggctcttg ctacactttc ggctctatcg cagcgctgga aggtcgtctg ctgatcgaga agggcggcga tagcgagacc cttgacctga gcgaagaaca tatggttcag tgcactcgtg aggacggtaa caacggttgt aacggtggtc tcggctctaa cgtgtataac tacataatgg agaatggtat cgctaaggag tctgactacc catacaccgg ttcagactct acttgccgtt ctgacgttaa ggctttcgct aagattaaat cttataaccg tgtagctcgt aacaacgaag ttgaacttaa agctgctatc tcccagggtt tagttgacgt ttccatcgac gctagctcag ttcagttcca gctgtataaa tctggcgctt atacggatac tcagtgtaag aacaactact tcgccctgaa ccacgaagtt tgcgctgtag gctacggcgt tgtagacggt aaagaatgct ggatcgttcg taattcttgg ggcacaggct ggggtgagaa aggctacatc aacatggtaa tcgaaggtaa cacctgtggc gttgcgactg acccgctgta ccctaccggc gttgaatacc tgtaaacagc acgaacaagt tctgcagcca agcttctcga ggatccggct gctaacaaag cccgaaagga agctgagttg gctgctgcca ccgctgagca ataactagca taaccccttg gggcctctaa acgggtcttg aggggttttt tgctgaaagg aggaactata tccggatatc cacaggacgg gtgtggtcgc catgatcgcg tagtcgatag tggctccaag tagcgaagcg agcaggactg ggcggcggcc aaagcggtcg gacagtgctc cgagaacggg tgcgcataga aattgcatca acgcatatag cgctagcagc acgccatagt gactggcgat gctgtcggaa tggacgatat cccgcaagag gcccggcagt accggcataa ccaagcctat gcctacagca tccagggtga cggtgccgag gatgacgatg agcgcattgt tagatttcat acacggtgcc tgactgcgtt agcaatttaa ctgtgataaa ctaccgcatt aaagcttatc gatgataagc tgtcaaacat gagaa
Details for EnhiA.01624.b.A16.GE33882
HARVESTED ON: 10/15/2011
SEQUENCED ON: 10/21/2011
EXPECTED MW: 33kDa
OBSERVED MW: 35kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Too many to count (100+)
TOTAL EXPRESSION LEVEL: Moderate Expression
SOLUBLE EXPRESSION LEVEL No Expression
EXPRESSION HOST: data unavailable
SEQUENCING RESULT: pass
PERCENT IDENTITY: 99
PERCENT COVERAGE: 100
Validated AA Sequence
MAHHHHHHMG TLEAQTQGPG SMAVESLRRR AIFNMNARIV AENNRKETFK LSVDGPFAAM TNEEYNSLLK LKRSGEEKGE VRYLNIQAPK AVDWRKKGKV TPIRDQGNCG SCYTFGSIAA LEGRLLIEKG GDSETLDLSE EHMVQCTRED GNNGCNGGLG SNVYNYIMEN GIAKESDYPY TGSDSTCRSD VKAFAKIKSY NRVARNNEVE LKAAISQGLV DVSIDASSVQ FQLYKSGAYT DTQCKNNYFA LNHEVCAVXY GVVDGKECWI VRNSWGTGWG EKGYINMVIE GNTCGVATDP LYPTG
Validated NT Sequence
ttttntttac tttaagaagg aganatacca tggctcatca ccatcaccat catatgggta ccctggaagc tcagacccag ggtcctggtt cgatggctgt tgaatccctg agacgtcgcg ctatcttcaa catgaacgct cgtatagttg cagagaacaa ccgtaaagaa accttcaaac tgagcgtaga cggcccgttc gccgctatga ccaatgaaga atacaactct ctgctgaaac tgaaacgtag cggtgaagag aagggtgaag tgcgctacct gaacatccag gctccgaaag ctgttgactg gcgtaagaag ggtaaagtta ctcctattag agaccagggt aactgcggct cttgctacac tttcggctct atcgcagcgc tggaaggtcg tctgctgatc gagaagggcg gcgatagcga gacccttgac ctgagcgaag aacatatggt tcagtgcact cgtgaggacg gtaacaacgg ttgtaacggt ggtctcggct ctaacgtgta taactacata atggagaatg gtatcgctaa ggagtctgac tacccataca ccggttcaga ctctacttgc cgttctgacg ttaaggcttt cgctaagatt aaatcttata accgtgtagc tcgtaacaac gaagttgaac ttaaagctgc tatctcccag ggtttagttg acgtttccat cgacgctagc tcagttcagt tccagctgta taaatctggc gcttatacgg atactcagtg taagaacaac tacttcgccc tgaaccacga agtttgcgct gtangctacg gcgttgtaga cggtaaagaa tgctggatcg ttcgtaattc ttggggcaca ggctggggtg agaaaggcta catcaacatg gtaatcgaag gtaacacctg tggcgttgcg actgacccgc tgtaccctac cggctaataa acagcacgaa caagttctgc agccnagctt ctcgaggatc cnnctgctan caaagcccga aaggnanctg anttggctnn tnc
Expected Protein Sequence
MAHHHHHHMG TLEAQTQGPG SMAVESLRRR AIFNMNARIV AENNRKETFK LSVDGPFAAM TNEEYNSLLK LKRSGEEKGE VRYLNIQAPK AVDWRKKGKV TPIRDQGNCG SCYTFGSIAA LEGRLLIEKG GDSETLDLSE EHMVQCTRED GNNGCNGGLG SNVYNYIMEN GIAKESDYPY TGSDSTCRSD VKAFAKIKSY NRVARNNEVE LKAAISQGLV DVSIDASSVQ FQLYKSGAYT DTQCKNNYFA LNHEVCAVGY GVVDGKECWI VRNSWGTGWG EKGYINMVIE GNTCGVATDP LYPTG
Full NT Sequence (Expression Vector + Insert)
ttcttgaaga cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca cccgtggcca ggacccaacg ctgcccgaga tgcgccgcgt gcggctgctg gagatggcgg acgcgatgga tatgttctgc caagggttgg tttgcgcatt cacagttctc cgcaagaatt gattggctcc aattcttgga gtggtgaatc cgttagcgag gtgccgccgg cttccattca ggtcgaggtg gcccggctcc atgcaccgcg acgcaacgcg gggaggcaga caaggtatag ggcggcgcct acaatccatg ccaacccgtt ccatgtgctc gccgaggcgg cataaatcgc cgtgacgatc agcggtccag tgatcgaagt taggctggta agagccgcga gcgatccttg aagctgtccc tgatggtcgt catctacctg cctggacagc atggcctgca acgcgggcat cccgatgccg ccggaagcga gaagaatcat aatggggaag gccatccagc ctcgcgtcgt gaacgccagc aagacgtagc ccagcgcgtc ggccgtaaca acaccattta aatggagtgg ttacaaatgg agtggttaat taacaacacc atttgtcgac gctctccctt atgcgactcc tgcattagga agcagcccag tagtaggttg aggccgttga gcaccgccgc cgcaaggaat ggtgcatgca aggagatggc gcccaacagt cccccggcca cggggcctgc caccataccc acgccgaaac aagcgctcat gagcccgaag tggcgagccc gatcttcccc atcggtgatg tcggcgatat aggcgccagc aaccgcacct gtggcgccgg tgatgccggc cacgatgcgt ccggcgtaga ggatcgagat ctcgatcccg cgaaattaat acgactcact atagggagac cacaacggtt tccctctaga aataattttg tttaacttta agaaggagat ataccatggc tcatcaccat caccatcata tgggtaccct ggaagctcag acccagggtc ctggttcgat ggctgttgaa tccctgagac gtcgcgctat cttcaacatg aacgctcgta tagttgcaga gaacaaccgt aaagaaacct tcaaactgag cgtagacggc ccgttcgccg ctatgaccaa tgaagaatac aactctctgc tgaaactgaa acgtagcggt gaagagaagg gtgaagtgcg ctacctgaac atccaggctc cgaaagctgt tgactggcgt aagaagggta aagttactcc tattagagac cagggtaact gcggctcttg ctacactttc ggctctatcg cagcgctgga aggtcgtctg ctgatcgaga agggcggcga tagcgagacc cttgacctga gcgaagaaca tatggttcag tgcactcgtg aggacggtaa caacggttgt aacggtggtc tcggctctaa cgtgtataac tacataatgg agaatggtat cgctaaggag tctgactacc catacaccgg ttcagactct acttgccgtt ctgacgttaa ggctttcgct aagattaaat cttataaccg tgtagctcgt aacaacgaag ttgaacttaa agctgctatc tcccagggtt tagttgacgt ttccatcgac gctagctcag ttcagttcca gctgtataaa tctggcgctt atacggatac tcagtgtaag aacaactact tcgccctgaa ccacgaagtt tgcgctgtag gctacggcgt tgtagacggt aaagaatgct ggatcgttcg taattcttgg ggcacaggct ggggtgagaa aggctacatc aacatggtaa tcgaaggtaa cacctgtggc gttgcgactg acccgctgta ccctaccggc taaacagcac gaacaagttc tgcagccaag cttctcgagg atccggctgc taacaaagcc cgaaaggaag ctgagttggc tgctgccacc gctgagcaat aactagcata accccttggg gcctctaaac gggtcttgag gggttttttg ctgaaaggag gaactatatc cggatatcca caggacgggt gtggtcgcca tgatcgcgta gtcgatagtg gctccaagta gcgaagcgag caggactggg cggcggccaa agcggtcgga cagtgctccg agaacgggtg cgcatagaaa ttgcatcaac gcatatagcg ctagcagcac gccatagtga ctggcgatgc tgtcggaatg gacgatatcc cgcaagaggc ccggcagtac cggcataacc aagcctatgc ctacagcatc cagggtgacg gtgccgagga tgacgatgag cgcattgtta gatttcatac acggtgcctg actgcgttag caatttaact gtgataaact accgcattaa agcttatcga tgataagctg tcaaacatga gaa