EnhiA.00186.b

Cysteine protease 12 (Cysteine protease, putative)

CENTER ID: EnhiA.00186.b
ORGANISM: Entamoeba histolytica HM-1:IMSS
ASSOCIATED DISEASE: Amoebic dysentery
CURRENT STATUS: expressed
COMMUNITY REQUEST: True
NIH RISK GROUP: 2
SELECT AGENT: False
NIH PRIORITY
pathogens category:
IIIB

Ordering Clones & Proteins

If there are materials available for this target, they will be listed below. Materials can be ordered from SSGCID using the button in the "order material" column. Clicking the button will add the material to a virtual cart. You may order multiple materials at a time at no cost to you, as this contract is funded by NIAID. When you are ready to place your order, click the "Place Order" link which will appear in the top right corner of the page after you place your first item in your cart.

Clones*

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
EnhiA.00186.b.A2.GE33558 soluble domain 31 349
* SSGCID clones represent un-induced expression constructs which have been verified by sequencing from vector primers. Clones may contain a tag, such as N-term 6xHis. Get sequence information using the button in the "info" column.

External Resources

RESOURCE REFERENCE ID
EuPathDB: AmoebaDB:EHI_140220
RefSeq: XP_656747.1
UniProt: Q8I8D6

Sequences

These sequences are the native gene sequence; sequences of constructs derived from these sequences may differ due to codon optimization or other protocols. To find the specific sequence of any material you may have ordered, click on the "info" button next to the name of that material.
AA Sequence
MSHLIIIVSL VICSFSLKVI PKLEYLKGNE NKLWEEWKIK YGKTYNNINE IHRKLIFMKN LMEIKTLNQK REKDVDAYFD LNQWSDLSNQ EFEDLMLMKK PKRSNAKELH DTINITIPYP KGPVPINYSA CNQKTLFGKL NPGEIDFCNG IEFDQQSCGS CYCVSNALAL QLKWANLTYL RDGKPQYKMF SPQQLLDCEV GGYRCAGGYA DSVLDFSHYV STIDDYPYYS GREPSKRMAC VKGKRTPIKL SYTIFDSAED INIIKPIIHH YGGFVSCVYP KYWTAYRGGI LRGLKCEKGV VTTHVVGIVG YGIEDGIEYV VVRNSWGKNW GLGGYIKLGA DSLCGIGGND GGDVPVSVVL HVDFSDVEYG PYGEFRNNTD VPQRLYNESA EANESSSNIE SSNDSSNESQ SLNSRQSEES LVVEDSSSVE PNIRPNNENH LRSITIYVLY VLVGISIITM VVAVIILHRS IIFN
NT Sequence
atgagccatt tgattattat tgtttcattg gttatatgtt ctttttcact caaagtaatt ccaaaattag aatatttaaa aggaaatgag aataaacttt gggaagaatg gaaaataaaa tatggaaaga catataataa tattaatgaa attcatagaa aattaatttt tatgaaaaat ttaatggaaa ttaaaacgtt aaatcaaaaa agagaaaaag atgttgatgc ttattttgat ttaaatcaat ggagtgattt atctaatcaa gagtttgaag atttaatgtt aatgaaaaag cctaaaagaa gtaatgctaa agaattacat gatacaatta atattactat tccatatcca aaaggtcctg ttcctattaa ctattctgct tgtaaccaaa aaacactttt tggaaagtta aatcctggag aaattgattt ttgcaatggg attgagtttg atcaacaatc ttgtgggtca tgttattgtg taagtaatgc tcttgctctt caattaaaat gggctaatct cacttattta agagatggaa agcctcaata caaaatgttt agtcctcaac aacttcttga ctgtgaagtc ggaggatatc gttgtgctgg aggatatgct gatagtgtat tagacttttc gcattatgtt tctacaatag atgattatcc ttattattca ggtagagaac catcaaaaag aatggcttgt gttaaaggaa aaagaacacc aattaaactc agttatacta ttttcgattc tgcagaagac attaatatta ttaaaccaat tattcatcat tatggtggat ttgtttcatg tgtttatcct aaatactgga ctgcatatcg tggaggtata ttaaggggtc ttaaatgtga aaaaggagtt gtcacaactc acgttgttgg aattgttgga tatggtattg aagatggtat tgagtatgtt gttgttcgaa attcatgggg taaaaattgg ggtcttggag gctatattaa acttggagca gattctttat gtggtattgg tgggaatgat ggaggagatg ttcctgtttc agtagttctt catgtagatt tctctgacgt agagtatgga ccatacggtg agtttagaaa taatactgat gtccctcaac gtctatacaa tgagtctgca gaagcaaatg agtcttcaag taatattgaa agtagtaatg attcatccaa tgaaagtcaa agcttaaata gccgtcagtc tgaagagagt ttagtagtag aagactcttc atctgtggag ccaaacatac gacctaacaa tgaaaatcat ttaaggtcta ttacaatcta tgttttatat gtacttgttg gaattagtat tattactatg gtagttgctg tcataattct tcatcgctca ataattttta attga
Details for EnhiA.00186.b.A2.GE33558
HARVESTED ON: 10/31/2011
SEQUENCED ON: 11/4/2011
EXPECTED MW: 38kDa
OBSERVED MW: 38kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Too many to count (100+)
TOTAL EXPRESSION LEVEL: Low Expression
SOLUBLE EXPRESSION LEVEL No Expression
EXPRESSION HOST: data unavailable
SEQUENCING RESULT: pass with incomplete coverage
PERCENT IDENTITY: 97
PERCENT COVERAGE: 83
Validated AA Sequence
MAHHHHHHMG TLEAQTQGPG SMNKLWEEWK IKYGKTYNNI NEIHRKLIFM KNLMEIKTLN QKREKDVDAY FDLNQWSDLS NQEFEDLMLM KKPKRSNAKE LHDTINITIP YPKGPVPINY SACNQKTLFG KLNPGEIDFC NGIEFDQQSC GSCYCVSNAL ALQLKWANLT YLRDGKPQYK MFSPQQLLDC EVGGYRCAGG YADSVLDFSH YVSTIDDYPY YSXREPSXRM ACVKGKRTPI KLSYTIFDSA EDINIIKPII HHYGGFVSCV YPKYWTAYRX XIXXGX
Validated NT Sequence
ttttgtttan ctttaagaag gagatatacc atggctcatc accatcacca tcatatgggt accctggaag ctcagaccca gggtcctggt tcgatgaata aactttggga agaatggaaa ataaaatatg gaaagacata taataatatt aatgaaattc atagaaaatt aatttttatg aaaaatttaa tggaaattaa aacgttaaat caaaaaagag aaaaagatgt tgatgcttat tttgatttaa atcaatggag tgatttatct aatcaagagt ttgaagattt aatgttaatg aaaaagccta aaagaagtaa tgctaaagaa ttacatgata caattaatat tactattcca tatccaaaag gtcctgttcc tattaactat tctgcttgta accaaaaaac actttttgga aagttaaatc ctggagaaat tgatttttgc aatgggattg agtttgatca acaatcttgt gggtcatgtt attgtgtaag taatgctctt gctcttcaat taaaatgggc taatctcact tatttaagag atggaaagcc tcaatacaaa atgtttagtc ctcaacaact tcttgactgt gaagtcggag gatatcgttg tgctggagga tatgctgata gtgtattaga cttttcgcat tatgtttcta caatagatga ttatccttat tattcangta gagaaccatc nnaaagaatg gcttgtgtta aaggaaaaag aacaccaatt aaactcagtt atactatttt cgattctgca gaagacatta atattattaa accnattatt catcattatg gtggatttgt ttcatgtgtt tatcctaaat actggactgc atatcgtgnn ngtatattna nggggnctta aatgtgaaaa nngagttgtc acaactcacg ttgntngaat tgttggatan ngnantgaan anggtattga gtnngntgnn gttcnanatn cntggnnnaa aantnggnnc ntnnnntann nnnnntnnnn nnnatnnngn nnntnntngn nnnnnnnnnn nnacantnng nancnnncnn ngcnnnna
Expected Protein Sequence
MAHHHHHHMG TLEAQTQGPG SMNKLWEEWK IKYGKTYNNI NEIHRKLIFM KNLMEIKTLN QKREKDVDAY FDLNQWSDLS NQEFEDLMLM KKPKRSNAKE LHDTINITIP YPKGPVPINY SACNQKTLFG KLNPGEIDFC NGIEFDQQSC GSCYCVSNAL ALQLKWANLT YLRDGKPQYK MFSPQQLLDC EVGGYRCAGG YADSVLDFSH YVSTIDDYPY YSGREPSKRM ACVKGKRTPI KLSYTIFDSA EDINIIKPII HHYGGFVSCV YPKYWTAYRG GILRGLKCEK GVVTTHVVGI VGYGIEDGIE YVVVRNSWGK NWGLGGYIKL GADSLCGIGG N
Full NT Sequence (Expression Vector + Insert)
ttcttgaaga cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca cccgtggcca ggacccaacg ctgcccgaga tgcgccgcgt gcggctgctg gagatggcgg acgcgatgga tatgttctgc caagggttgg tttgcgcatt cacagttctc cgcaagaatt gattggctcc aattcttgga gtggtgaatc cgttagcgag gtgccgccgg cttccattca ggtcgaggtg gcccggctcc atgcaccgcg acgcaacgcg gggaggcaga caaggtatag ggcggcgcct acaatccatg ccaacccgtt ccatgtgctc gccgaggcgg cataaatcgc cgtgacgatc agcggtccag tgatcgaagt taggctggta agagccgcga gcgatccttg aagctgtccc tgatggtcgt catctacctg cctggacagc atggcctgca acgcgggcat cccgatgccg ccggaagcga gaagaatcat aatggggaag gccatccagc ctcgcgtcgt gaacgccagc aagacgtagc ccagcgcgtc ggccgtaaca acaccattta aatggagtgg ttacaaatgg agtggttaat taacaacacc atttgtcgac gctctccctt atgcgactcc tgcattagga agcagcccag tagtaggttg aggccgttga gcaccgccgc cgcaaggaat ggtgcatgca aggagatggc gcccaacagt cccccggcca cggggcctgc caccataccc acgccgaaac aagcgctcat gagcccgaag tggcgagccc gatcttcccc atcggtgatg tcggcgatat aggcgccagc aaccgcacct gtggcgccgg tgatgccggc cacgatgcgt ccggcgtaga ggatcgagat ctcgatcccg cgaaattaat acgactcact atagggagac cacaacggtt tccctctaga aataattttg tttaacttta agaaggagat ataccatggc tcatcaccat caccatcata tgggtaccct ggaagctcag acccagggtc ctggttcgat gaataaactt tgggaagaat ggaaaataaa atatggaaag acatataata atattaatga aattcataga aaattaattt ttatgaaaaa tttaatggaa attaaaacgt taaatcaaaa aagagaaaaa gatgttgatg cttattttga tttaaatcaa tggagtgatt tatctaatca agagtttgaa gatttaatgt taatgaaaaa gcctaaaaga agtaatgcta aagaattaca tgatacaatt aatattacta ttccatatcc aaaaggtcct gttcctatta actattctgc ttgtaaccaa aaaacacttt ttggaaagtt aaatcctgga gaaattgatt tttgcaatgg gattgagttt gatcaacaat cttgtgggtc atgttattgt gtaagtaatg ctcttgctct tcaattaaaa tgggctaatc tcacttattt aagagatgga aagcctcaat acaaaatgtt tagtcctcaa caacttcttg actgtgaagt cggaggatat cgttgtgctg gaggatatgc tgatagtgta ttagactttt cgcattatgt ttctacaata gatgattatc cttattattc aggtagagaa ccatcaaaaa gaatggcttg tgttaaagga aaaagaacac caattaaact cagttatact attttcgatt ctgcagaaga cattaatatt attaaaccaa ttattcatca ttatggtgga tttgtttcat gtgtttatcc taaatactgg actgcatatc gtggaggtat attaaggggt cttaaatgtg aaaaaggagt tgtcacaact cacgttgttg gaattgttgg atatggtatt gaagatggta ttgagtatgt tgttgttcga aattcatggg gtaaaaattg gggtcttgga ggctatatta aacttggagc agattcttta tgtggtattg gtgggaatta aacagcacga acaagttctg cagccaagct tctcgaggat ccggctgcta acaaagcccg aaaggaagct gagttggctg ctgccaccgc tgagcaataa ctagcataac cccttggggc ctctaaacgg gtcttgaggg gttttttgct gaaaggagga actatatccg gatatccaca ggacgggtgt ggtcgccatg atcgcgtagt cgatagtggc tccaagtagc gaagcgagca ggactgggcg gcggccaaag cggtcggaca gtgctccgag aacgggtgcg catagaaatt gcatcaacgc atatagcgct agcagcacgc catagtgact ggcgatgctg tcggaatgga cgatatcccg caagaggccc ggcagtaccg gcataaccaa gcctatgcct acagcatcca gggtgacggt gccgaggatg acgatgagcg cattgttaga tttcatacac ggtgcctgac tgcgttagca atttaactgt gataaactac cgcattaaag cttatcgatg ataagctgtc aaacatgaga a