EnhiA.01486.b

Cysteine protease, putative

CENTER ID: EnhiA.01486.b
ORGANISM: Entamoeba histolytica HM-1:IMSS
ASSOCIATED DISEASE: Amoebic dysentery
CURRENT STATUS: expressed
COMMUNITY REQUEST: True
NIH RISK GROUP: 2
SELECT AGENT: False
NIH PRIORITY
pathogens category:
IIIB

Ordering Clones & Proteins

If there are materials available for this target, they will be listed below. Materials can be ordered from SSGCID using the button in the "order material" column. Clicking the button will add the material to a virtual cart. You may order multiple materials at a time at no cost to you, as this contract is funded by NIAID. When you are ready to place your order, click the "Place Order" link which will appear in the top right corner of the page after you place your first item in your cart.

Clones*

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
EnhiA.01486.b.A2.GE33560 soluble domain 24 386
* SSGCID clones represent un-induced expression constructs which have been verified by sequencing from vector primers. Clones may contain a tag, such as N-term 6xHis. Get sequence information using the button in the "info" column.

External Resources

RESOURCE REFERENCE ID
EuPathDB: AmoebaDB:EHI_097900
RefSeq: XP_651049.1
UniProt: C4M490

Sequences

These sequences are the native gene sequence; sequences of constructs derived from these sequences may differ due to codon optimization or other protocols. To find the specific sequence of any material you may have ordered, click on the "info" button next to the name of that material.
AA Sequence
MIFFVIFFII NLALAEFDMN KDDITLFKEF MSTFQKRYET PSQKLTRFAL FKKNCANIRK WNAERTNERD AHFGITSRTD KLPMEYGLSR NLNDLASKTK ENDVIIDGGA YPPILPESDD LPESLSYCGD YVVNNTDHPK VNLCLTPYDQ GSCGSCYAAS TANLGQYLYA NLSYYYNVGN QSNIVIKNFT AQRWIDQKNN FYVRRCCGGN TKMMLLTQPT FSTEYEYPYV DIHTSENDIN GCRNRGDQNP NVAIHLRTQK ITVFGMDNTY SHSQKVTIIK KILHHYGPIS VSILVDVNQT NNAIKMANYQ GGIFKFPSTC NINKMGIDHQ VIIVGYGVED GEEYLIMRNS WGKWGEYDDG YMKISTETPL CGIGEIVENY SPSNYIIYAG NCILDRNCAS CNSKTLVCSE CKEGTTMDSR GMCLDNSYPS IPEDAVAPEE PNPDPDRLED PPAEDSVNSL VIFSIISLIV LLI
NT Sequence
atgatattct ttgttatatt ttttataata aacttagcat tagctgagtt tgatatgaat aaagatgaca ttactttatt caaagaattt atgtcaacat ttcaaaaaag atatgagact ccatctcaaa agttaacaag gtttgcattg tttaagaaga actgtgcaaa tattagaaag tggaatgctg aaagaacaaa tgaaagagat gctcattttg gtattacttc aagaacagat aaacttccca tggaatatgg attaagtcgt aatctgaatg atttggcaag taaaacaaaa gaaaatgatg ttataattga tggaggagca tacccaccaa tactaccaga atctgatgat ttaccagagt cgttatcata ctgtggtgat tatgttgtaa ataatactga ccacccaaaa gtaaatttgt gtttaacacc atatgatcaa ggaagttgtg gttcttgtta tgcagcatca acagcaaacc ttggacaata tttatatgcc aatttatcat attattataa tgtaggaaat caatctaata ttgtaattaa aaactttaca gcccaacgat ggatagatca aaaaaataat ttttacgtta gaagatgttg tggtggaaat actaaaatga tgttattaac tcaacctaca ttttctacag aatatgaata tccatatgtt gatattcata catcagaaaa tgatataaat ggttgtagaa atagaggaga ccaaaaccca aatgttgcta ttcatttaag aacacaaaaa attactgtat ttggaatgga taatacttac tctcattcac aaaaagtaac tattattaaa aagattctcc atcactatgg accaatttct gtttctattt tagttgatgt aaatcaaact aataatgcta ttaaaatggc taactatcaa ggaggtattt tcaaattccc aagtacatgt aatattaata aaatgggaat tgatcatcaa gttattatag ttggatatgg tgttgaagat ggagaagaat atttaattat gagaaattct tggggtaagt ggggagaata tgatgatgga tatatgaaaa tatctactga aacaccatta tgtggtatag gtgaaatagt tgaaaattat agtccatcaa attatattat ttatgctgga aattgtattc ttgatagaaa ttgtgcctct tgtaatagca aaactttggt atgctctgaa tgtaaagaag gaactaccat ggatagtcgt ggtatgtgtt tagataattc ttatccttca attccagaag atgctgttgc accagaagaa cctaatcctg accctgatcg tcttgaagac ccaccagcag aagactctgt taatagtctt gttatttttt caattatttc tttgattgtt cttcttattt ga
Details for EnhiA.01486.b.A2.GE33560
HARVESTED ON: 10/31/2011
SEQUENCED ON: 11/4/2011
EXPECTED MW: 43kDa
OBSERVED MW: 43kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Too many to count (100+)
TOTAL EXPRESSION LEVEL: Low Expression
SOLUBLE EXPRESSION LEVEL No Expression
EXPRESSION HOST: data unavailable
SEQUENCING RESULT: pass with incomplete coverage
PERCENT IDENTITY: 82
PERCENT COVERAGE: 88
Validated AA Sequence
MAHHHHHHMG TLEAQTQGPG SMITLFKEFM STFQKRYETP SQKLTRFALF KKNCANIRKW NAERTNERDA HFGITSRTDK LPMEYGLSRN LNDLASKTKE NDVIIDGGAY PPILPESDDL PESLSYCGDY VVNNTDHPKV NLCLTPYDQG SCGSCYAAST ANLGQYLYAN LSYYYNVGNQ SNIVIKNFTA QRWIDQKNNF YVRRCCGGXT KMMLLTQPTF STEYEYPYVD IHTSENDING CRNRGXQNPN VAIHLRTXKI TXFGMXNTYS HXQKVTXIKK ILXHYGPXXX XXXXXVXXXX AXXXXYXXIX XXXXXXXXXX XQXXXXXXXX XXXXXXXXXX X
Validated NT Sequence
ttttntttaa ctttaagaag gagatatacc atggctcatc accatcacca tcatatgggt accctggaag ctcagaccca gggtcctggt tcgatgatta ctttattcaa agaatttatg tcaacatttc aaaaaagata tgagactcca tctcaaaagt taacaaggtt tgcattgttt aagaagaact gtgcaaatat tagaaagtgg aatgctgaaa gaacaaatga aagagatgct cattttggta ttacttcaag aacagataaa cttcccatgg aatatggatt aagtcgtaat ctgaatgatt tggcaagtaa aacaaaagaa aatgatgtta taattgatgg aggagcatac ccaccaatac taccagaatc tgatgattta ccagagtcgt tatcatactg tggtgattat gttgtaaata atactgacca cccaaaagta aatttgtgtt taacaccata tgatcaagga agttgtggtt cttgttatgc agcatcaaca gcaaaccttg gacaatattt atatgccaat ttatcatatt attataatgt aggaaatcaa tctaatattg taattaaaaa ctttacagcc caacgatgga tagatcaaaa aaataatttt tacgttagaa gatgttgtgg tgganatact aaaatgatgt tattaactca acctacattt tctacagaat atgaatatcc atatgttgat attcatacat cagaaaatga tataaatggt tgtagaaata gaggaganca aaacccaaat gttgctattc atttaagaac acnaaaaatt actgnatttg gnatggnnaa tacttactct catnnncaaa aagtaactat nattaaaaag attctccntc actatggacc nnnnncngnt ncnnnnttnn tngangtaan ncnaannaan gctnnnnnan gnnactatcn nnnnattnnn annnccnnnn cnngnatnna angggnnnnn cnncagtntn nnntgnnnng nnnntnnnnn nnnnnnnngn nnnggnnnnn nnnnngnnnt nnnnc
Expected Protein Sequence
MAHHHHHHMG TLEAQTQGPG SMITLFKEFM STFQKRYETP SQKLTRFALF KKNCANIRKW NAERTNERDA HFGITSRTDK LPMEYGLSRN LNDLASKTKE NDVIIDGGAY PPILPESDDL PESLSYCGDY VVNNTDHPKV NLCLTPYDQG SCGSCYAAST ANLGQYLYAN LSYYYNVGNQ SNIVIKNFTA QRWIDQKNNF YVRRCCGGNT KMMLLTQPTF STEYEYPYVD IHTSENDING CRNRGDQNPN VAIHLRTQKI TVFGMDNTYS HSQKVTIIKK ILHHYGPISV SILVDVNQTN NAIKMANYQG GIFKFPSTCN INKMGIDHQV IIVGYGVEDG EEYLIMRNSW GKWGEYDDGY MKISTETPLC GIGEIVENYS PSNYI
Full NT Sequence (Expression Vector + Insert)
ttcttgaaga cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca cccgtggcca ggacccaacg ctgcccgaga tgcgccgcgt gcggctgctg gagatggcgg acgcgatgga tatgttctgc caagggttgg tttgcgcatt cacagttctc cgcaagaatt gattggctcc aattcttgga gtggtgaatc cgttagcgag gtgccgccgg cttccattca ggtcgaggtg gcccggctcc atgcaccgcg acgcaacgcg gggaggcaga caaggtatag ggcggcgcct acaatccatg ccaacccgtt ccatgtgctc gccgaggcgg cataaatcgc cgtgacgatc agcggtccag tgatcgaagt taggctggta agagccgcga gcgatccttg aagctgtccc tgatggtcgt catctacctg cctggacagc atggcctgca acgcgggcat cccgatgccg ccggaagcga gaagaatcat aatggggaag gccatccagc ctcgcgtcgt gaacgccagc aagacgtagc ccagcgcgtc ggccgtaaca acaccattta aatggagtgg ttacaaatgg agtggttaat taacaacacc atttgtcgac gctctccctt atgcgactcc tgcattagga agcagcccag tagtaggttg aggccgttga gcaccgccgc cgcaaggaat ggtgcatgca aggagatggc gcccaacagt cccccggcca cggggcctgc caccataccc acgccgaaac aagcgctcat gagcccgaag tggcgagccc gatcttcccc atcggtgatg tcggcgatat aggcgccagc aaccgcacct gtggcgccgg tgatgccggc cacgatgcgt ccggcgtaga ggatcgagat ctcgatcccg cgaaattaat acgactcact atagggagac cacaacggtt tccctctaga aataattttg tttaacttta agaaggagat ataccatggc tcatcaccat caccatcata tgggtaccct ggaagctcag acccagggtc ctggttcgat gattacttta ttcaaagaat ttatgtcaac atttcaaaaa agatatgaga ctccatctca aaagttaaca aggtttgcat tgtttaagaa gaactgtgca aatattagaa agtggaatgc tgaaagaaca aatgaaagag atgctcattt tggtattact tcaagaacag ataaacttcc catggaatat ggattaagtc gtaatctgaa tgatttggca agtaaaacaa aagaaaatga tgttataatt gatggaggag catacccacc aatactacca gaatctgatg atttaccaga gtcgttatca tactgtggtg attatgttgt aaataatact gaccacccaa aagtaaattt gtgtttaaca ccatatgatc aaggaagttg tggttcttgt tatgcagcat caacagcaaa ccttggacaa tatttatatg ccaatttatc atattattat aatgtaggaa atcaatctaa tattgtaatt aaaaacttta cagcccaacg atggatagat caaaaaaata atttttacgt tagaagatgt tgtggtggaa atactaaaat gatgttatta actcaaccta cattttctac agaatatgaa tatccatatg ttgatattca tacatcagaa aatgatataa atggttgtag aaatagagga gaccaaaacc caaatgttgc tattcattta agaacacaaa aaattactgt atttggaatg gataatactt actctcattc acaaaaagta actattatta aaaagattct ccatcactat ggaccaattt ctgtttctat tttagttgat gtaaatcaaa ctaataatgc tattaaaatg gctaactatc aaggaggtat tttcaaattc ccaagtacat gtaatattaa taaaatggga attgatcatc aagttattat agttggatat ggtgttgaag atggagaaga atatttaatt atgagaaatt cttggggtaa gtggggagaa tatgatgatg gatatatgaa aatatctact gaaacaccat tatgtggtat aggtgaaata gttgaaaatt atagtccatc aaattatatt taaacagcac gaacaagttc tgcagccaag cttctcgagg atccggctgc taacaaagcc cgaaaggaag ctgagttggc tgctgccacc gctgagcaat aactagcata accccttggg gcctctaaac gggtcttgag gggttttttg ctgaaaggag gaactatatc cggatatcca caggacgggt gtggtcgcca tgatcgcgta gtcgatagtg gctccaagta gcgaagcgag caggactggg cggcggccaa agcggtcgga cagtgctccg agaacgggtg cgcatagaaa ttgcatcaac gcatatagcg ctagcagcac gccatagtga ctggcgatgc tgtcggaatg gacgatatcc cgcaagaggc ccggcagtac cggcataacc aagcctatgc ctacagcatc cagggtgacg gtgccgagga tgacgatgag cgcattgtta gatttcatac acggtgcctg actgcgttag caatttaact gtgataaact accgcattaa agcttatcga tgataagctg tcaaacatga gaa