EnhiA.17853.a

Cyst protein

CENTER ID: EnhiA.17853.a
ORGANISM: Entamoeba histolytica HM-1:IMSS
ASSOCIATED DISEASE: Amoebic dysentery
CURRENT STATUS: expressed
COMMUNITY REQUEST: True
NIH RISK GROUP: 2
SELECT AGENT: False
NIH PRIORITY
pathogens category:
IIIB

Ordering Clones & Proteins

If there are materials available for this target, they will be listed below. Materials can be ordered from SSGCID using the button in the "order material" column. Clicking the button will add the material to a virtual cart. You may order multiple materials at a time at no cost to you, as this contract is funded by NIAID. When you are ready to place your order, click the "Place Order" link which will appear in the top right corner of the page after you place your first item in your cart.

Clones*

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
EnhiA.17853.a.A1.GE32247 full length 1 445
* SSGCID clones represent un-induced expression constructs which have been verified by sequencing from vector primers. Clones may contain a tag, such as N-term 6xHis. Get sequence information using the button in the "info" column.

External Resources

RESOURCE REFERENCE ID
EuPathDB: AmoebaDB:EHI_110780
RefSeq: XP_654276.1
UniProt: C4LUC5

Sequences

These sequences are the native gene sequence; sequences of constructs derived from these sequences may differ due to codon optimization or other protocols. To find the specific sequence of any material you may have ordered, click on the "info" button next to the name of that material.
AA Sequence
MDINQALQLI TKKITSHEIL MKEVIMELTH ISTQLDAVNY LTQPTVREPI KKILMEQFSL NDEIVTQYSR QFNQIIPEQA IHFDDVAQFE SYVIVMINNQ NNTYLEKLVE LVDPSSFKYI DVLKKYCYDK ILMKQNKDLS NKNDKLVAKI AVIESLLKDA NSEIKTLQTE NSSLKCSVQN LENKISDLSR TPTSVKKVLT EDPQLGYNES KTPQNNVNNS MDVLSTFFKK PMLNKYSTKS SSSSDEETDQ MTLEGRSVSL GFSFPSLPME ETIKMKIDDI VEQVMSDLVK LAGCKQGAMV MAVKKSKTLT DVFNATKGCQ KFMMIYQLNS GELFGTYHSV HPNTLGVAVQ DKHMFAFTLS NPFNVPTQKY NWLRPDEDAF KISDTEVFIG GLGVFTDKRN GYIYDENEPV DLYYSDMPIQ ARKIFSTRLA PKKFTWKEMV VVKLV
NT Sequence
atggatatta accaagcact tcaacttatt acaaaaaaaa tcacatcaca tgaaattttg atgaaggaag ttataatgga actaacacat atatcaacac aattagacgc agtcaattat cttacacagc caacagttcg tgagccaatc aaaaagatat tgatggaaca atttagtctt aatgatgaaa tagttaccca atattcaagg caattcaacc aaattattcc tgagcaagca attcatttcg atgacgtggc acaatttgag tcatatgtaa ttgttatgat aaataatcaa aacaacacat acttagagaa actcgttgaa ttagttgacc cgtcatcctt taaatatatt gatgtattaa agaaatactg ttatgacaaa atattaatga aacaaaataa agaccttagc aataagaatg ataagttagt agctaaaatt gctgtcatag agtcattatt aaaagatgct aattctgaaa ttaagactct tcaaacagaa aatagttcac tcaaatgttc tgttcaaaat ttagaaaata aaatatctga tttatcaaga actcctactt ctgtaaagaa agttttaaca gaagacccac aattaggtta taatgaaagt aaaacacctc aaaataatgt taataacagt atggatgttt taagtacatt tttcaagaaa cctatgttaa acaagtattc tactaaaagt agtagttcta gtgatgaaga aactgatcaa atgactttag aaggaaggtc tgtttcttta ggattttcat ttcctagttt acctatggaa gagacaataa aaatgaaaat tgatgatatt gtagaacagg tcatgagtga tttggttaaa ttagctggat gtaagcaagg agcaatggtt atggctgtta agaaaagtaa gacattgact gatgtgttta atgcaactaa agggtgtcaa aaatttatga tgatatatca attgaatagt ggtgaattat ttggtactta tcattcagtt catccaaata cacttggtgt tgcagttcaa gacaaacata tgtttgcttt tacactatca aacccattca atgttccaac acaaaaatac aattggttaa gacctgacga agatgctttt aaaataagtg atacagaagt atttattgga gggttaggtg tatttactga taagcgaaat gggtacattt atgatgaaaa cgaaccagtt gatttgtatt attcagacat gccaattcag gcaagaaaaa tattttctac tcgtttagct ccaaagaaat tcacttggaa agaaatggtt gtagtaaaac tggtataa
Details for EnhiA.17853.a.A1.GE32247
HARVESTED ON: 7/28/2011
SEQUENCED ON: 8/3/2011
EXPECTED MW: 53kDa
OBSERVED MW: 53kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Good (10-50)
TOTAL EXPRESSION LEVEL: Low Expression
SOLUBLE EXPRESSION LEVEL No Expression
EXPRESSION HOST: data unavailable
SEQUENCING RESULT: pass with incomplete coverage
PERCENT IDENTITY: 95
PERCENT COVERAGE: 70
Validated AA Sequence
MAHHHHHHMG TLEAQTQGPG SMDINQALQL ITKKITSHEI LMKEVIMELT HISTQLDAVN YLTQPTVREP IKKILMEQFS LNDEIVTQYS RQFNQIIPEQ AIHFDDVAQF ESYVIVMINN QNNTYLEKLV ELVDPSSFKY IDALKKYCYD KILMKQNKDL SNKNDKLVAK IAVIESLLKD ANSEIKTLQT ENSSLKCSVQ NLENKISDLS RTPTSVKKVL TEDPQLGYNE SKTPQNNVNN SMDVLSTFFK KPMLNKYSTK SSSSSDEETD QMTLEGRSVS LGXSFPSLPM EETIKMKIDX XVEQVMSDLV KLAGCKXXQW XWLXXKVRX
Validated NT Sequence
ttgtttactt taagaaggag atataccatg gctcatcacc atcaccatca tatgggtacc ctggaagctc agacccaggg tcctggttcg atggatatta accaagcact tcaacttatt acaaaaaaaa tcacatcaca tgaaattttg atgaaggaag ttataatgga actaacacat atatcaacac aattagacgc agtcaattat cttacacagc caacagttcg tgagccaatc aaaaagatat tgatggaaca atttagtctt aatgatgaaa tagttaccca atattcaagg caattcaacc aaattattcc tgagcaagca attcatttcg atgacgtggc acaatttgag tcatatgtaa ttgttatgat aaataatcaa aacaacacat acttagagaa actcgttgaa ttagttgacc cgtcatcctt taaatatatt gatgcattaa agaaatactg ttatgacaaa atattaatga aacaaaataa agaccttagc aataagaatg ataagttagt agctaaaatt gctgtcatag agtcattatt aaaagatgct aattctgaaa ttaagactct tcaaacagaa aatagttcac tcaaatgttc tgttcaaaat ttagaaaata aaatatctga tttatcaaga actcctactt ctgtaaagaa agttttaaca gaagacccac aattaggtta taatgaaagt aaaacacctc aaaataatgt taataacagt atggatgttt taagtacatt tttcaagaaa cctatgttaa acaagtattc tactaaaagt agtagttcta gtgatgaaga aactgatcaa atgactttag aaggaaggtc tgtttcttta ggantttcat ttcctagttt acctatggaa gagacaataa aaatgaaaat tgatganntt gtagaacagg tcatgagtga tttggttaaa ttagctggat gtaagcangn gcaatggnna tggctgttna naaaagtaag annttgactg angtgnttna tgnanctnaa ggnngtcaaa aantnangan gnnnnatcnn nnnnannnnn ngnnnnnnnn nttnggnann nnnnnnncan tnnntncnaa nnnnnnnnnn nnnnannnnn nnnttnnnnn ttttnnnngn nnnnnnncnc nnaaannnnn nnnnnnnnnn nnnnnnnnnn nttnnannnn nnnnnnnnnn aannntt
Expected Protein Sequence
MAHHHHHHMG TLEAQTQGPG SMDINQALQL ITKKITSHEI LMKEVIMELT HISTQLDAVN YLTQPTVREP IKKILMEQFS LNDEIVTQYS RQFNQIIPEQ AIHFDDVAQF ESYVIVMINN QNNTYLEKLV ELVDPSSFKY IDVLKKYCYD KILMKQNKDL SNKNDKLVAK IAVIESLLKD ANSEIKTLQT ENSSLKCSVQ NLENKISDLS RTPTSVKKVL TEDPQLGYNE SKTPQNNVNN SMDVLSTFFK KPMLNKYSTK SSSSSDEETD QMTLEGRSVS LGFSFPSLPM EETIKMKIDD IVEQVMSDLV KLAGCKQGAM VMAVKKSKTL TDVFNATKGC QKFMMIYQLN SGELFGTYHS VHPNTLGVAV QDKHMFAFTL SNPFNVPTQK YNWLRPDEDA FKISDTEVFI GGLGVFTDKR NGYIYDENEP VDLYYSDMPI QARKIFSTRL APKKFTWKEM VVVKLV
Full NT Sequence (Expression Vector + Insert)
ttcttgaaga cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca cccgtggcca ggacccaacg ctgcccgaga tgcgccgcgt gcggctgctg gagatggcgg acgcgatgga tatgttctgc caagggttgg tttgcgcatt cacagttctc cgcaagaatt gattggctcc aattcttgga gtggtgaatc cgttagcgag gtgccgccgg cttccattca ggtcgaggtg gcccggctcc atgcaccgcg acgcaacgcg gggaggcaga caaggtatag ggcggcgcct acaatccatg ccaacccgtt ccatgtgctc gccgaggcgg cataaatcgc cgtgacgatc agcggtccag tgatcgaagt taggctggta agagccgcga gcgatccttg aagctgtccc tgatggtcgt catctacctg cctggacagc atggcctgca acgcgggcat cccgatgccg ccggaagcga gaagaatcat aatggggaag gccatccagc ctcgcgtcgt gaacgccagc aagacgtagc ccagcgcgtc ggccgtaaca acaccattta aatggagtgg ttacaaatgg agtggttaat taacaacacc atttgtcgac gctctccctt atgcgactcc tgcattagga agcagcccag tagtaggttg aggccgttga gcaccgccgc cgcaaggaat ggtgcatgca aggagatggc gcccaacagt cccccggcca cggggcctgc caccataccc acgccgaaac aagcgctcat gagcccgaag tggcgagccc gatcttcccc atcggtgatg tcggcgatat aggcgccagc aaccgcacct gtggcgccgg tgatgccggc cacgatgcgt ccggcgtaga ggatcgagat ctcgatcccg cgaaattaat acgactcact atagggagac cacaacggtt tccctctaga aataattttg tttaacttta agaaggagat ataccatggc tcatcaccat caccatcata tgggtaccct ggaagctcag acccagggtc ctggttcgat ggatattaac caagcacttc aacttattac aaaaaaaatc acatcacatg aaattttgat gaaggaagtt ataatggaac taacacatat atcaacacaa ttagacgcag tcaattatct tacacagcca acagttcgtg agccaatcaa aaagatattg atggaacaat ttagtcttaa tgatgaaata gttacccaat attcaaggca attcaaccaa attattcctg agcaagcaat tcatttcgat gacgtggcac aatttgagtc atatgtaatt gttatgataa ataatcaaaa caacacatac ttagagaaac tcgttgaatt agttgacccg tcatccttta aatatattga tgtattaaag aaatactgtt atgacaaaat attaatgaaa caaaataaag accttagcaa taagaatgat aagttagtag ctaaaattgc tgtcatagag tcattattaa aagatgctaa ttctgaaatt aagactcttc aaacagaaaa tagttcactc aaatgttctg ttcaaaattt agaaaataaa atatctgatt tatcaagaac tcctacttct gtaaagaaag ttttaacaga agacccacaa ttaggttata atgaaagtaa aacacctcaa aataatgtta ataacagtat ggatgtttta agtacatttt tcaagaaacc tatgttaaac aagtattcta ctaaaagtag tagttctagt gatgaagaaa ctgatcaaat gactttagaa ggaaggtctg tttctttagg attttcattt cctagtttac ctatggaaga gacaataaaa atgaaaattg atgatattgt agaacaggtc atgagtgatt tggttaaatt agctggatgt aagcaaggag caatggttat ggctgttaag aaaagtaaga cattgactga tgtgtttaat gcaactaaag ggtgtcaaaa atttatgatg atatatcaat tgaatagtgg tgaattattt ggtacttatc attcagttca tccaaataca cttggtgttg cagttcaaga caaacatatg tttgctttta cactatcaaa cccattcaat gttccaacac aaaaatacaa ttggttaaga cctgacgaag atgcttttaa aataagtgat acagaagtat ttattggagg gttaggtgta tttactgata agcgaaatgg gtacatttat gatgaaaacg aaccagttga tttgtattat tcagacatgc caattcaggc aagaaaaata ttttctactc gtttagctcc aaagaaattc acttggaaag aaatggttgt agtaaaactg gtataaacag cacgaacaag ttctgcagcc aagcttctcg aggatccggc tgctaacaaa gcccgaaagg aagctgagtt ggctgctgcc accgctgagc aataactagc ataacccctt ggggcctcta aacgggtctt gaggggtttt ttgctgaaag gaggaactat atccggatat ccacaggacg ggtgtggtcg ccatgatcgc gtagtcgata gtggctccaa gtagcgaagc gagcaggact gggcggcggc caaagcggtc ggacagtgct ccgagaacgg gtgcgcatag aaattgcatc aacgcatata gcgctagcag cacgccatag tgactggcga tgctgtcgga atggacgata tcccgcaaga ggcccggcag taccggcata accaagccta tgcctacagc atccagggtg acggtgccga ggatgacgat gagcgcattg ttagatttca tacacggtgc ctgactgcgt tagcaattta actgtgataa actaccgcat taaagcttat cgatgataag ctgtcaaaca tgagaa