EnhiA.17135.c

Cyst protein

CENTER ID: EnhiA.17135.c
ORGANISM: Entamoeba histolytica HM-1:IMSS
ASSOCIATED DISEASE: Amoebic dysentery
CURRENT STATUS: expressed
COMMUNITY REQUEST: True
NIH RISK GROUP: 2
SELECT AGENT: False
NIH PRIORITY
pathogens category:
IIIB

Ordering Clones & Proteins

If there are materials available for this target, they will be listed below. Materials can be ordered from SSGCID using the button in the "order material" column. Clicking the button will add the material to a virtual cart. You may order multiple materials at a time at no cost to you, as this contract is funded by NIAID. When you are ready to place your order, click the "Place Order" link which will appear in the top right corner of the page after you place your first item in your cart.

Clones*

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
EnhiA.17135.c.A2.GE32226 soluble domain 24 295
* SSGCID clones represent un-induced expression constructs which have been verified by sequencing from vector primers. Clones may contain a tag, such as N-term 6xHis. Get sequence information using the button in the "info" column.

External Resources

RESOURCE REFERENCE ID
EuPathDB: AmoebaDB:EHI_058920
RefSeq: XP_651253.1
UniProt: C4MAD9

Sequences

These sequences are the native gene sequence; sequences of constructs derived from these sequences may differ due to codon optimization or other protocols. To find the specific sequence of any material you may have ordered, click on the "info" button next to the name of that material.
AA Sequence
MNIIFICLLT GIVMSESIVI SLIKNLEGNT RIFTSIEFGK CYYTGTVTSY YYTHNGNNIT ISHYNTTTCS GDKEEKTVDI NDKEIKRFCI NKDTCTIEIK EIPDYVGTFS FGIDDDTCSH RNNMYLTYVT GVCGKCSEIG ETYCKYKEEN GVMYSNVYSN NQCDETGKEF GEEMWKCGLC ENGTMYKCGN KTMNQENSSD NQPNEKPNDQ PDEKQNDKSE GTKNTESSSN NNSQTEDTSK HNSEGSKSKD DDSKNQTDEK SMNHSEDGTK GHSENSFNAQ SDGNSKSDSE DASTHNIDSA PLSVFSFIVM GVILMI
NT Sequence
atgaatataa tatttatttg tttattaact ggtattgtaa tgtcagaatc aatagttatt tctctaataa aaaatcttga aggaaatact cgtatattta caagtattga atttgggaaa tgttattaca caggcacagt aacatcatat tactatactc ataatgggaa taatattaca atttcacatt ataatacaac tacatgtagt ggagataaag aagaaaaaac ggttgatata aatgataaag agattaaaag gttttgtatt aacaaagaca catgtactat tgaaattaaa gaaataccag attatgtagg aacttttagt tttggtattg atgatgatac atgttcacat cgaaacaata tgtatttgac atatgtcacc ggtgtatgtg gtaaatgtag tgaaattggg gagacgtact gtaaatacaa agaagagaat ggagttatgt attcaaatgt ttattctaat aatcaatgtg atgaaacagg aaaagaattt ggagaagaaa tgtggaaatg tggattatgt gaaaatggaa ctatgtataa atgtgggaat aagactatga atcaagaaaa tagttctgat aatcaaccaa atgaaaaacc aaatgatcaa ccagatgaaa aacaaaatga caaatcagaa ggaactaaga ataccgaaag cagttctaat aataatagtc aaacagaaga tacttcaaaa cataattcag aaggatctaa gagcaaagat gatgattcta agaatcaaac agatgaaaag tcaatgaatc actctgaaga tggaacaaag ggacactcag aaaatagttt caatgcacaa tccgatggaa attctaaaag tgattcagaa gatgcatcaa cacataacat agacagtgca ccattatctg ttttctcttt tattgtgatg ggagttatat taatgattta a
Details for EnhiA.17135.c.A2.GE32226
HARVESTED ON: 7/28/2011
SEQUENCED ON: 8/3/2011
EXPECTED MW: 32kDa
OBSERVED MW: 32kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Good (10-50)
TOTAL EXPRESSION LEVEL: Moderate Expression
SOLUBLE EXPRESSION LEVEL No Expression
EXPRESSION HOST: data unavailable
SEQUENCING RESULT: pass with sequence variation
PERCENT IDENTITY: 98
PERCENT COVERAGE: 100
Validated AA Sequence
MAHHHHHHMG TLEAQTQGPG SMKNLEGNTR IFTSIEFGKC YYTGTVTSYY YTHNGNNITI SHYNTTTCSG DKEEKTVDIN DKEIKRFCIN KDTCTIEIKE IPDYVGTFSF GIDDDTCSHR NNMYLTYVTG VCGKCSEIGE TYCKYKEENG VMYSNVYSNN QCDETGKEFG EEMWKCGLCE NGTMYKCGNK TMNQENSSDN QPNEKPNDQP DEKQNDKSEG TKNTEXSSNN NSQTEDTSKH NSEGSKSKDD DSKNQTDEKS MNHSEDGTKG HSENSFXAQS DGNSKSXSXX ASTH
Validated NT Sequence
tttgtttnnc tttaagaagg agatatacca tggctcatca ccatcaccat catatgggta ccctggaagc tcagacccag ggtcctggtt cgatgaaaaa tcttgaagga aatactcgta tattcacaag tattgaattt gggaaatgtt attacacagg cacagtaaca tcatattact atactcataa tgggaataat attacaattt cacattataa tacaactaca tgtagtggag ataaagaaga aaaaacggtt gatataaatg ataaagagat taaaaggttt tgtattaaca aagacacatg tactattgaa attaaagaaa taccagatta tgtaggaact tttagttttg gtattgatga tgatacatgt tcacatcgaa acaatatgta tttgacatat gtcaccggtg tatgtggtaa atgtagtgaa attggggaga cgtactgtaa atacaaagaa gagaatggag ttatgtattc aaatgtttat tctaataatc aatgtgatga aacaggaaaa gaatttggag aagaaatgtg gaaatgtgga ttatgtgaaa atggaactat gtataaatgt gggaataaga ctatgaatca agaaaatagt tctgataatc aaccaaatga aaaaccaaat gatcaaccag atgaaaaaca aaatgacaaa tcagaaggaa ctaagaatac cgaangcagt tctaataata atagtcaaac agaagatact tcnaaacata attcagaagg atctaagagc aaagatgatg attctaagaa tcaaacagat gaaaagtcaa tgaatcactc tgaagatgga acaaagggac actcagaaaa tagtttcnat gcacaatccg atggaaattc taaaagtgan tcanaanatg catcaacaca ttaataaaca gcacgaacaa gttctgcagc caagcttctc gaggatccnn ctgctaacaa agcccgaaag gaagctnant nnnnnnnnnn nnnnnnnnnn naaanaaann nnnnaaaaaa aaaaaaaann nntnnnnnnn nntnnnnnnc aatgnnngnn cntgcngnnn nnnnnnnnnn ccnnngnnnn nnnnnnnnnn nnnnnnnann nnngnnnnng gnnnnnnnnn nnnang
Expected Protein Sequence
MAHHHHHHMG TLEAQTQGPG SMKNLEGNTR IFTSIEFGKC YYTGTVTSYY YTHNGNNITI SHYNTTTCSG DKEEKTVDIN DKEIKRFCIN KDTCTIEIKE IPDYVGTFSF GIDDDTCSHR NNMYLTYVTG VCGKCSEIGE TYCKYKEENG VMYSNVYSNN QCDETGKEFG EEMWKCGLCE NGTMYKCGNK TMNQENSSDN QPNEKPNDQP DEKQNDKSEG TKNTESSSNN NSQTEDTSKH NSEGSKSKDD DSKNQTDEKS MNHSEDGTKG HSENSFNAQS DGNSKSDSED ASTH
Full NT Sequence (Expression Vector + Insert)
ttcttgaaga cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca cccgtggcca ggacccaacg ctgcccgaga tgcgccgcgt gcggctgctg gagatggcgg acgcgatgga tatgttctgc caagggttgg tttgcgcatt cacagttctc cgcaagaatt gattggctcc aattcttgga gtggtgaatc cgttagcgag gtgccgccgg cttccattca ggtcgaggtg gcccggctcc atgcaccgcg acgcaacgcg gggaggcaga caaggtatag ggcggcgcct acaatccatg ccaacccgtt ccatgtgctc gccgaggcgg cataaatcgc cgtgacgatc agcggtccag tgatcgaagt taggctggta agagccgcga gcgatccttg aagctgtccc tgatggtcgt catctacctg cctggacagc atggcctgca acgcgggcat cccgatgccg ccggaagcga gaagaatcat aatggggaag gccatccagc ctcgcgtcgt gaacgccagc aagacgtagc ccagcgcgtc ggccgtaaca acaccattta aatggagtgg ttacaaatgg agtggttaat taacaacacc atttgtcgac gctctccctt atgcgactcc tgcattagga agcagcccag tagtaggttg aggccgttga gcaccgccgc cgcaaggaat ggtgcatgca aggagatggc gcccaacagt cccccggcca cggggcctgc caccataccc acgccgaaac aagcgctcat gagcccgaag tggcgagccc gatcttcccc atcggtgatg tcggcgatat aggcgccagc aaccgcacct gtggcgccgg tgatgccggc cacgatgcgt ccggcgtaga ggatcgagat ctcgatcccg cgaaattaat acgactcact atagggagac cacaacggtt tccctctaga aataattttg tttaacttta agaaggagat ataccatggc tcatcaccat caccatcata tgggtaccct ggaagctcag acccagggtc ctggttcgat gaaaaatctt gaaggaaata ctcgtatatt tacaagtatt gaatttggga aatgttatta cacaggcaca gtaacatcat attactatac tcataatggg aataatatta caatttcaca ttataataca actacatgta gtggagataa agaagaaaaa acggttgata taaatgataa agagattaaa aggttttgta ttaacaaaga cacatgtact attgaaatta aagaaatacc agattatgta ggaactttta gttttggtat tgatgatgat acatgttcac atcgaaacaa tatgtatttg acatatgtca ccggtgtatg tggtaaatgt agtgaaattg gggagacgta ctgtaaatac aaagaagaga atggagttat gtattcaaat gtttattcta ataatcaatg tgatgaaaca ggaaaagaat ttggagaaga aatgtggaaa tgtggattat gtgaaaatgg aactatgtat aaatgtggga ataagactat gaatcaagaa aatagttctg ataatcaacc aaatgaaaaa ccaaatgatc aaccagatga aaaacaaaat gacaaatcag aaggaactaa gaataccgaa agcagttcta ataataatag tcaaacagaa gatacttcaa aacataattc agaaggatct aagagcaaag atgatgattc taagaatcaa acagatgaaa agtcaatgaa tcactctgaa gatggaacaa agggacactc agaaaatagt ttcaatgcac aatccgatgg aaattctaaa agtgattcag aagatgcatc aacacattaa acagcacgaa caagttctgc agccaagctt ctcgaggatc cggctgctaa caaagcccga aaggaagctg agttggctgc tgccaccgct gagcaataac tagcataacc ccttggggcc tctaaacggg tcttgagggg ttttttgctg aaaggaggaa ctatatccgg atatccacag gacgggtgtg gtcgccatga tcgcgtagtc gatagtggct ccaagtagcg aagcgagcag gactgggcgg cggccaaagc ggtcggacag tgctccgaga acgggtgcgc atagaaattg catcaacgca tatagcgcta gcagcacgcc atagtgactg gcgatgctgt cggaatggac gatatcccgc aagaggcccg gcagtaccgg cataaccaag cctatgccta cagcatccag ggtgacggtg ccgaggatga cgatgagcgc attgttagat ttcatacacg gtgcctgact gcgttagcaa tttaactgtg ataaactacc gcattaaagc ttatcgatga taagctgtca aacatgagaa