EnhiA.18483.a

Putative uncharacterized protein

CENTER ID: EnhiA.18483.a
ORGANISM: Entamoeba histolytica HM-1:IMSS
ASSOCIATED DISEASE: Amoebic dysentery
CURRENT STATUS: expressed
COMMUNITY REQUEST: True
NIH RISK GROUP: 2
SELECT AGENT: False
NIH PRIORITY
pathogens category:
IIIB

Ordering Clones & Proteins

If there are materials available for this target, they will be listed below. Materials can be ordered from SSGCID using the button in the "order material" column. Clicking the button will add the material to a virtual cart. You may order multiple materials at a time at no cost to you, as this contract is funded by NIAID. When you are ready to place your order, click the "Place Order" link which will appear in the top right corner of the page after you place your first item in your cart.

Clones*

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
EnhiA.18483.a.B1.GE35794 full length 1 388
* SSGCID clones represent un-induced expression constructs which have been verified by sequencing from vector primers. Clones may contain a tag, such as N-term 6xHis. Get sequence information using the button in the "info" column.

External Resources

RESOURCE REFERENCE ID
EuPathDB: AmoebaDB:EHI_087690
RefSeq: XP_653277.1
UniProt: C4LXU9

Sequences

These sequences are the native gene sequence; sequences of constructs derived from these sequences may differ due to codon optimization or other protocols. To find the specific sequence of any material you may have ordered, click on the "info" button next to the name of that material.
AA Sequence
MAKRIDKVTT TPKEISPAET KMNSTNTRRM YILAGCYYLI TKGWSFLISK EQRRAEKLHH FFPIQKIWNE NKVIILDKEF EYEVVQEGKE AKLKFDKEGI PIKKEVTEKS GKQNKKKLVY NELVVMNIIN ECYLKYNELG NDNSLAHTKV SKNTISSQQY TKLPLVKKDG KCIGIFNKIE VLNTTGIQVY DFLCQYLKKE KSSEKEKYSQ TKKEKDSQTK YESVFIGDKD CLNETSYGNI LLLSKEKFLD QAIDESKFKC LDNQVSSSSQ QLNEDVEIQQ PQSPTNISSS MEDNLLYNDI GSPLFEYQST VANCSLLSVK ISVALPANFR EMLSQFECLS KNGVITQNNE SESCVDMNRY ENNEEYLQID HQSDMGFKEY GFVQDVCH
NT Sequence
atggcaaaaa gaatagataa agttacaaca actccaaagg aaatatcccc cgctgaaact aaaatgaatt cgacaaatac aagaagaatg tatatattag ctgggtgtta ctacttgata acaaaaggat ggtcatttct tatttcaaaa gaacaaagaa gagcggaaaa gttacaccac ttcttcccta ttcaaaagat atggaacgaa aataaggtaa taatattgga taaagagttt gagtatgaag tagttcaaga agggaaagaa gccaaattaa aatttgataa agaaggaatt ccaataaaga aagaagtaac tgagaaaagt ggaaaacaaa ataagaagaa attagtttac aatgaacttg ttgttatgaa catcattaat gaatgttatc ttaaatacaa tgaacttggg aatgataata gtttggcaca cacaaaagta tcaaaaaata caattagtag tcaacaatat acaaaacttc cattagtaaa gaaagatggg aaatgtattg gtatatttaa taaaatagaa gtattaaata caacaggaat tcaagtatat gactttttgt gtcaatatct caaaaaagaa aaaagttctg aaaaagaaaa atactctcaa acaaaaaaag aaaaagactc tcaaacaaag tatgaaagtg tttttattgg ggacaaagat tgtttaaatg aaacatcata tggaaacatt cttttgcttt caaaagaaaa gttcttggat caagctattg atgaaagtaa attcaaatgt cttgataacc aagttagttc atcatctcaa caattaaatg aagatgttga aattcaacaa ccacaatctc ctactaacat tagtagctca atggaagata acctattata taacgatata ggttctcctt tatttgagta tcaaagtaca gtggctaatt gtagtttgtt atcagttaaa ataagtgtag cacttcctgc taattttagg gaaatgcttt ctcagtttga atgtttaagt aaaaatggag ttattacaca aaataatgaa tctgagtcat gcgttgatat gaatagatat gaaaataatg aagaatactt acaaatcgat catcaatccg acatgggatt taaagaatat ggatttgttc aagacgtgtg ccactaa
Details for EnhiA.18483.a.B1.GE35794
HARVESTED ON: 7/31/2012
SEQUENCED ON: 8/3/2012
EXPECTED MW: 45kDa
OBSERVED MW: 45kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Too many to count (100+)
TOTAL EXPRESSION LEVEL: Low Expression
SOLUBLE EXPRESSION LEVEL No Expression
EXPRESSION HOST: data unavailable
SEQUENCING RESULT: pass with sequence variation
PERCENT IDENTITY: 95
PERCENT COVERAGE: 80
Validated AA Sequence
MAHHHHHHMA KRIDKVTTTP KEISPAETKM NSTNTRRMYI LAGCYYLITK GWSFLISKEQ RRAEKLHHFF PIQKIWNENK VIILDKEFEY EVVQEGKEAK LKFDKEGIPI KKEVTEKSGK QNKKKLVYNE LVVMNIINEC YLKYNELGSD NSLAHTKVSK NTISSQQYTK LPLVKKDGKC IGIFNKIEVL NTTGIQVYDF LCQYLKKEKS SEKEKDSQTK KEKDSQTKYE SVFIGDKDCL NETSYGNLLL LSKEKFLDQA IDESKFKCLD NQVSSSSQXL NEGVKILQPX SPTNISSSMX DNLLYNDXXS XLFEYXVQ
Validated NT Sequence
attttgttta ntttaagaag gagatatacc atggctcacc accaccacca ccatatggca aaaagaatag ataaagttac aacaactcca aaggaaatat cccccgctga aactaaaatg aattcgacaa atacaagaag aatgtatata ttagctgggt gttactactt gataacaaaa ggatggtcat ttcttatttc aaaagaacaa agaagagcgg aaaagttaca ccacttcttc cctattcaaa agatatggaa cgaaaataaa gtaataatat tggataaaga gtttgagtat gaagtagttc aagaagggaa agaagccaaa ttaaaatttg ataaagaagg aattccaata aagaaagaag taactgagaa aagtggaaaa caaaataaga agaaattagt ttacaatgaa cttgttgtta tgaacatcat taatgaatgt tatcttaaat acaatgaact tgggagtgat aatagtttgg cacacacaaa agtatcaaaa aatacaatta gtagtcaaca atatacaaaa cttccattag taaagaaaga tgggaaatgt attggtatat ttaataaaat agaagtatta aatacaacag gaattcaagt atatgacttt ttgtgtcaat atctcaaaaa agaaaaaagt tctgaaaaag aaaaagattc tcaaacaaaa aaagaaaaag actctcaaac aaagtatgaa agtgttttta ttggggacaa agattgttta aatgaaacat catatggaaa ccttcttttg ctttcaaaag aaaaattctt ggatcaagct attgatgaaa gtaaattcaa atgtcttgat aaccaagtta gttcatcatc tcaacnatta aatgaaggtg ttaaaattct acaaccacna tctcctacta acattagtag ctcaatggna gataacctat tatataacga tatnnnttct nctttatttg agtatcnngt acagtagcta atngnagtnc atnatcagtt naaataagtg tagcacttnc ngctantttt nngnaaangc tttnncnann nnngntnant naaaatggan tnnntanaaa nannancnna ntcnngnnnn nnnnnnnann ngannnnaaa nnnnnnnnnn nnnnncnnna ncnnnnnnnn ntnnnnnann nnnnattgnt nnnannnnnn nncnncnnnn nnnnna
Expected Protein Sequence
MAHHHHHHMA KRIDKVTTTP KEISPAETKM NSTNTRRMYI LAGCYYLITK GWSFLISKEQ RRAEKLHHFF PIQKIWNENK VIILDKEFEY EVVQEGKEAK LKFDKEGIPI KKEVTEKSGK QNKKKLVYNE LVVMNIINEC YLKYNELGND NSLAHTKVSK NTISSQQYTK LPLVKKDGKC IGIFNKIEVL NTTGIQVYDF LCQYLKKEKS SEKEKYSQTK KEKDSQTKYE SVFIGDKDCL NETSYGNILL LSKEKFLDQA IDESKFKCLD NQVSSSSQQL NEDVEIQQPQ SPTNISSSME DNLLYNDIGS PLFEYQSTVA NCSLLSVKIS VALPANFREM LSQFECLSKN GVITQNNESE SCVDMNRYEN NEEYLQIDHQ SDMGFKEYGF VQDVCH
Full NT Sequence (Expression Vector + Insert)
taatacgact cactataggg agaccacaac ggtttccctc tagaaataat tttgtttaac tttaagaagg agatatacca tggctcacca ccaccaccac catatggcaa aaagaataga taaagttaca acaactccaa aggaaatatc ccccgctgaa actaaaatga attcgacaaa tacaagaaga atgtatatat tagctgggtg ttactacttg ataacaaaag gatggtcatt tcttatttca aaagaacaaa gaagagcgga aaagttacac cacttcttcc ctattcaaaa gatatggaac gaaaataagg taataatatt ggataaagag tttgagtatg aagtagttca agaagggaaa gaagccaaat taaaatttga taaagaagga attccaataa agaaagaagt aactgagaaa agtggaaaac aaaataagaa gaaattagtt tacaatgaac ttgttgttat gaacatcatt aatgaatgtt atcttaaata caatgaactt gggaatgata atagtttggc acacacaaaa gtatcaaaaa atacaattag tagtcaacaa tatacaaaac ttccattagt aaagaaagat gggaaatgta ttggtatatt taataaaata gaagtattaa atacaacagg aattcaagta tatgactttt tgtgtcaata tctcaaaaaa gaaaaaagtt ctgaaaaaga aaaatactct caaacaaaaa aagaaaaaga ctctcaaaca aagtatgaaa gtgtttttat tggggacaaa gattgtttaa atgaaacatc atatggaaac attcttttgc tttcaaaaga aaagttcttg gatcaagcta ttgatgaaag taaattcaaa tgtcttgata accaagttag ttcatcatct caacaattaa atgaagatgt tgaaattcaa caaccacaat ctcctactaa cattagtagc tcaatggaag ataacctatt atataacgat ataggttctc ctttatttga gtatcaaagt acagtggcta attgtagttt gttatcagtt aaaataagtg tagcacttcc tgctaatttt agggaaatgc tttctcagtt tgaatgttta agtaaaaatg gagttattac acaaaataat gaatctgagt catgcgttga tatgaataga tatgaaaata atgaagaata cttacaaatc gatcatcaat ccgacatggg atttaaagaa tatggatttg ttcaagacgt gtgccactga gtaagatagg atccggctgc taacaaagcc cgaaaggaag ctgagttggc tgctgccacc gctgagcaat aactagcata accccttggg gcctctaaac gggtcttgag gggttttttg ctgaaaggag gaactatatc cggatatcca caggacgggt gtggtcgcca tgatcgcgta gtcgatagtg gctccaagta gcgaagcgag caggactggg cggcggccaa agcggtcgga cagtgctccg agaacgggtg cgcatagaaa ttgcatcaac gcatatagcg ctagcagcac gccatagtga ctggcgatgc tgtcggaatg gacgatatcc cgcaagaggc ccggcagtac cggcataacc aagcctatgc ctacagcatc cagggtgacg gtgccgagga tgacgatgag cgcattgtta gatttcatac acggtgcctg actgcgttag caatttaact gtgataaact accgcattaa agcttatcga tgataagctg tcaaacatga gaattcttga agacgaaagg gcctcgtgat acgcctattt ttataggtta atgtcatgat aataatggtt tcttagacgt caggtggcac ttttcgggga aatgtgcgcg gaacccctat ttgtttattt ttctaaatac attcaaatat gtatccgctc atgagacaat aaccctgata aatgcttcaa taatattgaa aaaggaagag tatgagtatt caacatttcc gtgtcgccct tattcccttt tttgcggcat tttgccttcc tgtttttgct cacccagaaa cgctggtgaa agtaaaagat gctgaagatc agttgggtgc acgagtgggt tacatcgaac tggatctcaa cagcggtaag atccttgaga gttttcgccc cgaagaacgt tttccaatga tgagcacttt taaagttctg ctatgtggcg cggtattatc ccgtgttgac gccgggcaag agcaactcgg tcgccgcata cactattctc agaatgactt ggttgagtac tcaccagtca cagaaaagca tcttacggat ggcatgacag taagagaatt atgcagtgct gccataacca tgagtgataa cactgcggcc aacttacttc tgacaacgat cggaggaccg aaggagctaa ccgctttttt gcacaacatg ggggatcatg taactcgcct tgatcgttgg gaaccggagc tgaatgaagc cataccaaac gacgagcgtg acaccacgat gcctgcagca atggcaacaa cgttgcgcaa actattaact ggcgaactac ttactctagc ttcccggcaa caattaatag actggatgga ggcggataaa gttgcaggac cacttctgcg ctcggccctt ccggctggct ggtttattgc tgataaatct ggagccggtg agcgtgggtc tcgcggtatc attgcagcac tggggccaga tggtaagccc tcccgtatcg tagttatcta cacgacgggg agtcaggcaa ctatggatga acgaaataga cagatcgctg agataggtgc ctcactgatt aagcattggt aactgtcaga ccaagtttac tcatatatac tttagattga tttaaaactt catttttaat ttaaaaggat ctaggtgaag atcctttttg ataatctcat gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag cgcagatacc aaatactgtc cttctagtgt agccgtagtt aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg gcgataagtc gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg aactgagata cctacagcgt gagctatgag aaagcgccac gcttcccgaa gggagaaagg cggacaggta tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag ggggaaacgc ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc gatttttgtg atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc ctgattctgt ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg aggaagcgga agagcgcctg atgcggtatt ttctccttac gcatctgtgc ggtatttcac accgcatata tggtgcactc tcagtacaat ctgctctgat gccgcatagt taagccagta tacactccgc tatcgctacg tgactgggtc atggctgcgc cccgacaccc gccaacaccc gctgacgcgc cctgacgggc ttgtctgctc ccggcatccg cttacagaca agctgtgacc gtctccggga gctgcatgtg tcagaggttt tcaccgtcat caccgaaacg cgcgaggcag ctgcggtaaa gctcatcagc gtggtcgtga agcgattcac agatgtctgc ctgttcatcc gcgtccagct cgttgagttt ctccagaagc gttaatgtct ggcttctgat aaagcgggcc atgttaaggg cggttttttc ctgtttggtc actgatgcct ccgtgtaagg gggatttctg ttcatggggg taatgatacc gatgaaacga gagaggatgc tcacgatacg ggttactgat gatgaacatg cccggttact ggaacgttgt gagggtaaac aactggcggt atggatgcgg cgggaccaga gaaaaatcac tcagggtcaa tgccagcgct tcgttaatac agatgtaggt gttccacagg gtagccagca gcatcctgcg atgcagatcc ggaacataat ggtgcagggc gctgacttcc gcgtttccag actttacgaa acacggaaac cgaagaccat tcatgttgtt gctcaggtcg cagacgtttt gcagcagcag tcgcttcacg ttcgctcgcg tatcggtgat tcattctgct aaccagtaag gcaaccccgc cagcctagcc gggtcctcaa cgacaggagc acgatcatgc gcacccgtgg ccaggaccca acgctgcccg agatgcgccg cgtgcggctg ctggagatgg cggacgcgat ggatatgttc tgccaagggt tggtttgcgc attcacagtt ctccgcaaga attgattggc tccaattctt ggagtggtga atccgttagc gaggtgccgc cggcttccat tcaggtcgag gtggcccggc tccatgcacc gcgacgcaac gcggggaggc agacaaggta tagggcggcg cctacaatcc atgccaaccc gttccatgtg ctcgccgagg cggcataaat cgccgtgacg atcagcggtc cagtgatcga agttaggctg gtaagagccg cgagcgatcc ttgaagctgt ccctgatggt cgtcatctac ctgcctggac agcatggcct gcaacgcggg catcccgatg ccgccggaag cgagaagaat cataatgggg aaggccatcc agcctcgcgt cgcgaacgcc agcaagacgt agcccagcgc gtcggccgcc atgccggcga taatggcctg cttctcgccg aaacgtttgg tggcgggacc agtgacgaag gcttgagcga gggcgtgcaa gattccgaat accgcaagcg acaggccgat catcgtcgcg ctccagcgaa agcggtcctc gccgaaaatg acccagagcg ctgccggcac ctgtcctacg agttgcatga taaagaagac agtcataagt gcggcgacga tagtcatgcc ccgcgcccac cggaaggagc tgactgggtt gaaggctctc aagggcatcg gtcgacgctc tcccttatgc gactcctgca ttaggaagca gcccagtagt aggttgaggc cgttgagcac cgccgccgca aggaatggtg catgcaagga gatggcgccc aacagtcccc cggccacggg gcctgccacc atacccacgc cgaaacaagc gctcatgagc ccgaagtggc gagcccgatc ttccccatcg gtgatgtcgg cgatataggc gccagcaacc gcacctgtgg cgccggtgat gccggccacg atgcgtccgg cgtagaggat cgagatctcg atcccgcgaa at