EnhiA.18472.a

Putative uncharacterized protein

CENTER ID: EnhiA.18472.a
ORGANISM: Entamoeba histolytica HM-1:IMSS
ASSOCIATED DISEASE: Amoebic dysentery
CURRENT STATUS: expressed
COMMUNITY REQUEST: True
NIH RISK GROUP: 2
SELECT AGENT: False
NIH PRIORITY
pathogens category:
IIIB

Ordering Clones & Proteins

If there are materials available for this target, they will be listed below. Materials can be ordered from SSGCID using the button in the "order material" column. Clicking the button will add the material to a virtual cart. You may order multiple materials at a time at no cost to you, as this contract is funded by NIAID. When you are ready to place your order, click the "Place Order" link which will appear in the top right corner of the page after you place your first item in your cart.

Clones*

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
EnhiA.18472.a.B4.GE35769 region3 791 926
* SSGCID clones represent un-induced expression constructs which have been verified by sequencing from vector primers. Clones may contain a tag, such as N-term 6xHis. Get sequence information using the button in the "info" column.

External Resources

RESOURCE REFERENCE ID
EuPathDB: AmoebaDB:EHI_141950
RefSeq: XP_654633.2
UniProt: C4LW41

Sequences

These sequences are the native gene sequence; sequences of constructs derived from these sequences may differ due to codon optimization or other protocols. To find the specific sequence of any material you may have ordered, click on the "info" button next to the name of that material.
AA Sequence
MNKETFYTII KQIRNENQIY KLIFGKDICF KEIKEKCSSL IQEEGCRNKG NSDSRHLLYR CKTCEKTPSS CICCQCFDET KHQGHIYYSY LSMSQYTCDC GNEQYWKKEG FCNKHQHKFN GNIKEIIPKE INYCLKSLAI LFQYSISFLM EKDNNGELNM IFKVLCQIDN IPYFHKLFIS LLNEKYLFPS DSLLYNKYNY HSLTLFQRYF ELSFVHQHLP DSILTFLISF LTNISEMKFD YTSIYDLALA SVYFDYSPNN GLESIFSGFI YDKSIAEYSL FKSGRFIQMI DIFINTIKIY SSGIIHDNVL KNVESLLDYI NGLINNGIKY QCSLEEFSKV LELICCCSHL FPFYIKTGEH DETDPDHFEY AFLLLYSTLL CFSLIDVFDV DDFYNNHFNF IHQTLMNHVK TYLNNVPLIN INGIKIHLRK VGINQQVSPL SGINYFYLLI LQKLINKNQL DKISISNEDA ILLCETSFIS LQFIHQFSSN VYVRNNIDVE YHYTFYINLF LPHMLNGDLN ISKLMLKYIT IDQFIYTLLV IFNVIELEPT NEIIDLNNSH FNTNNFGILL SVLIQMNRTP LIKKVFNDES FIKNIIAHII MTGESHPTNI SELIPFCELN INQLCDELMI KKGVLKEEWL NRINPLYPFL QPELQEQLKI NTLFGDKSLK NNRYYFEGLN WNKDLFNCIT SSLFWNIIKK GLSNENMIFI LMLITDINSE IQNNICSTED IKSLGIFNEE NVTSITEILK NNYTQIKDFE IFIKGINNIN VRKLLKFESC TVIPSPVKTT KLTKADLLKK KVLKKMNEKQ TTFTLNEHII EESSLEDQNE DFCVVCQSIK QSPLGRFIYY DPGFAVLIQR KKEGGKSDSE TVFTTCKHHI HEECFMELIN QYTQCPICKS KFELFLPLSN QISIINENTF NEKYSLIKIA FNNNKNKIFN VLMSCLISTL ESLELAQFEK VDIHEDDFDI LKSLFIALKS LTINLKSYLD DFIKYSNINS PLYLYIILRL YNQDIQPAYI QQVNNVMSII AKIKPSLLSH KQINFIKPRL NNMNINGLSD CLSCEDAVNK LSISFEYCIH LIEPIIKNIP MTDEVPFLKT INYIDHATGN DQYTFIPTIP RHIPFIELPK TFEGVLKKYC YSKCDSCDSF ISDVLLSRVC LYCGKVICTK RKCLEYHSHQ CSCGFNVMLD LYSGAFCLNY RGKTMALLYM KMNMEIVLIQ KNYSFQIFC
NT Sequence
atgaataaag aaacgtttta tacaattatt aaacaaatta gaaatgaaaa tcaaatttat aaattgattt ttggaaaaga catttgtttt aaagaaatta aagagaaatg ttcttctcta attcaagaag aaggatgtag aaataaagga aattctgatt caagacattt attatataga tgtaagacat gtgaaaaaac tccatcaagt tgtatatgtt gtcaatgttt tgatgaaaca aaacatcaag gacatattta ttattcatat ttatctatgt ctcaatatac atgtgattgt gggaatgaac aatattggaa aaaagaagga ttttgtaata aacatcaaca taaatttaat ggaaatatta aagaaataat tccaaaagaa ataaattatt gtttaaaatc acttgctata ttatttcaat attcaatttc attcttaatg gaaaaagata ataatggaga actaaatatg atatttaaag tgttatgtca aattgataat attccttatt ttcataaatt atttatttca ttattaaatg aaaaatattt atttccatca gactccttac tttataataa atataactat cattctttaa ctttatttca aagatatttt gaattatctt ttgttcatca acatcttcct gattctattc ttacttttct tatttcattt cttactaata tatctgaaat gaagtttgat tatacttcaa tttatgactt agctcttgca tcagtttatt ttgattattc accaaataat ggattagaat ctatattttc aggtttcatt tatgacaaat caattgctga atattctctt tttaaatcag gaagatttat tcaaatgata gacatcttta ttaatacaat taaaatttat tcttctggta taattcatga caatgtttta aaaaatgttg aatcattact tgattatatt aatggattaa taaataatgg aattaaatat caatgttcat tagaagaatt ctctaaagta ttagaactta tttgttgttg ttcacatctt tttccatttt atataaaaac aggagaacat gatgaaactg atccagatca ttttgaatat gcttttcttc ttttatactc aacattatta tgttttagtt taatagatgt ttttgatgtt gatgactttt ataataacca ttttaatttc attcatcaaa cacttatgaa tcatgttaaa acatatttaa ataatgttcc tcttattaat attaatggta ttaaaattca tttaagaaaa gttggaatta atcaacaagt atcaccatta tcaggaatta attatttcta tttattaatt cttcaaaaat taattaataa aaaccaatta gataaaattt caatttctaa tgaagatgca atattattat gtgaaacatc atttatttca ttacaattta ttcatcaatt ttcaagtaat gtttatgtaa gaaataatat tgatgttgaa tatcattata cattttatat taatcttttt ttacctcata tgttaaatgg agacttaaac atttctaaat taatgttaaa atatattaca atagatcaat tcatttatac attattagtt atatttaatg taatagaatt agaaccaaca aatgaaatta ttgatttaaa taattctcat tttaatacaa ataattttgg tatattatta agtgttttaa ttcaaatgaa tagaactcca ttaattaaaa aagtatttaa tgatgaatct ttcattaaaa atattatagc acatataata atgacaggag aatcccatcc aacaaacata tctgaactta ttcctttttg tgaattaaat attaatcaat tatgtgatga attaatgatt aaaaaaggtg ttttaaaaga agaatggtta aatagaataa atccattata tcctttctta caacctgaac ttcaagaaca attaaaaata aatacattat ttggtgataa atcattaaaa aataacagat attattttga aggattaaat tggaataaag atttgtttaa ttgtattaca tcatctcttt tctggaatat cattaaaaaa ggattatcta atgaaaatat gatatttatt cttatgttaa ttactgacat taattctgaa atacaaaata atatttgttc tacagaagat attaaaagtc ttggtatatt taatgaagaa aatgttactt caattacaga aatattaaaa aataattata ctcaaattaa agattttgaa atatttataa aaggaataaa taatataaat gtaagaaaat tattgaaatt tgagtcttgt acagttattc catcacctgt aaaaactact aaactaacta aagcagattt attgaaaaag aaggtattaa aaaaaatgaa tgaaaaacaa actacattta cactaaatga acatattatt gaagaatcat cattagaaga tcaaaatgaa gatttttgtg ttgtttgtca atcaattaaa caaagtccat taggaagatt tatttattat gatcctggat ttgctgtatt aattcaaaga aaaaaagaag gtggaaaaag tgattcagaa actgtattta caacatgtaa acaccacatt catgaagaat gttttatgga attaattaat caatatacac aatgtccaat atgtaaaagt aaatttgaat tatttttacc attaagtaat caaatttcaa ttattaatga aaatacattt aatgaaaaat atagtctaat taaaattgca tttaataata ataagaataa aatatttaat gtcttaatgt cttgtttgat ttcaacatta gaatcattag aattggctca atttgaaaaa gttgatattc atgaagatga ttttgatatt ctaaaatcac tatttattgc attaaaatca ttaacaatca atttaaaaag ttatttagat gatttcatta aatattctaa cataaattca ccactttatc tttatatcat tttacgactt tataaccaag atattcaacc agcatatatt caacaagtta ataatgttat gagtattatt gctaaaataa aaccatcttt attatctcat aaacaaatta acttcattaa acctagatta aataatatga atattaatgg attatctgat tgtcttagtt gtgaagacgc tgtcaataaa ctttctattt catttgaata ttgtattcat ttaatagaac caataatcaa aaatattcca atgactgatg aagtaccatt tttgaaaaca ataaattata ttgaccatgc aactggaaat gatcaatata catttatacc aaccattcca cgacacatcc catttattga attaccaaag acttttgaag gagtattaaa aaagtattgt tatagtaaat gtgatagttg tgattctttt atttcagacg ttcttttatc aagagtatgt ctttattgtg gaaaagttat ttgtactaaa agaaaatgtt tagaatatca ttcacatcaa tgttcttgtg gatttaatgt tatgttagac ttatatagtg gtgcgttttg tttaaactat cgtggtaaga caatggcatt attgtatatg aagatgaata tggagattgt tttgatccaa aaaaattact cattccaaat cttttgttaa
Details for EnhiA.18472.a.B4.GE35769
HARVESTED ON: 7/31/2012
SEQUENCED ON: 8/6/2012
EXPECTED MW: 17kDa
OBSERVED MW: 17kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Too many to count (100+)
TOTAL EXPRESSION LEVEL: Moderate Expression
SOLUBLE EXPRESSION LEVEL No Expression
EXPRESSION HOST: data unavailable
SEQUENCING RESULT: pass with sequence variation
PERCENT IDENTITY: 98
PERCENT COVERAGE: 100
Validated AA Sequence
MAHHHHHHHM KLTKADLLKK KVLKKMNEKQ TTFTLNEHII EESSLEDQNE DFCVVCQSIK QSPLGRFIYY DPGFAVLIQR KKEGGKSDSE TVFTTCKHHI HEECFMELIN QYTQCPICKS KFELFLPLSN QISIINENTF NEKYSL
Validated NT Sequence
tttntttact ttaagaagga gnnataccat ggctcaccac caccaccacc accatatgaa actaactaaa gcagatttat tgaaaaagaa ggtattaaaa aaaatgaatg aaaaacaaac tacatttaca ctaaatgaac atattattga agaatcatca ttagaagatc aaaatgaaga tttttgtgtt gtttgtcaat caattaaaca aagtccatta ggaagattta tttattatga tcctggattt gctgtattaa ttcaaagaaa aaaagaaggt ggaaaaagtg attcagaaac tgtatttaca acatgtaaac accacattca tgaagaatgt tttatggaat taattaatca atatacacaa tgtccaatat gtaaaagtaa atttgaatta tttttaccat taagtaatca aatttcaatt attaatgaaa atacatttaa tgaaaaatat agtctataag tgagtaagat aggatccggc tgctaacann nnnnnaanga nnnnnnnnnn nnnnnnnnnn nnnnnttntn ttttctacnn nngntcgngt nntnntgcnc natnnnnnnn cngttgnnna nnnnnnnngt nnnnnnnnaa cnannnnnnn nnnnnnntta tnnncaatgn tnacnnantg anncnccnnc nnnntgnnnn tgnnaatcnn cncnncnnnn nnncnnnnnn cnnnnnnang ctnnnnnnnn nntttgngan nnnntnnnnn nnngntaatc gtncntatca tcnntgntnn nnnanattgn ctnnnngtna antgcnnaan tnnntaannn nnnnnnnnnn cncnnnnanc tnangnancn naacttnann ncnanntnnn nnnnngttnn ncanntnncn ncancnntnt acaannnacn nnnnnannnn ngnnnttnnn nnnnnnnnnn nnccntnnnn nnnaannnnn nnnnnncann cnttnncnan nnannnnnnn nnnnnannan nnttnnannn angg
Expected Protein Sequence
MAHHHHHHMK LTKADLLKKK VLKKMNEKQT TFTLNEHIIE ESSLEDQNED FCVVCQSIKQ SPLGRFIYYD PGFAVLIQRK KEGGKSDSET VFTTCKHHIH EECFMELINQ YTQCPICKSK FELFLPLSNQ ISIINENTFN EKYSL
Full NT Sequence (Expression Vector + Insert)
taatacgact cactataggg agaccacaac ggtttccctc tagaaataat tttgtttaac tttaagaagg agatatacca tggctcacca ccaccaccac catatgaaac taactaaagc agatttattg aaaaagaagg tattaaaaaa aatgaatgaa aaacaaacta catttacact aaatgaacat attattgaag aatcatcatt agaagatcaa aatgaagatt tttgtgttgt ttgtcaatca attaaacaaa gtccattagg aagatttatt tattatgatc ctggatttgc tgtattaatt caaagaaaaa aagaaggtgg aaaaagtgat tcagaaactg tatttacaac atgtaaacac cacattcatg aagaatgttt tatggaatta attaatcaat atacacaatg tccaatatgt aaaagtaaat ttgaattatt tttaccatta agtaatcaaa tttcaattat taatgaaaat acatttaatg aaaaatatag tctatgagta agataggatc cggctgctaa caaagcccga aaggaagctg agttggctgc tgccaccgct gagcaataac tagcataacc ccttggggcc tctaaacggg tcttgagggg ttttttgctg aaaggaggaa ctatatccgg atatccacag gacgggtgtg gtcgccatga tcgcgtagtc gatagtggct ccaagtagcg aagcgagcag gactgggcgg cggccaaagc ggtcggacag tgctccgaga acgggtgcgc atagaaattg catcaacgca tatagcgcta gcagcacgcc atagtgactg gcgatgctgt cggaatggac gatatcccgc aagaggcccg gcagtaccgg cataaccaag cctatgccta cagcatccag ggtgacggtg ccgaggatga cgatgagcgc attgttagat ttcatacacg gtgcctgact gcgttagcaa tttaactgtg ataaactacc gcattaaagc ttatcgatga taagctgtca aacatgagaa ttcttgaaga cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca cccgtggcca ggacccaacg ctgcccgaga tgcgccgcgt gcggctgctg gagatggcgg acgcgatgga tatgttctgc caagggttgg tttgcgcatt cacagttctc cgcaagaatt gattggctcc aattcttgga gtggtgaatc cgttagcgag gtgccgccgg cttccattca ggtcgaggtg gcccggctcc atgcaccgcg acgcaacgcg gggaggcaga caaggtatag ggcggcgcct acaatccatg ccaacccgtt ccatgtgctc gccgaggcgg cataaatcgc cgtgacgatc agcggtccag tgatcgaagt taggctggta agagccgcga gcgatccttg aagctgtccc tgatggtcgt catctacctg cctggacagc atggcctgca acgcgggcat cccgatgccg ccggaagcga gaagaatcat aatggggaag gccatccagc ctcgcgtcgc gaacgccagc aagacgtagc ccagcgcgtc ggccgccatg ccggcgataa tggcctgctt ctcgccgaaa cgtttggtgg cgggaccagt gacgaaggct tgagcgaggg cgtgcaagat tccgaatacc gcaagcgaca ggccgatcat cgtcgcgctc cagcgaaagc ggtcctcgcc gaaaatgacc cagagcgctg ccggcacctg tcctacgagt tgcatgataa agaagacagt cataagtgcg gcgacgatag tcatgccccg cgcccaccgg aaggagctga ctgggttgaa ggctctcaag ggcatcggtc gacgctctcc cttatgcgac tcctgcatta ggaagcagcc cagtagtagg ttgaggccgt tgagcaccgc cgccgcaagg aatggtgcat gcaaggagat ggcgcccaac agtcccccgg ccacggggcc tgccaccata cccacgccga aacaagcgct catgagcccg aagtggcgag cccgatcttc cccatcggtg atgtcggcga tataggcgcc agcaaccgca cctgtggcgc cggtgatgcc ggccacgatg cgtccggcgt agaggatcga gatctcgatc ccgcgaaat