EnhiA.18478.a

Putative uncharacterized protein

CENTER ID: EnhiA.18478.a
ORGANISM: Entamoeba histolytica HM-1:IMSS
ASSOCIATED DISEASE: Amoebic dysentery
CURRENT STATUS: soluble
COMMUNITY REQUEST: True
NIH RISK GROUP: 2
SELECT AGENT: False
NIH PRIORITY
pathogens category:
IIIB

Ordering Clones & Proteins

If there are materials available for this target, they will be listed below. Materials can be ordered from SSGCID using the button in the "order material" column. Clicking the button will add the material to a virtual cart. You may order multiple materials at a time at no cost to you, as this contract is funded by NIAID. When you are ready to place your order, click the "Place Order" link which will appear in the top right corner of the page after you place your first item in your cart.

Clones*

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
EnhiA.18478.a.B1.GE35793 full length 1 251
* SSGCID clones represent un-induced expression constructs which have been verified by sequencing from vector primers. Clones may contain a tag, such as N-term 6xHis. Get sequence information using the button in the "info" column.

External Resources

RESOURCE REFERENCE ID
EuPathDB: AmoebaDB:EHI_192430
RefSeq: XP_656829.1
UniProt: C4M1G3

Sequences

These sequences are the native gene sequence; sequences of constructs derived from these sequences may differ due to codon optimization or other protocols. To find the specific sequence of any material you may have ordered, click on the "info" button next to the name of that material.
AA Sequence
METESHSAFH SVRLMCDSDG QLLLRPINSE PNSVDNEVNQ KNIKKELTSS KSKQNKRQET YQEEMLNAKR KSKNRDAAQQ SFLIGLLATF GYELKINKLY KKGKTTSQLF TINSIKKEGT LVYKSYDVVP EVEDLRDKRR CLDAVTNSKL LDLALSHPEV VAVEKKCRIS KYGHAIGMRR IKTLSLNGCS LRILDITKLG SQIHNMILEH MGEKDCLVNA QSYDITSVIQ NYVSHRIKEE SDDSDTVDDL L
NT Sequence
atggagacag aaagccatag tgcctttcat tcagttcgtc ttatgtgtga ctctgatgga caacttctct tacgtccaat taattctgag ccaaattctg tagataatga agttaatcaa aagaatatta aaaaagaatt aacttcaagt aagtctaaac aaaataaaag acaagaaaca taccaagaag aaatgcttaa tgctaagagg aagtcaaaga atagagatgc agctcaacag tcatttttaa tagggttact tgcaacattt ggatatgagt taaaaattaa taagttatac aaaaaaggaa aaacaacaag tcaattattc actattaatt caattaaaaa agaaggaact ttagtctata aaagttatga tgttgttcca gaagttgaag acttacgaga taaacgaaga tgtttagatg cagtaactaa ttctaaatta cttgacttag ctttaagtca tccagaagtg gttgctgttg agaagaaatg tagaatatca aaatatggac atgctattgg aatgagaaga attaagacat tatcattaaa tggttgtagt ttaagaattt tagacataac taaattaggt tctcaaattc ataatatgat tcttgagcat atgggagaaa aagattgtct tgttaatgcc caatcatatg acattacttc tgttattcaa aattatgtat ctcatagaat taaagaagaa agtgatgata gtgatactgt cgatgatttg ttgtaa
Details for EnhiA.18478.a.B1.GE35793
HARVESTED ON: 7/31/2012
SEQUENCED ON: 8/3/2012
EXPECTED MW: 29kDa
OBSERVED MW: 35kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Too many to count (100+)
TOTAL EXPRESSION LEVEL: Low Expression
SOLUBLE EXPRESSION LEVEL Low Expression
EXPRESSION HOST: data unavailable
SEQUENCING RESULT: pass
PERCENT IDENTITY: 100
PERCENT COVERAGE: 100
Validated AA Sequence
MAHHHHHHME TESHSAFHSV RLMCDSDGQL LLRPINSEPN SVDNEVNQKN IKKELTSSKS KQNKRQETYQ EEMLNAKRKS KNRDAAQQSF LIGLLATFGY ELKINKLYKK GKTTSQLFTI NSIKKEGTLV YKSYDVVPEV EDLRDKRRCL DAVTNSKLLD LALSHPEVVA VEKKCRISKY GHAIGMRRIK TLSLNGCSLR ILDITKLGSQ IHNMILEHMG EKDCLVNAQS YDITSVIQNY VSHRIKEESD DSDTVDDLL
Validated NT Sequence
atggctcacc accaccacca ccatatggag acagaaagcc atagtgcctt tcattcagtt cgtcttatgt gtgactctga tggacaactt ctcttacgtc caattaattc tgagccaaat tctgtagata atgaagttaa tcaaaagaat attaaaaaag aattaacttc aagtaagtct aaacaaaata aaagacaaga aacataccaa gaagaaatgc ttaatgctaa gaggaagtca aagaatagag atgcagctca acagtcattt ttaatagggt tacttgcaac atttggatat gagttaaaaa ttaataagtt atacaaaaaa ggaaaaacaa caagtcaatt attcactatt aattcaatta aaaaagaagg aactttagtc tataaaagtt atgatgttgt tccagaagtt gaagacttac gagataaacg aagatgttta gatgcagtaa ctaattctaa attacttgac ttagctttaa gtcatccaga agtggttgct gttgagaaga aatgtagaat atcaaaatat ggacatgcta ttggaatgag aagaattaag acattatcat taaatggttg tagtttaaga attttagaca taactaaatt aggttctcaa attcataata tgattcttga gcatatggga gaaaaagatt gtcttgttaa tgcccaatca tatgacatta cttctgttat tcaaaattat gtatctcata gaattaaaga agaaagtgat gatagtgata ctgtcgatga tttgttgtaa gtgagtaaga taggatccgg ctg
Expected Protein Sequence
MAHHHHHHME TESHSAFHSV RLMCDSDGQL LLRPINSEPN SVDNEVNQKN IKKELTSSKS KQNKRQETYQ EEMLNAKRKS KNRDAAQQSF LIGLLATFGY ELKINKLYKK GKTTSQLFTI NSIKKEGTLV YKSYDVVPEV EDLRDKRRCL DAVTNSKLLD LALSHPEVVA VEKKCRISKY GHAIGMRRIK TLSLNGCSLR ILDITKLGSQ IHNMILEHMG EKDCLVNAQS YDITSVIQNY VSHRIKEESD DSDTVDDLL
Full NT Sequence (Expression Vector + Insert)
taatacgact cactataggg agaccacaac ggtttccctc tagaaataat tttgtttaac tttaagaagg agatatacca tggctcacca ccaccaccac catatggaga cagaaagcca tagtgccttt cattcagttc gtcttatgtg tgactctgat ggacaacttc tcttacgtcc aattaattct gagccaaatt ctgtagataa tgaagttaat caaaagaata ttaaaaaaga attaacttca agtaagtcta aacaaaataa aagacaagaa acataccaag aagaaatgct taatgctaag aggaagtcaa agaatagaga tgcagctcaa cagtcatttt taatagggtt acttgcaaca tttggatatg agttaaaaat taataagtta tacaaaaaag gaaaaacaac aagtcaatta ttcactatta attcaattaa aaaagaagga actttagtct ataaaagtta tgatgttgtt ccagaagttg aagacttacg agataaacga agatgtttag atgcagtaac taattctaaa ttacttgact tagctttaag tcatccagaa gtggttgctg ttgagaagaa atgtagaata tcaaaatatg gacatgctat tggaatgaga agaattaaga cattatcatt aaatggttgt agtttaagaa ttttagacat aactaaatta ggttctcaaa ttcataatat gattcttgag catatgggag aaaaagattg tcttgttaat gcccaatcat atgacattac ttctgttatt caaaattatg tatctcatag aattaaagaa gaaagtgatg atagtgatac tgtcgatgat ttgttgtgag taagatagga tccggctgct aacaaagccc gaaaggaagc tgagttggct gctgccaccg ctgagcaata actagcataa ccccttgggg cctctaaacg ggtcttgagg ggttttttgc tgaaaggagg aactatatcc ggatatccac aggacgggtg tggtcgccat gatcgcgtag tcgatagtgg ctccaagtag cgaagcgagc aggactgggc ggcggccaaa gcggtcggac agtgctccga gaacgggtgc gcatagaaat tgcatcaacg catatagcgc tagcagcacg ccatagtgac tggcgatgct gtcggaatgg acgatatccc gcaagaggcc cggcagtacc ggcataacca agcctatgcc tacagcatcc agggtgacgg tgccgaggat gacgatgagc gcattgttag atttcataca cggtgcctga ctgcgttagc aatttaactg tgataaacta ccgcattaaa gcttatcgat gataagctgt caaacatgag aattcttgaa gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc tatgtggcgc ggtattatcc cgtgttgacg ccgggcaaga gcaactcggt cgccgcatac actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg acgagcgtga caccacgatg cctgcagcaa tggcaacaac gttgcgcaaa ctattaactg gcgaactact tactctagct tcccggcaac aattaataga ctggatggag gcggataaag ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct cccgtatcgt agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac caagtttact catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa gagcgcctga tgcggtattt tctccttacg catctgtgcg gtatttcaca ccgcatatat ggtgcactct cagtacaatc tgctctgatg ccgcatagtt aagccagtat acactccgct atcgctacgt gactgggtca tggctgcgcc ccgacacccg ccaacacccg ctgacgcgcc ctgacgggct tgtctgctcc cggcatccgc ttacagacaa gctgtgaccg tctccgggag ctgcatgtgt cagaggtttt caccgtcatc accgaaacgc gcgaggcagc tgcggtaaag ctcatcagcg tggtcgtgaa gcgattcaca gatgtctgcc tgttcatccg cgtccagctc gttgagtttc tccagaagcg ttaatgtctg gcttctgata aagcgggcca tgttaagggc ggttttttcc tgtttggtca ctgatgcctc cgtgtaaggg ggatttctgt tcatgggggt aatgataccg atgaaacgag agaggatgct cacgatacgg gttactgatg atgaacatgc ccggttactg gaacgttgtg agggtaaaca actggcggta tggatgcggc gggaccagag aaaaatcact cagggtcaat gccagcgctt cgttaataca gatgtaggtg ttccacaggg tagccagcag catcctgcga tgcagatccg gaacataatg gtgcagggcg ctgacttccg cgtttccaga ctttacgaaa cacggaaacc gaagaccatt catgttgttg ctcaggtcgc agacgttttg cagcagcagt cgcttcacgt tcgctcgcgt atcggtgatt cattctgcta accagtaagg caaccccgcc agcctagccg ggtcctcaac gacaggagca cgatcatgcg cacccgtggc caggacccaa cgctgcccga gatgcgccgc gtgcggctgc tggagatggc ggacgcgatg gatatgttct gccaagggtt ggtttgcgca ttcacagttc tccgcaagaa ttgattggct ccaattcttg gagtggtgaa tccgttagcg aggtgccgcc ggcttccatt caggtcgagg tggcccggct ccatgcaccg cgacgcaacg cggggaggca gacaaggtat agggcggcgc ctacaatcca tgccaacccg ttccatgtgc tcgccgaggc ggcataaatc gccgtgacga tcagcggtcc agtgatcgaa gttaggctgg taagagccgc gagcgatcct tgaagctgtc cctgatggtc gtcatctacc tgcctggaca gcatggcctg caacgcgggc atcccgatgc cgccggaagc gagaagaatc ataatgggga aggccatcca gcctcgcgtc gcgaacgcca gcaagacgta gcccagcgcg tcggccgcca tgccggcgat aatggcctgc ttctcgccga aacgtttggt ggcgggacca gtgacgaagg cttgagcgag ggcgtgcaag attccgaata ccgcaagcga caggccgatc atcgtcgcgc tccagcgaaa gcggtcctcg ccgaaaatga cccagagcgc tgccggcacc tgtcctacga gttgcatgat aaagaagaca gtcataagtg cggcgacgat agtcatgccc cgcgcccacc ggaaggagct gactgggttg aaggctctca agggcatcgg tcgacgctct cccttatgcg actcctgcat taggaagcag cccagtagta ggttgaggcc gttgagcacc gccgccgcaa ggaatggtgc atgcaaggag atggcgccca acagtccccc ggccacgggg cctgccacca tacccacgcc gaaacaagcg ctcatgagcc cgaagtggcg agcccgatct tccccatcgg tgatgtcggc gatataggcg ccagcaaccg cacctgtggc gccggtgatg ccggccacga tgcgtccggc gtagaggatc gagatctcga tcccgcgaaa t