CrpaA.00785.a

Cryptopain

CENTER ID: CrpaA.00785.a
ORGANISM: Cryptosporidium parvum Iowa II
ASSOCIATED DISEASE: cryptosporidiosis - (diarrheal disease)
CURRENT STATUS: expressed
COMMUNITY REQUEST: True
NIH RISK GROUP: 2
SELECT AGENT: False
NIH PRIORITY
pathogens category:
IIIB

Ordering Clones & Proteins

If there are materials available for this target, they will be listed below. Materials can be ordered from SSGCID using the button in the "order material" column. Clicking the button will add the material to a virtual cart. You may order multiple materials at a time at no cost to you, as this contract is funded by NIAID. When you are ready to place your order, click the "Place Order" link which will appear in the top right corner of the page after you place your first item in your cart.

Clones*

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
CrpaA.00785.a.B2.GE39193 soluble domain 59 401
CrpaA.00785.a.B3.GE39442 soluble domain #2 86 401
CrpaA.00785.a.B4.GE39443 soluble domain #3 177 401
* SSGCID clones represent un-induced expression constructs which have been verified by sequencing from vector primers. Clones may contain a tag, such as N-term 6xHis. Get sequence information using the button in the "info" column.

External Resources

RESOURCE REFERENCE ID
EuPathDB: CryptoDB:cgd6_4880
UniProt: Q7YZ24

Sequences

These sequences are the native gene sequence; sequences of constructs derived from these sequences may differ due to codon optimization or other protocols. To find the specific sequence of any material you may have ordered, click on the "info" button next to the name of that material.
AA Sequence
MDIGNNVEEH QEYISGPYIA LINGTNQQRE PNKKLKNIII ATLIAIFIVL VVTVSLYITN NTSDKIDDFV PGDYVDPATR EYRKSFEEFK KKYHKVYSSM EEENQRFEIY KQNMNFIKTT NSQGFSYVLE MNEFGDLSKE EFMARFTGYI KDSKDDERVF KSSRVSASES EEEFVPPNSI NWVEAGCVNP IRNQKNCGSC WAFSAVAALE GATCAQTNRG LPSLSEQQFV DCSKQNGNFG CDGGTMGLAF QYAIKNKYLC TNDDYPYFAE EKTCMDSFCE NYIEIPVKAY KYVFPRNINA LKTALAKYGP ISVAIQADQT PFQFYKSGVF DAPCGTKVNH GVVLVGYDMD EDTNKEYWLV RNSWGEAWGE KGYIKLALHS GKKGTCGILV EPVYPVINQS I
NT Sequence
atggacatag gaaacaacgt ggaagaacat caggaatata tttctggacc atacattgca ttaattaatg gcactaatca acaaagggaa ccgaataaaa agttgaaaaa cataataatt gcaacgttga ttgcaatctt tatagttttg gttgttactg tatctttgta tattactaat aacaccagtg acaaaattga cgatttcgta cctggtgatt atgttgatcc agcaactagg gagtatagaa agagttttga ggagttcaaa aagaaatacc acaaagtata tagctctatg gaggaggaaa atcaaagatt tgaaatttat aagcaaaata tgaactttat taaaacaaca aatagccaag gattcagtta tgtgttagaa atgaatgaat ttggtgattt gtcgaaagaa gagtttatgg caagattcac aggatatata aaagattcca aagatgatga aagggtattt aagtcaagta gagtctcagc aagcgaatca gaagaggaat ttgttccccc aaattctatt aattgggtgg aagctggatg cgtgaaccca ataagaaatc aaaagaattg tgggtcatgt tgggctttct ctgctgttgc agctttggag ggagcaacgt gtgctcaaac aaaccgagga ttaccaagct tgagtgaaca gcaatttgtt gattgcagta aacaaaatgg caactttgga tgtgatggag gaacaatggg attggctttt cagtatgcaa ttaagaacaa atatttatgt actaatgatg attaccctta ctttgctgag gaaaaaacat gtatggattc attttgcgag aattatatag agattcctgt aaaagcctac aaatatgtat ttccaagaaa tattaatgca ttaaagactg ctttggctaa gtatggacca atttcagttg caattcaggc cgatcaaacc cctttccagt tttataaaag tggagtattc gatgctcctt gtggaaccaa ggttaatcat ggagttgttc tagttggata tgatatggat gaagatacta ataaagaata ttggctagta agaaatagct ggggtgaagc gtggggagag aaaggataca tcaaactagc tcttcattct ggaaagaagg gaacatgtgg tatattggtt gagccagtgt atccagtgat taatcaatca atataa
Details for CrpaA.00785.a.B2.GE39193
HARVESTED ON: 5/27/2015
SEQUENCED ON: 6/26/2015
EXPECTED MW: 40kDa
OBSERVED MW: 40kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Too many to count (100+)
TOTAL EXPRESSION LEVEL: High Expression
SOLUBLE EXPRESSION LEVEL Insoluble
EXPRESSION HOST: data unavailable
SEQUENCING RESULT:
PERCENT IDENTITY: 100
PERCENT COVERAGE: 97
Validated AA Sequence
MAHHHHHHMT NNTSDKIDDF VPGDYVDPAT REYRKSFEEF KKKYHKVYSS MEEENQRFEI YKQNMNFIKT TNSQGFSYVL EMNEFGDLSK EEFMARFTGY IKDSKDDERV FKSSRVSASE SEEEFVPPNS INWVEAGCVN PIRNQKNCGS CWAFSAVAAL EGATCAQTNR GLPSLSEQQF VDCSKQNGNF GCDGGTMGLA FQYAIKNKYL CTNDDYPYFA EEKTCMDSFC ENYIEIPVKA YKYVFPRNIN ALKTALAKYG PISVAIQADQ TPFQFYKSGV FDAPCGTKVN HGVVLVGYDM DEDTNKEYWL VRNSWGEAWG EKGYIKLALH SGKKGTCGIL VEP
Validated NT Sequence
actggctcaa ccaatatacc acatgttccc ttctttccag aatgaagagc tagtttgatg tatcctttct ctccccacgc ttcaccccag ctatttctta ctagccaata ttctttatta gtatcttcat ccatatcata tccaactaga acaactccat gattaacctt ggttccacaa ggagcatcga atactccact tttataaaac tggaaagggg tttgatcggc ctgaattgca actgaaattg gtccatactt agccaaagca gtctttaatg cattaatatt tctcggaaat acatatttgt aggcttttac aggaatctct atataattct cgcaaaatga atccatacat gttttttcct cagcaaagta agggtaatca tcattagtac ataaatattt gttcttaatt gcatactgaa aagccaatcc cattgttcct ccatcacatc caaagttgcc attttgttta ctgcaatcaa caaattgctg ttcactcaag cttggtaatc ctcggtttgt ttgagcacac gttgctccct ccaaagctgc aacagcagag aaagcccaac atgacccaca attcttttga tttcttattg ggttcacgca tccagcttcc acccaattaa tagaatttgg gggaacaaat tcctcttctg attcgcttgc tgagactcta cttgacttaa ataccctttc atcatctttg gaatctttta tatatcctgt gaatcttgcc ataaactctt ctttcgacaa atcaccaaat tcattcattt ctaacacata actgaatcct tggctatttg ttgttttaat aaagttcata ttttgcttat aaatttcaaa tctttgattt tcctcctcca tagagctata tactttgtgg tatttctttt tgaactcctc aaaactcttt ctatactccc tagttgctgg atcaacataa tcaccaggta cgaaatcgtc aattttgtca ctggtgttat tagtcatatg gtggtggtgg tggtgagcca t
Expected Protein Sequence
MAHHHHHHTN NTSDKIDDFV PGDYVDPATR EYRKSFEEFK KKYHKVYSSM EEENQRFEIY KQNMNFIKTT NSQGFSYVLE MNEFGDLSKE EFMARFTGYI KDSKDDERVF KSSRVSASES EEEFVPPNSI NWVEAGCVNP IRNQKNCGSC WAFSAVAALE GATCAQTNRG LPSLSEQQFV DCSKQNGNFG CDGGTMGLAF QYAIKNKYLC TNDDYPYFAE EKTCMDSFCE NYIEIPVKAY KYVFPRNINA LKTALAKYGP ISVAIQADQT PFQFYKSGVF DAPCGTKVNH GVVLVGYDMD EDTNKEYWLV RNSWGEAWGE KGYIKLALHS GKKGTCGILV EPVYPVINQS I
Full NT Sequence (Expression Vector + Insert)
taatacgact cactataggg agaccacaac ggtttccctc tagaaataat tttgtttaac tttaagaagg agatatacca tggctcacca ccaccaccac catactaata acaccagtga caaaattgac gatttcgtac ctggtgatta tgttgatcca gcaactaggg agtatagaaa gagttttgag gagttcaaaa agaaatacca caaagtatat agctctatgg aggaggaaaa tcaaagattt gaaatttata agcaaaatat gaactttatt aaaacaacaa atagccaagg attcagttat gtgttagaaa tgaatgaatt tggtgatttg tcgaaagaag agtttatggc aagattcaca ggatatataa aagattccaa agatgatgaa agggtattta agtcaagtag agtctcagca agcgaatcag aagaggaatt tgttccccca aattctatta attgggtgga agctggatgc gtgaacccaa taagaaatca aaagaattgt gggtcatgtt gggctttctc tgctgttgca gctttggagg gagcaacgtg tgctcaaaca aaccgaggat taccaagctt gagtgaacag caatttgttg attgcagtaa acaaaatggc aactttggat gtgatggagg aacaatggga ttggcttttc agtatgcaat taagaacaaa tatttatgta ctaatgatga ttacccttac tttgctgagg aaaaaacatg tatggattca ttttgcgaga attatataga gattcctgta aaagcctaca aatatgtatt tccaagaaat attaatgcat taaagactgc tttggctaag tatggaccaa tttcagttgc aattcaggcc gatcaaaccc ctttccagtt ttataaaagt ggagtattcg atgctccttg tggaaccaag gttaatcatg gagttgttct agttggatat gatatggatg aagatactaa taaagaatat tggctagtaa gaaatagctg gggtgaagcg tggggagaga aaggatacat caaactagct cttcattctg gaaagaaggg aacatgtggt atattggttg agccagtgta tccagtgatt aatcaatcaa tatgagtaag ataggatccg gctgctaaca aagcccgaaa ggaagctgag ttggctgctg ccaccgctga gcaataacta gcataacccc ttggggcctc taaacgggtc ttgaggggtt ttttgctgaa aggaggaact atatccggat atccacagga cgggtgtggt cgccatgatc gcgtagtcga tagtggctcc aagtagcgaa gcgagcagga ctgggcggcg gccaaagcgg tcggacagtg ctccgagaac gggtgcgcat agaaattgca tcaacgcata tagcgctagc agcacgccat agtgactggc gatgctgtcg gaatggacga tatcccgcaa gaggcccggc agtaccggca taaccaagcc tatgcctaca gcatccaggg tgacggtgcc gaggatgacg atgagcgcat tgttagattt catacacggt gcctgactgc gttagcaatt taactgtgat aaactaccgc attaaagctt atcgatgata agctgtcaaa catgagaatt cttgaagacg aaagggcctc gtgatacgcc tatttttata ggttaatgtc atgataataa tggtttctta gacgtcaggt ggcacttttc ggggaaatgt gcgcggaacc cctatttgtt tatttttcta aatacattca aatatgtatc cgctcatgag acaataaccc tgataaatgc ttcaataata ttgaaaaagg aagagtatga gtattcaaca tttccgtgtc gcccttattc ccttttttgc ggcattttgc cttcctgttt ttgctcaccc agaaacgctg gtgaaagtaa aagatgctga agatcagttg ggtgcacgag tgggttacat cgaactggat ctcaacagcg gtaagatcct tgagagtttt cgccccgaag aacgttttcc aatgatgagc acttttaaag ttctgctatg tggcgcggta ttatcccgtg ttgacgccgg gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat gacttggttg agtactcacc agtcacagaa aagcatctta cggatggcat gacagtaaga gaattatgca gtgctgccat aaccatgagt gataacactg cggccaactt acttctgaca acgatcggag gaccgaagga gctaaccgct tttttgcaca acatggggga tcatgtaact cgccttgatc gttgggaacc ggagctgaat gaagccatac caaacgacga gcgtgacacc acgatgcctg cagcaatggc aacaacgttg cgcaaactat taactggcga actacttact ctagcttccc ggcaacaatt aatagactgg atggaggcgg ataaagttgc aggaccactt ctgcgctcgg cccttccggc tggctggttt attgctgata aatctggagc cggtgagcgt gggtctcgcg gtatcattgc agcactgggg ccagatggta agccctcccg tatcgtagtt atctacacga cggggagtca ggcaactatg gatgaacgaa atagacagat cgctgagata ggtgcctcac tgattaagca ttggtaactg tcagaccaag tttactcata tatactttag attgatttaa aacttcattt ttaatttaaa aggatctagg tgaagatcct ttttgataat ctcatgacca aaatccctta acgtgagttt tcgttccact gagcgtcaga ccccgtagaa aagatcaaag gatcttcttg agatcctttt tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac cgctaccagc ggtggtttgt ttgccggatc aagagctacc aactcttttt ccgaaggtaa ctggcttcag cagagcgcag ataccaaata ctgtccttct agtgtagccg tagttaggcc accacttcaa gaactctgta gcaccgccta catacctcgc tctgctaatc ctgttaccag tggctgctgc cagtggcgat aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac cggataaggc gcagcggtcg ggctgaacgg ggggttcgtg cacacagccc agcttggagc gaacgaccta caccgaactg agatacctac agcgtgagct atgagaaagc gccacgcttc ccgaagggag aaaggcggac aggtatccgg taagcggcag ggtcggaaca ggagagcgca cgagggagct tccaggggga aacgcctggt atctttatag tcctgtcggg tttcgccacc tctgacttga gcgtcgattt ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg ccagcaacgc ggccttttta cggttcctgg ccttttgctg gccttttgct cacatgttct ttcctgcgtt atcccctgat tctgtggata accgtattac cgcctttgag tgagctgata ccgctcgccg cagccgaacg accgagcgca gcgagtcagt gagcgaggaa gcggaagagc gcctgatgcg gtattttctc cttacgcatc tgtgcggtat ttcacaccgc atatatggtg cactctcagt acaatctgct ctgatgccgc atagttaagc cagtatacac tccgctatcg ctacgtgact gggtcatggc tgcgccccga cacccgccaa cacccgctga cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga ggcagctgcg gtaaagctca tcagcgtggt cgtgaagcga ttcacagatg tctgcctgtt catccgcgtc cagctcgttg agtttctcca gaagcgttaa tgtctggctt ctgataaagc gggccatgtt aagggcggtt ttttcctgtt tggtcactga tgcctccgtg taagggggat ttctgttcat gggggtaatg ataccgatga aacgagagag gatgctcacg atacgggtta ctgatgatga acatgcccgg ttactggaac gttgtgaggg taaacaactg gcggtatgga tgcggcggga ccagagaaaa atcactcagg gtcaatgcca gcgcttcgtt aatacagatg taggtgttcc acagggtagc cagcagcatc ctgcgatgca gatccggaac ataatggtgc agggcgctga cttccgcgtt tccagacttt acgaaacacg gaaaccgaag accattcatg ttgttgctca ggtcgcagac gttttgcagc agcagtcgct tcacgttcgc tcgcgtatcg gtgattcatt ctgctaacca gtaaggcaac cccgccagcc tagccgggtc ctcaacgaca ggagcacgat catgcgcacc cgtggccagg acccaacgct gcccgagatg cgccgcgtgc ggctgctgga gatggcggac gcgatggata tgttctgcca agggttggtt tgcgcattca cagttctccg caagaattga ttggctccaa ttcttggagt ggtgaatccg ttagcgaggt gccgccggct tccattcagg tcgaggtggc ccggctccat gcaccgcgac gcaacgcggg gaggcagaca aggtataggg cggcgcctac aatccatgcc aacccgttcc atgtgctcgc cgaggcggca taaatcgccg tgacgatcag cggtccagtg atcgaagtta ggctggtaag agccgcgagc gatccttgaa gctgtccctg atggtcgtca tctacctgcc tggacagcat ggcctgcaac gcgggcatcc cgatgccgcc ggaagcgaga agaatcataa tggggaaggc catccagcct cgcgtcgcga acgccagcaa gacgtagccc agcgcgtcgg ccgccatgcc ggcgataatg gcctgcttct cgccgaaacg tttggtggcg ggaccagtga cgaaggcttg agcgagggcg tgcaagattc cgaataccgc aagcgacagg ccgatcatcg tcgcgctcca gcgaaagcgg tcctcgccga aaatgaccca gagcgctgcc ggcacctgtc ctacgagttg catgataaag aagacagtca taagtgcggc gacgatagtc atgccccgcg cccaccggaa ggagctgact gggttgaagg ctctcaaggg catcggtcga cgctctccct tatgcgactc ctgcattagg aagcagccca gtagtaggtt gaggccgttg agcaccgccg ccgcaaggaa tggtgcatgc aaggagatgg cgcccaacag tcccccggcc acggggcctg ccaccatacc cacgccgaaa caagcgctca tgagcccgaa gtggcgagcc cgatcttccc catcggtgat gtcggcgata taggcgccag caaccgcacc tgtggcgccg gtgatgccgg ccacgatgcg tccggcgtag aggatcgaga tctcgatccc gcgaaat
Details for CrpaA.00785.a.B3.GE39442
HARVESTED ON: 8/5/2015
SEQUENCED ON: 8/13/2015
EXPECTED MW: 37kDa
OBSERVED MW: 37kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Too many to count (100+)
TOTAL EXPRESSION LEVEL: Moderate Expression
SOLUBLE EXPRESSION LEVEL Insoluble
EXPRESSION HOST: data unavailable
SEQUENCING RESULT:
PERCENT IDENTITY: 99
PERCENT COVERAGE: 100
Validated AA Sequence
MAHHHHHHMF EEFKKKYHKV YSSMEEENQR FEIYKQNMNF IKTTNSQGFS YVLEMNEFGD LSKEEFMARF TGYIKDSKDD ERVFKSSRVS ASESEEEFVP PNSINWVEAG CVNPIRNQKN CGSCWAFSAV AALEGATCAQ TNRGLPSLSE QQFVDCSKQN GNFGCDGGTM GLAFQYAIKN KYLCTNDDYP YFAEEKTCMD SFCENYIEIP VKAYKYVFPR NINALKTALA KYGPISVAIQ ADQTPFQFYK SGVFDAPCGT KVNHGVVLVG YDMDEDTNKE YWLVRNSWGE AWGEKGYIKL ALHSGKKGTC GILVEPVXXX INQSI
Validated NT Sequence
ttactcactt atattgattg attaatcann nnnnacactg gctcaaccaa tataccacat gttcccttct ttccagaatg aagagctagt ttgatgtatc ctttctctcc ccacgcttca ccccagctat ttcttactag ccaatattct ttattagtat cttcatccat atcatatcca actagaacaa ctccatgatt aaccttggtt ccacaaggag catcgaatac tccactttta taaaactgga aaggggtttg atcggcctga attgcaactg aaattggtcc atacttagcc aaagcagtct ttaatgcatt aatatttctc ggaaatacat atttgtaggc ttttacagga atctctatat aattctcgca aaatgaatcc atacatgttt tttcctcagc aaagtaaggg taatcatcat tagtacataa atatttgttc ttaattgcat actgaaaagc caatcccatt gttcctccat cacatccaaa gttgccattt tgtttactgc aatcaacaaa ttgctgttca ctcaagcttg gtaatcctcg gtttgtttga gcacacgttg ctccctccaa agctgcaaca gcagagaaag cccaacatga cccacaattc ttttgatttc ttattgggtt cacgcatcca gcttccaccc aattaataga atttggggga acaaattcct cttctgattc gcttgctgag actctacttg acttaaatac cctttcatca tctttggaat cttttatata tcctgtgaat cttgccataa actcttcttt cgacaaatca ccaaattcat tcatttctaa cacataactg aatccttggc tatttgttgt tttaataaag ttcatatttt gcttataaat ttcaaatctt tgattttcct cctccataga gctatatact ttgtggtatt tctttttgaa ctcctcaaac atatggtggt ggtggtggtg agccat
Expected Protein Sequence
MAHHHHHHFE EFKKKYHKVY SSMEEENQRF EIYKQNMNFI KTTNSQGFSY VLEMNEFGDL SKEEFMARFT GYIKDSKDDE RVFKSSRVSA SESEEEFVPP NSINWVEAGC VNPIRNQKNC GSCWAFSAVA ALEGATCAQT NRGLPSLSEQ QFVDCSKQNG NFGCDGGTMG LAFQYAIKNK YLCTNDDYPY FAEEKTCMDS FCENYIEIPV KAYKYVFPRN INALKTALAK YGPISVAIQA DQTPFQFYKS GVFDAPCGTK VNHGVVLVGY DMDEDTNKEY WLVRNSWGEA WGEKGYIKLA LHSGKKGTCG ILVEPVYPVI NQSI
Full NT Sequence (Expression Vector + Insert)
taatacgact cactataggg agaccacaac ggtttccctc tagaaataat tttgtttaac tttaagaagg agatatacca tggctcacca ccaccaccac cattttgagg agttcaaaaa gaaataccac aaagtatata gctctatgga ggaggaaaat caaagatttg aaatttataa gcaaaatatg aactttatta aaacaacaaa tagccaagga ttcagttatg tgttagaaat gaatgaattt ggtgatttgt cgaaagaaga gtttatggca agattcacag gatatataaa agattccaaa gatgatgaaa gggtatttaa gtcaagtaga gtctcagcaa gcgaatcaga agaggaattt gttcccccaa attctattaa ttgggtggaa gctggatgcg tgaacccaat aagaaatcaa aagaattgtg ggtcatgttg ggctttctct gctgttgcag ctttggaggg agcaacgtgt gctcaaacaa accgaggatt accaagcttg agtgaacagc aatttgttga ttgcagtaaa caaaatggca actttggatg tgatggagga acaatgggat tggcttttca gtatgcaatt aagaacaaat atttatgtac taatgatgat tacccttact ttgctgagga aaaaacatgt atggattcat tttgcgagaa ttatatagag attcctgtaa aagcctacaa atatgtattt ccaagaaata ttaatgcatt aaagactgct ttggctaagt atggaccaat ttcagttgca attcaggccg atcaaacccc tttccagttt tataaaagtg gagtattcga tgctccttgt ggaaccaagg ttaatcatgg agttgttcta gttggatatg atatggatga agatactaat aaagaatatt ggctagtaag aaatagctgg ggtgaagcgt ggggagagaa aggatacatc aaactagctc ttcattctgg aaagaaggga acatgtggta tattggttga gccagtgtat ccagtgatta atcaatcaat atgagtaaga taggatccgg ctgctaacaa agcccgaaag gaagctgagt tggctgctgc caccgctgag caataactag cataacccct tggggcctct aaacgggtct tgaggggttt tttgctgaaa ggaggaacta tatccggata tccacaggac gggtgtggtc gccatgatcg cgtagtcgat agtggctcca agtagcgaag cgagcaggac tgggcggcgg ccaaagcggt cggacagtgc tccgagaacg ggtgcgcata gaaattgcat caacgcatat agcgctagca gcacgccata gtgactggcg atgctgtcgg aatggacgat atcccgcaag aggcccggca gtaccggcat aaccaagcct atgcctacag catccagggt gacggtgccg aggatgacga tgagcgcatt gttagatttc atacacggtg cctgactgcg ttagcaattt aactgtgata aactaccgca ttaaagctta tcgatgataa gctgtcaaac atgagaattc ttgaagacga aagggcctcg tgatacgcct atttttatag gttaatgtca tgataataat ggtttcttag acgtcaggtg gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt atttttctaa atacattcaa atatgtatcc gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga agagtatgag tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat tatcccgtgt tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag aattatgcag tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa cgatcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc gccttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca cgatgcctgc agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cctgatgcgg tattttctcc ttacgcatct gtgcggtatt tcacaccgca tatatggtgc actctcagta caatctgctc tgatgccgca tagttaagcc agtatacact ccgctatcgc tacgtgactg ggtcatggct gcgccccgac acccgccaac acccgctgac gcgccctgac gggcttgtct gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca tgtgtcagag gttttcaccg tcatcaccga aacgcgcgag gcagctgcgg taaagctcat cagcgtggtc gtgaagcgat tcacagatgt ctgcctgttc atccgcgtcc agctcgttga gtttctccag aagcgttaat gtctggcttc tgataaagcg ggccatgtta agggcggttt tttcctgttt ggtcactgat gcctccgtgt aagggggatt tctgttcatg ggggtaatga taccgatgaa acgagagagg atgctcacga tacgggttac tgatgatgaa catgcccggt tactggaacg ttgtgagggt aaacaactgg cggtatggat gcggcgggac cagagaaaaa tcactcaggg tcaatgccag cgcttcgtta atacagatgt aggtgttcca cagggtagcc agcagcatcc tgcgatgcag atccggaaca taatggtgca gggcgctgac ttccgcgttt ccagacttta cgaaacacgg aaaccgaaga ccattcatgt tgttgctcag gtcgcagacg ttttgcagca gcagtcgctt cacgttcgct cgcgtatcgg tgattcattc tgctaaccag taaggcaacc ccgccagcct agccgggtcc tcaacgacag gagcacgatc atgcgcaccc gtggccagga cccaacgctg cccgagatgc gccgcgtgcg gctgctggag atggcggacg cgatggatat gttctgccaa gggttggttt gcgcattcac agttctccgc aagaattgat tggctccaat tcttggagtg gtgaatccgt tagcgaggtg ccgccggctt ccattcaggt cgaggtggcc cggctccatg caccgcgacg caacgcgggg aggcagacaa ggtatagggc ggcgcctaca atccatgcca acccgttcca tgtgctcgcc gaggcggcat aaatcgccgt gacgatcagc ggtccagtga tcgaagttag gctggtaaga gccgcgagcg atccttgaag ctgtccctga tggtcgtcat ctacctgcct ggacagcatg gcctgcaacg cgggcatccc gatgccgccg gaagcgagaa gaatcataat ggggaaggcc atccagcctc gcgtcgcgaa cgccagcaag acgtagccca gcgcgtcggc cgccatgccg gcgataatgg cctgcttctc gccgaaacgt ttggtggcgg gaccagtgac gaaggcttga gcgagggcgt gcaagattcc gaataccgca agcgacaggc cgatcatcgt cgcgctccag cgaaagcggt cctcgccgaa aatgacccag agcgctgccg gcacctgtcc tacgagttgc atgataaaga agacagtcat aagtgcggcg acgatagtca tgccccgcgc ccaccggaag gagctgactg ggttgaaggc tctcaagggc atcggtcgac gctctccctt atgcgactcc tgcattagga agcagcccag tagtaggttg aggccgttga gcaccgccgc cgcaaggaat ggtgcatgca aggagatggc gcccaacagt cccccggcca cggggcctgc caccataccc acgccgaaac aagcgctcat gagcccgaag tggcgagccc gatcttcccc atcggtgatg tcggcgatat aggcgccagc aaccgcacct gtggcgccgg tgatgccggc cacgatgcgt ccggcgtaga ggatcgagat ctcgatcccg cgaaat
Details for CrpaA.00785.a.B4.GE39443
HARVESTED ON: 8/5/2015
SEQUENCED ON: 8/13/2015
EXPECTED MW: 26kDa
OBSERVED MW: 28kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Too many to count (100+)
TOTAL EXPRESSION LEVEL: Low Expression
SOLUBLE EXPRESSION LEVEL Insoluble
EXPRESSION HOST: data unavailable
SEQUENCING RESULT:
PERCENT IDENTITY: 99
PERCENT COVERAGE: 100
Validated AA Sequence
MAHHHHHHMP NSINWVEAGC VNPIRNQKNC GSCWAFSAVA ALEGATCAQT NRGLPSLSEQ QFVDCSKQNG NFGCDGGTMG LAFQYAIKNK YLCTNDDYPY FAEEKTCMDS FCENYIEIPV KAYKYVFPRN INALKTALAK YGPISVAIQA DQTPFQFYKS GVFDAPCGTK VNHGVVLVGY DMDEDTNKEY WLVRNSWGEA WGEKGYIKLA LHSGKKGTCG ILVEPVXPVI NQSI
Validated NT Sequence
atcttactca cttatattga ttgattaatc actggnnaca ctggctcaac caatatacca catgttccct tctttccaga atgaagagct agtttgatgt atcctttctc tccccacgct tcaccccagc tatttcttac tagccaatat tctttattag tatcttcatc catatcatat ccaactagaa caactccatg attaaccttg gttccacaag gagcatcgaa tactccactt ttataaaact ggaaaggggt ttgatcggcc tgaattgcaa ctgaaattgg tccatactta gccaaagcag tctttaatgc attaatattt ctcggaaata catatttgta ggcttttaca ggaatctcta tataattctc gcaaaatgaa tccatacatg ttttttcctc agcaaagtaa gggtaatcat cattagtaca taaatatttg ttcttaattg catactgaaa agccaatccc attgttcctc catcacatcc aaagttgcca ttttgtttac tgcaatcaac aaattgctgt tcactcaagc ttggtaatcc tcggtttgtt tgagcacacg ttgctccctc caaagctgca acagcagaga aagcccaaca tgacccacaa ttcttttgat ttcttattgg gttcacgcat ccagcttcca cccaattaat agaatttggc atatggtggt ggtggtggtg agccat
Expected Protein Sequence
MAHHHHHHPN SINWVEAGCV NPIRNQKNCG SCWAFSAVAA LEGATCAQTN RGLPSLSEQQ FVDCSKQNGN FGCDGGTMGL AFQYAIKNKY LCTNDDYPYF AEEKTCMDSF CENYIEIPVK AYKYVFPRNI NALKTALAKY GPISVAIQAD QTPFQFYKSG VFDAPCGTKV NHGVVLVGYD MDEDTNKEYW LVRNSWGEAW GEKGYIKLAL HSGKKGTCGI LVEPVYPVIN QSI
Full NT Sequence (Expression Vector + Insert)
taatacgact cactataggg agaccacaac ggtttccctc tagaaataat tttgtttaac tttaagaagg agatatacca tggctcacca ccaccaccac catccaaatt ctattaattg ggtggaagct ggatgcgtga acccaataag aaatcaaaag aattgtgggt catgttgggc tttctctgct gttgcagctt tggagggagc aacgtgtgct caaacaaacc gaggattacc aagcttgagt gaacagcaat ttgttgattg cagtaaacaa aatggcaact ttggatgtga tggaggaaca atgggattgg cttttcagta tgcaattaag aacaaatatt tatgtactaa tgatgattac ccttactttg ctgaggaaaa aacatgtatg gattcatttt gcgagaatta tatagagatt cctgtaaaag cctacaaata tgtatttcca agaaatatta atgcattaaa gactgctttg gctaagtatg gaccaatttc agttgcaatt caggccgatc aaaccccttt ccagttttat aaaagtggag tattcgatgc tccttgtgga accaaggtta atcatggagt tgttctagtt ggatatgata tggatgaaga tactaataaa gaatattggc tagtaagaaa tagctggggt gaagcgtggg gagagaaagg atacatcaaa ctagctcttc attctggaaa gaagggaaca tgtggtatat tggttgagcc agtgtatcca gtgattaatc aatcaatatg agtaagatag gatccggctg ctaacaaagc ccgaaaggaa gctgagttgg ctgctgccac cgctgagcaa taactagcat aaccccttgg ggcctctaaa cgggtcttga ggggtttttt gctgaaagga ggaactatat ccggatatcc acaggacggg tgtggtcgcc atgatcgcgt agtcgatagt ggctccaagt agcgaagcga gcaggactgg gcggcggcca aagcggtcgg acagtgctcc gagaacgggt gcgcatagaa attgcatcaa cgcatatagc gctagcagca cgccatagtg actggcgatg ctgtcggaat ggacgatatc ccgcaagagg cccggcagta ccggcataac caagcctatg cctacagcat ccagggtgac ggtgccgagg atgacgatga gcgcattgtt agatttcata cacggtgcct gactgcgtta gcaatttaac tgtgataaac taccgcatta aagcttatcg atgataagct gtcaaacatg agaattcttg aagacgaaag ggcctcgtga tacgcctatt tttataggtt aatgtcatga taataatggt ttcttagacg tcaggtggca cttttcgggg aaatgtgcgc ggaaccccta tttgtttatt tttctaaata cattcaaata tgtatccgct catgagacaa taaccctgat aaatgcttca ataatattga aaaaggaaga gtatgagtat tcaacatttc cgtgtcgccc ttattccctt ttttgcggca ttttgccttc ctgtttttgc tcacccagaa acgctggtga aagtaaaaga tgctgaagat cagttgggtg cacgagtggg ttacatcgaa ctggatctca acagcggtaa gatccttgag agttttcgcc ccgaagaacg ttttccaatg atgagcactt ttaaagttct gctatgtggc gcggtattat cccgtgttga cgccgggcaa gagcaactcg gtcgccgcat acactattct cagaatgact tggttgagta ctcaccagtc acagaaaagc atcttacgga tggcatgaca gtaagagaat tatgcagtgc tgccataacc atgagtgata acactgcggc caacttactt ctgacaacga tcggaggacc gaaggagcta accgcttttt tgcacaacat gggggatcat gtaactcgcc ttgatcgttg ggaaccggag ctgaatgaag ccataccaaa cgacgagcgt gacaccacga tgcctgcagc aatggcaaca acgttgcgca aactattaac tggcgaacta cttactctag cttcccggca acaattaata gactggatgg aggcggataa agttgcagga ccacttctgc gctcggccct tccggctggc tggtttattg ctgataaatc tggagccggt gagcgtgggt ctcgcggtat cattgcagca ctggggccag atggtaagcc ctcccgtatc gtagttatct acacgacggg gagtcaggca actatggatg aacgaaatag acagatcgct gagataggtg cctcactgat taagcattgg taactgtcag accaagttta ctcatatata ctttagattg atttaaaact tcatttttaa tttaaaagga tctaggtgaa gatccttttt gataatctca tgaccaaaat cccttaacgt gagttttcgt tccactgagc gtcagacccc gtagaaaaga tcaaaggatc ttcttgagat cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa aaccaccgct accagcggtg gtttgtttgc cggatcaaga gctaccaact ctttttccga aggtaactgg cttcagcaga gcgcagatac caaatactgt ccttctagtg tagccgtagt taggccacca cttcaagaac tctgtagcac cgcctacata cctcgctctg ctaatcctgt taccagtggc tgctgccagt ggcgataagt cgtgtcttac cgggttggac tcaagacgat agttaccgga taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca cagcccagct tggagcgaac gacctacacc gaactgagat acctacagcg tgagctatga gaaagcgcca cgcttcccga agggagaaag gcggacaggt atccggtaag cggcagggtc ggaacaggag agcgcacgag ggagcttcca gggggaaacg cctggtatct ttatagtcct gtcgggtttc gccacctctg acttgagcgt cgatttttgt gatgctcgtc aggggggcgg agcctatgga aaaacgccag caacgcggcc tttttacggt tcctggcctt ttgctggcct tttgctcaca tgttctttcc tgcgttatcc cctgattctg tggataaccg tattaccgcc tttgagtgag ctgataccgc tcgccgcagc cgaacgaccg agcgcagcga gtcagtgagc gaggaagcgg aagagcgcct gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatat atggtgcact ctcagtacaa tctgctctga tgccgcatag ttaagccagt atacactccg ctatcgctac gtgactgggt catggctgcg ccccgacacc cgccaacacc cgctgacgcg ccctgacggg cttgtctgct cccggcatcc gcttacagac aagctgtgac cgtctccggg agctgcatgt gtcagaggtt ttcaccgtca tcaccgaaac gcgcgaggca gctgcggtaa agctcatcag cgtggtcgtg aagcgattca cagatgtctg cctgttcatc cgcgtccagc tcgttgagtt tctccagaag cgttaatgtc tggcttctga taaagcgggc catgttaagg gcggtttttt cctgtttggt cactgatgcc tccgtgtaag ggggatttct gttcatgggg gtaatgatac cgatgaaacg agagaggatg ctcacgatac gggttactga tgatgaacat gcccggttac tggaacgttg tgagggtaaa caactggcgg tatggatgcg gcgggaccag agaaaaatca ctcagggtca atgccagcgc ttcgttaata cagatgtagg tgttccacag ggtagccagc agcatcctgc gatgcagatc cggaacataa tggtgcaggg cgctgacttc cgcgtttcca gactttacga aacacggaaa ccgaagacca ttcatgttgt tgctcaggtc gcagacgttt tgcagcagca gtcgcttcac gttcgctcgc gtatcggtga ttcattctgc taaccagtaa ggcaaccccg ccagcctagc cgggtcctca acgacaggag cacgatcatg cgcacccgtg gccaggaccc aacgctgccc gagatgcgcc gcgtgcggct gctggagatg gcggacgcga tggatatgtt ctgccaaggg ttggtttgcg cattcacagt tctccgcaag aattgattgg ctccaattct tggagtggtg aatccgttag cgaggtgccg ccggcttcca ttcaggtcga ggtggcccgg ctccatgcac cgcgacgcaa cgcggggagg cagacaaggt atagggcggc gcctacaatc catgccaacc cgttccatgt gctcgccgag gcggcataaa tcgccgtgac gatcagcggt ccagtgatcg aagttaggct ggtaagagcc gcgagcgatc cttgaagctg tccctgatgg tcgtcatcta cctgcctgga cagcatggcc tgcaacgcgg gcatcccgat gccgccggaa gcgagaagaa tcataatggg gaaggccatc cagcctcgcg tcgcgaacgc cagcaagacg tagcccagcg cgtcggccgc catgccggcg ataatggcct gcttctcgcc gaaacgtttg gtggcgggac cagtgacgaa ggcttgagcg agggcgtgca agattccgaa taccgcaagc gacaggccga tcatcgtcgc gctccagcga aagcggtcct cgccgaaaat gacccagagc gctgccggca cctgtcctac gagttgcatg ataaagaaga cagtcataag tgcggcgacg atagtcatgc cccgcgccca ccggaaggag ctgactgggt tgaaggctct caagggcatc ggtcgacgct ctcccttatg cgactcctgc attaggaagc agcccagtag taggttgagg ccgttgagca ccgccgccgc aaggaatggt gcatgcaagg agatggcgcc caacagtccc ccggccacgg ggcctgccac catacccacg ccgaaacaag cgctcatgag cccgaagtgg cgagcccgat cttccccatc ggtgatgtcg gcgatatagg cgccagcaac cgcacctgtg gcgccggtga tgccggccac gatgcgtccg gcgtagagga tcgagatctc gatcccgcga aat