RiprA.10100.a

PROBABLE PERIPLASMIC SERINE PROTEASE DO-LIKE PRECURSOR (htrA)

CENTER ID: RiprA.10100.a
ORGANISM: Rickettsia prowazekii str. Madrid E
ASSOCIATED DISEASE: Endemic typhus
CURRENT STATUS: crystallized
COMMUNITY REQUEST: False
NIH RISK GROUP: 3
SELECT AGENT: True
NIH PRIORITY
pathogens category:
IIIB/C

Ordering Clones & Proteins

If there are materials available for this target, they will be listed below. Materials can be ordered from SSGCID using the button in the "order material" column. Clicking the button will add the material to a virtual cart. You may order multiple materials at a time at no cost to you, as this contract is funded by NIAID. When you are ready to place your order, click the "Place Order" link which will appear in the top right corner of the page after you place your first item in your cart.

Clones*

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
RiprA.10100.a.A2.GE27748 Mature protein( RiprA.10100.a ) 21 513
* SSGCID clones represent un-induced expression constructs which have been verified by sequencing from vector primers. Clones may contain a tag, such as N-term 6xHis. Get sequence information using the button in the "info" column.

Proteins

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
RiprA.10100.a.A2.PS00478 Mature protein( RiprA.10100.a ) 21 513

External Resources

RESOURCE REFERENCE ID
BV-BRC: fig|272947.5.peg.126
RefSeq: NP_220516.1
UniProt: O05942

Sequences

These sequences are the native gene sequence; sequences of constructs derived from these sequences may differ due to codon optimization or other protocols. To find the specific sequence of any material you may have ordered, click on the "info" button next to the name of that material.
AA Sequence
MVNLKIFLIV IVLMFNNIIL AKENSNALKV VDQEENEFTA INSAPLKISE AARYSFADIV EPLIPAVVNI STIEYVNDKS ENSEKDLLQE NKHLGFMSDV LEKLNIPLNL EEIAKTPKSI PLGSGFIIAP NGLIVTNYHV IANVEKINIK LADNTEFLAK LIGSDSKTDL ALLKIDSEEP LPFVEFGDSN DARVGDWVIA IGNPFGNLGG TVTSGIISSK GRDIDVDTDN IVDNFIQTDA AINNGNSGGP MFNLDQKVIG VNTAIFSPLG TNIGIGFAIP SNTAKPIIER LKKDGKVSRG RLGVTIQDLT EEISEVLGFK GTNGVLVSKV QENGPGYKAG IKKGDIIIKF GDRLVKNTKK LRVIIADTPI NQEVKLKILR DAQELELPIK VTADNEEVIN DSTEETNKAV IINKKENNLS ITKNNITFSN LTEELRKKYD IPQDKTGIVI INIDEEESVF KLGDLITNIN HDSIDDIRKL EVLYENAKKL EKQNILLLIE RGDTSVFIPL SVS
NT Sequence
ATGGTTAATC TAAAAATATT TCTTATTGTA ATAGTTTTAA TGTTTAATAA CATTATTCTA GCAAAAGAAA ATAGTAATGC TTTAAAAGTA GTAGATCAAG AAGAGAATGA ATTTACTGCA ATAAATTCGG CTCCTTTGAA AATAAGTGAA GCAGCACGGT ATAGCTTTGC CGATATAGTG GAGCCGTTAA TACCTGCAGT CGTTAATATT TCAACAATAG AATATGTTAA TGATAAGTCC GAAAATTCTG AGAAAGATCT TTTGCAAGAA AATAAGCATT TAGGCTTCAT GAGTGATGTT CTAGAAAAAC TTAATATACC GCTGAATTTA GAAGAGATCG CAAAAACTCC TAAAAGTATT CCGCTTGGTT CAGGATTTAT TATTGCACCT AATGGTTTAA TAGTGACAAA CTATCATGTA ATTGCAAATG TAGAAAAAAT TAATATAAAA CTTGCAGATA ATACAGAATT TTTAGCTAAA TTAATAGGTA GTGATTCTAA AACCGATTTA GCCCTTTTAA AAATAGATAG TGAAGAACCG CTGCCTTTTG TTGAGTTTGG AGATTCAAAT GATGCAAGAG TAGGCGACTG GGTTATTGCA ATCGGTAATC CTTTCGGTAA CCTAGGCGGT ACAGTGACAA GCGGTATTAT ATCCTCTAAA GGGCGTGATA TTGATGTAGA TACGGACAAT ATAGTTGATA ATTTTATTCA AACTGATGCT GCAATTAATA ATGGTAATTC TGGTGGTCCT ATGTTTAATT TGGATCAGAA AGTAATTGGC GTAAATACGG CAATTTTCTC ACCACTTGGT ACTAATATAG GTATCGGCTT TGCTATTCCT TCAAATACTG CAAAGCCTAT AATTGAACGT CTAAAAAAAG ATGGTAAAGT AAGTAGAGGT CGCCTTGGAG TAACAATACA AGATTTAACT GAAGAAATTT CTGAAGTTCT AGGGTTTAAA GGTACTAATG GTGTTTTAGT ATCTAAAGTA CAAGAAAATG GTCCAGGTTA TAAAGCAGGT ATTAAAAAAG GTGATATAAT AATAAAGTTT GGAGATAGAT TAGTTAAAAA TACTAAAAAG TTACGTGTAA TTATAGCCGA TACCCCTATT AATCAAGAAG TAAAATTAAA AATACTACGT GATGCACAAG AGCTTGAATT ACCAATTAAA GTTACTGCAG ATAATGAAGA AGTTATTAAT GATTCTACAG AAGAGACTAA TAAAGCAGTA ATAATAAACA AAAAAGAAAA TAATCTATCT ATTACTAAAA ATAATATTAC TTTTAGTAAT TTAACTGAAG AATTAAGAAA AAAATATGAT ATTCCTCAGG ATAAAACAGG AATCGTTATA ATTAATATTG ATGAAGAAGA AAGTGTATTT AAGCTTGGTG ATTTAATAAC CAATATTAAT CATGATAGCA TCGATGATAT AAGAAAGTTA GAAGTATTAT ATGAAAATGC TAAAAAATTA GAAAAACAAA ATATTTTGCT CTTAATTGAA AGGGGTGATA CGAGTGTGTT TATCCCGTTA TCGGTTTCA
Details for RiprA.10100.a.A2.GE27748
HARVESTED ON: 11/16/2009
SEQUENCED ON: 11/23/2009
EXPECTED MW: 56kDa
OBSERVED MW: 64kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Too many to count (100+)
TOTAL EXPRESSION LEVEL: Low Expression
SOLUBLE EXPRESSION LEVEL Low Expression
EXPRESSION HOST: data unavailable
SEQUENCING RESULT: pass
PERCENT IDENTITY: 100
PERCENT COVERAGE: 100
Validated AA Sequence
MAHHHHHHMG TLEAQTQGPG SMAKENSNAL KVVDQEENEF TAINSAPLKI SEAARYSFAD IVEPLIPAVV NISTIEYVND KSENSEKDLL QENKHLGFMS DVLEKLNIPL NLEEIAKTPK SIPLGSGFII APNGLIVTNY HVIANVEKIN IKLADNTEFL AKLIGSDSKT DLALLKIDSE EPLPFVEFGD SNDARVGDWV IAIGNPFGNL GGTVTSGIIS SKGRDIDVDT DNIVDNFIQT DAAINNGNSG GPMFNLDQKV IGVNTAIFSP LGTNIGIGFA IPSNTAKPII ERLKKDGKVS RGRLGVTIQD LTEEISEVLG FKGTNGVLVS KVQENGPGYK AGIKKGDIII KFGDRLVKNT KKLRVIIADT PINQEVKLKI LRDAQELELP IKVTADNEEV INDSTEETNK AVIINKKENN LSITKNNITF SNLTEELRKK YDIPQDKTGI VIINIDEEES VFKLGDLITN INHDSIDDIR KLEVLYENAK KLEKQNILLL IERGDTSVFI PLSVS
Validated NT Sequence
atggctcatc accatcacca tcatatgggt accctggaag ctcagaccca gggtcctggt tcgatggcaa aagaaaatag taatgcttta aaagtagtag atcaagaaga gaatgaattt actgcaataa attcggctcc tttgaaaata agtgaagcag cacggtatag ctttgccgat atagtggagc cgttaatacc tgcagtcgtt aatatttcaa caatagaata tgttaatgat aagtccgaaa attctgagaa agatcttttg caagaaaata agcatttagg cttcatgagt gatgttctag aaaaacttaa tataccgctg aatttagaag agatcgcaaa aactcctaaa agtattccgc ttggttcagg atttattatt gcacctaatg gtttaatagt gacaaactat catgtaattg caaatgtaga aaaaattaat ataaaacttg cagataatac agaattttta gctaaattaa taggtagtga ttctaaaacc gatttagccc ttttaaaaat agatagtgaa gaaccgctgc cttttgttga gtttggagat tcaaatgatg caagagtagg cgactgggtt attgcaatcg gtaatccttt cggtaaccta ggcggtacag tgacaagcgg tattatatcc tctaaagggc gtgatattga tgtagatacg gacaatatag ttgataattt tattcaaact gatgctgcaa ttaataatgg taattctggt ggtcctatgt ttaatttgga tcagaaagta attggcgtaa atacggcaat tttctcacca cttggtacta atataggtat cggctttgct attccttcaa atactgcaaa gcctataatt gaacgtctaa aaaaagatgg taaagtaagt agaggtcgcc ttggagtaac aatacaagat ttaactgaag aaatttctga agttctaggg tttaaaggta ctaatggtgt tttagtatct aaagtacaag aaaatggtcc aggttataaa gcaggtatta aaaaaggtga tataataata aagtttggag atagattagt taaaaatact aaaaagttac gtgtaattat agccgatacc cctattaatc aagaagtaaa attaaaaata ctacgtgatg cacaagagct tgaattacca attaaagtta ctgcagataa tgaagaagtt attaatgatt ctacagaaga gactaataaa gcagtaataa taaacaaaaa agaaaataat ctatctatta ctaaaaataa tattactttt agtaatttaa ctgaagaatt aagaaaaaaa tatgatattc ctcaggataa aacaggaatc gttataatta atattgatga agaagaaagt gtatttaagc ttggtgattt aataaccaat attaatcatg atagcatcga tgatataaga aagttagaag tattatatga aaatgctaaa aaattagaaa aacaaaatat tttgctctta attgaaaggg gtgatacgag tgtgtttatc ccgttatcgg tttcataa
Expected Protein Sequence
MAHHHHHHMG TLEAQTQGPG SMAKENSNAL KVVDQEENEF TAINSAPLKI SEAARYSFAD IVEPLIPAVV NISTIEYVND KSENSEKDLL QENKHLGFMS DVLEKLNIPL NLEEIAKTPK SIPLGSGFII APNGLIVTNY HVIANVEKIN IKLADNTEFL AKLIGSDSKT DLALLKIDSE EPLPFVEFGD SNDARVGDWV IAIGNPFGNL GGTVTSGIIS SKGRDIDVDT DNIVDNFIQT DAAINNGNSG GPMFNLDQKV IGVNTAIFSP LGTNIGIGFA IPSNTAKPII ERLKKDGKVS RGRLGVTIQD LTEEISEVLG FKGTNGVLVS KVQENGPGYK AGIKKGDIII KFGDRLVKNT KKLRVIIADT PINQEVKLKI LRDAQELELP IKVTADNEEV INDSTEETNK AVIINKKENN LSITKNNITF SNLTEELRKK YDIPQDKTGI VIINIDEEES VFKLGDLITN INHDSIDDIR KLEVLYENAK KLEKQNILLL IERGDTSVFI PLSVS
Full NT Sequence (Expression Vector + Insert)
ttcttgaaga cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca cccgtggcca ggacccaacg ctgcccgaga tgcgccgcgt gcggctgctg gagatggcgg acgcgatgga tatgttctgc caagggttgg tttgcgcatt cacagttctc cgcaagaatt gattggctcc aattcttgga gtggtgaatc cgttagcgag gtgccgccgg cttccattca ggtcgaggtg gcccggctcc atgcaccgcg acgcaacgcg gggaggcaga caaggtatag ggcggcgcct acaatccatg ccaacccgtt ccatgtgctc gccgaggcgg cataaatcgc cgtgacgatc agcggtccag tgatcgaagt taggctggta agagccgcga gcgatccttg aagctgtccc tgatggtcgt catctacctg cctggacagc atggcctgca acgcgggcat cccgatgccg ccggaagcga gaagaatcat aatggggaag gccatccagc ctcgcgtcgt gaacgccagc aagacgtagc ccagcgcgtc ggccgtaaca acaccattta aatggagtgg ttacaaatgg agtggttaat taacaacacc atttgtcgac gctctccctt atgcgactcc tgcattagga agcagcccag tagtaggttg aggccgttga gcaccgccgc cgcaaggaat ggtgcatgca aggagatggc gcccaacagt cccccggcca cggggcctgc caccataccc acgccgaaac aagcgctcat gagcccgaag tggcgagccc gatcttcccc atcggtgatg tcggcgatat aggcgccagc aaccgcacct gtggcgccgg tgatgccggc cacgatgcgt ccggcgtaga ggatcgagat ctcgatcccg cgaaattaat acgactcact atagggagac cacaacggtt tccctctaga aataattttg tttaacttta agaaggagat ataccatggc tcatcaccat caccatcata tgggtaccct ggaagctcag acccagggtc ctggttcgat ggcaaaagaa aatagtaatg ctttaaaagt agtagatcaa gaagagaatg aatttactgc aataaattcg gctcctttga aaataagtga agcagcacgg tatagctttg ccgatatagt ggagccgtta atacctgcag tcgttaatat ttcaacaata gaatatgtta atgataagtc cgaaaattct gagaaagatc ttttgcaaga aaataagcat ttaggcttca tgagtgatgt tctagaaaaa cttaatatac cgctgaattt agaagagatc gcaaaaactc ctaaaagtat tccgcttggt tcaggattta ttattgcacc taatggttta atagtgacaa actatcatgt aattgcaaat gtagaaaaaa ttaatataaa acttgcagat aatacagaat ttttagctaa attaataggt agtgattcta aaaccgattt agccctttta aaaatagata gtgaagaacc gctgcctttt gttgagtttg gagattcaaa tgatgcaaga gtaggcgact gggttattgc aatcggtaat cctttcggta acctaggcgg tacagtgaca agcggtatta tatcctctaa agggcgtgat attgatgtag atacggacaa tatagttgat aattttattc aaactgatgc tgcaattaat aatggtaatt ctggtggtcc tatgtttaat ttggatcaga aagtaattgg cgtaaatacg gcaattttct caccacttgg tactaatata ggtatcggct ttgctattcc ttcaaatact gcaaagccta taattgaacg tctaaaaaaa gatggtaaag taagtagagg tcgccttgga gtaacaatac aagatttaac tgaagaaatt tctgaagttc tagggtttaa aggtactaat ggtgttttag tatctaaagt acaagaaaat ggtccaggtt ataaagcagg tattaaaaaa ggtgatataa taataaagtt tggagataga ttagttaaaa atactaaaaa gttacgtgta attatagccg atacccctat taatcaagaa gtaaaattaa aaatactacg tgatgcacaa gagcttgaat taccaattaa agttactgca gataatgaag aagttattaa tgattctaca gaagagacta ataaagcagt aataataaac aaaaaagaaa ataatctatc tattactaaa aataatatta cttttagtaa tttaactgaa gaattaagaa aaaaatatga tattcctcag gataaaacag gaatcgttat aattaatatt gatgaagaag aaagtgtatt taagcttggt gatttaataa ccaatattaa tcatgatagc atcgatgata taagaaagtt agaagtatta tatgaaaatg ctaaaaaatt agaaaaacaa aatattttgc tcttaattga aaggggtgat acgagtgtgt ttatcccgtt atcggtttca aaacagcacg aacaagttct gcagccaagc ttctcgagga tccggctgct aacaaagccc gaaaggaagc tgagttggct gctgccaccg ctgagcaata actagcataa ccccttgggg cctctaaacg ggtcttgagg ggttttttgc tgaaaggagg aactatatcc ggatatccac aggacgggtg tggtcgccat gatcgcgtag tcgatagtgg ctccaagtag cgaagcgagc aggactgggc ggcggccaaa gcggtcggac agtgctccga gaacgggtgc gcatagaaat tgcatcaacg catatagcgc tagcagcacg ccatagtgac tggcgatgct gtcggaatgg acgatatccc gcaagaggcc cggcagtacc ggcataacca agcctatgcc tacagcatcc agggtgacgg tgccgaggat gacgatgagc gcattgttag atttcataca cggtgcctga ctgcgttagc aatttaactg tgataaacta ccgcattaaa gcttatcgat gataagctgt caaacatgag aa
Details for RiprA.10100.a.A2.PS00478
PURIFICATION DATe: 2/12/2010
CONCENTRATION: 15.81mg/ml
OBSERVED MW: data unavailable
EXPRESSION LEVEL: Low Expression
PROTEIN PURIFICATION BUFFER: 20 mM HEPES, pH 7.0, 300 mM NaCl, 5% glycerol and 1 mM TCEP
EXPRESSION HOST: data unavailable
VIAL COUNT (approx.): 8
VIAL VOLUME: 200µl
PERCENT IDENTITY: 100
PERCENT COVERAGE: 100
Protocol Notes
notes unavailable
Validated AA Sequence
MAHHHHHHMG TLEAQTQGPG SMAKENSNAL KVVDQEENEF TAINSAPLKI SEAARYSFAD IVEPLIPAVV NISTIEYVND KSENSEKDLL QENKHLGFMS DVLEKLNIPL NLEEIAKTPK SIPLGSGFII APNGLIVTNY HVIANVEKIN IKLADNTEFL AKLIGSDSKT DLALLKIDSE EPLPFVEFGD SNDARVGDWV IAIGNPFGNL GGTVTSGIIS SKGRDIDVDT DNIVDNFIQT DAAINNGNSG GPMFNLDQKV IGVNTAIFSP LGTNIGIGFA IPSNTAKPII ERLKKDGKVS RGRLGVTIQD LTEEISEVLG FKGTNGVLVS KVQENGPGYK AGIKKGDIII KFGDRLVKNT KKLRVIIADT PINQEVKLKI LRDAQELELP IKVTADNEEV INDSTEETNK AVIINKKENN LSITKNNITF SNLTEELRKK YDIPQDKTGI VIINIDEEES VFKLGDLITN INHDSIDDIR KLEVLYENAK KLEKQNILLL IERGDTSVFI PLSVS
Validated NT Sequence
atggctcatc accatcacca tcatatgggt accctggaag ctcagaccca gggtcctggt tcgatggcaa aagaaaatag taatgcttta aaagtagtag atcaagaaga gaatgaattt actgcaataa attcggctcc tttgaaaata agtgaagcag cacggtatag ctttgccgat atagtggagc cgttaatacc tgcagtcgtt aatatttcaa caatagaata tgttaatgat aagtccgaaa attctgagaa agatcttttg caagaaaata agcatttagg cttcatgagt gatgttctag aaaaacttaa tataccgctg aatttagaag agatcgcaaa aactcctaaa agtattccgc ttggttcagg atttattatt gcacctaatg gtttaatagt gacaaactat catgtaattg caaatgtaga aaaaattaat ataaaacttg cagataatac agaattttta gctaaattaa taggtagtga ttctaaaacc gatttagccc ttttaaaaat agatagtgaa gaaccgctgc cttttgttga gtttggagat tcaaatgatg caagagtagg cgactgggtt attgcaatcg gtaatccttt cggtaaccta ggcggtacag tgacaagcgg tattatatcc tctaaagggc gtgatattga tgtagatacg gacaatatag ttgataattt tattcaaact gatgctgcaa ttaataatgg taattctggt ggtcctatgt ttaatttgga tcagaaagta attggcgtaa atacggcaat tttctcacca cttggtacta atataggtat cggctttgct attccttcaa atactgcaaa gcctataatt gaacgtctaa aaaaagatgg taaagtaagt agaggtcgcc ttggagtaac aatacaagat ttaactgaag aaatttctga agttctaggg tttaaaggta ctaatggtgt tttagtatct aaagtacaag aaaatggtcc aggttataaa gcaggtatta aaaaaggtga tataataata aagtttggag atagattagt taaaaatact aaaaagttac gtgtaattat agccgatacc cctattaatc aagaagtaaa attaaaaata ctacgtgatg cacaagagct tgaattacca attaaagtta ctgcagataa tgaagaagtt attaatgatt ctacagaaga gactaataaa gcagtaataa taaacaaaaa agaaaataat ctatctatta ctaaaaataa tattactttt agtaatttaa ctgaagaatt aagaaaaaaa tatgatattc ctcaggataa aacaggaatc gttataatta atattgatga agaagaaagt gtatttaagc ttggtgattt aataaccaat attaatcatg atagcatcga tgatataaga aagttagaag tattatatga aaatgctaaa aaattagaaa aacaaaatat tttgctctta attgaaaggg gtgatacgag tgtgtttatc ccgttatcgg tttcataa
Expressed Protein Sequence
MAHHHHHHMG TLEAQTQGPG SMAKENSNAL KVVDQEENEF TAINSAPLKI SEAARYSFAD IVEPLIPAVV NISTIEYVND KSENSEKDLL QENKHLGFMS DVLEKLNIPL NLEEIAKTPK SIPLGSGFII APNGLIVTNY HVIANVEKIN IKLADNTEFL AKLIGSDSKT DLALLKIDSE EPLPFVEFGD SNDARVGDWV IAIGNPFGNL GGTVTSGIIS SKGRDIDVDT DNIVDNFIQT DAAINNGNSG GPMFNLDQKV IGVNTAIFSP LGTNIGIGFA IPSNTAKPII ERLKKDGKVS RGRLGVTIQD LTEEISEVLG FKGTNGVLVS KVQENGPGYK AGIKKGDIII KFGDRLVKNT KKLRVIIADT PINQEVKLKI LRDAQELELP IKVTADNEEV INDSTEETNK AVIINKKENN LSITKNNITF SNLTEELRKK YDIPQDKTGI VIINIDEEES VFKLGDLITN INHDSIDDIR KLEVLYENAK KLEKQNILLL IERGDTSVFI PLSVS
Full NT Sequence (Expression Vector + Insert)
TTCTTGAAGA CGAAAGGGCC TCGTGATACG CCTATTTTTA TAGGTTAATG TCATGATAAT AATGGTTTCT TAGACGTCAG GTGGCACTTT TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTTATTTTTC TAAATACATT CAAATATGTA TCCGCTCATG AGACAATAAC CCTGATAAAT GCTTCAATAA TATTGAAAAA GGAAGAGTAT GAGTATTCAA CATTTCCGTG TCGCCCTTAT TCCCTTTTTT GCGGCATTTT GCCTTCCTGT TTTTGCTCAC CCAGAAACGC TGGTGAAAGT AAAAGATGCT GAAGATCAGT TGGGTGCACG AGTGGGTTAC ATCGAACTGG ATCTCAACAG CGGTAAGATC CTTGAGAGTT TTCGCCCCGA AGAACGTTTT CCAATGATGA GCACTTTTAA AGTTCTGCTA TGTGGCGCGG TATTATCCCG TGTTGACGCC GGGCAAGAGC AACTCGGTCG CCGCATACAC TATTCTCAGA ATGACTTGGT TGAGTACTCA CCAGTCACAG AAAAGCATCT TACGGATGGC ATGACAGTAA GAGAATTATG CAGTGCTGCC ATAACCATGA GTGATAACAC TGCGGCCAAC TTACTTCTGA CAACGATCGG AGGACCGAAG GAGCTAACCG CTTTTTTGCA CAACATGGGG GATCATGTAA CTCGCCTTGA TCGTTGGGAA CCGGAGCTGA ATGAAGCCAT ACCAAACGAC GAGCGTGACA CCACGATGCC TGCAGCAATG GCAACAACGT TGCGCAAACT ATTAACTGGC GAACTACTTA CTCTAGCTTC CCGGCAACAA TTAATAGACT GGATGGAGGC GGATAAAGTT GCAGGACCAC TTCTGCGCTC GGCCCTTCCG GCTGGCTGGT TTATTGCTGA TAAATCTGGA GCCGGTGAGC GTGGGTCTCG CGGTATCATT GCAGCACTGG GGCCAGATGG TAAGCCCTCC CGTATCGTAG TTATCTACAC GACGGGGAGT CAGGCAACTA TGGATGAACG AAATAGACAG ATCGCTGAGA TAGGTGCCTC ACTGATTAAG CATTGGTAAC TGTCAGACCA AGTTTACTCA TATATACTTT AGATTGATTT AAAACTTCAT TTTTAATTTA AAAGGATCTA GGTGAAGATC CTTTTTGATA ATCTCATGAC CAAAATCCCT TAACGTGAGT TTTCGTTCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT TTTTTCTGCG CGTAATCTGC TGCTTGCAAA CAAAAAAACC ACCGCTACCA GCGGTGGTTT GTTTGCCGGA TCAAGAGCTA CCAACTCTTT TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AGATACCAAA TACTGTCCTT CTAGTGTAGC CGTAGTTAGG CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA GACGATAGTT ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCACACAGC CCAGCTTGGA GCGAACGACC TACACCGAAC TGAGATACCT ACAGCGTGAG CTATGAGAAA GCGCCACGCT TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGCCTG GTATCTTTAT AGTCCTGTCG GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG GGGCGGAGCC TATGGAAAAA CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG CTCACATGTT CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG AGTGAGCTGA TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA GCGCCTGATG CGGTATTTTC TCCTTACGCA TCTGTGCGGT ATTTCACACC GCATATATGG TGCACTCTCA GTACAATCTG CTCTGATGCC GCATAGTTAA GCCAGTATAC ACTCCGCTAT CGCTACGTGA CTGGGTCATG GCTGCGCCCC GACACCCGCC AACACCCGCT GACGCGCCCT GACGGGCTTG TCTGCTCCCG GCATCCGCTT ACAGACAAGC TGTGACCGTC TCCGGGAGCT GCATGTGTCA GAGGTTTTCA CCGTCATCAC CGAAACGCGC GAGGCAGCTG CGGTAAAGCT CATCAGCGTG GTCGTGAAGC GATTCACAGA TGTCTGCCTG TTCATCCGCG TCCAGCTCGT TGAGTTTCTC CAGAAGCGTT AATGTCTGGC TTCTGATAAA GCGGGCCATG TTAAGGGCGG TTTTTTCCTG TTTGGTCACT GATGCCTCCG TGTAAGGGGG ATTTCTGTTC ATGGGGGTAA TGATACCGAT GAAACGAGAG AGGATGCTCA CGATACGGGT TACTGATGAT GAACATGCCC GGTTACTGGA ACGTTGTGAG GGTAAACAAC TGGCGGTATG GATGCGGCGG GACCAGAGAA AAATCACTCA GGGTCAATGC CAGCGCTTCG TTAATACAGA TGTAGGTGTT CCACAGGGTA GCCAGCAGCA TCCTGCGATG CAGATCCGGA ACATAATGGT GCAGGGCGCT GACTTCCGCG TTTCCAGACT TTACGAAACA CGGAAACCGA AGACCATTCA TGTTGTTGCT CAGGTCGCAG ACGTTTTGCA GCAGCAGTCG CTTCACGTTC GCTCGCGTAT CGGTGATTCA TTCTGCTAAC CAGTAAGGCA ACCCCGCCAG CCTAGCCGGG TCCTCAACGA CAGGAGCACG ATCATGCGCA CCCGTGGCCA GGACCCAACG CTGCCCGAGA TGCGCCGCGT GCGGCTGCTG GAGATGGCGG ACGCGATGGA TATGTTCTGC CAAGGGTTGG TTTGCGCATT CACAGTTCTC CGCAAGAATT GATTGGCTCC AATTCTTGGA GTGGTGAATC CGTTAGCGAG GTGCCGCCGG CTTCCATTCA GGTCGAGGTG GCCCGGCTCC ATGCACCGCG ACGCAACGCG GGGAGGCAGA CAAGGTATAG GGCGGCGCCT ACAATCCATG CCAACCCGTT CCATGTGCTC GCCGAGGCGG CATAAATCGC CGTGACGATC AGCGGTCCAG TGATCGAAGT TAGGCTGGTA AGAGCCGCGA GCGATCCTTG AAGCTGTCCC TGATGGTCGT CATCTACCTG CCTGGACAGC ATGGCCTGCA ACGCGGGCAT CCCGATGCCG CCGGAAGCGA GAAGAATCAT AATGGGGAAG GCCATCCAGC CTCGCGTCGT GAACGCCAGC AAGACGTAGC CCAGCGCGTC GGCCGTAACA ACACCATTTA AATGGAGTGG TTACAAATGG AGTGGTTAAT TAACAACACC ATTTGTCGAC GCTCTCCCTT ATGCGACTCC TGCATTAGGA AGCAGCCCAG TAGTAGGTTG AGGCCGTTGA GCACCGCCGC CGCAAGGAAT GGTGCATGCA AGGAGATGGC GCCCAACAGT CCCCCGGCCA CGGGGCCTGC CACCATACCC ACGCCGAAAC AAGCGCTCAT GAGCCCGAAG TGGCGAGCCC GATCTTCCCC ATCGGTGATG TCGGCGATAT AGGCGCCAGC AACCGCACCT GTGGCGCCGG TGATGCCGGC CACGATGCGT CCGGCGTAGA GGATCGAGAT CTCGATCCCG CGAAATTAAT ACGACTCACT ATAGGGAGAC CACAACGGTT TCCCTCTAGA AATAATTTTG TTTAACTTTA AGAAGGAGAT ATACCATGGC TCATCACCAT CACCATCATA TGGGTACCCT GGAAGCTCAG ACCCAGGGTC CTGGTTCGAT GGCAAAAGAA AATAGTAATG CTTTAAAAGT AGTAGATCAA GAAGAGAATG AATTTACTGC AATAAATTCG GCTCCTTTGA AAATAAGTGA AGCAGCACGG TATAGCTTTG CCGATATAGT GGAGCCGTTA ATACCTGCAG TCGTTAATAT TTCAACAATA GAATATGTTA ATGATAAGTC CGAAAATTCT GAGAAAGATC TTTTGCAAGA AAATAAGCAT TTAGGCTTCA TGAGTGATGT TCTAGAAAAA CTTAATATAC CGCTGAATTT AGAAGAGATC GCAAAAACTC CTAAAAGTAT TCCGCTTGGT TCAGGATTTA TTATTGCACC TAATGGTTTA ATAGTGACAA ACTATCATGT AATTGCAAAT GTAGAAAAAA TTAATATAAA ACTTGCAGAT AATACAGAAT TTTTAGCTAA ATTAATAGGT AGTGATTCTA AAACCGATTT AGCCCTTTTA AAAATAGATA GTGAAGAACC GCTGCCTTTT GTTGAGTTTG GAGATTCAAA TGATGCAAGA GTAGGCGACT GGGTTATTGC AATCGGTAAT CCTTTCGGTA ACCTAGGCGG TACAGTGACA AGCGGTATTA TATCCTCTAA AGGGCGTGAT ATTGATGTAG ATACGGACAA TATAGTTGAT AATTTTATTC AAACTGATGC TGCAATTAAT AATGGTAATT CTGGTGGTCC TATGTTTAAT TTGGATCAGA AAGTAATTGG CGTAAATACG GCAATTTTCT CACCACTTGG TACTAATATA GGTATCGGCT TTGCTATTCC TTCAAATACT GCAAAGCCTA TAATTGAACG TCTAAAAAAA GATGGTAAAG TAAGTAGAGG TCGCCTTGGA GTAACAATAC AAGATTTAAC TGAAGAAATT TCTGAAGTTC TAGGGTTTAA AGGTACTAAT GGTGTTTTAG TATCTAAAGT ACAAGAAAAT GGTCCAGGTT ATAAAGCAGG TATTAAAAAA GGTGATATAA TAATAAAGTT TGGAGATAGA TTAGTTAAAA ATACTAAAAA GTTACGTGTA ATTATAGCCG ATACCCCTAT TAATCAAGAA GTAAAATTAA AAATACTACG TGATGCACAA GAGCTTGAAT TACCAATTAA AGTTACTGCA GATAATGAAG AAGTTATTAA TGATTCTACA GAAGAGACTA ATAAAGCAGT AATAATAAAC AAAAAAGAAA ATAATCTATC TATTACTAAA AATAATATTA CTTTTAGTAA TTTAACTGAA GAATTAAGAA AAAAATATGA TATTCCTCAG GATAAAACAG GAATCGTTAT AATTAATATT GATGAAGAAG AAAGTGTATT TAAGCTTGGT GATTTAATAA CCAATATTAA TCATGATAGC ATCGATGATA TAAGAAAGTT AGAAGTATTA TATGAAAATG CTAAAAAATT AGAAAAACAA AATATTTTGC TCTTAATTGA AAGGGGTGAT ACGAGTGTGT TTATCCCGTT ATCGGTTTCA AAACAGCACG AACAAGTTCT GCAGCCAAGC TTCTCGAGGA TCCGGCTGCT AACAAAGCCC GAAAGGAAGC TGAGTTGGCT GCTGCCACCG CTGAGCAATA ACTAGCATAA CCCCTTGGGG CCTCTAAACG GGTCTTGAGG GGTTTTTTGC TGAAAGGAGG AACTATATCC GGATATCCAC AGGACGGGTG TGGTCGCCAT GATCGCGTAG TCGATAGTGG CTCCAAGTAG CGAAGCGAGC AGGACTGGGC GGCGGCCAAA GCGGTCGGAC AGTGCTCCGA GAACGGGTGC GCATAGAAAT TGCATCAACG CATATAGCGC TAGCAGCACG CCATAGTGAC TGGCGATGCT GTCGGAATGG ACGATATCCC GCAAGAGGCC CGGCAGTACC GGCATAACCA AGCCTATGCC TACAGCATCC AGGGTGACGG TGCCGAGGAT GACGATGAGC GCATTGTTAG ATTTCATACA CGGTGCCTGA CTGCGTTAGC AATTTAACTG TGATAAACTA CCGCATTAAA GCTTATCGAT GATAAGCTGT CAAACATGAG AA