HepyC.01032.a

Tyrosine--tRNA ligase (EC 6.1.1.1) (Tyrosyl-tRNA synthetase)

CENTER ID: HepyC.01032.a
ORGANISM: Helicobacter pylori G27
ASSOCIATED DISEASE:
CURRENT STATUS: in PDB
COMMUNITY REQUEST: True
NIH RISK GROUP: 2
SELECT AGENT: False
NIH PRIORITY
pathogens category:
I

Ordering Clones & Proteins

If there are materials available for this target, they will be listed below. Materials can be ordered from SSGCID using the button in the "order material" column. Clicking the button will add the material to a virtual cart. You may order multiple materials at a time at no cost to you, as this contract is funded by NIAID. When you are ready to place your order, click the "Place Order" link which will appear in the top right corner of the page after you place your first item in your cart.

Clones*

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
HepyC.01032.a.B1.GE40952 full length 1 402
* SSGCID clones represent un-induced expression constructs which have been verified by sequencing from vector primers. Clones may contain a tag, such as N-term 6xHis. Get sequence information using the button in the "info" column.

Proteins

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
HepyC.01032.a.B1.PS38283 full length 1 402
HepyC.01032.a.B1.PW38234 full length 1 402
HepyC.01032.a.B1.PW38235 full length 1 402

Structures

6BYQ
DEPOSITED: 12/21/2017
DETERMINATION: XRay
CLONE: HepyC.01032.a.B1.GE40952
PROTEIN: HepyC.01032.a.B1.PS38283

External Resources

RESOURCE REFERENCE ID
PATRIC ID: fig|563041.6.peg.786
UniProt: B5Z7D7

Sequences

These sequences are the native gene sequence; sequences of constructs derived from these sequences may differ due to codon optimization or other protocols. To find the specific sequence of any material you may have ordered, click on the "info" button next to the name of that material.
AA Sequence
MEQKISVALK EIKRGANEII GLEYIEKLVR KYYETNERFI VKAGFDPTAP DLHLGHTVLI QKLALLQQYG ARVKFLIGDF TAMIGDPTGK NETRKPLNRE QVLENAKTYE EQIYKILDQK HTEVCFNSTW LDALGAKGMI ELCAKFSVAR MLERDDFAKR HKENRPISIV EFLYPLLQGY DSVAMGADIE LGGNDQKFNL LVGRFLQRAY GLNKEQSIIT MPLLEGLDGV QKMSKSLGNY VGITEEPNAM FGKIMSVSDD LMWRYYTLLS AKTLEEIEDL KHGILNQTLH PKAVKEDLAG EIVARYYDND QAFKAKEQFS KVFSANLLPE ILSESDFDEG VGILDVLKQI GFCPSTSQAR RDIQGGGVKI NQEVIKDESY RFVKGNYVIQ LGKKRFMKLN IN
NT Sequence
atggaacaaa aaatcagtgt ggccttaaaa gagatcaaaa gaggtgctaa tgaaatcatt ggattagaat acattgaaaa gctggtgagg aaatattatg aaaccaatga acgctttatc gttaaagccg gttttgatcc taccgctccc gatttgcatt tagggcatac ggtgttgatc caaaaattgg ctttgttgca gcaatatggg gctagggtta agtttttgat tggggatttt accgctatga taggcgatcc tacgggtaag aatgaaacga gaaaaccctt aaaccgggag caagtcttag aaaacgctaa aacttatgaa gagcaaatct ataagatttt agatcaaaaa cacaccgaag tgtgctttaa ttccacttgg ttggatgctt taggcgcaaa gggcatgata gaattgtgcg cgaagttttc agtcgctaga atgttagaaa gggacgattt tgctaaacgc cataaagaaa accgccccat tagcatcgtg gaatttttat accctttgtt gcaaggctat gattcagtgg cgatgggtgc ggatattgag cttgggggca atgatcaaaa gtttaatttg ctggtggggc gctttttgca acgagcttat ggcttgaata aagagcagtc tattattacc atgcctttat tagaagggct tgatggggtg caaaaaatga gtaaaagctt ggggaattat gtggggatca ctgaagagcc taatgcgatg tttgggaaga tcatgagcgt gagcgatgat ctcatgtggc gctactacac ccttttgagc gctaagactt tagaagaaat tgaagactta aaacatggta ttttaaacca aaccttgcac cctaaagccg ttaaagagga tctcgctggt gaaatcgtgg ctcgttatta tgataatgat caagcattca aggctaaaga gcaattttct aaagtgttta gcgcaaacct tttgcctgaa attttatcag agagcgattt tgatgaaggg gttgggattt tagatgtttt aaaacagatt ggcttttgcc catccacttc acaagccagg cgtgatattc aagggggagg ggtaaagatt aatcaagaag tgataaaaga tgagagttat cgttttgtta aaggaaatta tgttatacag cttggtaaga aaagatttat gaaattaaat atcaactaa
Details for HepyC.01032.a.B1.GE40952
HARVESTED ON: 12/20/2016
SEQUENCED ON: 12/27/2016
EXPECTED MW: 47kDa
OBSERVED MW: 47kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Many (50-100)
TOTAL EXPRESSION LEVEL: Moderate Expression
SOLUBLE EXPRESSION LEVEL Moderate Expression
EXPRESSION HOST: BL 21 (DE3) Rosetta
SEQUENCING RESULT:
PERCENT IDENTITY: 98
PERCENT COVERAGE: 97
Validated AA Sequence
MAHHHHHHME QKIXVALKEI KRGANEIIGX XYIEKLVRKY YETNERFIVK AGFDPTAPDL HLGHTVLIQK LALLQQYGAR VKFLIGDFTA MIGDPTGKNE TRKPLNREQV XENAKTYEEQ IYKILDQKHT EVCFNSTWLD ALGAKGMIEL CAKFSVARML ERDDFAKRHK ENRPISIVEF LYPLLQGYDS VAMGADIELG GNDQKFNLLV GRFLQRAYGL NKEQSIITMP LLEGLDGVQK MSKSLGNYVG ITEEPNAMFG KIMSVSDDLM WRYYTLLSAK TLEEIEDLKH GILNQTLHPK AVKEDLAGEI VARYYDNDQA FKAKEQFXKV FSANLLPEIL SESDFDEGVG ILDVLKQIGF CPSTSQARRD IQGGGVKINQ EVIKDESYRF VKGNYVIQLG K
Validated NT Sequence
tttcttacca agctgtataa cataatttcc tttaacaaaa cgataactct catcttttat cacttcttga ttaatcttta cccctccccc ttgaatatca cgcctggctt gtgaagtgga tgggcaaaag ccaatctgtt ttaaaacatc taaaatccca accccttcat caaaatcgct ctctgataaa atttcaggca aaaggtttgc gctaaacact ttngnaaatt gctctttngc cttgaatgct tgatcattat cataataacg agccacgatt tcaccagcga gatcctcttt aacggcttta gggtgcaagg tttggtttaa aataccatgt tttaagtctt caatttcttc taaagtctta gcgctcaaaa gggtgtagta gcgccacatg agatcatcgc tcacgctcat gatcttccca aacatcgcat taggctcttc agtgatcccc acataattcc ccaagctttt actcattttt tgcaccccat caagcccttc taataaaggc atggtaataa tagactgctc tttattcaag ccataagctc gttgcaaaaa gcgccccacc agcaaattaa acttttgatc attgccccca agctcaatat ccgcacccat cgccactgaa tcatagcctt gcaacaaagg gtataaaaat tccacgatgc taatggggcg gttttcttta tggcgtttag caaaatcgtc cctttctaac attctagcga ctgaaaactt cgcgcacaat tctatcatgc cctttgcgcc taaagcatcc aaccaagtgg aattaaagca cacttcggtg tgtttttgat ctaaaatctt atagatttgc tcttcataag ttttagcgtt ttcnaagact tgctcccggt ttaagggttt tctcgtttca ttcttacccg taggatcgcc tatcatagcg gtaaaatccc caatcaaaaa cttaacccta gccccatatt gctgcaacaa agccaatttt tggatcaaca ccgtatgccc taaatgcaaa tcgggagcgg taggatcaaa accggcttta acgataaagc gttcattggt ttcataatat ttcctcacca gcttttcaat gtattnnaat ccaatgattt cattagcacc tcttttgatc tcttttaagg ccacnntgat tttttgttcc atatggtggt ggtggtggtg agccat
Expected Protein Sequence
MAHHHHHHME QKISVALKEI KRGANEIIGL EYIEKLVRKY YETNERFIVK AGFDPTAPDL HLGHTVLIQK LALLQQYGAR VKFLIGDFTA MIGDPTGKNE TRKPLNREQV LENAKTYEEQ IYKILDQKHT EVCFNSTWLD ALGAKGMIEL CAKFSVARML ERDDFAKRHK ENRPISIVEF LYPLLQGYDS VAMGADIELG GNDQKFNLLV GRFLQRAYGL NKEQSIITMP LLEGLDGVQK MSKSLGNYVG ITEEPNAMFG KIMSVSDDLM WRYYTLLSAK TLEEIEDLKH GILNQTLHPK AVKEDLAGEI VARYYDNDQA FKAKEQFSKV FSANLLPEIL SESDFDEGVG ILDVLKQIGF CPSTSQARRD IQGGGVKINQ EVIKDESYRF VKGNYVIQLG KKRFMKLNIN
Full NT Sequence (Expression Vector + Insert)
taatacgact cactataggg agaccacaac ggtttccctc tagaaataat tttgtttaac tttaagaagg agatatacca tggctcacca ccaccaccac catatggaac aaaaaatcag tgtggcctta aaagagatca aaagaggtgc taatgaaatc attggattag aatacattga aaagctggtg aggaaatatt atgaaaccaa tgaacgcttt atcgttaaag ccggttttga tcctaccgct cccgatttgc atttagggca tacggtgttg atccaaaaat tggctttgtt gcagcaatat ggggctaggg ttaagttttt gattggggat tttaccgcta tgataggcga tcctacgggt aagaatgaaa cgagaaaacc cttaaaccgg gagcaagtct tagaaaacgc taaaacttat gaagagcaaa tctataagat tttagatcaa aaacacaccg aagtgtgctt taattccact tggttggatg ctttaggcgc aaagggcatg atagaattgt gcgcgaagtt ttcagtcgct agaatgttag aaagggacga ttttgctaaa cgccataaag aaaaccgccc cattagcatc gtggaatttt tatacccttt gttgcaaggc tatgattcag tggcgatggg tgcggatatt gagcttgggg gcaatgatca aaagtttaat ttgctggtgg ggcgcttttt gcaacgagct tatggcttga ataaagagca gtctattatt accatgcctt tattagaagg gcttgatggg gtgcaaaaaa tgagtaaaag cttggggaat tatgtgggga tcactgaaga gcctaatgcg atgtttggga agatcatgag cgtgagcgat gatctcatgt ggcgctacta cacccttttg agcgctaaga ctttagaaga aattgaagac ttaaaacatg gtattttaaa ccaaaccttg caccctaaag ccgttaaaga ggatctcgct ggtgaaatcg tggctcgtta ttatgataat gatcaagcat tcaaggctaa agagcaattt tctaaagtgt ttagcgcaaa ccttttgcct gaaattttat cagagagcga ttttgatgaa ggggttggga ttttagatgt tttaaaacag attggctttt gcccatccac ttcacaagcc aggcgtgata ttcaaggggg aggggtaaag attaatcaag aagtgataaa agatgagagt tatcgttttg ttaaaggaaa ttatgttata cagcttggta agaaaagatt tatgaaatta aatatcaact gagtaagata ggatccggct gctaacaaag cccgaaagga agctgagttg gctgctgcca ccgctgagca ataactagca taaccccttg gggcctctaa acgggtcttg aggggttttt tgctgaaagg aggaactata tccggatatc cacaggacgg gtgtggtcgc catgatcgcg tagtcgatag tggctccaag tagcgaagcg agcaggactg ggcggcggcc aaagcggtcg gacagtgctc cgagaacggg tgcgcataga aattgcatca acgcatatag cgctagcagc acgccatagt gactggcgat gctgtcggaa tggacgatat cccgcaagag gcccggcagt accggcataa ccaagcctat gcctacagca tccagggtga cggtgccgag gatgacgatg agcgcattgt tagatttcat acacggtgcc tgactgcgtt agcaatttaa ctgtgataaa ctaccgcatt aaagcttatc gatgataagc tgtcaaacat gagaattctt gaagacgaaa gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg ctcacccaga aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg gttacatcga actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac gttttccaat gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtgttg acgccgggca agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt actcaccagt cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg ctgccataac catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac cgaaggagct aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgcag caatggcaac aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc aacaattaat agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc ttccggctgg ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta tcattgcagc actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg ggagtcaggc aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga ttaagcattg gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac ttcattttta atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa tcccttaacg tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg gcttcagcag agcgcagata ccaaatactg tccttctagt gtagccgtag ttaggccacc acttcaagaa ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg ctgctgccag tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa cgacctacac cgaactgaga tacctacagc gtgagctatg agaaagcgcc acgcttcccg aagggagaaa ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga gggagcttcc agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct gacttgagcg tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc ctgcgttatc ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc tgatgcggta ttttctcctt acgcatctgt gcggtatttc acaccgcata tatggtgcac tctcagtaca atctgctctg atgccgcata gttaagccag tatacactcc gctatcgcta cgtgactggg tcatggctgc gccccgacac ccgccaacac ccgctgacgc gccctgacgg gcttgtctgc tcccggcatc cgcttacaga caagctgtga ccgtctccgg gagctgcatg tgtcagaggt tttcaccgtc atcaccgaaa cgcgcgaggc agctgcggta aagctcatca gcgtggtcgt gaagcgattc acagatgtct gcctgttcat ccgcgtccag ctcgttgagt ttctccagaa gcgttaatgt ctggcttctg ataaagcggg ccatgttaag ggcggttttt tcctgtttgg tcactgatgc ctccgtgtaa gggggatttc tgttcatggg ggtaatgata ccgatgaaac gagagaggat gctcacgata cgggttactg atgatgaaca tgcccggtta ctggaacgtt gtgagggtaa acaactggcg gtatggatgc ggcgggacca gagaaaaatc actcagggtc aatgccagcg cttcgttaat acagatgtag gtgttccaca gggtagccag cagcatcctg cgatgcagat ccggaacata atggtgcagg gcgctgactt ccgcgtttcc agactttacg aaacacggaa accgaagacc attcatgttg ttgctcaggt cgcagacgtt ttgcagcagc agtcgcttca cgttcgctcg cgtatcggtg attcattctg ctaaccagta aggcaacccc gccagcctag ccgggtcctc aacgacagga gcacgatcat gcgcacccgt ggccaggacc caacgctgcc cgagatgcgc cgcgtgcggc tgctggagat ggcggacgcg atggatatgt tctgccaagg gttggtttgc gcattcacag ttctccgcaa gaattgattg gctccaattc ttggagtggt gaatccgtta gcgaggtgcc gccggcttcc attcaggtcg aggtggcccg gctccatgca ccgcgacgca acgcggggag gcagacaagg tatagggcgg cgcctacaat ccatgccaac ccgttccatg tgctcgccga ggcggcataa atcgccgtga cgatcagcgg tccagtgatc gaagttaggc tggtaagagc cgcgagcgat ccttgaagct gtccctgatg gtcgtcatct acctgcctgg acagcatggc ctgcaacgcg ggcatcccga tgccgccgga agcgagaaga atcataatgg ggaaggccat ccagcctcgc gtcgcgaacg ccagcaagac gtagcccagc gcgtcggccg ccatgccggc gataatggcc tgcttctcgc cgaaacgttt ggtggcggga ccagtgacga aggcttgagc gagggcgtgc aagattccga ataccgcaag cgacaggccg atcatcgtcg cgctccagcg aaagcggtcc tcgccgaaaa tgacccagag cgctgccggc acctgtccta cgagttgcat gataaagaag acagtcataa gtgcggcgac gatagtcatg ccccgcgccc accggaagga gctgactggg ttgaaggctc tcaagggcat cggtcgacgc tctcccttat gcgactcctg cattaggaag cagcccagta gtaggttgag gccgttgagc accgccgccg caaggaatgg tgcatgcaag gagatggcgc ccaacagtcc cccggccacg gggcctgcca ccatacccac gccgaaacaa gcgctcatga gcccgaagtg gcgagcccga tcttccccat cggtgatgtc ggcgatatag gcgccagcaa ccgcacctgt ggcgccggtg atgccggcca cgatgcgtcc ggcgtagagg atcgagatct cgatcccgcg aaat
Details for HepyC.01032.a.B1.PS38283
PURIFICATION DATe: 5/9/2017
CONCENTRATION: 56.53mg/ml
OBSERVED MW: 45kDa
EXPRESSION LEVEL: Moderate Expression
PROTEIN PURIFICATION BUFFER: SEC: 20 mM HEPES pH 7.0, 300 mM NaCl, 5% glycerol, 1 mM TCEP
EXPRESSION HOST: BL 21 (DE3) Rosetta
VIAL COUNT (approx.): 7
VIAL VOLUME: 200µl
PERCENT IDENTITY: 98
PERCENT COVERAGE: 97
Protocol Notes
notes unavailable
Validated AA Sequence
MAHHHHHHME QKIXVALKEI KRGANEIIGX XYIEKLVRKY YETNERFIVK AGFDPTAPDL HLGHTVLIQK LALLQQYGAR VKFLIGDFTA MIGDPTGKNE TRKPLNREQV XENAKTYEEQ IYKILDQKHT EVCFNSTWLD ALGAKGMIEL CAKFSVARML ERDDFAKRHK ENRPISIVEF LYPLLQGYDS VAMGADIELG GNDQKFNLLV GRFLQRAYGL NKEQSIITMP LLEGLDGVQK MSKSLGNYVG ITEEPNAMFG KIMSVSDDLM WRYYTLLSAK TLEEIEDLKH GILNQTLHPK AVKEDLAGEI VARYYDNDQA FKAKEQFXKV FSANLLPEIL SESDFDEGVG ILDVLKQIGF CPSTSQARRD IQGGGVKINQ EVIKDESYRF VKGNYVIQLG K
Validated NT Sequence
tttcttacca agctgtataa cataatttcc tttaacaaaa cgataactct catcttttat cacttcttga ttaatcttta cccctccccc ttgaatatca cgcctggctt gtgaagtgga tgggcaaaag ccaatctgtt ttaaaacatc taaaatccca accccttcat caaaatcgct ctctgataaa atttcaggca aaaggtttgc gctaaacact ttngnaaatt gctctttngc cttgaatgct tgatcattat cataataacg agccacgatt tcaccagcga gatcctcttt aacggcttta gggtgcaagg tttggtttaa aataccatgt tttaagtctt caatttcttc taaagtctta gcgctcaaaa gggtgtagta gcgccacatg agatcatcgc tcacgctcat gatcttccca aacatcgcat taggctcttc agtgatcccc acataattcc ccaagctttt actcattttt tgcaccccat caagcccttc taataaaggc atggtaataa tagactgctc tttattcaag ccataagctc gttgcaaaaa gcgccccacc agcaaattaa acttttgatc attgccccca agctcaatat ccgcacccat cgccactgaa tcatagcctt gcaacaaagg gtataaaaat tccacgatgc taatggggcg gttttcttta tggcgtttag caaaatcgtc cctttctaac attctagcga ctgaaaactt cgcgcacaat tctatcatgc cctttgcgcc taaagcatcc aaccaagtgg aattaaagca cacttcggtg tgtttttgat ctaaaatctt atagatttgc tcttcataag ttttagcgtt ttcnaagact tgctcccggt ttaagggttt tctcgtttca ttcttacccg taggatcgcc tatcatagcg gtaaaatccc caatcaaaaa cttaacccta gccccatatt gctgcaacaa agccaatttt tggatcaaca ccgtatgccc taaatgcaaa tcgggagcgg taggatcaaa accggcttta acgataaagc gttcattggt ttcataatat ttcctcacca gcttttcaat gtattnnaat ccaatgattt cattagcacc tcttttgatc tcttttaagg ccacnntgat tttttgttcc atatggtggt ggtggtggtg agccat
Expressed Protein Sequence
MAHHHHHHME QKISVALKEI KRGANEIIGL EYIEKLVRKY YETNERFIVK AGFDPTAPDL HLGHTVLIQK LALLQQYGAR VKFLIGDFTA MIGDPTGKNE TRKPLNREQV LENAKTYEEQ IYKILDQKHT EVCFNSTWLD ALGAKGMIEL CAKFSVARML ERDDFAKRHK ENRPISIVEF LYPLLQGYDS VAMGADIELG GNDQKFNLLV GRFLQRAYGL NKEQSIITMP LLEGLDGVQK MSKSLGNYVG ITEEPNAMFG KIMSVSDDLM WRYYTLLSAK TLEEIEDLKH GILNQTLHPK AVKEDLAGEI VARYYDNDQA FKAKEQFSKV FSANLLPEIL SESDFDEGVG ILDVLKQIGF CPSTSQARRD IQGGGVKINQ EVIKDESYRF VKGNYVIQLG KKRFMKLNIN
Full NT Sequence (Expression Vector + Insert)
TAATACGACT CACTATAGGG AGACCACAAC GGTTTCCCTC TAGAAATAAT TTTGTTTAAC TTTAAGAAGG AGATATACCA TGGCTCACCA CCACCACCAC CATATGGAAC AAAAAATCAG TGTGGCCTTA AAAGAGATCA AAAGAGGTGC TAATGAAATC ATTGGATTAG AATACATTGA AAAGCTGGTG AGGAAATATT ATGAAACCAA TGAACGCTTT ATCGTTAAAG CCGGTTTTGA TCCTACCGCT CCCGATTTGC ATTTAGGGCA TACGGTGTTG ATCCAAAAAT TGGCTTTGTT GCAGCAATAT GGGGCTAGGG TTAAGTTTTT GATTGGGGAT TTTACCGCTA TGATAGGCGA TCCTACGGGT AAGAATGAAA CGAGAAAACC CTTAAACCGG GAGCAAGTCT TAGAAAACGC TAAAACTTAT GAAGAGCAAA TCTATAAGAT TTTAGATCAA AAACACACCG AAGTGTGCTT TAATTCCACT TGGTTGGATG CTTTAGGCGC AAAGGGCATG ATAGAATTGT GCGCGAAGTT TTCAGTCGCT AGAATGTTAG AAAGGGACGA TTTTGCTAAA CGCCATAAAG AAAACCGCCC CATTAGCATC GTGGAATTTT TATACCCTTT GTTGCAAGGC TATGATTCAG TGGCGATGGG TGCGGATATT GAGCTTGGGG GCAATGATCA AAAGTTTAAT TTGCTGGTGG GGCGCTTTTT GCAACGAGCT TATGGCTTGA ATAAAGAGCA GTCTATTATT ACCATGCCTT TATTAGAAGG GCTTGATGGG GTGCAAAAAA TGAGTAAAAG CTTGGGGAAT TATGTGGGGA TCACTGAAGA GCCTAATGCG ATGTTTGGGA AGATCATGAG CGTGAGCGAT GATCTCATGT GGCGCTACTA CACCCTTTTG AGCGCTAAGA CTTTAGAAGA AATTGAAGAC TTAAAACATG GTATTTTAAA CCAAACCTTG CACCCTAAAG CCGTTAAAGA GGATCTCGCT GGTGAAATCG TGGCTCGTTA TTATGATAAT GATCAAGCAT TCAAGGCTAA AGAGCAATTT TCTAAAGTGT TTAGCGCAAA CCTTTTGCCT GAAATTTTAT CAGAGAGCGA TTTTGATGAA GGGGTTGGGA TTTTAGATGT TTTAAAACAG ATTGGCTTTT GCCCATCCAC TTCACAAGCC AGGCGTGATA TTCAAGGGGG AGGGGTAAAG ATTAATCAAG AAGTGATAAA AGATGAGAGT TATCGTTTTG TTAAAGGAAA TTATGTTATA CAGCTTGGTA AGAAAAGATT TATGAAATTA AATATCAACT GAGTAAGATA GGATCCGGCT GCTAACAAAG CCCGAAAGGA AGCTGAGTTG GCTGCTGCCA CCGCTGAGCA ATAACTAGCA TAACCCCTTG GGGCCTCTAA ACGGGTCTTG AGGGGTTTTT TGCTGAAAGG AGGAACTATA TCCGGATATC CACAGGACGG GTGTGGTCGC CATGATCGCG TAGTCGATAG TGGCTCCAAG TAGCGAAGCG AGCAGGACTG GGCGGCGGCC AAAGCGGTCG GACAGTGCTC CGAGAACGGG TGCGCATAGA AATTGCATCA ACGCATATAG CGCTAGCAGC ACGCCATAGT GACTGGCGAT GCTGTCGGAA TGGACGATAT CCCGCAAGAG GCCCGGCAGT ACCGGCATAA CCAAGCCTAT GCCTACAGCA TCCAGGGTGA CGGTGCCGAG GATGACGATG AGCGCATTGT TAGATTTCAT ACACGGTGCC TGACTGCGTT AGCAATTTAA CTGTGATAAA CTACCGCATT AAAGCTTATC GATGATAAGC TGTCAAACAT GAGAATTCTT GAAGACGAAA GGGCCTCGTG ATACGCCTAT TTTTATAGGT TAATGTCATG ATAATAATGG TTTCTTAGAC GTCAGGTGGC ACTTTTCGGG GAAATGTGCG CGGAACCCCT ATTTGTTTAT TTTTCTAAAT ACATTCAAAT ATGTATCCGC TCATGAGACA ATAACCCTGA TAAATGCTTC AATAATATTG AAAAAGGAAG AGTATGAGTA TTCAACATTT CCGTGTCGCC CTTATTCCCT TTTTTGCGGC ATTTTGCCTT CCTGTTTTTG CTCACCCAGA AACGCTGGTG AAAGTAAAAG ATGCTGAAGA TCAGTTGGGT GCACGAGTGG GTTACATCGA ACTGGATCTC AACAGCGGTA AGATCCTTGA GAGTTTTCGC CCCGAAGAAC GTTTTCCAAT GATGAGCACT TTTAAAGTTC TGCTATGTGG CGCGGTATTA TCCCGTGTTG ACGCCGGGCA AGAGCAACTC GGTCGCCGCA TACACTATTC TCAGAATGAC TTGGTTGAGT ACTCACCAGT CACAGAAAAG CATCTTACGG ATGGCATGAC AGTAAGAGAA TTATGCAGTG CTGCCATAAC CATGAGTGAT AACACTGCGG CCAACTTACT TCTGACAACG ATCGGAGGAC CGAAGGAGCT AACCGCTTTT TTGCACAACA TGGGGGATCA TGTAACTCGC CTTGATCGTT GGGAACCGGA GCTGAATGAA GCCATACCAA ACGACGAGCG TGACACCACG ATGCCTGCAG CAATGGCAAC AACGTTGCGC AAACTATTAA CTGGCGAACT ACTTACTCTA GCTTCCCGGC AACAATTAAT AGACTGGATG GAGGCGGATA AAGTTGCAGG ACCACTTCTG CGCTCGGCCC TTCCGGCTGG CTGGTTTATT GCTGATAAAT CTGGAGCCGG TGAGCGTGGG TCTCGCGGTA TCATTGCAGC ACTGGGGCCA GATGGTAAGC CCTCCCGTAT CGTAGTTATC TACACGACGG GGAGTCAGGC AACTATGGAT GAACGAAATA GACAGATCGC TGAGATAGGT GCCTCACTGA TTAAGCATTG GTAACTGTCA GACCAAGTTT ACTCATATAT ACTTTAGATT GATTTAAAAC TTCATTTTTA ATTTAAAAGG ATCTAGGTGA AGATCCTTTT TGATAATCTC ATGACCAAAA TCCCTTAACG TGAGTTTTCG TTCCACTGAG CGTCAGACCC CGTAGAAAAG ATCAAAGGAT CTTCTTGAGA TCCTTTTTTT CTGCGCGTAA TCTGCTGCTT GCAAACAAAA AAACCACCGC TACCAGCGGT GGTTTGTTTG CCGGATCAAG AGCTACCAAC TCTTTTTCCG AAGGTAACTG GCTTCAGCAG AGCGCAGATA CCAAATACTG TCCTTCTAGT GTAGCCGTAG TTAGGCCACC ACTTCAAGAA CTCTGTAGCA CCGCCTACAT ACCTCGCTCT GCTAATCCTG TTACCAGTGG CTGCTGCCAG TGGCGATAAG TCGTGTCTTA CCGGGTTGGA CTCAAGACGA TAGTTACCGG ATAAGGCGCA GCGGTCGGGC TGAACGGGGG GTTCGTGCAC ACAGCCCAGC TTGGAGCGAA CGACCTACAC CGAACTGAGA TACCTACAGC GTGAGCTATG AGAAAGCGCC ACGCTTCCCG AAGGGAGAAA GGCGGACAGG TATCCGGTAA GCGGCAGGGT CGGAACAGGA GAGCGCACGA GGGAGCTTCC AGGGGGAAAC GCCTGGTATC TTTATAGTCC TGTCGGGTTT CGCCACCTCT GACTTGAGCG TCGATTTTTG TGATGCTCGT CAGGGGGGCG GAGCCTATGG AAAAACGCCA GCAACGCGGC CTTTTTACGG TTCCTGGCCT TTTGCTGGCC TTTTGCTCAC ATGTTCTTTC CTGCGTTATC CCCTGATTCT GTGGATAACC GTATTACCGC CTTTGAGTGA GCTGATACCG CTCGCCGCAG CCGAACGACC GAGCGCAGCG AGTCAGTGAG CGAGGAAGCG GAAGAGCGCC TGATGCGGTA TTTTCTCCTT ACGCATCTGT GCGGTATTTC ACACCGCATA TATGGTGCAC TCTCAGTACA ATCTGCTCTG ATGCCGCATA GTTAAGCCAG TATACACTCC GCTATCGCTA CGTGACTGGG TCATGGCTGC GCCCCGACAC CCGCCAACAC CCGCTGACGC GCCCTGACGG GCTTGTCTGC TCCCGGCATC CGCTTACAGA CAAGCTGTGA CCGTCTCCGG GAGCTGCATG TGTCAGAGGT TTTCACCGTC ATCACCGAAA CGCGCGAGGC AGCTGCGGTA AAGCTCATCA GCGTGGTCGT GAAGCGATTC ACAGATGTCT GCCTGTTCAT CCGCGTCCAG CTCGTTGAGT TTCTCCAGAA GCGTTAATGT CTGGCTTCTG ATAAAGCGGG CCATGTTAAG GGCGGTTTTT TCCTGTTTGG TCACTGATGC CTCCGTGTAA GGGGGATTTC TGTTCATGGG GGTAATGATA CCGATGAAAC GAGAGAGGAT GCTCACGATA CGGGTTACTG ATGATGAACA TGCCCGGTTA CTGGAACGTT GTGAGGGTAA ACAACTGGCG GTATGGATGC GGCGGGACCA GAGAAAAATC ACTCAGGGTC AATGCCAGCG CTTCGTTAAT ACAGATGTAG GTGTTCCACA GGGTAGCCAG CAGCATCCTG CGATGCAGAT CCGGAACATA ATGGTGCAGG GCGCTGACTT CCGCGTTTCC AGACTTTACG AAACACGGAA ACCGAAGACC ATTCATGTTG TTGCTCAGGT CGCAGACGTT TTGCAGCAGC AGTCGCTTCA CGTTCGCTCG CGTATCGGTG ATTCATTCTG CTAACCAGTA AGGCAACCCC GCCAGCCTAG CCGGGTCCTC AACGACAGGA GCACGATCAT GCGCACCCGT GGCCAGGACC CAACGCTGCC CGAGATGCGC CGCGTGCGGC TGCTGGAGAT GGCGGACGCG ATGGATATGT TCTGCCAAGG GTTGGTTTGC GCATTCACAG TTCTCCGCAA GAATTGATTG GCTCCAATTC TTGGAGTGGT GAATCCGTTA GCGAGGTGCC GCCGGCTTCC ATTCAGGTCG AGGTGGCCCG GCTCCATGCA CCGCGACGCA ACGCGGGGAG GCAGACAAGG TATAGGGCGG CGCCTACAAT CCATGCCAAC CCGTTCCATG TGCTCGCCGA GGCGGCATAA ATCGCCGTGA CGATCAGCGG TCCAGTGATC GAAGTTAGGC TGGTAAGAGC CGCGAGCGAT CCTTGAAGCT GTCCCTGATG GTCGTCATCT ACCTGCCTGG ACAGCATGGC CTGCAACGCG GGCATCCCGA TGCCGCCGGA AGCGAGAAGA ATCATAATGG GGAAGGCCAT CCAGCCTCGC GTCGCGAACG CCAGCAAGAC GTAGCCCAGC GCGTCGGCCG CCATGCCGGC GATAATGGCC TGCTTCTCGC CGAAACGTTT GGTGGCGGGA CCAGTGACGA AGGCTTGAGC GAGGGCGTGC AAGATTCCGA ATACCGCAAG CGACAGGCCG ATCATCGTCG CGCTCCAGCG AAAGCGGTCC TCGCCGAAAA TGACCCAGAG CGCTGCCGGC ACCTGTCCTA CGAGTTGCAT GATAAAGAAG ACAGTCATAA GTGCGGCGAC GATAGTCATG CCCCGCGCCC ACCGGAAGGA GCTGACTGGG TTGAAGGCTC TCAAGGGCAT CGGTCGACGC TCTCCCTTAT GCGACTCCTG CATTAGGAAG CAGCCCAGTA GTAGGTTGAG GCCGTTGAGC ACCGCCGCCG CAAGGAATGG TGCATGCAAG GAGATGGCGC CCAACAGTCC CCCGGCCACG GGGCCTGCCA CCATACCCAC GCCGAAACAA GCGCTCATGA GCCCGAAGTG GCGAGCCCGA TCTTCCCCAT CGGTGATGTC GGCGATATAG GCGCCAGCAA CCGCACCTGT GGCGCCGGTG ATGCCGGCCA CGATGCGTCC GGCGTAGAGG ATCGAGATCT CGATCCCGCG AAAT
Details for HepyC.01032.a.B1.PW38234
PURIFICATION DATe: 3/27/2017
CONCENTRATION: 37mg/ml
OBSERVED MW: 47kDa
EXPRESSION LEVEL: High Expression
PROTEIN PURIFICATION BUFFER: 25 mM HEPES pH 8.0, 500 mM NaCl, 5% Glycerol , 2 mM DTT, and 0.025% Azide
EXPRESSION HOST: BL 21 (DE3) Rosetta
VIAL COUNT (approx.): 2
VIAL VOLUME: 200µl
PERCENT IDENTITY: 98
PERCENT COVERAGE: 97
Protocol Notes
notes unavailable
Validated AA Sequence
MAHHHHHHME QKIXVALKEI KRGANEIIGX XYIEKLVRKY YETNERFIVK AGFDPTAPDL HLGHTVLIQK LALLQQYGAR VKFLIGDFTA MIGDPTGKNE TRKPLNREQV XENAKTYEEQ IYKILDQKHT EVCFNSTWLD ALGAKGMIEL CAKFSVARML ERDDFAKRHK ENRPISIVEF LYPLLQGYDS VAMGADIELG GNDQKFNLLV GRFLQRAYGL NKEQSIITMP LLEGLDGVQK MSKSLGNYVG ITEEPNAMFG KIMSVSDDLM WRYYTLLSAK TLEEIEDLKH GILNQTLHPK AVKEDLAGEI VARYYDNDQA FKAKEQFXKV FSANLLPEIL SESDFDEGVG ILDVLKQIGF CPSTSQARRD IQGGGVKINQ EVIKDESYRF VKGNYVIQLG K
Validated NT Sequence
tttcttacca agctgtataa cataatttcc tttaacaaaa cgataactct catcttttat cacttcttga ttaatcttta cccctccccc ttgaatatca cgcctggctt gtgaagtgga tgggcaaaag ccaatctgtt ttaaaacatc taaaatccca accccttcat caaaatcgct ctctgataaa atttcaggca aaaggtttgc gctaaacact ttngnaaatt gctctttngc cttgaatgct tgatcattat cataataacg agccacgatt tcaccagcga gatcctcttt aacggcttta gggtgcaagg tttggtttaa aataccatgt tttaagtctt caatttcttc taaagtctta gcgctcaaaa gggtgtagta gcgccacatg agatcatcgc tcacgctcat gatcttccca aacatcgcat taggctcttc agtgatcccc acataattcc ccaagctttt actcattttt tgcaccccat caagcccttc taataaaggc atggtaataa tagactgctc tttattcaag ccataagctc gttgcaaaaa gcgccccacc agcaaattaa acttttgatc attgccccca agctcaatat ccgcacccat cgccactgaa tcatagcctt gcaacaaagg gtataaaaat tccacgatgc taatggggcg gttttcttta tggcgtttag caaaatcgtc cctttctaac attctagcga ctgaaaactt cgcgcacaat tctatcatgc cctttgcgcc taaagcatcc aaccaagtgg aattaaagca cacttcggtg tgtttttgat ctaaaatctt atagatttgc tcttcataag ttttagcgtt ttcnaagact tgctcccggt ttaagggttt tctcgtttca ttcttacccg taggatcgcc tatcatagcg gtaaaatccc caatcaaaaa cttaacccta gccccatatt gctgcaacaa agccaatttt tggatcaaca ccgtatgccc taaatgcaaa tcgggagcgg taggatcaaa accggcttta acgataaagc gttcattggt ttcataatat ttcctcacca gcttttcaat gtattnnaat ccaatgattt cattagcacc tcttttgatc tcttttaagg ccacnntgat tttttgttcc atatggtggt ggtggtggtg agccat
Expressed Protein Sequence
MAHHHHHHME QKISVALKEI KRGANEIIGL EYIEKLVRKY YETNERFIVK AGFDPTAPDL HLGHTVLIQK LALLQQYGAR VKFLIGDFTA MIGDPTGKNE TRKPLNREQV LENAKTYEEQ IYKILDQKHT EVCFNSTWLD ALGAKGMIEL CAKFSVARML ERDDFAKRHK ENRPISIVEF LYPLLQGYDS VAMGADIELG GNDQKFNLLV GRFLQRAYGL NKEQSIITMP LLEGLDGVQK MSKSLGNYVG ITEEPNAMFG KIMSVSDDLM WRYYTLLSAK TLEEIEDLKH GILNQTLHPK AVKEDLAGEI VARYYDNDQA FKAKEQFSKV FSANLLPEIL SESDFDEGVG ILDVLKQIGF CPSTSQARRD IQGGGVKINQ EVIKDESYRF VKGNYVIQLG KKRFMKLNIN
Full NT Sequence (Expression Vector + Insert)
TAATACGACT CACTATAGGG AGACCACAAC GGTTTCCCTC TAGAAATAAT TTTGTTTAAC TTTAAGAAGG AGATATACCA TGGCTCACCA CCACCACCAC CATATGGAAC AAAAAATCAG TGTGGCCTTA AAAGAGATCA AAAGAGGTGC TAATGAAATC ATTGGATTAG AATACATTGA AAAGCTGGTG AGGAAATATT ATGAAACCAA TGAACGCTTT ATCGTTAAAG CCGGTTTTGA TCCTACCGCT CCCGATTTGC ATTTAGGGCA TACGGTGTTG ATCCAAAAAT TGGCTTTGTT GCAGCAATAT GGGGCTAGGG TTAAGTTTTT GATTGGGGAT TTTACCGCTA TGATAGGCGA TCCTACGGGT AAGAATGAAA CGAGAAAACC CTTAAACCGG GAGCAAGTCT TAGAAAACGC TAAAACTTAT GAAGAGCAAA TCTATAAGAT TTTAGATCAA AAACACACCG AAGTGTGCTT TAATTCCACT TGGTTGGATG CTTTAGGCGC AAAGGGCATG ATAGAATTGT GCGCGAAGTT TTCAGTCGCT AGAATGTTAG AAAGGGACGA TTTTGCTAAA CGCCATAAAG AAAACCGCCC CATTAGCATC GTGGAATTTT TATACCCTTT GTTGCAAGGC TATGATTCAG TGGCGATGGG TGCGGATATT GAGCTTGGGG GCAATGATCA AAAGTTTAAT TTGCTGGTGG GGCGCTTTTT GCAACGAGCT TATGGCTTGA ATAAAGAGCA GTCTATTATT ACCATGCCTT TATTAGAAGG GCTTGATGGG GTGCAAAAAA TGAGTAAAAG CTTGGGGAAT TATGTGGGGA TCACTGAAGA GCCTAATGCG ATGTTTGGGA AGATCATGAG CGTGAGCGAT GATCTCATGT GGCGCTACTA CACCCTTTTG AGCGCTAAGA CTTTAGAAGA AATTGAAGAC TTAAAACATG GTATTTTAAA CCAAACCTTG CACCCTAAAG CCGTTAAAGA GGATCTCGCT GGTGAAATCG TGGCTCGTTA TTATGATAAT GATCAAGCAT TCAAGGCTAA AGAGCAATTT TCTAAAGTGT TTAGCGCAAA CCTTTTGCCT GAAATTTTAT CAGAGAGCGA TTTTGATGAA GGGGTTGGGA TTTTAGATGT TTTAAAACAG ATTGGCTTTT GCCCATCCAC TTCACAAGCC AGGCGTGATA TTCAAGGGGG AGGGGTAAAG ATTAATCAAG AAGTGATAAA AGATGAGAGT TATCGTTTTG TTAAAGGAAA TTATGTTATA CAGCTTGGTA AGAAAAGATT TATGAAATTA AATATCAACT GAGTAAGATA GGATCCGGCT GCTAACAAAG CCCGAAAGGA AGCTGAGTTG GCTGCTGCCA CCGCTGAGCA ATAACTAGCA TAACCCCTTG GGGCCTCTAA ACGGGTCTTG AGGGGTTTTT TGCTGAAAGG AGGAACTATA TCCGGATATC CACAGGACGG GTGTGGTCGC CATGATCGCG TAGTCGATAG TGGCTCCAAG TAGCGAAGCG AGCAGGACTG GGCGGCGGCC AAAGCGGTCG GACAGTGCTC CGAGAACGGG TGCGCATAGA AATTGCATCA ACGCATATAG CGCTAGCAGC ACGCCATAGT GACTGGCGAT GCTGTCGGAA TGGACGATAT CCCGCAAGAG GCCCGGCAGT ACCGGCATAA CCAAGCCTAT GCCTACAGCA TCCAGGGTGA CGGTGCCGAG GATGACGATG AGCGCATTGT TAGATTTCAT ACACGGTGCC TGACTGCGTT AGCAATTTAA CTGTGATAAA CTACCGCATT AAAGCTTATC GATGATAAGC TGTCAAACAT GAGAATTCTT GAAGACGAAA GGGCCTCGTG ATACGCCTAT TTTTATAGGT TAATGTCATG ATAATAATGG TTTCTTAGAC GTCAGGTGGC ACTTTTCGGG GAAATGTGCG CGGAACCCCT ATTTGTTTAT TTTTCTAAAT ACATTCAAAT ATGTATCCGC TCATGAGACA ATAACCCTGA TAAATGCTTC AATAATATTG AAAAAGGAAG AGTATGAGTA TTCAACATTT CCGTGTCGCC CTTATTCCCT TTTTTGCGGC ATTTTGCCTT CCTGTTTTTG CTCACCCAGA AACGCTGGTG AAAGTAAAAG ATGCTGAAGA TCAGTTGGGT GCACGAGTGG GTTACATCGA ACTGGATCTC AACAGCGGTA AGATCCTTGA GAGTTTTCGC CCCGAAGAAC GTTTTCCAAT GATGAGCACT TTTAAAGTTC TGCTATGTGG CGCGGTATTA TCCCGTGTTG ACGCCGGGCA AGAGCAACTC GGTCGCCGCA TACACTATTC TCAGAATGAC TTGGTTGAGT ACTCACCAGT CACAGAAAAG CATCTTACGG ATGGCATGAC AGTAAGAGAA TTATGCAGTG CTGCCATAAC CATGAGTGAT AACACTGCGG CCAACTTACT TCTGACAACG ATCGGAGGAC CGAAGGAGCT AACCGCTTTT TTGCACAACA TGGGGGATCA TGTAACTCGC CTTGATCGTT GGGAACCGGA GCTGAATGAA GCCATACCAA ACGACGAGCG TGACACCACG ATGCCTGCAG CAATGGCAAC AACGTTGCGC AAACTATTAA CTGGCGAACT ACTTACTCTA GCTTCCCGGC AACAATTAAT AGACTGGATG GAGGCGGATA AAGTTGCAGG ACCACTTCTG CGCTCGGCCC TTCCGGCTGG CTGGTTTATT GCTGATAAAT CTGGAGCCGG TGAGCGTGGG TCTCGCGGTA TCATTGCAGC ACTGGGGCCA GATGGTAAGC CCTCCCGTAT CGTAGTTATC TACACGACGG GGAGTCAGGC AACTATGGAT GAACGAAATA GACAGATCGC TGAGATAGGT GCCTCACTGA TTAAGCATTG GTAACTGTCA GACCAAGTTT ACTCATATAT ACTTTAGATT GATTTAAAAC TTCATTTTTA ATTTAAAAGG ATCTAGGTGA AGATCCTTTT TGATAATCTC ATGACCAAAA TCCCTTAACG TGAGTTTTCG TTCCACTGAG CGTCAGACCC CGTAGAAAAG ATCAAAGGAT CTTCTTGAGA TCCTTTTTTT CTGCGCGTAA TCTGCTGCTT GCAAACAAAA AAACCACCGC TACCAGCGGT GGTTTGTTTG CCGGATCAAG AGCTACCAAC TCTTTTTCCG AAGGTAACTG GCTTCAGCAG AGCGCAGATA CCAAATACTG TCCTTCTAGT GTAGCCGTAG TTAGGCCACC ACTTCAAGAA CTCTGTAGCA CCGCCTACAT ACCTCGCTCT GCTAATCCTG TTACCAGTGG CTGCTGCCAG TGGCGATAAG TCGTGTCTTA CCGGGTTGGA CTCAAGACGA TAGTTACCGG ATAAGGCGCA GCGGTCGGGC TGAACGGGGG GTTCGTGCAC ACAGCCCAGC TTGGAGCGAA CGACCTACAC CGAACTGAGA TACCTACAGC GTGAGCTATG AGAAAGCGCC ACGCTTCCCG AAGGGAGAAA GGCGGACAGG TATCCGGTAA GCGGCAGGGT CGGAACAGGA GAGCGCACGA GGGAGCTTCC AGGGGGAAAC GCCTGGTATC TTTATAGTCC TGTCGGGTTT CGCCACCTCT GACTTGAGCG TCGATTTTTG TGATGCTCGT CAGGGGGGCG GAGCCTATGG AAAAACGCCA GCAACGCGGC CTTTTTACGG TTCCTGGCCT TTTGCTGGCC TTTTGCTCAC ATGTTCTTTC CTGCGTTATC CCCTGATTCT GTGGATAACC GTATTACCGC CTTTGAGTGA GCTGATACCG CTCGCCGCAG CCGAACGACC GAGCGCAGCG AGTCAGTGAG CGAGGAAGCG GAAGAGCGCC TGATGCGGTA TTTTCTCCTT ACGCATCTGT GCGGTATTTC ACACCGCATA TATGGTGCAC TCTCAGTACA ATCTGCTCTG ATGCCGCATA GTTAAGCCAG TATACACTCC GCTATCGCTA CGTGACTGGG TCATGGCTGC GCCCCGACAC CCGCCAACAC CCGCTGACGC GCCCTGACGG GCTTGTCTGC TCCCGGCATC CGCTTACAGA CAAGCTGTGA CCGTCTCCGG GAGCTGCATG TGTCAGAGGT TTTCACCGTC ATCACCGAAA CGCGCGAGGC AGCTGCGGTA AAGCTCATCA GCGTGGTCGT GAAGCGATTC ACAGATGTCT GCCTGTTCAT CCGCGTCCAG CTCGTTGAGT TTCTCCAGAA GCGTTAATGT CTGGCTTCTG ATAAAGCGGG CCATGTTAAG GGCGGTTTTT TCCTGTTTGG TCACTGATGC CTCCGTGTAA GGGGGATTTC TGTTCATGGG GGTAATGATA CCGATGAAAC GAGAGAGGAT GCTCACGATA CGGGTTACTG ATGATGAACA TGCCCGGTTA CTGGAACGTT GTGAGGGTAA ACAACTGGCG GTATGGATGC GGCGGGACCA GAGAAAAATC ACTCAGGGTC AATGCCAGCG CTTCGTTAAT ACAGATGTAG GTGTTCCACA GGGTAGCCAG CAGCATCCTG CGATGCAGAT CCGGAACATA ATGGTGCAGG GCGCTGACTT CCGCGTTTCC AGACTTTACG AAACACGGAA ACCGAAGACC ATTCATGTTG TTGCTCAGGT CGCAGACGTT TTGCAGCAGC AGTCGCTTCA CGTTCGCTCG CGTATCGGTG ATTCATTCTG CTAACCAGTA AGGCAACCCC GCCAGCCTAG CCGGGTCCTC AACGACAGGA GCACGATCAT GCGCACCCGT GGCCAGGACC CAACGCTGCC CGAGATGCGC CGCGTGCGGC TGCTGGAGAT GGCGGACGCG ATGGATATGT TCTGCCAAGG GTTGGTTTGC GCATTCACAG TTCTCCGCAA GAATTGATTG GCTCCAATTC TTGGAGTGGT GAATCCGTTA GCGAGGTGCC GCCGGCTTCC ATTCAGGTCG AGGTGGCCCG GCTCCATGCA CCGCGACGCA ACGCGGGGAG GCAGACAAGG TATAGGGCGG CGCCTACAAT CCATGCCAAC CCGTTCCATG TGCTCGCCGA GGCGGCATAA ATCGCCGTGA CGATCAGCGG TCCAGTGATC GAAGTTAGGC TGGTAAGAGC CGCGAGCGAT CCTTGAAGCT GTCCCTGATG GTCGTCATCT ACCTGCCTGG ACAGCATGGC CTGCAACGCG GGCATCCCGA TGCCGCCGGA AGCGAGAAGA ATCATAATGG GGAAGGCCAT CCAGCCTCGC GTCGCGAACG CCAGCAAGAC GTAGCCCAGC GCGTCGGCCG CCATGCCGGC GATAATGGCC TGCTTCTCGC CGAAACGTTT GGTGGCGGGA CCAGTGACGA AGGCTTGAGC GAGGGCGTGC AAGATTCCGA ATACCGCAAG CGACAGGCCG ATCATCGTCG CGCTCCAGCG AAAGCGGTCC TCGCCGAAAA TGACCCAGAG CGCTGCCGGC ACCTGTCCTA CGAGTTGCAT GATAAAGAAG ACAGTCATAA GTGCGGCGAC GATAGTCATG CCCCGCGCCC ACCGGAAGGA GCTGACTGGG TTGAAGGCTC TCAAGGGCAT CGGTCGACGC TCTCCCTTAT GCGACTCCTG CATTAGGAAG CAGCCCAGTA GTAGGTTGAG GCCGTTGAGC ACCGCCGCCG CAAGGAATGG TGCATGCAAG GAGATGGCGC CCAACAGTCC CCCGGCCACG GGGCCTGCCA CCATACCCAC GCCGAAACAA GCGCTCATGA GCCCGAAGTG GCGAGCCCGA TCTTCCCCAT CGGTGATGTC GGCGATATAG GCGCCAGCAA CCGCACCTGT GGCGCCGGTG ATGCCGGCCA CGATGCGTCC GGCGTAGAGG ATCGAGATCT CGATCCCGCG AAAT
Details for HepyC.01032.a.B1.PW38235
PURIFICATION DATe: 3/27/2017
CONCENTRATION: 27mg/ml
OBSERVED MW: 47kDa
EXPRESSION LEVEL: High Expression
PROTEIN PURIFICATION BUFFER: 25 mM HEPES pH 8.0, 500 mM NaCl, 5% Glycerol , 2 mM DTT, and 0.025% Azide
EXPRESSION HOST: BL 21 (DE3) Rosetta
VIAL COUNT (approx.): 3
VIAL VOLUME: 110µl
PERCENT IDENTITY: 98
PERCENT COVERAGE: 97
Protocol Notes
notes unavailable
Validated AA Sequence
MAHHHHHHME QKIXVALKEI KRGANEIIGX XYIEKLVRKY YETNERFIVK AGFDPTAPDL HLGHTVLIQK LALLQQYGAR VKFLIGDFTA MIGDPTGKNE TRKPLNREQV XENAKTYEEQ IYKILDQKHT EVCFNSTWLD ALGAKGMIEL CAKFSVARML ERDDFAKRHK ENRPISIVEF LYPLLQGYDS VAMGADIELG GNDQKFNLLV GRFLQRAYGL NKEQSIITMP LLEGLDGVQK MSKSLGNYVG ITEEPNAMFG KIMSVSDDLM WRYYTLLSAK TLEEIEDLKH GILNQTLHPK AVKEDLAGEI VARYYDNDQA FKAKEQFXKV FSANLLPEIL SESDFDEGVG ILDVLKQIGF CPSTSQARRD IQGGGVKINQ EVIKDESYRF VKGNYVIQLG K
Validated NT Sequence
tttcttacca agctgtataa cataatttcc tttaacaaaa cgataactct catcttttat cacttcttga ttaatcttta cccctccccc ttgaatatca cgcctggctt gtgaagtgga tgggcaaaag ccaatctgtt ttaaaacatc taaaatccca accccttcat caaaatcgct ctctgataaa atttcaggca aaaggtttgc gctaaacact ttngnaaatt gctctttngc cttgaatgct tgatcattat cataataacg agccacgatt tcaccagcga gatcctcttt aacggcttta gggtgcaagg tttggtttaa aataccatgt tttaagtctt caatttcttc taaagtctta gcgctcaaaa gggtgtagta gcgccacatg agatcatcgc tcacgctcat gatcttccca aacatcgcat taggctcttc agtgatcccc acataattcc ccaagctttt actcattttt tgcaccccat caagcccttc taataaaggc atggtaataa tagactgctc tttattcaag ccataagctc gttgcaaaaa gcgccccacc agcaaattaa acttttgatc attgccccca agctcaatat ccgcacccat cgccactgaa tcatagcctt gcaacaaagg gtataaaaat tccacgatgc taatggggcg gttttcttta tggcgtttag caaaatcgtc cctttctaac attctagcga ctgaaaactt cgcgcacaat tctatcatgc cctttgcgcc taaagcatcc aaccaagtgg aattaaagca cacttcggtg tgtttttgat ctaaaatctt atagatttgc tcttcataag ttttagcgtt ttcnaagact tgctcccggt ttaagggttt tctcgtttca ttcttacccg taggatcgcc tatcatagcg gtaaaatccc caatcaaaaa cttaacccta gccccatatt gctgcaacaa agccaatttt tggatcaaca ccgtatgccc taaatgcaaa tcgggagcgg taggatcaaa accggcttta acgataaagc gttcattggt ttcataatat ttcctcacca gcttttcaat gtattnnaat ccaatgattt cattagcacc tcttttgatc tcttttaagg ccacnntgat tttttgttcc atatggtggt ggtggtggtg agccat
Expressed Protein Sequence
MAHHHHHHME QKISVALKEI KRGANEIIGL EYIEKLVRKY YETNERFIVK AGFDPTAPDL HLGHTVLIQK LALLQQYGAR VKFLIGDFTA MIGDPTGKNE TRKPLNREQV LENAKTYEEQ IYKILDQKHT EVCFNSTWLD ALGAKGMIEL CAKFSVARML ERDDFAKRHK ENRPISIVEF LYPLLQGYDS VAMGADIELG GNDQKFNLLV GRFLQRAYGL NKEQSIITMP LLEGLDGVQK MSKSLGNYVG ITEEPNAMFG KIMSVSDDLM WRYYTLLSAK TLEEIEDLKH GILNQTLHPK AVKEDLAGEI VARYYDNDQA FKAKEQFSKV FSANLLPEIL SESDFDEGVG ILDVLKQIGF CPSTSQARRD IQGGGVKINQ EVIKDESYRF VKGNYVIQLG KKRFMKLNIN
Full NT Sequence (Expression Vector + Insert)
TAATACGACT CACTATAGGG AGACCACAAC GGTTTCCCTC TAGAAATAAT TTTGTTTAAC TTTAAGAAGG AGATATACCA TGGCTCACCA CCACCACCAC CATATGGAAC AAAAAATCAG TGTGGCCTTA AAAGAGATCA AAAGAGGTGC TAATGAAATC ATTGGATTAG AATACATTGA AAAGCTGGTG AGGAAATATT ATGAAACCAA TGAACGCTTT ATCGTTAAAG CCGGTTTTGA TCCTACCGCT CCCGATTTGC ATTTAGGGCA TACGGTGTTG ATCCAAAAAT TGGCTTTGTT GCAGCAATAT GGGGCTAGGG TTAAGTTTTT GATTGGGGAT TTTACCGCTA TGATAGGCGA TCCTACGGGT AAGAATGAAA CGAGAAAACC CTTAAACCGG GAGCAAGTCT TAGAAAACGC TAAAACTTAT GAAGAGCAAA TCTATAAGAT TTTAGATCAA AAACACACCG AAGTGTGCTT TAATTCCACT TGGTTGGATG CTTTAGGCGC AAAGGGCATG ATAGAATTGT GCGCGAAGTT TTCAGTCGCT AGAATGTTAG AAAGGGACGA TTTTGCTAAA CGCCATAAAG AAAACCGCCC CATTAGCATC GTGGAATTTT TATACCCTTT GTTGCAAGGC TATGATTCAG TGGCGATGGG TGCGGATATT GAGCTTGGGG GCAATGATCA AAAGTTTAAT TTGCTGGTGG GGCGCTTTTT GCAACGAGCT TATGGCTTGA ATAAAGAGCA GTCTATTATT ACCATGCCTT TATTAGAAGG GCTTGATGGG GTGCAAAAAA TGAGTAAAAG CTTGGGGAAT TATGTGGGGA TCACTGAAGA GCCTAATGCG ATGTTTGGGA AGATCATGAG CGTGAGCGAT GATCTCATGT GGCGCTACTA CACCCTTTTG AGCGCTAAGA CTTTAGAAGA AATTGAAGAC TTAAAACATG GTATTTTAAA CCAAACCTTG CACCCTAAAG CCGTTAAAGA GGATCTCGCT GGTGAAATCG TGGCTCGTTA TTATGATAAT GATCAAGCAT TCAAGGCTAA AGAGCAATTT TCTAAAGTGT TTAGCGCAAA CCTTTTGCCT GAAATTTTAT CAGAGAGCGA TTTTGATGAA GGGGTTGGGA TTTTAGATGT TTTAAAACAG ATTGGCTTTT GCCCATCCAC TTCACAAGCC AGGCGTGATA TTCAAGGGGG AGGGGTAAAG ATTAATCAAG AAGTGATAAA AGATGAGAGT TATCGTTTTG TTAAAGGAAA TTATGTTATA CAGCTTGGTA AGAAAAGATT TATGAAATTA AATATCAACT GAGTAAGATA GGATCCGGCT GCTAACAAAG CCCGAAAGGA AGCTGAGTTG GCTGCTGCCA CCGCTGAGCA ATAACTAGCA TAACCCCTTG GGGCCTCTAA ACGGGTCTTG AGGGGTTTTT TGCTGAAAGG AGGAACTATA TCCGGATATC CACAGGACGG GTGTGGTCGC CATGATCGCG TAGTCGATAG TGGCTCCAAG TAGCGAAGCG AGCAGGACTG GGCGGCGGCC AAAGCGGTCG GACAGTGCTC CGAGAACGGG TGCGCATAGA AATTGCATCA ACGCATATAG CGCTAGCAGC ACGCCATAGT GACTGGCGAT GCTGTCGGAA TGGACGATAT CCCGCAAGAG GCCCGGCAGT ACCGGCATAA CCAAGCCTAT GCCTACAGCA TCCAGGGTGA CGGTGCCGAG GATGACGATG AGCGCATTGT TAGATTTCAT ACACGGTGCC TGACTGCGTT AGCAATTTAA CTGTGATAAA CTACCGCATT AAAGCTTATC GATGATAAGC TGTCAAACAT GAGAATTCTT GAAGACGAAA GGGCCTCGTG ATACGCCTAT TTTTATAGGT TAATGTCATG ATAATAATGG TTTCTTAGAC GTCAGGTGGC ACTTTTCGGG GAAATGTGCG CGGAACCCCT ATTTGTTTAT TTTTCTAAAT ACATTCAAAT ATGTATCCGC TCATGAGACA ATAACCCTGA TAAATGCTTC AATAATATTG AAAAAGGAAG AGTATGAGTA TTCAACATTT CCGTGTCGCC CTTATTCCCT TTTTTGCGGC ATTTTGCCTT CCTGTTTTTG CTCACCCAGA AACGCTGGTG AAAGTAAAAG ATGCTGAAGA TCAGTTGGGT GCACGAGTGG GTTACATCGA ACTGGATCTC AACAGCGGTA AGATCCTTGA GAGTTTTCGC CCCGAAGAAC GTTTTCCAAT GATGAGCACT TTTAAAGTTC TGCTATGTGG CGCGGTATTA TCCCGTGTTG ACGCCGGGCA AGAGCAACTC GGTCGCCGCA TACACTATTC TCAGAATGAC TTGGTTGAGT ACTCACCAGT CACAGAAAAG CATCTTACGG ATGGCATGAC AGTAAGAGAA TTATGCAGTG CTGCCATAAC CATGAGTGAT AACACTGCGG CCAACTTACT TCTGACAACG ATCGGAGGAC CGAAGGAGCT AACCGCTTTT TTGCACAACA TGGGGGATCA TGTAACTCGC CTTGATCGTT GGGAACCGGA GCTGAATGAA GCCATACCAA ACGACGAGCG TGACACCACG ATGCCTGCAG CAATGGCAAC AACGTTGCGC AAACTATTAA CTGGCGAACT ACTTACTCTA GCTTCCCGGC AACAATTAAT AGACTGGATG GAGGCGGATA AAGTTGCAGG ACCACTTCTG CGCTCGGCCC TTCCGGCTGG CTGGTTTATT GCTGATAAAT CTGGAGCCGG TGAGCGTGGG TCTCGCGGTA TCATTGCAGC ACTGGGGCCA GATGGTAAGC CCTCCCGTAT CGTAGTTATC TACACGACGG GGAGTCAGGC AACTATGGAT GAACGAAATA GACAGATCGC TGAGATAGGT GCCTCACTGA TTAAGCATTG GTAACTGTCA GACCAAGTTT ACTCATATAT ACTTTAGATT GATTTAAAAC TTCATTTTTA ATTTAAAAGG ATCTAGGTGA AGATCCTTTT TGATAATCTC ATGACCAAAA TCCCTTAACG TGAGTTTTCG TTCCACTGAG CGTCAGACCC CGTAGAAAAG ATCAAAGGAT CTTCTTGAGA TCCTTTTTTT CTGCGCGTAA TCTGCTGCTT GCAAACAAAA AAACCACCGC TACCAGCGGT GGTTTGTTTG CCGGATCAAG AGCTACCAAC TCTTTTTCCG AAGGTAACTG GCTTCAGCAG AGCGCAGATA CCAAATACTG TCCTTCTAGT GTAGCCGTAG TTAGGCCACC ACTTCAAGAA CTCTGTAGCA CCGCCTACAT ACCTCGCTCT GCTAATCCTG TTACCAGTGG CTGCTGCCAG TGGCGATAAG TCGTGTCTTA CCGGGTTGGA CTCAAGACGA TAGTTACCGG ATAAGGCGCA GCGGTCGGGC TGAACGGGGG GTTCGTGCAC ACAGCCCAGC TTGGAGCGAA CGACCTACAC CGAACTGAGA TACCTACAGC GTGAGCTATG AGAAAGCGCC ACGCTTCCCG AAGGGAGAAA GGCGGACAGG TATCCGGTAA GCGGCAGGGT CGGAACAGGA GAGCGCACGA GGGAGCTTCC AGGGGGAAAC GCCTGGTATC TTTATAGTCC TGTCGGGTTT CGCCACCTCT GACTTGAGCG TCGATTTTTG TGATGCTCGT CAGGGGGGCG GAGCCTATGG AAAAACGCCA GCAACGCGGC CTTTTTACGG TTCCTGGCCT TTTGCTGGCC TTTTGCTCAC ATGTTCTTTC CTGCGTTATC CCCTGATTCT GTGGATAACC GTATTACCGC CTTTGAGTGA GCTGATACCG CTCGCCGCAG CCGAACGACC GAGCGCAGCG AGTCAGTGAG CGAGGAAGCG GAAGAGCGCC TGATGCGGTA TTTTCTCCTT ACGCATCTGT GCGGTATTTC ACACCGCATA TATGGTGCAC TCTCAGTACA ATCTGCTCTG ATGCCGCATA GTTAAGCCAG TATACACTCC GCTATCGCTA CGTGACTGGG TCATGGCTGC GCCCCGACAC CCGCCAACAC CCGCTGACGC GCCCTGACGG GCTTGTCTGC TCCCGGCATC CGCTTACAGA CAAGCTGTGA CCGTCTCCGG GAGCTGCATG TGTCAGAGGT TTTCACCGTC ATCACCGAAA CGCGCGAGGC AGCTGCGGTA AAGCTCATCA GCGTGGTCGT GAAGCGATTC ACAGATGTCT GCCTGTTCAT CCGCGTCCAG CTCGTTGAGT TTCTCCAGAA GCGTTAATGT CTGGCTTCTG ATAAAGCGGG CCATGTTAAG GGCGGTTTTT TCCTGTTTGG TCACTGATGC CTCCGTGTAA GGGGGATTTC TGTTCATGGG GGTAATGATA CCGATGAAAC GAGAGAGGAT GCTCACGATA CGGGTTACTG ATGATGAACA TGCCCGGTTA CTGGAACGTT GTGAGGGTAA ACAACTGGCG GTATGGATGC GGCGGGACCA GAGAAAAATC ACTCAGGGTC AATGCCAGCG CTTCGTTAAT ACAGATGTAG GTGTTCCACA GGGTAGCCAG CAGCATCCTG CGATGCAGAT CCGGAACATA ATGGTGCAGG GCGCTGACTT CCGCGTTTCC AGACTTTACG AAACACGGAA ACCGAAGACC ATTCATGTTG TTGCTCAGGT CGCAGACGTT TTGCAGCAGC AGTCGCTTCA CGTTCGCTCG CGTATCGGTG ATTCATTCTG CTAACCAGTA AGGCAACCCC GCCAGCCTAG CCGGGTCCTC AACGACAGGA GCACGATCAT GCGCACCCGT GGCCAGGACC CAACGCTGCC CGAGATGCGC CGCGTGCGGC TGCTGGAGAT GGCGGACGCG ATGGATATGT TCTGCCAAGG GTTGGTTTGC GCATTCACAG TTCTCCGCAA GAATTGATTG GCTCCAATTC TTGGAGTGGT GAATCCGTTA GCGAGGTGCC GCCGGCTTCC ATTCAGGTCG AGGTGGCCCG GCTCCATGCA CCGCGACGCA ACGCGGGGAG GCAGACAAGG TATAGGGCGG CGCCTACAAT CCATGCCAAC CCGTTCCATG TGCTCGCCGA GGCGGCATAA ATCGCCGTGA CGATCAGCGG TCCAGTGATC GAAGTTAGGC TGGTAAGAGC CGCGAGCGAT CCTTGAAGCT GTCCCTGATG GTCGTCATCT ACCTGCCTGG ACAGCATGGC CTGCAACGCG GGCATCCCGA TGCCGCCGGA AGCGAGAAGA ATCATAATGG GGAAGGCCAT CCAGCCTCGC GTCGCGAACG CCAGCAAGAC GTAGCCCAGC GCGTCGGCCG CCATGCCGGC GATAATGGCC TGCTTCTCGC CGAAACGTTT GGTGGCGGGA CCAGTGACGA AGGCTTGAGC GAGGGCGTGC AAGATTCCGA ATACCGCAAG CGACAGGCCG ATCATCGTCG CGCTCCAGCG AAAGCGGTCC TCGCCGAAAA TGACCCAGAG CGCTGCCGGC ACCTGTCCTA CGAGTTGCAT GATAAAGAAG ACAGTCATAA GTGCGGCGAC GATAGTCATG CCCCGCGCCC ACCGGAAGGA GCTGACTGGG TTGAAGGCTC TCAAGGGCAT CGGTCGACGC TCTCCCTTAT GCGACTCCTG CATTAGGAAG CAGCCCAGTA GTAGGTTGAG GCCGTTGAGC ACCGCCGCCG CAAGGAATGG TGCATGCAAG GAGATGGCGC CCAACAGTCC CCCGGCCACG GGGCCTGCCA CCATACCCAC GCCGAAACAA GCGCTCATGA GCCCGAAGTG GCGAGCCCGA TCTTCCCCAT CGGTGATGTC GGCGATATAG GCGCCAGCAA CCGCACCTGT GGCGCCGGTG ATGCCGGCCA CGATGCGTCC GGCGTAGAGG ATCGAGATCT CGATCCCGCG AAAT