HepyC.00852.a

GTPase Der (GTP-binding protein EngA)

CENTER ID: HepyC.00852.a
ORGANISM: Helicobacter pylori G27
ASSOCIATED DISEASE:
CURRENT STATUS: crystallized
COMMUNITY REQUEST: True
NIH RISK GROUP: 2
SELECT AGENT: False
NIH PRIORITY
pathogens category:
I

Ordering Clones & Proteins

If there are materials available for this target, they will be listed below. Materials can be ordered from SSGCID using the button in the "order material" column. Clicking the button will add the material to a virtual cart. You may order multiple materials at a time at no cost to you, as this contract is funded by NIAID. When you are ready to place your order, click the "Place Order" link which will appear in the top right corner of the page after you place your first item in your cart.

Clones*

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
HepyC.00852.a.AE1.GE44057 full length 1 460
HepyC.00852.a.B1.GE40944 full length 1 460
* SSGCID clones represent un-induced expression constructs which have been verified by sequencing from vector primers. Clones may contain a tag, such as N-term 6xHis. Get sequence information using the button in the "info" column.

Proteins

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
HepyC.00852.a.B1.PS38262 full length 1 460
HepyC.00852.a.AE1.PS38657 full length 1 460

External Resources

RESOURCE REFERENCE ID
BV-BRC: fig|563041.6.peg.851
UniProt: B5Z7J9

Sequences

These sequences are the native gene sequence; sequences of constructs derived from these sequences may differ due to codon optimization or other protocols. To find the specific sequence of any material you may have ordered, click on the "info" button next to the name of that material.
AA Sequence
MNTSHKTLKT IAILGQPNVG KSSLFNRLAR ERIAITSDFA GTTRDINKRK IALNGHEVEL LDTGGMAKDA LLSKEIKALN LKAAQMSDLI LYVVDGKSIP SDEDIKLFRE VFKTNPNCFL VINKIDNDKE KERAYAFSSF GMPKSFNISV SHNRGISALI DAVLNALSLN QIIEQDLDAD ILESLENNAP EEETKEEIIQ VGIIGRVNVG KSSLLNALTK KERSLVSSVA GTTIDPIDET ILIGDQKICF VDTAGIRHRG KILGIEKYAL ERTQKALEKS HIALLVLDVS APFVELDEKI SSLADKHSLG IILILNKWDI RYAPYEEIMA TLKRKFRFLE YAPVITTSCL KARHIDEIKH KIIEVYECFS RRIPTSLLNS VITQATQKHP LPSDGGKLVK VYYATQFATK PPQISLIMNR PKALHFSYKR YLINTLRKEF NFLGTPLILN AKDKKSAQQN
NT Sequence
atgaatacaa gccataaaac tttaaaaacc attgcgattt taggccagcc taatgtgggg aaaagctcgt tatttaaccg cctggctaga gaaaggatcg ctatcacttc agattttgca ggcactacac gagacattaa caaacgaaaa atcgcattga atggccatga agtggaattg ctagatacag ggggcatggc taaagacgct cttttgtcta aagaaatcaa agcccttaat ttaaaagccg ctcaaatgag cgatttgatt ttgtatgttg tggatggcaa gtctatccct agcgatgaag acatcaagct ttttagagag gtttttaaaa ccaaccctaa ctgcttttta gtgatcaata aaattgataa cgataaagaa aaagagcgag cttatgcgtt ttcttctttt ggcatgccaa agagttttaa tatttccgtt tcgcacaata gaggcattag tgcattgatt gatgcggtat tgaacgcgct gagtttaaac caaatcatag agcaagattt ggatgcggat attttagaaa gcttagaaaa taacgcacca gaagaagaaa ctaaagaaga gatcattcaa gtaggcatca ttgggagggt gaatgtgggc aaaagctcgc tcttaaacgc gctcacgaaa aaagaaagga gccttgtttc tagcgtggct ggcacgacca ttgaccccat agatgaaacc attctcatag gcgatcaaaa aatttgcttt gtggataccg ctggcatcag gcataggggt aagattttag gcattgaaaa atacgcacta gaacgcacgc aaaaagcctt agaaaaatcc cacattgcgc ttttagtttt agacgtgagc gctccttttg tggaattgga cgaaaagatc agctccttag cggataaaca ctctttaggc atcattctta ttctaaacaa atgggacatc cgctacgccc cttatgaaga aatcatggca actctaaaaa ggaaattccg ctttttagaa tacgcccctg tgatcacaac cagctgctta aaagcgcgcc atatagatga aatcaagcat aaaatcatag aagtctatga gtgtttttcc agacgtattc ccacgagcct actcaatagc gtgatcactc aagccaccca aaaacacccc ttaccaagcg atggagggaa attagtgaaa gtgtattacg ccacgcaatt tgccaccaaa ccccctcaaa tctctcttat catgaatcgc cctaaagcct tgcatttcag ttacaaacgc tatttgatca acaccttaag gaaagaattt aattttttag gcacgccttt aatccttaac gctaaagata aaaaaagcgc ccaacaaaat taa
Details for HepyC.00852.a.AE1.GE44057
HARVESTED ON: 1/7/2021
SEQUENCED ON: 1/8/2021
EXPECTED MW: 52kDa
OBSERVED MW: 52kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Good (10-50)
TOTAL EXPRESSION LEVEL: High Expression
SOLUBLE EXPRESSION LEVEL High Expression
EXPRESSION HOST: BL 21 (DE3) Rosetta
SEQUENCING RESULT:
PERCENT IDENTITY: 90
PERCENT COVERAGE: 100
Validated AA Sequence
MNTXHXTLKT IXXLGQPXXG KSSLFNRLAR ERIAITSDFA GTTRDINKRK IALNGHEVEL LDTGGMAKDA LLXKEIXALN LKAAQMSDLI LYVXDGKSIP SDEDIXLFRE VFKTNPNCFL VINKIDNDKE KERAYAFSSF GMPKSFNISX SHXXGIXALI DAVLNALSLN XIIXQXLDAD ILESLXNNAP EXXXXXEIIQ VGIIGRXNVG KSSLLNALTK KERSLXSSVA GTTIDPIDET ILIGDQKICF VDTAGIRHRG KILGIEKYAL ERTQKXLEXS HIALLVLXVS APFVEXDEKI XSLADKXXLG IIXIXNKWDI RYAPYEEIMA XXKRKFRFLE YAPVITTXCL KARHIDEIKH KIIEVYECFS RRIPTSLLNX VITQATQKHP XXSDGGKLVK VYYATQFATK PPQIXXIMNR XKALHFSYKR YLINTLRKEF NFLGTPLILN AKDKKSAQQN XGHHHHHH
Validated NT Sequence
tccgttaatg atgatgatgg tggtgcccan cattttgttg ggcgcttttt ttatctttag cgttaaggat taaaggcgtg cctaaaaaat taaattcttt ccttaaggtg ttgatcaaat ancgtttgta actgaaatgc aaggctttan ggcgattcat gataananag atttgagggg gtttggtggc aaattgcgtg gcgtaataca ctttcactaa tttccctcca tcgcttgnna angggtgttt ttgggtggct tgagtgatca cnctattgag taggctcgtg ggaatacgtc tggaaaaaca ctcatanact tctatgattt tatgcttgat ttcatctata tggcgcgctt ttaagcanct ggttgtgatc acaggggcgt attctaaaaa gcggaatttc ctttttanan ttgccatgat ttcttcataa ggggcgtanc ggatgtccca tttgtttana ataanaatga tgcctaaana ntgtttatcc gctaanganc tgatcttttc gtccnattcc acaaaaggag cgctcacgnc taaaactaaa agcgcaatgt ggganttttc taagnctttt tgcgtgcgtt ctagtgcgta tttttcaatg cctaaaatct tacccctatg cctgatgcca gcggtatcca caaagcaaat tttttgatcg cctatgagaa tggtttcatc tatggggtca atggtcgtgc cagccacgct agaancaagg ctcctttctt ttttcgtgag cgcgtttaag agcgagcttt tgcccacatt cnccctccca atgatgccta cttgaatgat ctctnnttna gnttnttntt cnggngcgtt attttntaag ctttctaaaa tatccgcatc caantcttgc tntatgattn ggtttaaact cagcgcgttc aataccgcat caatcaatgc antaatgcct ntatngtgng aaanggaaat attaaaactc tttggcatgc caaaagaaga aaacgcataa gctcgctctt tttctttatc gttatcaatt ttattgatca ctaaaaagca gttagggttg gttttaaaaa cctctctaaa aagcnngatg tcttcatcgc tagggataga cttgccatcc ncaacataca aaatcaaatc gctcatttga gcggctttta aattaagggc tnngatttct ttagncaaaa gagcgtcttt agccatgccc cctgtatcta gcaattccac ttcatggcca ttcaatgcga tttttcgttt gttaatgtct cgtgtagtgc ctgcaaaatc tgaagtgata gcgatccttt ctctagccag gcggttaaat aacgagcttt tccccncant aggctggcct aanntnncaa tggtttttaa agtnntatgg ntngtattca tgggagaga
Expected Protein Sequence
MNTSHKTLKT IAILGQPNVG KSSLFNRLAR ERIAITSDFA GTTRDINKRK IALNGHEVEL LDTGGMAKDA LLSKEIKALN LKAAQMSDLI LYVVDGKSIP SDEDIKLFRE VFKTNPNCFL VINKIDNDKE KERAYAFSSF GMPKSFNISV SHNRGISALI DAVLNALSLN QIIEQDLDAD ILESLENNAP EEETKEEIIQ VGIIGRVNVG KSSLLNALTK KERSLVSSVA GTTIDPIDET ILIGDQKICF VDTAGIRHRG KILGIEKYAL ERTQKALEKS HIALLVLDVS APFVELDEKI SSLADKHSLG IILILNKWDI RYAPYEEIMA TLKRKFRFLE YAPVITTSCL KARHIDEIKH KIIEVYECFS RRIPTSLLNS VITQATQKHP LPSDGGKLVK VYYATQFATK PPQISLIMNR PKALHFSYKR YLINTLRKEF NFLGTPLILN AKDKKSAQQN GHHHHHH
Full NT Sequence (Expression Vector + Insert)
tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acattaacgc ttacaattta ggtggcactt ttcggggaaa tgtgcgcgga acccctattt gtttattttt ctaaatacat tcaaatatgt atccgctcat gagacaataa ccctgataaa tgcttcaata atttgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatatgg tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatac actccgctat cgctacgtga ctgggtcatg gctgcgcccc gacacccgcc aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gaggcagctg cggtaaagct catcagcgtg gtcgtgaagc gattcacaga tgtctgcctg ttcatccgcg tccagctcgt tgagtttctc cagaagcgtt aatgtctggc ttctgataaa gcgggccatg ttaagggcgg ttttttcctg tttggtcact gatgcctccg tgtaaggggg atttctgttc atgggggtaa tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc ggttactgga acgttgtgag ggtaaacaac tggcggtatg gatgcggcgg gaccagagaa aaatcactca gggtcaatgc cagcgcttcg ttaatacaga tgtaggtgtt ccacagggta gccagcagca tcctgcgatg cagatccgga acataatggt gcagggcgct gacttccgcg tttccagact ttacgaaaca cggaaaccga agaccattca tgttgttgct caggtcgcag acgttttgca gcagcagtcg cttcacgttc gctcgcgtat cggtgattca ttctgctaac cagtaaggca accccgccag cctagccggg tcctcaacga caggagcacg atcatgcgca cccgtggggc cgccatgccg gcgataatgg cctgcttctc gccgaaacgt ttggtggcgg gaccagtgac gaaggcttga gcgagggcgt gcaagattcc gaataccgca agcgacaggc cgatcatcgt cgcgctccag cgaaagcggt cctcgccgaa aatgacccag agcgctgccg gcacctgtcc tacgagttgc atgataaaga agacagtcat aagtgcggcg acgatagtca tgccccgcgc ccaccggaag gagctgactg ggttgaaggc tctcaagggc atcggtcgag atcccggtgc ctaatgagtg agctaactta cattaattgc gttgcgctca ctgcccgctt tccagtcggg aaacctgtcg tgccagctgc attaatgaat cggccaacgc gcggggagag gcggtttgcg tattgggcgc cagggtggtt tttcttttca ccagtgagac gggcaacagc tgattgccct tcaccgcctg gccctgagag agttgcagca agcggtccac gctggtttgc cccagcaggc gaaaatcctg tttgatggtg gttaacggcg ggatataaca tgagctgtct tcggtatcgt cgtatcccac taccgagata tccgcaccaa cgcgcagccc ggactcggta atggcgcgca ttgcgcccag cgccatctga tcgttggcaa ccagcatcgc agtgggaacg atgccctcat tcagcatttg catggtttgt tgaaaaccgg acatggcact ccagtcgcct tcccgttccg ctatcggctg aatttgattg cgagtgagat atttatgcca gccagccaga cgcagacgcg ccgagacaga acttaatggg cccgctaaca gcgcgatttg ctggtgaccc aatgcgacca gatgctccac gcccagtcgc gtaccgtctt catgggagaa aataatactg ttgatgggtg tctggtcaga gacatcaaga aataacgccg gaacattagt gcaggcagct tccacagcaa tggcatcctg gtcatccagc ggatagttaa tgatcagccc actgacgcgt tgcgcgagaa gattgtgcac cgccgcttta caggcttcga cgccgcttcg ttctaccatc gacaccacca cgctggcacc cagttgatcg gcgcgagatt taatcgccgc gacaatttgc gacggcgcgt gcagggccag actggaggtg gcaacgccaa tcagcaacga ctgtttgccc gccagttgtt gtgccacgcg gttgggaatg taattcagct ccgccatcgc cgcttccact ttttcccgcg ttttcgcaga aacgtggctg gcctggttca ccacgcggga aacggtctga taagagacac cggcatactc tgcgacatcg tataacgtta ctggtttcac attcaccacc ctgaattgac tctcttccgg gcgctatcat gccataccgc gaaaggtttt gcgccattcg atggtgtccg ggatctcgac gctctccctt atgcgactcc tgcattagga agcagcccag tagtaggttg aggccgttga gcaccgccgc cgcaaggaat ggtgcatgca aggagatggc gcccaacagt cccccggcca cggggcctgc caccataccc acgccgaaac aagcgctcat gagcccgaag tggcgagccc gatcttcccc atcggtgatg tcggcgatat aggcgccagc aaccgcacct gtggcgccgg tgatgccggc cacgatgcgt ccggcgtaga ggatcgagat cgatctcgat cccgcgaaat taatacgact cactataggg gaattgtgag cggataacaa ttcccctcta gaaataattt tgtttaactt taagaaggag tctctcccat gaatacaagc cataaaactt taaaaaccat tgcgatttta ggccagccta atgtggggaa aagctcgtta tttaaccgcc tggctagaga aaggatcgct atcacttcag attttgcagg cactacacga gacattaaca aacgaaaaat cgcattgaat ggccatgaag tggaattgct agatacaggg ggcatggcta aagacgctct tttgtctaaa gaaatcaaag cccttaattt aaaagccgct caaatgagcg atttgatttt gtatgttgtg gatggcaagt ctatccctag cgatgaagac atcaagcttt ttagagaggt ttttaaaacc aaccctaact gctttttagt gatcaataaa attgataacg ataaagaaaa agagcgagct tatgcgtttt cttcttttgg catgccaaag agttttaata tttccgtttc gcacaataga ggcattagtg cattgattga tgcggtattg aacgcgctga gtttaaacca aatcatagag caagatttgg atgcggatat tttagaaagc ttagaaaata acgcaccaga agaagaaact aaagaagaga tcattcaagt aggcatcatt gggagggtga atgtgggcaa aagctcgctc ttaaacgcgc tcacgaaaaa agaaaggagc cttgtttcta gcgtggctgg cacgaccatt gaccccatag atgaaaccat tctcataggc gatcaaaaaa tttgctttgt ggataccgct ggcatcaggc ataggggtaa gattttaggc attgaaaaat acgcactaga acgcacgcaa aaagccttag aaaaatccca cattgcgctt ttagttttag acgtgagcgc tccttttgtg gaattggacg aaaagatcag ctccttagcg gataaacact ctttaggcat cattcttatt ctaaacaaat gggacatccg ctacgcccct tatgaagaaa tcatggcaac tctaaaaagg aaattccgct ttttagaata cgcccctgtg atcacaacca gctgcttaaa agcgcgccat atagatgaaa tcaagcataa aatcatagaa gtctatgagt gtttttccag acgtattccc acgagcctac tcaatagcgt gatcactcaa gccacccaaa aacacccctt accaagcgat ggagggaaat tagtgaaagt gtattacgcc acgcaatttg ccaccaaacc ccctcaaatc tctcttatca tgaatcgccc taaagccttg catttcagtt acaaacgcta tttgatcaac accttaagga aagaatttaa ttttttaggc acgcctttaa tccttaacgc taaagataaa aaaagcgccc aacaaaatgg gcaccaccat catcatcatt aacggatccg aattcgagct ccgtcgacaa gcttgcggcc gcactcgagc accaccacca ccaccactga gatccggctg ctaacaaagc ccgaaaggaa gctgagttgg ctgctgccac cgctgagcaa taactagcat aaccccttgg ggcctctaaa cgggtcttga ggggtttttt gctgaaagga ggaactatat ccggat
Details for HepyC.00852.a.B1.GE40944
HARVESTED ON: 12/20/2016
SEQUENCED ON: 12/27/2016
EXPECTED MW: 53kDa
OBSERVED MW: 53kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Many (50-100)
TOTAL EXPRESSION LEVEL: High Expression
SOLUBLE EXPRESSION LEVEL High Expression
EXPRESSION HOST: BL 21 (DE3) Rosetta
SEQUENCING RESULT:
PERCENT IDENTITY: 100
PERCENT COVERAGE: 98
Validated AA Sequence
MAHHHHHHMN TSHKTLKTIA ILGQPNVGKS SLFNRLARER IAITSDFAGT TRDINKRKIA LNGHEVELLD TGGMAKDALL SKEIKALNLK AAQMSDLILY VVDGKSIPSD EDIKLFREVF KTNPNCFLVI NKIDNDKEKE RAYAFSSFGM PKSFNISVSH NRGISALIDA VLNALSLNQI IEQDLDADIL ESLENNAPEE ETKEEIIQVG IIGRVNVGKS SLLNALTKKE RSLVSSVAGT TIDPIDETIL IGDQKICFVD TAGIRHRGKI LGIEKYALER TQKALEKSHI ALLVLDVSAP FVELDEKISS LADKHSLGII LILNKWDIRY APYEEIMATL KRKFRFLEYA PVITTSCLKA RHIDEIKHKI IEVYECFSRR IPTSLLNSVI TQATQKHPLP SDGGKLVKVY YATQFATKPP QISLIMNRPK ALHFSYKRYL INTLRKEFNF LGTPLILNAK
Validated NT Sequence
ctttagcgtt aaggattaaa ggcgtgccta aaaaattaaa ttctttcctt aaggtgttga tcaaatagcg tttgtaactg aaatgcaagg ctttagggcg attcatgata agagagattt gagggggttt ggtggcaaat tgcgtggcgt aatacacttt cactaatttc cctccatcgc ttggtaaggg gtgtttttgg gtggcttgag tgatcacgct attgagtagg ctcgtgggaa tacgtctgga aaaacactca tagacttcta tgattttatg cttgatttca tctatatggc gcgcttttaa gcagctggtt gtgatcacag gggcgtattc taaaaagcgg aatttccttt ttagagttgc catgatttct tcataagggg cgtagcggat gtcccatttg tttagaataa gaatgatgcc taaagagtgt ttatccgcta aggagctgat cttttcgtcc aattccacaa aaggagcgct cacgtctaaa actaaaagcg caatgtggga tttttctaag gctttttgcg tgcgttctag tgcgtatttt tcaatgccta aaatcttacc cctatgcctg atgccagcgg tatccacaaa gcaaattttt tgatcgccta tgagaatggt ttcatctatg gggtcaatgg tcgtgccagc cacgctagaa acaaggctcc tttctttttt cgtgagcgcg tttaagagcg agcttttgcc cacattcacc ctcccaatga tgcctacttg aatgatctct tctttagttt cttcttctgg tgcgttattt tctaagcttt ctaaaatatc cgcatccaaa tcttgctcta tgatttggtt taaactcagc gcgttcaata ccgcatcaat caatgcacta atgcctctat tgtgcgaaac ggaaatatta aaactctttg gcatgccaaa agaagaaaac gcataagctc gctctttttc tttatcgtta tcaattttat tgatcactaa aaagcagtta gggttggttt taaaaacctc tctaaaaagc ttgatgtctt catcgctagg gatagacttg ccatccacaa catacaaaat caaatcgctc atttgagcgg cttttaaatt aagggctttg atttctttag acaaaagagc gtctttagcc atgccccctg tatctagcaa ttccacttca tggccattca atgcgatttt tcgtttgtta atgtctcgtg tagtgcctgc aaaatctgaa gtgatagcga tcctttctct agccaggcgg ttaaataacg agcttttccc cacattaggc tggcctaaaa tcgcaatggt ttttaaagtt ttatggcttg tattcatatg gtggtggtgg tggtgagcca t
Expected Protein Sequence
MAHHHHHHMN TSHKTLKTIA ILGQPNVGKS SLFNRLARER IAITSDFAGT TRDINKRKIA LNGHEVELLD TGGMAKDALL SKEIKALNLK AAQMSDLILY VVDGKSIPSD EDIKLFREVF KTNPNCFLVI NKIDNDKEKE RAYAFSSFGM PKSFNISVSH NRGISALIDA VLNALSLNQI IEQDLDADIL ESLENNAPEE ETKEEIIQVG IIGRVNVGKS SLLNALTKKE RSLVSSVAGT TIDPIDETIL IGDQKICFVD TAGIRHRGKI LGIEKYALER TQKALEKSHI ALLVLDVSAP FVELDEKISS LADKHSLGII LILNKWDIRY APYEEIMATL KRKFRFLEYA PVITTSCLKA RHIDEIKHKI IEVYECFSRR IPTSLLNSVI TQATQKHPLP SDGGKLVKVY YATQFATKPP QISLIMNRPK ALHFSYKRYL INTLRKEFNF LGTPLILNAK DKKSAQQN
Full NT Sequence (Expression Vector + Insert)
taatacgact cactataggg agaccacaac ggtttccctc tagaaataat tttgtttaac tttaagaagg agatatacca tggctcacca ccaccaccac catatgaata caagccataa aactttaaaa accattgcga ttttaggcca gcctaatgtg gggaaaagct cgttatttaa ccgcctggct agagaaagga tcgctatcac ttcagatttt gcaggcacta cacgagacat taacaaacga aaaatcgcat tgaatggcca tgaagtggaa ttgctagata cagggggcat ggctaaagac gctcttttgt ctaaagaaat caaagccctt aatttaaaag ccgctcaaat gagcgatttg attttgtatg ttgtggatgg caagtctatc cctagcgatg aagacatcaa gctttttaga gaggttttta aaaccaaccc taactgcttt ttagtgatca ataaaattga taacgataaa gaaaaagagc gagcttatgc gttttcttct tttggcatgc caaagagttt taatatttcc gtttcgcaca atagaggcat tagtgcattg attgatgcgg tattgaacgc gctgagttta aaccaaatca tagagcaaga tttggatgcg gatattttag aaagcttaga aaataacgca ccagaagaag aaactaaaga agagatcatt caagtaggca tcattgggag ggtgaatgtg ggcaaaagct cgctcttaaa cgcgctcacg aaaaaagaaa ggagccttgt ttctagcgtg gctggcacga ccattgaccc catagatgaa accattctca taggcgatca aaaaatttgc tttgtggata ccgctggcat caggcatagg ggtaagattt taggcattga aaaatacgca ctagaacgca cgcaaaaagc cttagaaaaa tcccacattg cgcttttagt tttagacgtg agcgctcctt ttgtggaatt ggacgaaaag atcagctcct tagcggataa acactcttta ggcatcattc ttattctaaa caaatgggac atccgctacg ccccttatga agaaatcatg gcaactctaa aaaggaaatt ccgcttttta gaatacgccc ctgtgatcac aaccagctgc ttaaaagcgc gccatataga tgaaatcaag cataaaatca tagaagtcta tgagtgtttt tccagacgta ttcccacgag cctactcaat agcgtgatca ctcaagccac ccaaaaacac cccttaccaa gcgatggagg gaaattagtg aaagtgtatt acgccacgca atttgccacc aaaccccctc aaatctctct tatcatgaat cgccctaaag ccttgcattt cagttacaaa cgctatttga tcaacacctt aaggaaagaa tttaattttt taggcacgcc tttaatcctt aacgctaaag ataaaaaaag cgcccaacaa aattgagtaa gataggatcc ggctgctaac aaagcccgaa aggaagctga gttggctgct gccaccgctg agcaataact agcataaccc cttggggcct ctaaacgggt cttgaggggt tttttgctga aaggaggaac tatatccgga tatccacagg acgggtgtgg tcgccatgat cgcgtagtcg atagtggctc caagtagcga agcgagcagg actgggcggc ggccaaagcg gtcggacagt gctccgagaa cgggtgcgca tagaaattgc atcaacgcat atagcgctag cagcacgcca tagtgactgg cgatgctgtc ggaatggacg atatcccgca agaggcccgg cagtaccggc ataaccaagc ctatgcctac agcatccagg gtgacggtgc cgaggatgac gatgagcgca ttgttagatt tcatacacgg tgcctgactg cgttagcaat ttaactgtga taaactaccg cattaaagct tatcgatgat aagctgtcaa acatgagaat tcttgaagac gaaagggcct cgtgatacgc ctatttttat aggttaatgt catgataata atggtttctt agacgtcagg tggcactttt cggggaaatg tgcgcggaac ccctatttgt ttatttttct aaatacattc aaatatgtat ccgctcatga gacaataacc ctgataaatg cttcaataat attgaaaaag gaagagtatg agtattcaac atttccgtgt cgcccttatt cccttttttg cggcattttg ccttcctgtt tttgctcacc cagaaacgct ggtgaaagta aaagatgctg aagatcagtt gggtgcacga gtgggttaca tcgaactgga tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc caatgatgag cacttttaaa gttctgctat gtggcgcggt attatcccgt gttgacgccg ggcaagagca actcggtcgc cgcatacact attctcagaa tgacttggtt gagtactcac cagtcacaga aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca taaccatgag tgataacact gcggccaact tacttctgac aacgatcgga ggaccgaagg agctaaccgc ttttttgcac aacatggggg atcatgtaac tcgccttgat cgttgggaac cggagctgaa tgaagccata ccaaacgacg agcgtgacac cacgatgcct gcagcaatgg caacaacgtt gcgcaaacta ttaactggcg aactacttac tctagcttcc cggcaacaat taatagactg gatggaggcg gataaagttg caggaccact tctgcgctcg gcccttccgg ctggctggtt tattgctgat aaatctggag ccggtgagcg tgggtctcgc ggtatcattg cagcactggg gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc aggcaactat ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc attggtaact gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt tttaatttaa aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt aacgtgagtt ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt gagatccttt ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg tttgccggat caagagctac caactctttt tccgaaggta actggcttca gcagagcgca gataccaaat actgtccttc tagtgtagcc gtagttaggc caccacttca agaactctgt agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg ccagtggcga taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg cgcagcggtc gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct acaccgaact gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga gaaaggcgga caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc ttccaggggg aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg cggccttttt acggttcctg gccttttgct ggccttttgc tcacatgttc tttcctgcgt tatcccctga ttctgtggat aaccgtatta ccgcctttga gtgagctgat accgctcgcc gcagccgaac gaccgagcgc agcgagtcag tgagcgagga agcggaagag cgcctgatgc ggtattttct ccttacgcat ctgtgcggta tttcacaccg catatatggt gcactctcag tacaatctgc tctgatgccg catagttaag ccagtataca ctccgctatc gctacgtgac tgggtcatgg ctgcgccccg acacccgcca acacccgctg acgcgccctg acgggcttgt ctgctcccgg catccgctta cagacaagct gtgaccgtct ccgggagctg catgtgtcag aggttttcac cgtcatcacc gaaacgcgcg aggcagctgc ggtaaagctc atcagcgtgg tcgtgaagcg attcacagat gtctgcctgt tcatccgcgt ccagctcgtt gagtttctcc agaagcgtta atgtctggct tctgataaag cgggccatgt taagggcggt tttttcctgt ttggtcactg atgcctccgt gtaaggggga tttctgttca tgggggtaat gataccgatg aaacgagaga ggatgctcac gatacgggtt actgatgatg aacatgcccg gttactggaa cgttgtgagg gtaaacaact ggcggtatgg atgcggcggg accagagaaa aatcactcag ggtcaatgcc agcgcttcgt taatacagat gtaggtgttc cacagggtag ccagcagcat cctgcgatgc agatccggaa cataatggtg cagggcgctg acttccgcgt ttccagactt tacgaaacac ggaaaccgaa gaccattcat gttgttgctc aggtcgcaga cgttttgcag cagcagtcgc ttcacgttcg ctcgcgtatc ggtgattcat tctgctaacc agtaaggcaa ccccgccagc ctagccgggt cctcaacgac aggagcacga tcatgcgcac ccgtggccag gacccaacgc tgcccgagat gcgccgcgtg cggctgctgg agatggcgga cgcgatggat atgttctgcc aagggttggt ttgcgcattc acagttctcc gcaagaattg attggctcca attcttggag tggtgaatcc gttagcgagg tgccgccggc ttccattcag gtcgaggtgg cccggctcca tgcaccgcga cgcaacgcgg ggaggcagac aaggtatagg gcggcgccta caatccatgc caacccgttc catgtgctcg ccgaggcggc ataaatcgcc gtgacgatca gcggtccagt gatcgaagtt aggctggtaa gagccgcgag cgatccttga agctgtccct gatggtcgtc atctacctgc ctggacagca tggcctgcaa cgcgggcatc ccgatgccgc cggaagcgag aagaatcata atggggaagg ccatccagcc tcgcgtcgcg aacgccagca agacgtagcc cagcgcgtcg gccgccatgc cggcgataat ggcctgcttc tcgccgaaac gtttggtggc gggaccagtg acgaaggctt gagcgagggc gtgcaagatt ccgaataccg caagcgacag gccgatcatc gtcgcgctcc agcgaaagcg gtcctcgccg aaaatgaccc agagcgctgc cggcacctgt cctacgagtt gcatgataaa gaagacagtc ataagtgcgg cgacgatagt catgccccgc gcccaccgga aggagctgac tgggttgaag gctctcaagg gcatcggtcg acgctctccc ttatgcgact cctgcattag gaagcagccc agtagtaggt tgaggccgtt gagcaccgcc gccgcaagga atggtgcatg caaggagatg gcgcccaaca gtcccccggc cacggggcct gccaccatac ccacgccgaa acaagcgctc atgagcccga agtggcgagc ccgatcttcc ccatcggtga tgtcggcgat ataggcgcca gcaaccgcac ctgtggcgcc ggtgatgccg gccacgatgc gtccggcgta gaggatcgag atctcgatcc cgcgaaat
Details for HepyC.00852.a.B1.PS38262
PURIFICATION DATe: 4/18/2017
CONCENTRATION: 31.27mg/ml
OBSERVED MW: 55kDa
EXPRESSION LEVEL: Moderate Expression
PROTEIN PURIFICATION BUFFER: SEC: 20 mM HEPES pH 7.0, 300 mM NaCl, 5% glycerol, 1 mM TCEP
EXPRESSION HOST: BL 21 (DE3) Rosetta
VIAL COUNT (approx.): 4
VIAL VOLUME: 200µl
PERCENT IDENTITY: 100
PERCENT COVERAGE: 98
Protocol Notes
notes unavailable
Validated AA Sequence
MAHHHHHHMN TSHKTLKTIA ILGQPNVGKS SLFNRLARER IAITSDFAGT TRDINKRKIA LNGHEVELLD TGGMAKDALL SKEIKALNLK AAQMSDLILY VVDGKSIPSD EDIKLFREVF KTNPNCFLVI NKIDNDKEKE RAYAFSSFGM PKSFNISVSH NRGISALIDA VLNALSLNQI IEQDLDADIL ESLENNAPEE ETKEEIIQVG IIGRVNVGKS SLLNALTKKE RSLVSSVAGT TIDPIDETIL IGDQKICFVD TAGIRHRGKI LGIEKYALER TQKALEKSHI ALLVLDVSAP FVELDEKISS LADKHSLGII LILNKWDIRY APYEEIMATL KRKFRFLEYA PVITTSCLKA RHIDEIKHKI IEVYECFSRR IPTSLLNSVI TQATQKHPLP SDGGKLVKVY YATQFATKPP QISLIMNRPK ALHFSYKRYL INTLRKEFNF LGTPLILNAK
Validated NT Sequence
ctttagcgtt aaggattaaa ggcgtgccta aaaaattaaa ttctttcctt aaggtgttga tcaaatagcg tttgtaactg aaatgcaagg ctttagggcg attcatgata agagagattt gagggggttt ggtggcaaat tgcgtggcgt aatacacttt cactaatttc cctccatcgc ttggtaaggg gtgtttttgg gtggcttgag tgatcacgct attgagtagg ctcgtgggaa tacgtctgga aaaacactca tagacttcta tgattttatg cttgatttca tctatatggc gcgcttttaa gcagctggtt gtgatcacag gggcgtattc taaaaagcgg aatttccttt ttagagttgc catgatttct tcataagggg cgtagcggat gtcccatttg tttagaataa gaatgatgcc taaagagtgt ttatccgcta aggagctgat cttttcgtcc aattccacaa aaggagcgct cacgtctaaa actaaaagcg caatgtggga tttttctaag gctttttgcg tgcgttctag tgcgtatttt tcaatgccta aaatcttacc cctatgcctg atgccagcgg tatccacaaa gcaaattttt tgatcgccta tgagaatggt ttcatctatg gggtcaatgg tcgtgccagc cacgctagaa acaaggctcc tttctttttt cgtgagcgcg tttaagagcg agcttttgcc cacattcacc ctcccaatga tgcctacttg aatgatctct tctttagttt cttcttctgg tgcgttattt tctaagcttt ctaaaatatc cgcatccaaa tcttgctcta tgatttggtt taaactcagc gcgttcaata ccgcatcaat caatgcacta atgcctctat tgtgcgaaac ggaaatatta aaactctttg gcatgccaaa agaagaaaac gcataagctc gctctttttc tttatcgtta tcaattttat tgatcactaa aaagcagtta gggttggttt taaaaacctc tctaaaaagc ttgatgtctt catcgctagg gatagacttg ccatccacaa catacaaaat caaatcgctc atttgagcgg cttttaaatt aagggctttg atttctttag acaaaagagc gtctttagcc atgccccctg tatctagcaa ttccacttca tggccattca atgcgatttt tcgtttgtta atgtctcgtg tagtgcctgc aaaatctgaa gtgatagcga tcctttctct agccaggcgg ttaaataacg agcttttccc cacattaggc tggcctaaaa tcgcaatggt ttttaaagtt ttatggcttg tattcatatg gtggtggtgg tggtgagcca t
Expressed Protein Sequence
MAHHHHHHMN TSHKTLKTIA ILGQPNVGKS SLFNRLARER IAITSDFAGT TRDINKRKIA LNGHEVELLD TGGMAKDALL SKEIKALNLK AAQMSDLILY VVDGKSIPSD EDIKLFREVF KTNPNCFLVI NKIDNDKEKE RAYAFSSFGM PKSFNISVSH NRGISALIDA VLNALSLNQI IEQDLDADIL ESLENNAPEE ETKEEIIQVG IIGRVNVGKS SLLNALTKKE RSLVSSVAGT TIDPIDETIL IGDQKICFVD TAGIRHRGKI LGIEKYALER TQKALEKSHI ALLVLDVSAP FVELDEKISS LADKHSLGII LILNKWDIRY APYEEIMATL KRKFRFLEYA PVITTSCLKA RHIDEIKHKI IEVYECFSRR IPTSLLNSVI TQATQKHPLP SDGGKLVKVY YATQFATKPP QISLIMNRPK ALHFSYKRYL INTLRKEFNF LGTPLILNAK DKKSAQQN
Full NT Sequence (Expression Vector + Insert)
TAATACGACT CACTATAGGG AGACCACAAC GGTTTCCCTC TAGAAATAAT TTTGTTTAAC TTTAAGAAGG AGATATACCA TGGCTCACCA CCACCACCAC CATATGAATA CAAGCCATAA AACTTTAAAA ACCATTGCGA TTTTAGGCCA GCCTAATGTG GGGAAAAGCT CGTTATTTAA CCGCCTGGCT AGAGAAAGGA TCGCTATCAC TTCAGATTTT GCAGGCACTA CACGAGACAT TAACAAACGA AAAATCGCAT TGAATGGCCA TGAAGTGGAA TTGCTAGATA CAGGGGGCAT GGCTAAAGAC GCTCTTTTGT CTAAAGAAAT CAAAGCCCTT AATTTAAAAG CCGCTCAAAT GAGCGATTTG ATTTTGTATG TTGTGGATGG CAAGTCTATC CCTAGCGATG AAGACATCAA GCTTTTTAGA GAGGTTTTTA AAACCAACCC TAACTGCTTT TTAGTGATCA ATAAAATTGA TAACGATAAA GAAAAAGAGC GAGCTTATGC GTTTTCTTCT TTTGGCATGC CAAAGAGTTT TAATATTTCC GTTTCGCACA ATAGAGGCAT TAGTGCATTG ATTGATGCGG TATTGAACGC GCTGAGTTTA AACCAAATCA TAGAGCAAGA TTTGGATGCG GATATTTTAG AAAGCTTAGA AAATAACGCA CCAGAAGAAG AAACTAAAGA AGAGATCATT CAAGTAGGCA TCATTGGGAG GGTGAATGTG GGCAAAAGCT CGCTCTTAAA CGCGCTCACG AAAAAAGAAA GGAGCCTTGT TTCTAGCGTG GCTGGCACGA CCATTGACCC CATAGATGAA ACCATTCTCA TAGGCGATCA AAAAATTTGC TTTGTGGATA CCGCTGGCAT CAGGCATAGG GGTAAGATTT TAGGCATTGA AAAATACGCA CTAGAACGCA CGCAAAAAGC CTTAGAAAAA TCCCACATTG CGCTTTTAGT TTTAGACGTG AGCGCTCCTT TTGTGGAATT GGACGAAAAG ATCAGCTCCT TAGCGGATAA ACACTCTTTA GGCATCATTC TTATTCTAAA CAAATGGGAC ATCCGCTACG CCCCTTATGA AGAAATCATG GCAACTCTAA AAAGGAAATT CCGCTTTTTA GAATACGCCC CTGTGATCAC AACCAGCTGC TTAAAAGCGC GCCATATAGA TGAAATCAAG CATAAAATCA TAGAAGTCTA TGAGTGTTTT TCCAGACGTA TTCCCACGAG CCTACTCAAT AGCGTGATCA CTCAAGCCAC CCAAAAACAC CCCTTACCAA GCGATGGAGG GAAATTAGTG AAAGTGTATT ACGCCACGCA ATTTGCCACC AAACCCCCTC AAATCTCTCT TATCATGAAT CGCCCTAAAG CCTTGCATTT CAGTTACAAA CGCTATTTGA TCAACACCTT AAGGAAAGAA TTTAATTTTT TAGGCACGCC TTTAATCCTT AACGCTAAAG ATAAAAAAAG CGCCCAACAA AATTGAGTAA GATAGGATCC GGCTGCTAAC AAAGCCCGAA AGGAAGCTGA GTTGGCTGCT GCCACCGCTG AGCAATAACT AGCATAACCC CTTGGGGCCT CTAAACGGGT CTTGAGGGGT TTTTTGCTGA AAGGAGGAAC TATATCCGGA TATCCACAGG ACGGGTGTGG TCGCCATGAT CGCGTAGTCG ATAGTGGCTC CAAGTAGCGA AGCGAGCAGG ACTGGGCGGC GGCCAAAGCG GTCGGACAGT GCTCCGAGAA CGGGTGCGCA TAGAAATTGC ATCAACGCAT ATAGCGCTAG CAGCACGCCA TAGTGACTGG CGATGCTGTC GGAATGGACG ATATCCCGCA AGAGGCCCGG CAGTACCGGC ATAACCAAGC CTATGCCTAC AGCATCCAGG GTGACGGTGC CGAGGATGAC GATGAGCGCA TTGTTAGATT TCATACACGG TGCCTGACTG CGTTAGCAAT TTAACTGTGA TAAACTACCG CATTAAAGCT TATCGATGAT AAGCTGTCAA ACATGAGAAT TCTTGAAGAC GAAAGGGCCT CGTGATACGC CTATTTTTAT AGGTTAATGT CATGATAATA ATGGTTTCTT AGACGTCAGG TGGCACTTTT CGGGGAAATG TGCGCGGAAC CCCTATTTGT TTATTTTTCT AAATACATTC AAATATGTAT CCGCTCATGA GACAATAACC CTGATAAATG CTTCAATAAT ATTGAAAAAG GAAGAGTATG AGTATTCAAC ATTTCCGTGT CGCCCTTATT CCCTTTTTTG CGGCATTTTG CCTTCCTGTT TTTGCTCACC CAGAAACGCT GGTGAAAGTA AAAGATGCTG AAGATCAGTT GGGTGCACGA GTGGGTTACA TCGAACTGGA TCTCAACAGC GGTAAGATCC TTGAGAGTTT TCGCCCCGAA GAACGTTTTC CAATGATGAG CACTTTTAAA GTTCTGCTAT GTGGCGCGGT ATTATCCCGT GTTGACGCCG GGCAAGAGCA ACTCGGTCGC CGCATACACT ATTCTCAGAA TGACTTGGTT GAGTACTCAC CAGTCACAGA AAAGCATCTT ACGGATGGCA TGACAGTAAG AGAATTATGC AGTGCTGCCA TAACCATGAG TGATAACACT GCGGCCAACT TACTTCTGAC AACGATCGGA GGACCGAAGG AGCTAACCGC TTTTTTGCAC AACATGGGGG ATCATGTAAC TCGCCTTGAT CGTTGGGAAC CGGAGCTGAA TGAAGCCATA CCAAACGACG AGCGTGACAC CACGATGCCT GCAGCAATGG CAACAACGTT GCGCAAACTA TTAACTGGCG AACTACTTAC TCTAGCTTCC CGGCAACAAT TAATAGACTG GATGGAGGCG GATAAAGTTG CAGGACCACT TCTGCGCTCG GCCCTTCCGG CTGGCTGGTT TATTGCTGAT AAATCTGGAG CCGGTGAGCG TGGGTCTCGC GGTATCATTG CAGCACTGGG GCCAGATGGT AAGCCCTCCC GTATCGTAGT TATCTACACG ACGGGGAGTC AGGCAACTAT GGATGAACGA AATAGACAGA TCGCTGAGAT AGGTGCCTCA CTGATTAAGC ATTGGTAACT GTCAGACCAA GTTTACTCAT ATATACTTTA GATTGATTTA AAACTTCATT TTTAATTTAA AAGGATCTAG GTGAAGATCC TTTTTGATAA TCTCATGACC AAAATCCCTT AACGTGAGTT TTCGTTCCAC TGAGCGTCAG ACCCCGTAGA AAAGATCAAA GGATCTTCTT GAGATCCTTT TTTTCTGCGC GTAATCTGCT GCTTGCAAAC AAAAAAACCA CCGCTACCAG CGGTGGTTTG TTTGCCGGAT CAAGAGCTAC CAACTCTTTT TCCGAAGGTA ACTGGCTTCA GCAGAGCGCA GATACCAAAT ACTGTCCTTC TAGTGTAGCC GTAGTTAGGC CACCACTTCA AGAACTCTGT AGCACCGCCT ACATACCTCG CTCTGCTAAT CCTGTTACCA GTGGCTGCTG CCAGTGGCGA TAAGTCGTGT CTTACCGGGT TGGACTCAAG ACGATAGTTA CCGGATAAGG CGCAGCGGTC GGGCTGAACG GGGGGTTCGT GCACACAGCC CAGCTTGGAG CGAACGACCT ACACCGAACT GAGATACCTA CAGCGTGAGC TATGAGAAAG CGCCACGCTT CCCGAAGGGA GAAAGGCGGA CAGGTATCCG GTAAGCGGCA GGGTCGGAAC AGGAGAGCGC ACGAGGGAGC TTCCAGGGGG AAACGCCTGG TATCTTTATA GTCCTGTCGG GTTTCGCCAC CTCTGACTTG AGCGTCGATT TTTGTGATGC TCGTCAGGGG GGCGGAGCCT ATGGAAAAAC GCCAGCAACG CGGCCTTTTT ACGGTTCCTG GCCTTTTGCT GGCCTTTTGC TCACATGTTC TTTCCTGCGT TATCCCCTGA TTCTGTGGAT AACCGTATTA CCGCCTTTGA GTGAGCTGAT ACCGCTCGCC GCAGCCGAAC GACCGAGCGC AGCGAGTCAG TGAGCGAGGA AGCGGAAGAG CGCCTGATGC GGTATTTTCT CCTTACGCAT CTGTGCGGTA TTTCACACCG CATATATGGT GCACTCTCAG TACAATCTGC TCTGATGCCG CATAGTTAAG CCAGTATACA CTCCGCTATC GCTACGTGAC TGGGTCATGG CTGCGCCCCG ACACCCGCCA ACACCCGCTG ACGCGCCCTG ACGGGCTTGT CTGCTCCCGG CATCCGCTTA CAGACAAGCT GTGACCGTCT CCGGGAGCTG CATGTGTCAG AGGTTTTCAC CGTCATCACC GAAACGCGCG AGGCAGCTGC GGTAAAGCTC ATCAGCGTGG TCGTGAAGCG ATTCACAGAT GTCTGCCTGT TCATCCGCGT CCAGCTCGTT GAGTTTCTCC AGAAGCGTTA ATGTCTGGCT TCTGATAAAG CGGGCCATGT TAAGGGCGGT TTTTTCCTGT TTGGTCACTG ATGCCTCCGT GTAAGGGGGA TTTCTGTTCA TGGGGGTAAT GATACCGATG AAACGAGAGA GGATGCTCAC GATACGGGTT ACTGATGATG AACATGCCCG GTTACTGGAA CGTTGTGAGG GTAAACAACT GGCGGTATGG ATGCGGCGGG ACCAGAGAAA AATCACTCAG GGTCAATGCC AGCGCTTCGT TAATACAGAT GTAGGTGTTC CACAGGGTAG CCAGCAGCAT CCTGCGATGC AGATCCGGAA CATAATGGTG CAGGGCGCTG ACTTCCGCGT TTCCAGACTT TACGAAACAC GGAAACCGAA GACCATTCAT GTTGTTGCTC AGGTCGCAGA CGTTTTGCAG CAGCAGTCGC TTCACGTTCG CTCGCGTATC GGTGATTCAT TCTGCTAACC AGTAAGGCAA CCCCGCCAGC CTAGCCGGGT CCTCAACGAC AGGAGCACGA TCATGCGCAC CCGTGGCCAG GACCCAACGC TGCCCGAGAT GCGCCGCGTG CGGCTGCTGG AGATGGCGGA CGCGATGGAT ATGTTCTGCC AAGGGTTGGT TTGCGCATTC ACAGTTCTCC GCAAGAATTG ATTGGCTCCA ATTCTTGGAG TGGTGAATCC GTTAGCGAGG TGCCGCCGGC TTCCATTCAG GTCGAGGTGG CCCGGCTCCA TGCACCGCGA CGCAACGCGG GGAGGCAGAC AAGGTATAGG GCGGCGCCTA CAATCCATGC CAACCCGTTC CATGTGCTCG CCGAGGCGGC ATAAATCGCC GTGACGATCA GCGGTCCAGT GATCGAAGTT AGGCTGGTAA GAGCCGCGAG CGATCCTTGA AGCTGTCCCT GATGGTCGTC ATCTACCTGC CTGGACAGCA TGGCCTGCAA CGCGGGCATC CCGATGCCGC CGGAAGCGAG AAGAATCATA ATGGGGAAGG CCATCCAGCC TCGCGTCGCG AACGCCAGCA AGACGTAGCC CAGCGCGTCG GCCGCCATGC CGGCGATAAT GGCCTGCTTC TCGCCGAAAC GTTTGGTGGC GGGACCAGTG ACGAAGGCTT GAGCGAGGGC GTGCAAGATT CCGAATACCG CAAGCGACAG GCCGATCATC GTCGCGCTCC AGCGAAAGCG GTCCTCGCCG AAAATGACCC AGAGCGCTGC CGGCACCTGT CCTACGAGTT GCATGATAAA GAAGACAGTC ATAAGTGCGG CGACGATAGT CATGCCCCGC GCCCACCGGA AGGAGCTGAC TGGGTTGAAG GCTCTCAAGG GCATCGGTCG ACGCTCTCCC TTATGCGACT CCTGCATTAG GAAGCAGCCC AGTAGTAGGT TGAGGCCGTT GAGCACCGCC GCCGCAAGGA ATGGTGCATG CAAGGAGATG GCGCCCAACA GTCCCCCGGC CACGGGGCCT GCCACCATAC CCACGCCGAA ACAAGCGCTC ATGAGCCCGA AGTGGCGAGC CCGATCTTCC CCATCGGTGA TGTCGGCGAT ATAGGCGCCA GCAACCGCAC CTGTGGCGCC GGTGATGCCG GCCACGATGC GTCCGGCGTA GAGGATCGAG ATCTCGATCC CGCGAAAT
Details for HepyC.00852.a.AE1.PS38657
PURIFICATION DATe: 2/9/2021
CONCENTRATION: 20.9mg/ml
OBSERVED MW: 53kDa
EXPRESSION LEVEL: High Expression
PROTEIN PURIFICATION BUFFER: SEC: 20 mM HEPES pH 7.0, 300 mM NaCl, 5% glycerol, 1 mM TCEP
EXPRESSION HOST: BL 21 (DE3) Rosetta
VIAL COUNT (approx.): 4
VIAL VOLUME: 200µl
PERCENT IDENTITY: 90
PERCENT COVERAGE: 100
Protocol Notes
notes unavailable
Validated AA Sequence
MNTXHXTLKT IXXLGQPXXG KSSLFNRLAR ERIAITSDFA GTTRDINKRK IALNGHEVEL LDTGGMAKDA LLXKEIXALN LKAAQMSDLI LYVXDGKSIP SDEDIXLFRE VFKTNPNCFL VINKIDNDKE KERAYAFSSF GMPKSFNISX SHXXGIXALI DAVLNALSLN XIIXQXLDAD ILESLXNNAP EXXXXXEIIQ VGIIGRXNVG KSSLLNALTK KERSLXSSVA GTTIDPIDET ILIGDQKICF VDTAGIRHRG KILGIEKYAL ERTQKXLEXS HIALLVLXVS APFVEXDEKI XSLADKXXLG IIXIXNKWDI RYAPYEEIMA XXKRKFRFLE YAPVITTXCL KARHIDEIKH KIIEVYECFS RRIPTSLLNX VITQATQKHP XXSDGGKLVK VYYATQFATK PPQIXXIMNR XKALHFSYKR YLINTLRKEF NFLGTPLILN AKDKKSAQQN XGHHHHHH
Validated NT Sequence
tccgttaatg atgatgatgg tggtgcccan cattttgttg ggcgcttttt ttatctttag cgttaaggat taaaggcgtg cctaaaaaat taaattcttt ccttaaggtg ttgatcaaat ancgtttgta actgaaatgc aaggctttan ggcgattcat gataananag atttgagggg gtttggtggc aaattgcgtg gcgtaataca ctttcactaa tttccctcca tcgcttgnna angggtgttt ttgggtggct tgagtgatca cnctattgag taggctcgtg ggaatacgtc tggaaaaaca ctcatanact tctatgattt tatgcttgat ttcatctata tggcgcgctt ttaagcanct ggttgtgatc acaggggcgt attctaaaaa gcggaatttc ctttttanan ttgccatgat ttcttcataa ggggcgtanc ggatgtccca tttgtttana ataanaatga tgcctaaana ntgtttatcc gctaanganc tgatcttttc gtccnattcc acaaaaggag cgctcacgnc taaaactaaa agcgcaatgt ggganttttc taagnctttt tgcgtgcgtt ctagtgcgta tttttcaatg cctaaaatct tacccctatg cctgatgcca gcggtatcca caaagcaaat tttttgatcg cctatgagaa tggtttcatc tatggggtca atggtcgtgc cagccacgct agaancaagg ctcctttctt ttttcgtgag cgcgtttaag agcgagcttt tgcccacatt cnccctccca atgatgccta cttgaatgat ctctnnttna gnttnttntt cnggngcgtt attttntaag ctttctaaaa tatccgcatc caantcttgc tntatgattn ggtttaaact cagcgcgttc aataccgcat caatcaatgc antaatgcct ntatngtgng aaanggaaat attaaaactc tttggcatgc caaaagaaga aaacgcataa gctcgctctt tttctttatc gttatcaatt ttattgatca ctaaaaagca gttagggttg gttttaaaaa cctctctaaa aagcnngatg tcttcatcgc tagggataga cttgccatcc ncaacataca aaatcaaatc gctcatttga gcggctttta aattaagggc tnngatttct ttagncaaaa gagcgtcttt agccatgccc cctgtatcta gcaattccac ttcatggcca ttcaatgcga tttttcgttt gttaatgtct cgtgtagtgc ctgcaaaatc tgaagtgata gcgatccttt ctctagccag gcggttaaat aacgagcttt tccccncant aggctggcct aanntnncaa tggtttttaa agtnntatgg ntngtattca tgggagaga
Expressed Protein Sequence
MNTSHKTLKT IAILGQPNVG KSSLFNRLAR ERIAITSDFA GTTRDINKRK IALNGHEVEL LDTGGMAKDA LLSKEIKALN LKAAQMSDLI LYVVDGKSIP SDEDIKLFRE VFKTNPNCFL VINKIDNDKE KERAYAFSSF GMPKSFNISV SHNRGISALI DAVLNALSLN QIIEQDLDAD ILESLENNAP EEETKEEIIQ VGIIGRVNVG KSSLLNALTK KERSLVSSVA GTTIDPIDET ILIGDQKICF VDTAGIRHRG KILGIEKYAL ERTQKALEKS HIALLVLDVS APFVELDEKI SSLADKHSLG IILILNKWDI RYAPYEEIMA TLKRKFRFLE YAPVITTSCL KARHIDEIKH KIIEVYECFS RRIPTSLLNS VITQATQKHP LPSDGGKLVK VYYATQFATK PPQISLIMNR PKALHFSYKR YLINTLRKEF NFLGTPLILN AKDKKSAQQN GHHHHHH
Full NT Sequence (Expression Vector + Insert)
TGGCGAATGG GACGCGCCCT GTAGCGGCGC ATTAAGCGCG GCGGGTGTGG TGGTTACGCG CAGCGTGACC GCTACACTTG CCAGCGCCCT AGCGCCCGCT CCTTTCGCTT TCTTCCCTTC CTTTCTCGCC ACGTTCGCCG GCTTTCCCCG TCAAGCTCTA AATCGGGGGC TCCCTTTAGG GTTCCGATTT AGTGCTTTAC GGCACCTCGA CCCCAAAAAA CTTGATTAGG GTGATGGTTC ACATTAACGC TTACAATTTA GGTGGCACTT TTCGGGGAAA TGTGCGCGGA ACCCCTATTT GTTTATTTTT CTAAATACAT TCAAATATGT ATCCGCTCAT GAGACAATAA CCCTGATAAA TGCTTCAATA ATTTGAAAAA GGAAGAGTAT GAGTATTCAA CATTTCCGTG TCGCCCTTAT TCCCTTTTTT GCGGCATTTT GCCTTCCTGT TTTTGCTCAC CCAGAAACGC TGGTGAAAGT AAAAGATGCT GAAGATCAGT TGGGTGCACG AGTGGGTTAC ATCGAACTGG ATCTCAACAG CGGTAAGATC CTTGAGAGTT TTCGCCCCGA AGAACGTTTT CCAATGATGA GCACTTTTAA AGTTCTGCTA TGTGGCGCGG TATTATCCCG TATTGACGCC GGGCAAGAGC AACTCGGTCG CCGCATACAC TATTCTCAGA ATGACTTGGT TGAGTACTCA CCAGTCACAG AAAAGCATCT TACGGATGGC ATGACAGTAA GAGAATTATG CAGTGCTGCC ATAACCATGA GTGATAACAC TGCGGCCAAC TTACTTCTGA CAACGATCGG AGGACCGAAG GAGCTAACCG CTTTTTTGCA CAACATGGGG GATCATGTAA CTCGCCTTGA TCGTTGGGAA CCGGAGCTGA ATGAAGCCAT ACCAAACGAC GAGCGTGACA CCACGATGCC TGCAGCAATG GCAACAACGT TGCGCAAACT ATTAACTGGC GAACTACTTA CTCTAGCTTC CCGGCAACAA TTAATAGACT GGATGGAGGC GGATAAAGTT GCAGGACCAC TTCTGCGCTC GGCCCTTCCG GCTGGCTGGT TTATTGCTGA TAAATCTGGA GCCGGTGAGC GTGGGTCTCG CGGTATCATT GCAGCACTGG GGCCAGATGG TAAGCCCTCC CGTATCGTAG TTATCTACAC GACGGGGAGT CAGGCAACTA TGGATGAACG AAATAGACAG ATCGCTGAGA TAGGTGCCTC ACTGATTAAG CATTGGTAAC TGTCAGACCA AGTTTACTCA TATATACTTT AGATTGATTT AAAACTTCAT TTTTAATTTA AAAGGATCTA GGTGAAGATC CTTTTTGATA ATCTCATGAC CAAAATCCCT TAACGTGAGT TTTCGTTCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT TTTTTCTGCG CGTAATCTGC TGCTTGCAAA CAAAAAAACC ACCGCTACCA GCGGTGGTTT GTTTGCCGGA TCAAGAGCTA CCAACTCTTT TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AGATACCAAA TACTGTCCTT CTAGTGTAGC CGTAGTTAGG CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA GACGATAGTT ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCACACAGC CCAGCTTGGA GCGAACGACC TACACCGAAC TGAGATACCT ACAGCGTGAG CTATGAGAAA GCGCCACGCT TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGCCTG GTATCTTTAT AGTCCTGTCG GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG GGGCGGAGCC TATGGAAAAA CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG CTCACATGTT CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG AGTGAGCTGA TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA GCGCCTGATG CGGTATTTTC TCCTTACGCA TCTGTGCGGT ATTTCACACC GCATATATGG TGCACTCTCA GTACAATCTG CTCTGATGCC GCATAGTTAA GCCAGTATAC ACTCCGCTAT CGCTACGTGA CTGGGTCATG GCTGCGCCCC GACACCCGCC AACACCCGCT GACGCGCCCT GACGGGCTTG TCTGCTCCCG GCATCCGCTT ACAGACAAGC TGTGACCGTC TCCGGGAGCT GCATGTGTCA GAGGTTTTCA CCGTCATCAC CGAAACGCGC GAGGCAGCTG CGGTAAAGCT CATCAGCGTG GTCGTGAAGC GATTCACAGA TGTCTGCCTG TTCATCCGCG TCCAGCTCGT TGAGTTTCTC CAGAAGCGTT AATGTCTGGC TTCTGATAAA GCGGGCCATG TTAAGGGCGG TTTTTTCCTG TTTGGTCACT GATGCCTCCG TGTAAGGGGG ATTTCTGTTC ATGGGGGTAA TGATACCGAT GAAACGAGAG AGGATGCTCA CGATACGGGT TACTGATGAT GAACATGCCC GGTTACTGGA ACGTTGTGAG GGTAAACAAC TGGCGGTATG GATGCGGCGG GACCAGAGAA AAATCACTCA GGGTCAATGC CAGCGCTTCG TTAATACAGA TGTAGGTGTT CCACAGGGTA GCCAGCAGCA TCCTGCGATG CAGATCCGGA ACATAATGGT GCAGGGCGCT GACTTCCGCG TTTCCAGACT TTACGAAACA CGGAAACCGA AGACCATTCA TGTTGTTGCT CAGGTCGCAG ACGTTTTGCA GCAGCAGTCG CTTCACGTTC GCTCGCGTAT CGGTGATTCA TTCTGCTAAC CAGTAAGGCA ACCCCGCCAG CCTAGCCGGG TCCTCAACGA CAGGAGCACG ATCATGCGCA CCCGTGGGGC CGCCATGCCG GCGATAATGG CCTGCTTCTC GCCGAAACGT TTGGTGGCGG GACCAGTGAC GAAGGCTTGA GCGAGGGCGT GCAAGATTCC GAATACCGCA AGCGACAGGC CGATCATCGT CGCGCTCCAG CGAAAGCGGT CCTCGCCGAA AATGACCCAG AGCGCTGCCG GCACCTGTCC TACGAGTTGC ATGATAAAGA AGACAGTCAT AAGTGCGGCG ACGATAGTCA TGCCCCGCGC CCACCGGAAG GAGCTGACTG GGTTGAAGGC TCTCAAGGGC ATCGGTCGAG ATCCCGGTGC CTAATGAGTG AGCTAACTTA CATTAATTGC GTTGCGCTCA CTGCCCGCTT TCCAGTCGGG AAACCTGTCG TGCCAGCTGC ATTAATGAAT CGGCCAACGC GCGGGGAGAG GCGGTTTGCG TATTGGGCGC CAGGGTGGTT TTTCTTTTCA CCAGTGAGAC GGGCAACAGC TGATTGCCCT TCACCGCCTG GCCCTGAGAG AGTTGCAGCA AGCGGTCCAC GCTGGTTTGC CCCAGCAGGC GAAAATCCTG TTTGATGGTG GTTAACGGCG GGATATAACA TGAGCTGTCT TCGGTATCGT CGTATCCCAC TACCGAGATA TCCGCACCAA CGCGCAGCCC GGACTCGGTA ATGGCGCGCA TTGCGCCCAG CGCCATCTGA TCGTTGGCAA CCAGCATCGC AGTGGGAACG ATGCCCTCAT TCAGCATTTG CATGGTTTGT TGAAAACCGG ACATGGCACT CCAGTCGCCT TCCCGTTCCG CTATCGGCTG AATTTGATTG CGAGTGAGAT ATTTATGCCA GCCAGCCAGA CGCAGACGCG CCGAGACAGA ACTTAATGGG CCCGCTAACA GCGCGATTTG CTGGTGACCC AATGCGACCA GATGCTCCAC GCCCAGTCGC GTACCGTCTT CATGGGAGAA AATAATACTG TTGATGGGTG TCTGGTCAGA GACATCAAGA AATAACGCCG GAACATTAGT GCAGGCAGCT TCCACAGCAA TGGCATCCTG GTCATCCAGC GGATAGTTAA TGATCAGCCC ACTGACGCGT TGCGCGAGAA GATTGTGCAC CGCCGCTTTA CAGGCTTCGA CGCCGCTTCG TTCTACCATC GACACCACCA CGCTGGCACC CAGTTGATCG GCGCGAGATT TAATCGCCGC GACAATTTGC GACGGCGCGT GCAGGGCCAG ACTGGAGGTG GCAACGCCAA TCAGCAACGA CTGTTTGCCC GCCAGTTGTT GTGCCACGCG GTTGGGAATG TAATTCAGCT CCGCCATCGC CGCTTCCACT TTTTCCCGCG TTTTCGCAGA AACGTGGCTG GCCTGGTTCA CCACGCGGGA AACGGTCTGA TAAGAGACAC CGGCATACTC TGCGACATCG TATAACGTTA CTGGTTTCAC ATTCACCACC CTGAATTGAC TCTCTTCCGG GCGCTATCAT GCCATACCGC GAAAGGTTTT GCGCCATTCG ATGGTGTCCG GGATCTCGAC GCTCTCCCTT ATGCGACTCC TGCATTAGGA AGCAGCCCAG TAGTAGGTTG AGGCCGTTGA GCACCGCCGC CGCAAGGAAT GGTGCATGCA AGGAGATGGC GCCCAACAGT CCCCCGGCCA CGGGGCCTGC CACCATACCC ACGCCGAAAC AAGCGCTCAT GAGCCCGAAG TGGCGAGCCC GATCTTCCCC ATCGGTGATG TCGGCGATAT AGGCGCCAGC AACCGCACCT GTGGCGCCGG TGATGCCGGC CACGATGCGT CCGGCGTAGA GGATCGAGAT CGATCTCGAT CCCGCGAAAT TAATACGACT CACTATAGGG GAATTGTGAG CGGATAACAA TTCCCCTCTA GAAATAATTT TGTTTAACTT TAAGAAGGAG TCTCTCCCAT GAATACAAGC CATAAAACTT TAAAAACCAT TGCGATTTTA GGCCAGCCTA ATGTGGGGAA AAGCTCGTTA TTTAACCGCC TGGCTAGAGA AAGGATCGCT ATCACTTCAG ATTTTGCAGG CACTACACGA GACATTAACA AACGAAAAAT CGCATTGAAT GGCCATGAAG TGGAATTGCT AGATACAGGG GGCATGGCTA AAGACGCTCT TTTGTCTAAA GAAATCAAAG CCCTTAATTT AAAAGCCGCT CAAATGAGCG ATTTGATTTT GTATGTTGTG GATGGCAAGT CTATCCCTAG CGATGAAGAC ATCAAGCTTT TTAGAGAGGT TTTTAAAACC AACCCTAACT GCTTTTTAGT GATCAATAAA ATTGATAACG ATAAAGAAAA AGAGCGAGCT TATGCGTTTT CTTCTTTTGG CATGCCAAAG AGTTTTAATA TTTCCGTTTC GCACAATAGA GGCATTAGTG CATTGATTGA TGCGGTATTG AACGCGCTGA GTTTAAACCA AATCATAGAG CAAGATTTGG ATGCGGATAT TTTAGAAAGC TTAGAAAATA ACGCACCAGA AGAAGAAACT AAAGAAGAGA TCATTCAAGT AGGCATCATT GGGAGGGTGA ATGTGGGCAA AAGCTCGCTC TTAAACGCGC TCACGAAAAA AGAAAGGAGC CTTGTTTCTA GCGTGGCTGG CACGACCATT GACCCCATAG ATGAAACCAT TCTCATAGGC GATCAAAAAA TTTGCTTTGT GGATACCGCT GGCATCAGGC ATAGGGGTAA GATTTTAGGC ATTGAAAAAT ACGCACTAGA ACGCACGCAA AAAGCCTTAG AAAAATCCCA CATTGCGCTT TTAGTTTTAG ACGTGAGCGC TCCTTTTGTG GAATTGGACG AAAAGATCAG CTCCTTAGCG GATAAACACT CTTTAGGCAT CATTCTTATT CTAAACAAAT GGGACATCCG CTACGCCCCT TATGAAGAAA TCATGGCAAC TCTAAAAAGG AAATTCCGCT TTTTAGAATA CGCCCCTGTG ATCACAACCA GCTGCTTAAA AGCGCGCCAT ATAGATGAAA TCAAGCATAA AATCATAGAA GTCTATGAGT GTTTTTCCAG ACGTATTCCC ACGAGCCTAC TCAATAGCGT GATCACTCAA GCCACCCAAA AACACCCCTT ACCAAGCGAT GGAGGGAAAT TAGTGAAAGT GTATTACGCC ACGCAATTTG CCACCAAACC CCCTCAAATC TCTCTTATCA TGAATCGCCC TAAAGCCTTG CATTTCAGTT ACAAACGCTA TTTGATCAAC ACCTTAAGGA AAGAATTTAA TTTTTTAGGC ACGCCTTTAA TCCTTAACGC TAAAGATAAA AAAAGCGCCC AACAAAATGG GCACCACCAT CATCATCATT AACGGATCCG AATTCGAGCT CCGTCGACAA GCTTGCGGCC GCACTCGAGC ACCACCACCA CCACCACTGA GATCCGGCTG CTAACAAAGC CCGAAAGGAA GCTGAGTTGG CTGCTGCCAC CGCTGAGCAA TAACTAGCAT AACCCCTTGG GGCCTCTAAA CGGGTCTTGA GGGGTTTTTT GCTGAAAGGA GGAACTATAT CCGGAT