NegoA.17981.a

prolyl-tRNA synthetase

CENTER ID: NegoA.17981.a
ORGANISM: Neisseria gonorrhoeae NCCP11945
ASSOCIATED DISEASE:
CURRENT STATUS: purified
COMMUNITY REQUEST: True
NIH RISK GROUP: 2
SELECT AGENT: False
NIH PRIORITY
pathogens category:
I/II

Ordering Clones & Proteins

If there are materials available for this target, they will be listed below. Materials can be ordered from SSGCID using the button in the "order material" column. Clicking the button will add the material to a virtual cart. You may order multiple materials at a time at no cost to you, as this contract is funded by NIAID. When you are ready to place your order, click the "Place Order" link which will appear in the top right corner of the page after you place your first item in your cart.

Clones*

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
NegoA.17981.a.B1.GE40085 full length 1 570
* SSGCID clones represent un-induced expression constructs which have been verified by sequencing from vector primers. Clones may contain a tag, such as N-term 6xHis. Get sequence information using the button in the "info" column.

Proteins

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
NegoA.17981.a.B1.PW37920 full length 1 570
Molecular models
METHOD RESULTS
comparative modelling Robetta_85049
External Resources
RESOURCE REFERENCE ID
PATRIC ID: fig|521006.8.peg.1457
UniProt: B4RMJ8
Sequences
These sequences are the native gene sequence; sequences of constructs derived from these sequences may differ due to codon optimization or other protocols. To find the specific sequence of any material you may have ordered, click on the "info" button next to the name of that material.
AA Sequence
MKASQFFIST LKEAPAEAAF ASHKLMIRAG LIKANASGLY TWMPMGLRVL RKVENVVREE MARAGSVELL MPVVQPAELW QESGRWEFYG KELLRLKDRH ERDFCMGPTC EEVIADIVRK EINSYKQLPK NFYHIQTKFR DEVRPRFGVM RAREFVMKDA YSFHADYASL QATYDAMYDA YCRIFTRLGL AFRPVAADTG SIGGTGSHEF QVLAESGEDV IAYSDTSDYA ANIELAPTLP LKGERAAAQA VLTKVHTPNV KTIESLVEFL NIPVEQTLKS IVVEGENEGE LVLLLLRGDH EFNDIKAEKL AGVKSPLTMA SPAAIVEQFG ANGGSLGPVG FTGKVYADFA TEKGADWVIG ANEDDYHYTG FNFGRDAAEP EFVDLRNVVE GDESPDGQGR LKLARGIEVG HVFQLRGKYT QAMNVSFLDN NGKSQIMEMG CYGIGITRVV AAAIEQNNDE KGIIWTKAMA PFEVVIVPMN YKKSDTVREA ADRIYAELLA AGADVLLDDR DERAGVLLND SELLGIPHRI VIGDRALKEG NVEYAERRDN EAQAVAIGEI VARVTASLNA
NT Sequence
ATGAAAGCCA GCCAATTCTT TATCTCTACT TTAAAAGAAG CCCCTGCCGA AGCCGCGTTT GCCAGCCACA AGCTGATGAT TCGCGCCGGT CTGATTAAAG CCAACGCGTC CGGTCTTTAT ACTTGGATGC CGATGGGGCT GCGCGTGTTA CGCAAAGTCG AAAACGTCGT GCGCGAGGAA ATGGCGCGCG CGGGCAGCGT GGAGCTGCTG ATGCCGGTGG TGCAGCCTGC CGAACTGTGG CAGGAATCCG GCCGCTGGGA GTTTTACGGT AAAGAACTGC TGCGCCTGAA AGACCGCCAC GAACGCGATT TCTGCATGGG CCCGACCTGC GAGGAAGTCA TCGCCGACAT CGTGCGCAAA GAAATCAACA GCTACAAACA ACTGCCGAAA AATTTTTACC ACATCCAAAC CAAATTCCGC GACGAAGTGC GCCCGCGTTT CGGCGTGATG CGCGCGCGCG AATTTGTGAT GAAAGATGCT TATTCCTTCC ACGCCGACTA CGCCTCGCTT CAGGCGACCT ATGATGCCAT GTATGACGCT TACTGCCGCA TCTTTACCCG TCTGGGTTTG GCGTTCCGCC CCGTCGCCGC AGACACCGGC AGCATCGGCG GTACGGGTTC GCACGAGTTT CAAGTGTTGG CGGAAAGCGG CGAAGATGTG ATTGCATACA GCGACACTTC CGATTACGCC GCCAATATCG AGTTGGCACC GACCTTGCCG CTTAAAGGTG AACGTGCCGC CGCTCAGGCT GTGTTGACCA AAGTACATAC ACCAAACGTC AAAACCATTG AGTCTTTGGT TGAATTCCTG AATATTCCGG TTGAACAAAC CCTCAAATCC ATCGTGGTTG AAGGCGAAAA CGAAGGCGAA CTCGTCCTAC TGCTGTTGCG TGGCGACCAT GAGTTTAACG ACATCAAGGC AGAAAAACTG GCGGGCGTAA AATCGCCACT GACTATGGCA AGCCCTGCCG CGATTGTTGA ACAATTCGGC GCAAACGGCG GTTCGCTCGG CCCCGTCGGC TTCACAGGCA AAGTCTATGC CGATTTCGCT ACCGAAAAAG GCGCGGACTG GGTTATCGGC GCAAACGAAG ACGACTACCA CTATACCGGC TTCAACTTCG GCCGCGATGC CGCCGAGCCT GAATTCGTCG ATTTGCGTAA TGTGGTCGAA GGCGACGAAA GCCCCGACGG GCAAGGCCGT CTGAAACTGG CGCGCGGCAT CGAAGTCGGA CATGTCTTCC AGTTGCGCGG CAAATATACC CAAGCTATGA ACGTCAGCTT CCTCGACAAT AACGGCAAAT CGCAAATCAT GGAAATGGGC TGCTACGGCA TCGGCATCAC CCGCGTCGTT GCCGCCGCCA TCGAGCAGAA TAACGACGAA AAAGGCATCA TTTGGACCAA AGCGATGGCG CCGTTTGAAG TCGTCATCGT GCCGATGAAC TACAAAAAAT CCGACACTGT GCGTGAAGCC GCCGACAGAA TCTATGCCGA ATTGCTGGCG GCAGGCGCGG ATGTGCTGCT GGACGACCGC GACGAACGCG CAGGCGTATT GCTGAACGAT TCCGAGCTTC TCGGTATCCC GCACCGCATC GTCATCGGCG ACCGCGCCTT GAAAGAAGGC AATGTCGAAT ACGCCGAACG CCGCGACAAC GAAGCGCAGG CAGTTGCAAT CGGAGAAATT GTTGCGCGTG TAACAGCTTC ATTAAATGCG TAA
Details for NegoA.17981.a.B1.GE40085
HARVESTED ON: 3/24/2016
SEQUENCED ON: 3/25/2016
EXPECTED MW: 64kDa
OBSERVED MW: 70kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Many (50-100)
TOTAL EXPRESSION LEVEL: Moderate Expression
SOLUBLE EXPRESSION LEVEL Low Expression
EXPRESSION HOST: data unavailable
SEQUENCING RESULT:
PERCENT IDENTITY: 97
PERCENT COVERAGE: 98
Validated AA Sequence
MAHHHHHHHM KASQFFISTL KEAPAEAAFA SHKLMIRAGL IKANASGLYT WMPMGLRVLR KVENVVREEM ARAGSVELLM PVVQPAELWQ ESGRWEFYGK ELLRLKDRHE RDFCMGPTCE EVIADIVRKE INSYKQLPKN FYHIQTKFRD EVRPRFGVMR AREFVMKDAY SFHADYASLQ ATYDAMYDAY CRIFTRLGLA FRPVAADTGS IGGTGSHEFQ VLAXXGEDVI AYSDTSDYAA NIELAPTLPL XXERAAAXXV LTKVHTPNVK TIESLVEFLN IPVEQTLKSX XXXXENEGEL VLLLLRGDHE XXDIKAEKLA GVKSPLTMAS PAAIVEQFGA NGGSLGPVGF TGKVYADFAT EKGADWVIGA NEDDYHYTGF NFGRDAAEPE FVDLRNVVEG DESPDGQGRL KLARGIEVGH VFQLRGKYTQ AMNVSFLDNN GKSQIMEMGC YGIGITRVVA AAIEQNNDEK GIIWTKAMAP FEVVIVPMNY KKSDTVREAA DRIYAELLAA GADVLLDDRD ERAGVLLNDS ELLGIPHRIV IGDRALKEGN VEYAERRGNE AQAVAIGEIV
Validated NT Sequence
caacaatttc tccgattgca actgcctgcg cttcgttgcc gcggcgttcg gcgtattcga cattgccttc tttcaaggcg cggtcgccga tgacgatgcg gtgcgggata ccgagaagct cggaatcgtt cagcaatacg cctgcgcgtt cgtcgcggtc gtccagcagc acatccgcgc ctgccgccag caattcggca tagattctgt cggcggcttc acgcacagtg tcggattttt tgtagttcat cggcacgatg acgacttcaa acggcgccat cgctttggtc caaatgatgc ctttttcgtc gttattctgc tcgatggcgg cggcaacgac gcgggtgatg ccgatgccgt agcagcccat ttccatgatt tgcgatttgc cgttattgtc gaggaagctg acgttcatag cttgggtata tttgccgcgc aactggaaga catgtccgac ttcgatgccg cgcgccagtt tcagacggcc ttgcccgtcg gggctttcgt cgccttcgac cacattacgc aaatcgacga attcaggctc ggcggcatcg cggccgaagt tgaagccggt atagtggtag tcgtcttcgt ttgcgccgat aacccagtcc gcgccttttt cggtagcgaa atcggcatag actttgcctg tgaagccgac ggggccgagc gaaccgccgt ttgcgccgaa ttgttcaaca atcgcggcag ggcttgccat agtcagtggc gattttacgc ccgccagttt ttctgccttg atgtcgtnna actcatggtc gccacgcaac agcagtagga cgagttcgcc ttcgttttcn nnnnnnnnnn nnntggattt gagggtttgt tcaaccggaa tattcaggaa ttcaaccaaa gactcaatgg ttttgacgtt tggtgtatgt actttggtca acacagnnng agcggcggca cgttcacnnt taagcggcaa ggtcggtgcc aactcgatat tggcggcgta atcggaagtg tcgctgtatg caatcacatc ttcgccgnnn nnngccaaca cttgaaactc gtgcgaaccc gtaccgccga tgctgccggt gtctgcggcg acggggcgga acgccaaacc cagacgggta aagatgcggc agtaagcgtc atacatggca tcataggtcg cctgaagcga ggcgtagtcg gcgtggaagg aataagcatc tttcatcaca aattcgcgcg cgcgcatcac gccgaaacgc gggcgcactt cgtcgcggaa tttggtttgg atgtggtaaa aatttttcgg cagttgtttg tagctgttga tttctttgcg cacgatgtcg gcgatgactt cctcgcaggt cgggcccatg cagaaatcgc gttcgtggcg gtctttcagg cgcagcagtt ctttaccgta aaactcccag cggccggatt cctgccacag ttcggcaggc tgcaccaccg gcatcagcag ctccacgctg cccgcgcgcg ccatttcctc gcgcacgacg ttttcgactt tgcgtaacac gcgcagcccc atcggcatcc aagtataaag accggacgcg ttggctttaa tcagaccggc gcgaatcatc agcttgtggc tggcaaacgc ggcttcggca ggggcttctt ttaaagtaga gataaagaat tggctggctt tcatatggtg gtggtggtgg tggtgagcca t
Expected Protein Sequence
MAHHHHHHMK ASQFFISTLK EAPAEAAFAS HKLMIRAGLI KANASGLYTW MPMGLRVLRK VENVVREEMA RAGSVELLMP VVQPAELWQE SGRWEFYGKE LLRLKDRHER DFCMGPTCEE VIADIVRKEI NSYKQLPKNF YHIQTKFRDE VRPRFGVMRA REFVMKDAYS FHADYASLQA TYDAMYDAYC RIFTRLGLAF RPVAADTGSI GGTGSHEFQV LAESGEDVIA YSDTSDYAAN IELAPTLPLK GERAAAQAVL TKVHTPNVKT IESLVEFLNI PVEQTLKSIV VEGENEGELV LLLLRGDHEF NDIKAEKLAG VKSPLTMASP AAIVEQFGAN GGSLGPVGFT GKVYADFATE KGADWVIGAN EDDYHYTGFN FGRDAAEPEF VDLRNVVEGD ESPDGQGRLK LARGIEVGHV FQLRGKYTQA MNVSFLDNNG KSQIMEMGCY GIGITRVVAA AIEQNNDEKG IIWTKAMAPF EVVIVPMNYK KSDTVREAAD RIYAELLAAG ADVLLDDRDE RAGVLLNDSE LLGIPHRIVI GDRALKEGNV EYAERRDNEA QAVAIGEIVA RVTASLNA
Full NT Sequence (Expression Vector + Insert)
taatacgact cactataggg agaccacaac ggtttccctc tagaaataat tttgtttaac tttaagaagg agatatacca tggctcacca ccaccaccac catatgaaag ccagccaatt ctttatctct actttaaaag aagcccctgc cgaagccgcg tttgccagcc acaagctgat gattcgcgcc ggtctgatta aagccaacgc gtccggtctt tatacttgga tgccgatggg gctgcgcgtg ttacgcaaag tcgaaaacgt cgtgcgcgag gaaatggcgc gcgcgggcag cgtggagctg ctgatgccgg tggtgcagcc tgccgaactg tggcaggaat ccggccgctg ggagttttac ggtaaagaac tgctgcgcct gaaagaccgc cacgaacgcg atttctgcat gggcccgacc tgcgaggaag tcatcgccga catcgtgcgc aaagaaatca acagctacaa acaactgccg aaaaattttt accacatcca aaccaaattc cgcgacgaag tgcgcccgcg tttcggcgtg atgcgcgcgc gcgaatttgt gatgaaagat gcttattcct tccacgccga ctacgcctcg cttcaggcga cctatgatgc catgtatgac gcttactgcc gcatctttac ccgtctgggt ttggcgttcc gccccgtcgc cgcagacacc ggcagcatcg gcggtacggg ttcgcacgag tttcaagtgt tggcggaaag cggcgaagat gtgattgcat acagcgacac ttccgattac gccgccaata tcgagttggc accgaccttg ccgcttaaag gtgaacgtgc cgccgctcag gctgtgttga ccaaagtaca tacaccaaac gtcaaaacca ttgagtcttt ggttgaattc ctgaatattc cggttgaaca aaccctcaaa tccatcgtgg ttgaaggcga aaacgaaggc gaactcgtcc tactgctgtt gcgtggcgac catgagttta acgacatcaa ggcagaaaaa ctggcgggcg taaaatcgcc actgactatg gcaagccctg ccgcgattgt tgaacaattc ggcgcaaacg gcggttcgct cggccccgtc ggcttcacag gcaaagtcta tgccgatttc gctaccgaaa aaggcgcgga ctgggttatc ggcgcaaacg aagacgacta ccactatacc ggcttcaact tcggccgcga tgccgccgag cctgaattcg tcgatttgcg taatgtggtc gaaggcgacg aaagccccga cgggcaaggc cgtctgaaac tggcgcgcgg catcgaagtc ggacatgtct tccagttgcg cggcaaatat acccaagcta tgaacgtcag cttcctcgac aataacggca aatcgcaaat catggaaatg ggctgctacg gcatcggcat cacccgcgtc gttgccgccg ccatcgagca gaataacgac gaaaaaggca tcatttggac caaagcgatg gcgccgtttg aagtcgtcat cgtgccgatg aactacaaaa aatccgacac tgtgcgtgaa gccgccgaca gaatctatgc cgaattgctg gcggcaggcg cggatgtgct gctggacgac cgcgacgaac gcgcaggcgt attgctgaac gattccgagc ttctcggtat cccgcaccgc atcgtcatcg gcgaccgcgc cttgaaagaa ggcaatgtcg aatacgccga acgccgcgac aacgaagcgc aggcagttgc aatcggagaa attgttgcgc gtgtaacagc ttcattaaat gcgtgagtaa gataggatcc ggctgctaac aaagcccgaa aggaagctga gttggctgct gccaccgctg agcaataact agcataaccc cttggggcct ctaaacgggt cttgaggggt tttttgctga aaggaggaac tatatccgga tatccacagg acgggtgtgg tcgccatgat cgcgtagtcg atagtggctc caagtagcga agcgagcagg actgggcggc ggccaaagcg gtcggacagt gctccgagaa cgggtgcgca tagaaattgc atcaacgcat atagcgctag cagcacgcca tagtgactgg cgatgctgtc ggaatggacg atatcccgca agaggcccgg cagtaccggc ataaccaagc ctatgcctac agcatccagg gtgacggtgc cgaggatgac gatgagcgca ttgttagatt tcatacacgg tgcctgactg cgttagcaat ttaactgtga taaactaccg cattaaagct tatcgatgat aagctgtcaa acatgagaat tcttgaagac gaaagggcct cgtgatacgc ctatttttat aggttaatgt catgataata atggtttctt agacgtcagg tggcactttt cggggaaatg tgcgcggaac ccctatttgt ttatttttct aaatacattc aaatatgtat ccgctcatga gacaataacc ctgataaatg cttcaataat attgaaaaag gaagagtatg agtattcaac atttccgtgt cgcccttatt cccttttttg cggcattttg ccttcctgtt tttgctcacc cagaaacgct ggtgaaagta aaagatgctg aagatcagtt gggtgcacga gtgggttaca tcgaactgga tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc caatgatgag cacttttaaa gttctgctat gtggcgcggt attatcccgt gttgacgccg ggcaagagca actcggtcgc cgcatacact attctcagaa tgacttggtt gagtactcac cagtcacaga aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca taaccatgag tgataacact gcggccaact tacttctgac aacgatcgga ggaccgaagg agctaaccgc ttttttgcac aacatggggg atcatgtaac tcgccttgat cgttgggaac cggagctgaa tgaagccata ccaaacgacg agcgtgacac cacgatgcct gcagcaatgg caacaacgtt gcgcaaacta ttaactggcg aactacttac tctagcttcc cggcaacaat taatagactg gatggaggcg gataaagttg caggaccact tctgcgctcg gcccttccgg ctggctggtt tattgctgat aaatctggag ccggtgagcg tgggtctcgc ggtatcattg cagcactggg gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc aggcaactat ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc attggtaact gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt tttaatttaa aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt aacgtgagtt ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt gagatccttt ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg tttgccggat caagagctac caactctttt tccgaaggta actggcttca gcagagcgca gataccaaat actgtccttc tagtgtagcc gtagttaggc caccacttca agaactctgt agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg ccagtggcga taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg cgcagcggtc gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct acaccgaact gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga gaaaggcgga caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc ttccaggggg aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg cggccttttt acggttcctg gccttttgct ggccttttgc tcacatgttc tttcctgcgt tatcccctga ttctgtggat aaccgtatta ccgcctttga gtgagctgat accgctcgcc gcagccgaac gaccgagcgc agcgagtcag tgagcgagga agcggaagag cgcctgatgc ggtattttct ccttacgcat ctgtgcggta tttcacaccg catatatggt gcactctcag tacaatctgc tctgatgccg catagttaag ccagtataca ctccgctatc gctacgtgac tgggtcatgg ctgcgccccg acacccgcca acacccgctg acgcgccctg acgggcttgt ctgctcccgg catccgctta cagacaagct gtgaccgtct ccgggagctg catgtgtcag aggttttcac cgtcatcacc gaaacgcgcg aggcagctgc ggtaaagctc atcagcgtgg tcgtgaagcg attcacagat gtctgcctgt tcatccgcgt ccagctcgtt gagtttctcc agaagcgtta atgtctggct tctgataaag cgggccatgt taagggcggt tttttcctgt ttggtcactg atgcctccgt gtaaggggga tttctgttca tgggggtaat gataccgatg aaacgagaga ggatgctcac gatacgggtt actgatgatg aacatgcccg gttactggaa cgttgtgagg gtaaacaact ggcggtatgg atgcggcggg accagagaaa aatcactcag ggtcaatgcc agcgcttcgt taatacagat gtaggtgttc cacagggtag ccagcagcat cctgcgatgc agatccggaa cataatggtg cagggcgctg acttccgcgt ttccagactt tacgaaacac ggaaaccgaa gaccattcat gttgttgctc aggtcgcaga cgttttgcag cagcagtcgc ttcacgttcg ctcgcgtatc ggtgattcat tctgctaacc agtaaggcaa ccccgccagc ctagccgggt cctcaacgac aggagcacga tcatgcgcac ccgtggccag gacccaacgc tgcccgagat gcgccgcgtg cggctgctgg agatggcgga cgcgatggat atgttctgcc aagggttggt ttgcgcattc acagttctcc gcaagaattg attggctcca attcttggag tggtgaatcc gttagcgagg tgccgccggc ttccattcag gtcgaggtgg cccggctcca tgcaccgcga cgcaacgcgg ggaggcagac aaggtatagg gcggcgccta caatccatgc caacccgttc catgtgctcg ccgaggcggc ataaatcgcc gtgacgatca gcggtccagt gatcgaagtt aggctggtaa gagccgcgag cgatccttga agctgtccct gatggtcgtc atctacctgc ctggacagca tggcctgcaa cgcgggcatc ccgatgccgc cggaagcgag aagaatcata atggggaagg ccatccagcc tcgcgtcgcg aacgccagca agacgtagcc cagcgcgtcg gccgccatgc cggcgataat ggcctgcttc tcgccgaaac gtttggtggc gggaccagtg acgaaggctt gagcgagggc gtgcaagatt ccgaataccg caagcgacag gccgatcatc gtcgcgctcc agcgaaagcg gtcctcgccg aaaatgaccc agagcgctgc cggcacctgt cctacgagtt gcatgataaa gaagacagtc ataagtgcgg cgacgatagt catgccccgc gcccaccgga aggagctgac tgggttgaag gctctcaagg gcatcggtcg acgctctccc ttatgcgact cctgcattag gaagcagccc agtagtaggt tgaggccgtt gagcaccgcc gccgcaagga atggtgcatg caaggagatg gcgcccaaca gtcccccggc cacggggcct gccaccatac ccacgccgaa acaagcgctc atgagcccga agtggcgagc ccgatcttcc ccatcggtga tgtcggcgat ataggcgcca gcaaccgcac ctgtggcgcc ggtgatgccg gccacgatgc gtccggcgta gaggatcgag atctcgatcc cgcgaaat
Details for NegoA.17981.a.B1.PW37920
PURIFICATION DATe: 6/13/2016
CONCENTRATION: 29mg/ml
OBSERVED MW: 68kDa
EXPRESSION LEVEL: Moderate Expression
PROTEIN PURIFICATION BUFFER: 25 mM HEPES pH 7.0, 500 mM NaCl, 5% Glycerol , 2 mM DTT, and 0.025% Azide
EXPRESSION HOST: data unavailable
VIAL COUNT (approx.): 4
VIAL VOLUME: 200µl
PERCENT IDENTITY: 97
PERCENT COVERAGE: 98
Protocol Notes
notes unavailable
Validated AA Sequence
MAHHHHHHHM KASQFFISTL KEAPAEAAFA SHKLMIRAGL IKANASGLYT WMPMGLRVLR KVENVVREEM ARAGSVELLM PVVQPAELWQ ESGRWEFYGK ELLRLKDRHE RDFCMGPTCE EVIADIVRKE INSYKQLPKN FYHIQTKFRD EVRPRFGVMR AREFVMKDAY SFHADYASLQ ATYDAMYDAY CRIFTRLGLA FRPVAADTGS IGGTGSHEFQ VLAXXGEDVI AYSDTSDYAA NIELAPTLPL XXERAAAXXV LTKVHTPNVK TIESLVEFLN IPVEQTLKSX XXXXENEGEL VLLLLRGDHE XXDIKAEKLA GVKSPLTMAS PAAIVEQFGA NGGSLGPVGF TGKVYADFAT EKGADWVIGA NEDDYHYTGF NFGRDAAEPE FVDLRNVVEG DESPDGQGRL KLARGIEVGH VFQLRGKYTQ AMNVSFLDNN GKSQIMEMGC YGIGITRVVA AAIEQNNDEK GIIWTKAMAP FEVVIVPMNY KKSDTVREAA DRIYAELLAA GADVLLDDRD ERAGVLLNDS ELLGIPHRIV IGDRALKEGN VEYAERRGNE AQAVAIGEIV
Validated NT Sequence
caacaatttc tccgattgca actgcctgcg cttcgttgcc gcggcgttcg gcgtattcga cattgccttc tttcaaggcg cggtcgccga tgacgatgcg gtgcgggata ccgagaagct cggaatcgtt cagcaatacg cctgcgcgtt cgtcgcggtc gtccagcagc acatccgcgc ctgccgccag caattcggca tagattctgt cggcggcttc acgcacagtg tcggattttt tgtagttcat cggcacgatg acgacttcaa acggcgccat cgctttggtc caaatgatgc ctttttcgtc gttattctgc tcgatggcgg cggcaacgac gcgggtgatg ccgatgccgt agcagcccat ttccatgatt tgcgatttgc cgttattgtc gaggaagctg acgttcatag cttgggtata tttgccgcgc aactggaaga catgtccgac ttcgatgccg cgcgccagtt tcagacggcc ttgcccgtcg gggctttcgt cgccttcgac cacattacgc aaatcgacga attcaggctc ggcggcatcg cggccgaagt tgaagccggt atagtggtag tcgtcttcgt ttgcgccgat aacccagtcc gcgccttttt cggtagcgaa atcggcatag actttgcctg tgaagccgac ggggccgagc gaaccgccgt ttgcgccgaa ttgttcaaca atcgcggcag ggcttgccat agtcagtggc gattttacgc ccgccagttt ttctgccttg atgtcgtnna actcatggtc gccacgcaac agcagtagga cgagttcgcc ttcgttttcn nnnnnnnnnn nnntggattt gagggtttgt tcaaccggaa tattcaggaa ttcaaccaaa gactcaatgg ttttgacgtt tggtgtatgt actttggtca acacagnnng agcggcggca cgttcacnnt taagcggcaa ggtcggtgcc aactcgatat tggcggcgta atcggaagtg tcgctgtatg caatcacatc ttcgccgnnn nnngccaaca cttgaaactc gtgcgaaccc gtaccgccga tgctgccggt gtctgcggcg acggggcgga acgccaaacc cagacgggta aagatgcggc agtaagcgtc atacatggca tcataggtcg cctgaagcga ggcgtagtcg gcgtggaagg aataagcatc tttcatcaca aattcgcgcg cgcgcatcac gccgaaacgc gggcgcactt cgtcgcggaa tttggtttgg atgtggtaaa aatttttcgg cagttgtttg tagctgttga tttctttgcg cacgatgtcg gcgatgactt cctcgcaggt cgggcccatg cagaaatcgc gttcgtggcg gtctttcagg cgcagcagtt ctttaccgta aaactcccag cggccggatt cctgccacag ttcggcaggc tgcaccaccg gcatcagcag ctccacgctg cccgcgcgcg ccatttcctc gcgcacgacg ttttcgactt tgcgtaacac gcgcagcccc atcggcatcc aagtataaag accggacgcg ttggctttaa tcagaccggc gcgaatcatc agcttgtggc tggcaaacgc ggcttcggca ggggcttctt ttaaagtaga gataaagaat tggctggctt tcatatggtg gtggtggtgg tggtgagcca t
Expressed Protein Sequence
MAHHHHHHMK ASQFFISTLK EAPAEAAFAS HKLMIRAGLI KANASGLYTW MPMGLRVLRK VENVVREEMA RAGSVELLMP VVQPAELWQE SGRWEFYGKE LLRLKDRHER DFCMGPTCEE VIADIVRKEI NSYKQLPKNF YHIQTKFRDE VRPRFGVMRA REFVMKDAYS FHADYASLQA TYDAMYDAYC RIFTRLGLAF RPVAADTGSI GGTGSHEFQV LAESGEDVIA YSDTSDYAAN IELAPTLPLK GERAAAQAVL TKVHTPNVKT IESLVEFLNI PVEQTLKSIV VEGENEGELV LLLLRGDHEF NDIKAEKLAG VKSPLTMASP AAIVEQFGAN GGSLGPVGFT GKVYADFATE KGADWVIGAN EDDYHYTGFN FGRDAAEPEF VDLRNVVEGD ESPDGQGRLK LARGIEVGHV FQLRGKYTQA MNVSFLDNNG KSQIMEMGCY GIGITRVVAA AIEQNNDEKG IIWTKAMAPF EVVIVPMNYK KSDTVREAAD RIYAELLAAG ADVLLDDRDE RAGVLLNDSE LLGIPHRIVI GDRALKEGNV EYAERRDNEA QAVAIGEIVA RVTASLNA
Full NT Sequence (Expression Vector + Insert)
TAATACGACT CACTATAGGG AGACCACAAC GGTTTCCCTC TAGAAATAAT TTTGTTTAAC TTTAAGAAGG AGATATACCA TGGCTCACCA CCACCACCAC CATATGAAAG CCAGCCAATT CTTTATCTCT ACTTTAAAAG AAGCCCCTGC CGAAGCCGCG TTTGCCAGCC ACAAGCTGAT GATTCGCGCC GGTCTGATTA AAGCCAACGC GTCCGGTCTT TATACTTGGA TGCCGATGGG GCTGCGCGTG TTACGCAAAG TCGAAAACGT CGTGCGCGAG GAAATGGCGC GCGCGGGCAG CGTGGAGCTG CTGATGCCGG TGGTGCAGCC TGCCGAACTG TGGCAGGAAT CCGGCCGCTG GGAGTTTTAC GGTAAAGAAC TGCTGCGCCT GAAAGACCGC CACGAACGCG ATTTCTGCAT GGGCCCGACC TGCGAGGAAG TCATCGCCGA CATCGTGCGC AAAGAAATCA ACAGCTACAA ACAACTGCCG AAAAATTTTT ACCACATCCA AACCAAATTC CGCGACGAAG TGCGCCCGCG TTTCGGCGTG ATGCGCGCGC GCGAATTTGT GATGAAAGAT GCTTATTCCT TCCACGCCGA CTACGCCTCG CTTCAGGCGA CCTATGATGC CATGTATGAC GCTTACTGCC GCATCTTTAC CCGTCTGGGT TTGGCGTTCC GCCCCGTCGC CGCAGACACC GGCAGCATCG GCGGTACGGG TTCGCACGAG TTTCAAGTGT TGGCGGAAAG CGGCGAAGAT GTGATTGCAT ACAGCGACAC TTCCGATTAC GCCGCCAATA TCGAGTTGGC ACCGACCTTG CCGCTTAAAG GTGAACGTGC CGCCGCTCAG GCTGTGTTGA CCAAAGTACA TACACCAAAC GTCAAAACCA TTGAGTCTTT GGTTGAATTC CTGAATATTC CGGTTGAACA AACCCTCAAA TCCATCGTGG TTGAAGGCGA AAACGAAGGC GAACTCGTCC TACTGCTGTT GCGTGGCGAC CATGAGTTTA ACGACATCAA GGCAGAAAAA CTGGCGGGCG TAAAATCGCC ACTGACTATG GCAAGCCCTG CCGCGATTGT TGAACAATTC GGCGCAAACG GCGGTTCGCT CGGCCCCGTC GGCTTCACAG GCAAAGTCTA TGCCGATTTC GCTACCGAAA AAGGCGCGGA CTGGGTTATC GGCGCAAACG AAGACGACTA CCACTATACC GGCTTCAACT TCGGCCGCGA TGCCGCCGAG CCTGAATTCG TCGATTTGCG TAATGTGGTC GAAGGCGACG AAAGCCCCGA CGGGCAAGGC CGTCTGAAAC TGGCGCGCGG CATCGAAGTC GGACATGTCT TCCAGTTGCG CGGCAAATAT ACCCAAGCTA TGAACGTCAG CTTCCTCGAC AATAACGGCA AATCGCAAAT CATGGAAATG GGCTGCTACG GCATCGGCAT CACCCGCGTC GTTGCCGCCG CCATCGAGCA GAATAACGAC GAAAAAGGCA TCATTTGGAC CAAAGCGATG GCGCCGTTTG AAGTCGTCAT CGTGCCGATG AACTACAAAA AATCCGACAC TGTGCGTGAA GCCGCCGACA GAATCTATGC CGAATTGCTG GCGGCAGGCG CGGATGTGCT GCTGGACGAC CGCGACGAAC GCGCAGGCGT ATTGCTGAAC GATTCCGAGC TTCTCGGTAT CCCGCACCGC ATCGTCATCG GCGACCGCGC CTTGAAAGAA GGCAATGTCG AATACGCCGA ACGCCGCGAC AACGAAGCGC AGGCAGTTGC AATCGGAGAA ATTGTTGCGC GTGTAACAGC TTCATTAAAT GCGTGAGTAA GATAGGATCC GGCTGCTAAC AAAGCCCGAA AGGAAGCTGA GTTGGCTGCT GCCACCGCTG AGCAATAACT AGCATAACCC CTTGGGGCCT CTAAACGGGT CTTGAGGGGT TTTTTGCTGA AAGGAGGAAC TATATCCGGA TATCCACAGG ACGGGTGTGG TCGCCATGAT CGCGTAGTCG ATAGTGGCTC CAAGTAGCGA AGCGAGCAGG ACTGGGCGGC GGCCAAAGCG GTCGGACAGT GCTCCGAGAA CGGGTGCGCA TAGAAATTGC ATCAACGCAT ATAGCGCTAG CAGCACGCCA TAGTGACTGG CGATGCTGTC GGAATGGACG ATATCCCGCA AGAGGCCCGG CAGTACCGGC ATAACCAAGC CTATGCCTAC AGCATCCAGG GTGACGGTGC CGAGGATGAC GATGAGCGCA TTGTTAGATT TCATACACGG TGCCTGACTG CGTTAGCAAT TTAACTGTGA TAAACTACCG CATTAAAGCT TATCGATGAT AAGCTGTCAA ACATGAGAAT TCTTGAAGAC GAAAGGGCCT CGTGATACGC CTATTTTTAT AGGTTAATGT CATGATAATA ATGGTTTCTT AGACGTCAGG TGGCACTTTT CGGGGAAATG TGCGCGGAAC CCCTATTTGT TTATTTTTCT AAATACATTC AAATATGTAT CCGCTCATGA GACAATAACC CTGATAAATG CTTCAATAAT ATTGAAAAAG GAAGAGTATG AGTATTCAAC ATTTCCGTGT CGCCCTTATT CCCTTTTTTG CGGCATTTTG CCTTCCTGTT TTTGCTCACC CAGAAACGCT GGTGAAAGTA AAAGATGCTG AAGATCAGTT GGGTGCACGA GTGGGTTACA TCGAACTGGA TCTCAACAGC GGTAAGATCC TTGAGAGTTT TCGCCCCGAA GAACGTTTTC CAATGATGAG CACTTTTAAA GTTCTGCTAT GTGGCGCGGT ATTATCCCGT GTTGACGCCG GGCAAGAGCA ACTCGGTCGC CGCATACACT ATTCTCAGAA TGACTTGGTT GAGTACTCAC CAGTCACAGA AAAGCATCTT ACGGATGGCA TGACAGTAAG AGAATTATGC AGTGCTGCCA TAACCATGAG TGATAACACT GCGGCCAACT TACTTCTGAC AACGATCGGA GGACCGAAGG AGCTAACCGC TTTTTTGCAC AACATGGGGG ATCATGTAAC TCGCCTTGAT CGTTGGGAAC CGGAGCTGAA TGAAGCCATA CCAAACGACG AGCGTGACAC CACGATGCCT GCAGCAATGG CAACAACGTT GCGCAAACTA TTAACTGGCG AACTACTTAC TCTAGCTTCC CGGCAACAAT TAATAGACTG GATGGAGGCG GATAAAGTTG CAGGACCACT TCTGCGCTCG GCCCTTCCGG CTGGCTGGTT TATTGCTGAT AAATCTGGAG CCGGTGAGCG TGGGTCTCGC GGTATCATTG CAGCACTGGG GCCAGATGGT AAGCCCTCCC GTATCGTAGT TATCTACACG ACGGGGAGTC AGGCAACTAT GGATGAACGA AATAGACAGA TCGCTGAGAT AGGTGCCTCA CTGATTAAGC ATTGGTAACT GTCAGACCAA GTTTACTCAT ATATACTTTA GATTGATTTA AAACTTCATT TTTAATTTAA AAGGATCTAG GTGAAGATCC TTTTTGATAA TCTCATGACC AAAATCCCTT AACGTGAGTT TTCGTTCCAC TGAGCGTCAG ACCCCGTAGA AAAGATCAAA GGATCTTCTT GAGATCCTTT TTTTCTGCGC GTAATCTGCT GCTTGCAAAC AAAAAAACCA CCGCTACCAG CGGTGGTTTG TTTGCCGGAT CAAGAGCTAC CAACTCTTTT TCCGAAGGTA ACTGGCTTCA GCAGAGCGCA GATACCAAAT ACTGTCCTTC TAGTGTAGCC GTAGTTAGGC CACCACTTCA AGAACTCTGT AGCACCGCCT ACATACCTCG CTCTGCTAAT CCTGTTACCA GTGGCTGCTG CCAGTGGCGA TAAGTCGTGT CTTACCGGGT TGGACTCAAG ACGATAGTTA CCGGATAAGG CGCAGCGGTC GGGCTGAACG GGGGGTTCGT GCACACAGCC CAGCTTGGAG CGAACGACCT ACACCGAACT GAGATACCTA CAGCGTGAGC TATGAGAAAG CGCCACGCTT CCCGAAGGGA GAAAGGCGGA CAGGTATCCG GTAAGCGGCA GGGTCGGAAC AGGAGAGCGC ACGAGGGAGC TTCCAGGGGG AAACGCCTGG TATCTTTATA GTCCTGTCGG GTTTCGCCAC CTCTGACTTG AGCGTCGATT TTTGTGATGC TCGTCAGGGG GGCGGAGCCT ATGGAAAAAC GCCAGCAACG CGGCCTTTTT ACGGTTCCTG GCCTTTTGCT GGCCTTTTGC TCACATGTTC TTTCCTGCGT TATCCCCTGA TTCTGTGGAT AACCGTATTA CCGCCTTTGA GTGAGCTGAT ACCGCTCGCC GCAGCCGAAC GACCGAGCGC AGCGAGTCAG TGAGCGAGGA AGCGGAAGAG CGCCTGATGC GGTATTTTCT CCTTACGCAT CTGTGCGGTA TTTCACACCG CATATATGGT GCACTCTCAG TACAATCTGC TCTGATGCCG CATAGTTAAG CCAGTATACA CTCCGCTATC GCTACGTGAC TGGGTCATGG CTGCGCCCCG ACACCCGCCA ACACCCGCTG ACGCGCCCTG ACGGGCTTGT CTGCTCCCGG CATCCGCTTA CAGACAAGCT GTGACCGTCT CCGGGAGCTG CATGTGTCAG AGGTTTTCAC CGTCATCACC GAAACGCGCG AGGCAGCTGC GGTAAAGCTC ATCAGCGTGG TCGTGAAGCG ATTCACAGAT GTCTGCCTGT TCATCCGCGT CCAGCTCGTT GAGTTTCTCC AGAAGCGTTA ATGTCTGGCT TCTGATAAAG CGGGCCATGT TAAGGGCGGT TTTTTCCTGT TTGGTCACTG ATGCCTCCGT GTAAGGGGGA TTTCTGTTCA TGGGGGTAAT GATACCGATG AAACGAGAGA GGATGCTCAC GATACGGGTT ACTGATGATG AACATGCCCG GTTACTGGAA CGTTGTGAGG GTAAACAACT GGCGGTATGG ATGCGGCGGG ACCAGAGAAA AATCACTCAG GGTCAATGCC AGCGCTTCGT TAATACAGAT GTAGGTGTTC CACAGGGTAG CCAGCAGCAT CCTGCGATGC AGATCCGGAA CATAATGGTG CAGGGCGCTG ACTTCCGCGT TTCCAGACTT TACGAAACAC GGAAACCGAA GACCATTCAT GTTGTTGCTC AGGTCGCAGA CGTTTTGCAG CAGCAGTCGC TTCACGTTCG CTCGCGTATC GGTGATTCAT TCTGCTAACC AGTAAGGCAA CCCCGCCAGC CTAGCCGGGT CCTCAACGAC AGGAGCACGA TCATGCGCAC CCGTGGCCAG GACCCAACGC TGCCCGAGAT GCGCCGCGTG CGGCTGCTGG AGATGGCGGA CGCGATGGAT ATGTTCTGCC AAGGGTTGGT TTGCGCATTC ACAGTTCTCC GCAAGAATTG ATTGGCTCCA ATTCTTGGAG TGGTGAATCC GTTAGCGAGG TGCCGCCGGC TTCCATTCAG GTCGAGGTGG CCCGGCTCCA TGCACCGCGA CGCAACGCGG GGAGGCAGAC AAGGTATAGG GCGGCGCCTA CAATCCATGC CAACCCGTTC CATGTGCTCG CCGAGGCGGC ATAAATCGCC GTGACGATCA GCGGTCCAGT GATCGAAGTT AGGCTGGTAA GAGCCGCGAG CGATCCTTGA AGCTGTCCCT GATGGTCGTC ATCTACCTGC CTGGACAGCA TGGCCTGCAA CGCGGGCATC CCGATGCCGC CGGAAGCGAG AAGAATCATA ATGGGGAAGG CCATCCAGCC TCGCGTCGCG AACGCCAGCA AGACGTAGCC CAGCGCGTCG GCCGCCATGC CGGCGATAAT GGCCTGCTTC TCGCCGAAAC GTTTGGTGGC GGGACCAGTG ACGAAGGCTT GAGCGAGGGC GTGCAAGATT CCGAATACCG CAAGCGACAG GCCGATCATC GTCGCGCTCC AGCGAAAGCG GTCCTCGCCG AAAATGACCC AGAGCGCTGC CGGCACCTGT CCTACGAGTT GCATGATAAA GAAGACAGTC ATAAGTGCGG CGACGATAGT CATGCCCCGC GCCCACCGGA AGGAGCTGAC TGGGTTGAAG GCTCTCAAGG GCATCGGTCG ACGCTCTCCC TTATGCGACT CCTGCATTAG GAAGCAGCCC AGTAGTAGGT TGAGGCCGTT GAGCACCGCC GCCGCAAGGA ATGGTGCATG CAAGGAGATG GCGCCCAACA GTCCCCCGGC CACGGGGCCT GCCACCATAC CCACGCCGAA ACAAGCGCTC ATGAGCCCGA AGTGGCGAGC CCGATCTTCC CCATCGGTGA TGTCGGCGAT ATAGGCGCCA GCAACCGCAC CTGTGGCGCC GGTGATGCCG GCCACGATGC GTCCGGCGTA GAGGATCGAG ATCTCGATCC CGCGAAAT