MythA.18528.a

Glycosyl transferase family protein (Rv0050 ortholog)

CENTER ID: MythA.18528.a
ORGANISM: Mycobacterium thermoresistibile ATCC 19527 / NCTC 10409
ASSOCIATED DISEASE: Respiratory Infection
CURRENT STATUS: purified
COMMUNITY REQUEST: True
NIH RISK GROUP: 2
SELECT AGENT: False
NIH PRIORITY
pathogens category:
IIIC

Ordering Clones & Proteins

If there are materials available for this target, they will be listed below. Materials can be ordered from SSGCID using the button in the "order material" column. Clicking the button will add the material to a virtual cart. You may order multiple materials at a time at no cost to you, as this contract is funded by NIAID. When you are ready to place your order, click the "Place Order" link which will appear in the top right corner of the page after you place your first item in your cart.

Clones*

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
MythA.18528.a.B2.GE37654 mature protein 23 737
* SSGCID clones represent un-induced expression constructs which have been verified by sequencing from vector primers. Clones may contain a tag, such as N-term 6xHis. Get sequence information using the button in the "info" column.

Proteins

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
MythA.18528.a.B2.PS02014 mature protein 23 737

External Resources

RESOURCE REFERENCE ID
BV-BRC: fig|1078020.3.peg.2074
UniProt: G7CIK2

Orthologues for Rv0050

Species % Identity % Coverage Clones Proteins Structures
marinum 92 98 1 1 0
ulcerans 91 98 1 0 0
fortuitum 79 96 0 0 0
thermoresistibile 78 97 1 1 0

Sequences

These sequences are the native gene sequence; sequences of constructs derived from these sequences may differ due to codon optimization or other protocols. To find the specific sequence of any material you may have ordered, click on the "info" button next to the name of that material.
AA Sequence
MRRGLFAGAV ALLVLPIITF VMAYLIVDVP RPGDIRTNQV STILASDGSE LARIVPPEGN RIDVNINQIP VHVRDAVMAA EDRDFYSNPG FSFTGFARAI GNNIFGGDVQ GGSTITQQYV KNALVGSERV GVRGLIRKAK ELVISTKMSR QWSKDQVMQS YLNIIYFGRG AYGVAAASRA YFDKPVEELN VAEGALLAAL IQRPSTLDPA ADPQGAETRW NWVLDGMVEI GALSPQERAA QVFPPTVPPE LARSQNQTTG PNGLIERQVT RELLELFNIS EQALNTEGLQ ITTTIDPQAQ QAAVDAVETY LEGQDPDMRA AVVSIDPRTG AVKAYYGGSD ANGYDFAQAA VPTGSSFKVF ALVAALEQGM GLGYQVDSSP LTVNGIEITN VEGSSCGRCN IAEALKRSLN TSFYRLMLEL KNGPEDVADA AHRAGIAESI PGIEHTLSED GKGGPPNNGI VLGQYQSRPI DMASAYATLA NSGVYHRPHF VQKVVNSEGQ VLFDADQRDD DGEQRIEKDV ADNVTAAMQP IAAYSNGHAL AGGRPSAAKT GTHQLGDTGE NRDAWMVGYT PSLSTAVWVG TVDGNKPLRT KWGGKIYGSG LPSDIWKATM DGALKGTERE SFPKPGEIGG YAGVPQAPSP QRTSDDDGTT TVTPPSETVI QPSVEVAPGI TIPLGPPTTV PLGPPGGQSP VPGPGAPQQP FSPQQPYPPQ QPGPGAPVVP EPQVPGVPPP PAPQPFP
NT Sequence
gtgcgccgcg gcctgttcgc cggggcggtg gcgctgctgg tgctgcccat catcacgttc gtgatggcct acctgatcgt cgacgtgccg agaccgggtg acatccggac caaccaggtg tcgacgatcc tggccagcga cggcagtgag ctggcgagaa tcgtgccgcc cgaaggcaac cggatcgatg tgaacatcaa ccagataccg gtgcacgtgc gtgacgcggt gatggccgcc gaggaccgtg acttctactc caacccgggt ttctccttca ccgggttcgc ccgcgcgatc ggcaacaaca tcttcggcgg tgacgtccag ggcggttcga ccatcaccca gcagtacgtc aagaacgccc tggtcggttc cgagcgggtg ggcgtgcggg gcctgatccg gaaggccaag gagctggtga tctcgacgaa gatgtcgcgc cagtggtcca aggaccaggt gatgcagtcg tacctgaaca tcatctattt cggtcggggc gcctacgggg tggcggccgc gtcgcgggcg tacttcgaca aaccggtcga ggaactcaac gtcgccgagg gggcgctgct ggccgcgctg atccagcggc cctcgacgct ggacccggcc gccgatccgc agggcgcgga gacgcggtgg aactgggtgc tcgacgggat ggtggagatc ggcgcgctgt cgccgcagga gcgggccgcg caggtgttcc cgccgacggt gccgccggaa ctggcgcggt cgcagaacca gaccaccggc cccaacgggc tgatcgaacg gcaggtcacc cgggagctgc tggagctgtt caacatcagt gagcaggccc tcaacaccga gggtctgcag atcaccacga cgatcgaccc gcaggcacag caggccgcgg tggatgcggt cgagacctac ctggagggcc aggaccccga catgcgggcc gcggtggtgt ccatcgatcc gcgcaccggt gcggtgaagg cctattacgg cggttcggac gccaacggct acgacttcgc gcaggccgcg gtgcccaccg gttcgtcgtt caaggtgttc gcgttggtgg ccgcgctcga gcagggcatg gggctgggct atcaggtgga cagctcgccg ctgacggtca acggcatcga gatcaccaac gtcgagggca gcagctgcgg gcggtgcaat atcgccgagg cgctgaagcg gtcactgaac accagcttct accggttgat gctggaactg aagaacgggc ccgaggacgt cgccgacgcc gcgcaccgcg ccgggatcgc cgagagcatc cccgggatcg agcacaccct gtccgaggac ggcaagggcg gaccgccgaa caatggaatc gtgctggggc agtatcagtc ccgtccgatc gacatggcct cggcgtacgc caccctggcc aactccggtg tgtaccaccg gccgcacttc gtgcagaagg tggtgaactc cgaaggccag gtgctgttcg acgccgatca gcgtgacgac gacggcgagc agcgcatcga gaaggacgtc gccgacaacg tcaccgcggc catgcagccg atcgccgcgt actccaacgg gcacgcgctg gccggcggtc ggccgtcggc cgccaagacg ggcacccatc agctcgggga caccggggag aaccgggacg cctggatggt gggctacacc ccgtcgctgt cgacggcggt gtgggtcggc accgtcgacg gcaacaaacc gctgcgcacc aagtggggcg ggaagatcta cggctccggc ctgccgtcgg acatctggaa ggccaccatg gacggcgccc tgaagggtac cgaacgggag tcgttcccca agccgggcga gatcggcggc tacgccgggg tgccgcaggc cccctcgccg cagcgcacct cggatgacga cgggacgacg acggtgacgc cgccgtcgga gacggtgatc cagccgagtg tggaagtggc gcctggcatc acgatcccgc tgggaccgcc gacgacggtg ccgctcggcc cgcccggcgg gcagtccccg gtgccgggtc cgggtgcgcc gcagcagccg ttctcgccac agcagccgta cccaccgcag cagccggggc cgggcgcacc ggtggtgcct gaaccgcagg tgcccggcgt gccgccgccc cccgcaccac agccgttccc gtga
Details for MythA.18528.a.B2.GE37654
HARVESTED ON: 1/13/2014
SEQUENCED ON: 1/9/2014
EXPECTED MW: 77kDa
OBSERVED MW: 75kDa
ANTIBIOTIC MARKER: ampicillin
COUNT OF EXPRESSION COLONIES: Too many to count (100+)
TOTAL EXPRESSION LEVEL: High Expression
SOLUBLE EXPRESSION LEVEL High Expression
EXPRESSION HOST: data unavailable
SEQUENCING RESULT: pass with incomplete coverage
PERCENT IDENTITY: 92
PERCENT COVERAGE: 43
Validated AA Sequence
MAHHHHHHMA YLIVDVPRPG DIRTNQVSTI LASDGSELAR IVPPEGNRID VNINQIPVHV RDAVMAAEDR DFYSNPGFSF TGFARAIGNN IFGGDVQGGS TITQQYVKNA LVGSERVGVR GLIRKAKELV ISTKMSRQWS KDQVMQSYLN IIYFGRGAYG VAAASRAYFD KPVEELNVAE GALLAALIQR PSTLDPAADP QGAETRWNWV LDGMVEIGAL SPQERAAQVF PPTVPPELAR SQNQTTGPNG LIERQVTREL LELFNISEQA LNTEGLQITT TIDPXHSTPR WMRSRPTWRA RTPTCGPRWC PSIRAPVR
Validated NT Sequence
cnctananaa ttttgtttaa ctttaagaag gagatatacc atggctcacc accaccacca ccatatggcc tacctgatcg tcgacgtgcc gagaccgggt gacatccgga ccaaccaggt gtcgacgatc ctggccagcg acggcagtga gctggcgaga atcgtgccgc ccgaaggcaa ccggatcgat gtgaacatca accagatacc ggtgcacgtg cgtgacgcgg tgatggccgc cgaggaccgt gacttctact ccaacccggg tttctccttc accgggttcg cccgcgcgat cggcaacaac atcttcggcg gtgacgtcca gggcggttcg accatcaccc agcagtacgt caagaacgcc ctggtcggtt ccgagcgggt gggcgtgcgg ggcctgatcc ggaaggccaa ggagctggtg atctcgacga agatgtcgcg ccagtggtcc aaggaccagg tgatgcagtc gtacctgaac atcatctatt tcggtcgggg cgcctacggg gtggcggccg cgtcgcgggc gtacttcgac aaaccggtcg aggaactcaa cgtcgccgag ggggcgctgc tggccgcgct gatccagcgg ccctcgacgc tggacccggc cgccgatccg cagggcgcgg agacgcggtg gaactgggtg ctcgacggga tggtggagat cggcgcgctg tcgccgcagg agcgggccgc gcaggtgttc ccgccgacgg tgccgccgga actggcgcgg tcgcagaacc agaccaccgg ccccaacggg ctgatcgaac ggcaggtcac ccgggagctg ctggagctgt tcaacatcag tgagcaggcc ctcaacaccg agggtctgca gatcaccacg acgatcgacc cgcngcacag cacgccgcgg tggatgcggt cgagacctac ctggagggcc aggaccccga catgcgggcc gcggtggtgt ccatcgatcc gcgcaccggt gcggtgaann ctattacggc ggttcggacg ccaacggcta cgacttcgcg cagncgcgng cccaccggtt cgtcgttcan gtgttcgcnn tggnngccgc nnctcgagca ggnatgggnc tggnnatcag nnngnnnnct cnccnctgan gnnacngcat cnagaatcnn cnnntnnngn ancnncngnn gncgnncncn nnnngnnnnn nnnggtnnnc nnaancacnn ntncntncnn nnnnatgtcn nnnaaac
Expected Protein Sequence
MAHHHHHHAY LIVDVPRPGD IRTNQVSTIL ASDGSELARI VPPEGNRIDV NINQIPVHVR DAVMAAEDRD FYSNPGFSFT GFARAIGNNI FGGDVQGGST ITQQYVKNAL VGSERVGVRG LIRKAKELVI STKMSRQWSK DQVMQSYLNI IYFGRGAYGV AAASRAYFDK PVEELNVAEG ALLAALIQRP STLDPAADPQ GAETRWNWVL DGMVEIGALS PQERAAQVFP PTVPPELARS QNQTTGPNGL IERQVTRELL ELFNISEQAL NTEGLQITTT IDPQAQQAAV DAVETYLEGQ DPDMRAAVVS IDPRTGAVKA YYGGSDANGY DFAQAAVPTG SSFKVFALVA ALEQGMGLGY QVDSSPLTVN GIEITNVEGS SCGRCNIAEA LKRSLNTSFY RLMLELKNGP EDVADAAHRA GIAESIPGIE HTLSEDGKGG PPNNGIVLGQ YQSRPIDMAS AYATLANSGV YHRPHFVQKV VNSEGQVLFD ADQRDDDGEQ RIEKDVADNV TAAMQPIAAY SNGHALAGGR PSAAKTGTHQ LGDTGENRDA WMVGYTPSLS TAVWVGTVDG NKPLRTKWGG KIYGSGLPSD IWKATMDGAL KGTERESFPK PGEIGGYAGV PQAPSPQRTS DDDGTTTVTP PSETVIQPSV EVAPGITIPL GPPTTVPLGP PGGQSPVPGP GAPQQPFSPQ QPYPPQQPGP GAPVVPEPQV PGVPPPPAPQ PFP
Full NT Sequence (Expression Vector + Insert)
taatacgact cactataggg agaccacaac ggtttccctc tagaaataat tttgtttaac tttaagaagg agatatacca tggctcacca ccaccaccac catgcctacc tgatcgtcga cgtgccgaga ccgggtgaca tccggaccaa ccaggtgtcg acgatcctgg ccagcgacgg cagtgagctg gcgagaatcg tgccgcccga aggcaaccgg atcgatgtga acatcaacca gataccggtg cacgtgcgtg acgcggtgat ggccgccgag gaccgtgact tctactccaa cccgggtttc tccttcaccg ggttcgcccg cgcgatcggc aacaacatct tcggcggtga cgtccagggc ggttcgacca tcacccagca gtacgtcaag aacgccctgg tcggttccga gcgggtgggc gtgcggggcc tgatccggaa ggccaaggag ctggtgatct cgacgaagat gtcgcgccag tggtccaagg accaggtgat gcagtcgtac ctgaacatca tctatttcgg tcggggcgcc tacggggtgg cggccgcgtc gcgggcgtac ttcgacaaac cggtcgagga actcaacgtc gccgaggggg cgctgctggc cgcgctgatc cagcggccct cgacgctgga cccggccgcc gatccgcagg gcgcggagac gcggtggaac tgggtgctcg acgggatggt ggagatcggc gcgctgtcgc cgcaggagcg ggccgcgcag gtgttcccgc cgacggtgcc gccggaactg gcgcggtcgc agaaccagac caccggcccc aacgggctga tcgaacggca ggtcacccgg gagctgctgg agctgttcaa catcagtgag caggccctca acaccgaggg tctgcagatc accacgacga tcgacccgca ggcacagcag gccgcggtgg atgcggtcga gacctacctg gagggccagg accccgacat gcgggccgcg gtggtgtcca tcgatccgcg caccggtgcg gtgaaggcct attacggcgg ttcggacgcc aacggctacg acttcgcgca ggccgcggtg cccaccggtt cgtcgttcaa ggtgttcgcg ttggtggccg cgctcgagca gggcatgggg ctgggctatc aggtggacag ctcgccgctg acggtcaacg gcatcgagat caccaacgtc gagggcagca gctgcgggcg gtgcaatatc gccgaggcgc tgaagcggtc actgaacacc agcttctacc ggttgatgct ggaactgaag aacgggcccg aggacgtcgc cgacgccgcg caccgcgccg ggatcgccga gagcatcccc gggatcgagc acaccctgtc cgaggacggc aagggcggac cgccgaacaa tggaatcgtg ctggggcagt atcagtcccg tccgatcgac atggcctcgg cgtacgccac cctggccaac tccggtgtgt accaccggcc gcacttcgtg cagaaggtgg tgaactccga aggccaggtg ctgttcgacg ccgatcagcg tgacgacgac ggcgagcagc gcatcgagaa ggacgtcgcc gacaacgtca ccgcggccat gcagccgatc gccgcgtact ccaacgggca cgcgctggcc ggcggtcggc cgtcggccgc caagacgggc acccatcagc tcggggacac cggggagaac cgggacgcct ggatggtggg ctacaccccg tcgctgtcga cggcggtgtg ggtcggcacc gtcgacggca acaaaccgct gcgcaccaag tggggcggga agatctacgg ctccggcctg ccgtcggaca tctggaaggc caccatggac ggcgccctga agggtaccga acgggagtcg ttccccaagc cgggcgagat cggcggctac gccggggtgc cgcaggcccc ctcgccgcag cgcacctcgg atgacgacgg gacgacgacg gtgacgccgc cgtcggagac ggtgatccag ccgagtgtgg aagtggcgcc tggcatcacg atcccgctgg gaccgccgac gacggtgccg ctcggcccgc ccggcgggca gtccccggtg ccgggtccgg gtgcgccgca gcagccgttc tcgccacagc agccgtaccc accgcagcag ccggggccgg gcgcaccggt ggtgcctgaa ccgcaggtgc ccggcgtgcc gccgcccccc gcaccacagc cgttcccgtg agtaagatag gatccggctg ctaacaaagc ccgaaaggaa gctgagttgg ctgctgccac cgctgagcaa taactagcat aaccccttgg ggcctctaaa cgggtcttga ggggtttttt gctgaaagga ggaactatat ccggatatcc acaggacggg tgtggtcgcc atgatcgcgt agtcgatagt ggctccaagt agcgaagcga gcaggactgg gcggcggcca aagcggtcgg acagtgctcc gagaacgggt gcgcatagaa attgcatcaa cgcatatagc gctagcagca cgccatagtg actggcgatg ctgtcggaat ggacgatatc ccgcaagagg cccggcagta ccggcataac caagcctatg cctacagcat ccagggtgac ggtgccgagg atgacgatga gcgcattgtt agatttcata cacggtgcct gactgcgtta gcaatttaac tgtgataaac taccgcatta aagcttatcg atgataagct gtcaaacatg agaattcttg aagacgaaag ggcctcgtga tacgcctatt tttataggtt aatgtcatga taataatggt ttcttagacg tcaggtggca cttttcgggg aaatgtgcgc ggaaccccta tttgtttatt tttctaaata cattcaaata tgtatccgct catgagacaa taaccctgat aaatgcttca ataatattga aaaaggaaga gtatgagtat tcaacatttc cgtgtcgccc ttattccctt ttttgcggca ttttgccttc ctgtttttgc tcacccagaa acgctggtga aagtaaaaga tgctgaagat cagttgggtg cacgagtggg ttacatcgaa ctggatctca acagcggtaa gatccttgag agttttcgcc ccgaagaacg ttttccaatg atgagcactt ttaaagttct gctatgtggc gcggtattat cccgtgttga cgccgggcaa gagcaactcg gtcgccgcat acactattct cagaatgact tggttgagta ctcaccagtc acagaaaagc atcttacgga tggcatgaca gtaagagaat tatgcagtgc tgccataacc atgagtgata acactgcggc caacttactt ctgacaacga tcggaggacc gaaggagcta accgcttttt tgcacaacat gggggatcat gtaactcgcc ttgatcgttg ggaaccggag ctgaatgaag ccataccaaa cgacgagcgt gacaccacga tgcctgcagc aatggcaaca acgttgcgca aactattaac tggcgaacta cttactctag cttcccggca acaattaata gactggatgg aggcggataa agttgcagga ccacttctgc gctcggccct tccggctggc tggtttattg ctgataaatc tggagccggt gagcgtgggt ctcgcggtat cattgcagca ctggggccag atggtaagcc ctcccgtatc gtagttatct acacgacggg gagtcaggca actatggatg aacgaaatag acagatcgct gagataggtg cctcactgat taagcattgg taactgtcag accaagttta ctcatatata ctttagattg atttaaaact tcatttttaa tttaaaagga tctaggtgaa gatccttttt gataatctca tgaccaaaat cccttaacgt gagttttcgt tccactgagc gtcagacccc gtagaaaaga tcaaaggatc ttcttgagat cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa aaccaccgct accagcggtg gtttgtttgc cggatcaaga gctaccaact ctttttccga aggtaactgg cttcagcaga gcgcagatac caaatactgt ccttctagtg tagccgtagt taggccacca cttcaagaac tctgtagcac cgcctacata cctcgctctg ctaatcctgt taccagtggc tgctgccagt ggcgataagt cgtgtcttac cgggttggac tcaagacgat agttaccgga taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca cagcccagct tggagcgaac gacctacacc gaactgagat acctacagcg tgagctatga gaaagcgcca cgcttcccga agggagaaag gcggacaggt atccggtaag cggcagggtc ggaacaggag agcgcacgag ggagcttcca gggggaaacg cctggtatct ttatagtcct gtcgggtttc gccacctctg acttgagcgt cgatttttgt gatgctcgtc aggggggcgg agcctatgga aaaacgccag caacgcggcc tttttacggt tcctggcctt ttgctggcct tttgctcaca tgttctttcc tgcgttatcc cctgattctg tggataaccg tattaccgcc tttgagtgag ctgataccgc tcgccgcagc cgaacgaccg agcgcagcga gtcagtgagc gaggaagcgg aagagcgcct gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatat atggtgcact ctcagtacaa tctgctctga tgccgcatag ttaagccagt atacactccg ctatcgctac gtgactgggt catggctgcg ccccgacacc cgccaacacc cgctgacgcg ccctgacggg cttgtctgct cccggcatcc gcttacagac aagctgtgac cgtctccggg agctgcatgt gtcagaggtt ttcaccgtca tcaccgaaac gcgcgaggca gctgcggtaa agctcatcag cgtggtcgtg aagcgattca cagatgtctg cctgttcatc cgcgtccagc tcgttgagtt tctccagaag cgttaatgtc tggcttctga taaagcgggc catgttaagg gcggtttttt cctgtttggt cactgatgcc tccgtgtaag ggggatttct gttcatgggg gtaatgatac cgatgaaacg agagaggatg ctcacgatac gggttactga tgatgaacat gcccggttac tggaacgttg tgagggtaaa caactggcgg tatggatgcg gcgggaccag agaaaaatca ctcagggtca atgccagcgc ttcgttaata cagatgtagg tgttccacag ggtagccagc agcatcctgc gatgcagatc cggaacataa tggtgcaggg cgctgacttc cgcgtttcca gactttacga aacacggaaa ccgaagacca ttcatgttgt tgctcaggtc gcagacgttt tgcagcagca gtcgcttcac gttcgctcgc gtatcggtga ttcattctgc taaccagtaa ggcaaccccg ccagcctagc cgggtcctca acgacaggag cacgatcatg cgcacccgtg gccaggaccc aacgctgccc gagatgcgcc gcgtgcggct gctggagatg gcggacgcga tggatatgtt ctgccaaggg ttggtttgcg cattcacagt tctccgcaag aattgattgg ctccaattct tggagtggtg aatccgttag cgaggtgccg ccggcttcca ttcaggtcga ggtggcccgg ctccatgcac cgcgacgcaa cgcggggagg cagacaaggt atagggcggc gcctacaatc catgccaacc cgttccatgt gctcgccgag gcggcataaa tcgccgtgac gatcagcggt ccagtgatcg aagttaggct ggtaagagcc gcgagcgatc cttgaagctg tccctgatgg tcgtcatcta cctgcctgga cagcatggcc tgcaacgcgg gcatcccgat gccgccggaa gcgagaagaa tcataatggg gaaggccatc cagcctcgcg tcgcgaacgc cagcaagacg tagcccagcg cgtcggccgc catgccggcg ataatggcct gcttctcgcc gaaacgtttg gtggcgggac cagtgacgaa ggcttgagcg agggcgtgca agattccgaa taccgcaagc gacaggccga tcatcgtcgc gctccagcga aagcggtcct cgccgaaaat gacccagagc gctgccggca cctgtcctac gagttgcatg ataaagaaga cagtcataag tgcggcgacg atagtcatgc cccgcgccca ccggaaggag ctgactgggt tgaaggctct caagggcatc ggtcgacgct ctcccttatg cgactcctgc attaggaagc agcccagtag taggttgagg ccgttgagca ccgccgccgc aaggaatggt gcatgcaagg agatggcgcc caacagtccc ccggccacgg ggcctgccac catacccacg ccgaaacaag cgctcatgag cccgaagtgg cgagcccgat cttccccatc ggtgatgtcg gcgatatagg cgccagcaac cgcacctgtg gcgccggtga tgccggccac gatgcgtccg gcgtagagga tcgagatctc gatcccgcga aat
Details for MythA.18528.a.B2.PS02014
PURIFICATION DATe: 3/7/2014
CONCENTRATION: 44.9mg/ml
OBSERVED MW: 77kDa
EXPRESSION LEVEL: Low Expression
PROTEIN PURIFICATION BUFFER: 20 mM HEPES, pH 7.0, 300 mM NaCl, 5% glycerol and 1 mM TCEP
EXPRESSION HOST: data unavailable
VIAL COUNT (approx.): 2
VIAL VOLUME: 110µl
PERCENT IDENTITY: 92
PERCENT COVERAGE: 43
Protocol Notes
notes unavailable
Validated AA Sequence
MAHHHHHHMA YLIVDVPRPG DIRTNQVSTI LASDGSELAR IVPPEGNRID VNINQIPVHV RDAVMAAEDR DFYSNPGFSF TGFARAIGNN IFGGDVQGGS TITQQYVKNA LVGSERVGVR GLIRKAKELV ISTKMSRQWS KDQVMQSYLN IIYFGRGAYG VAAASRAYFD KPVEELNVAE GALLAALIQR PSTLDPAADP QGAETRWNWV LDGMVEIGAL SPQERAAQVF PPTVPPELAR SQNQTTGPNG LIERQVTREL LELFNISEQA LNTEGLQITT TIDPXHSTPR WMRSRPTWRA RTPTCGPRWC PSIRAPVR
Validated NT Sequence
cnctananaa ttttgtttaa ctttaagaag gagatatacc atggctcacc accaccacca ccatatggcc tacctgatcg tcgacgtgcc gagaccgggt gacatccgga ccaaccaggt gtcgacgatc ctggccagcg acggcagtga gctggcgaga atcgtgccgc ccgaaggcaa ccggatcgat gtgaacatca accagatacc ggtgcacgtg cgtgacgcgg tgatggccgc cgaggaccgt gacttctact ccaacccggg tttctccttc accgggttcg cccgcgcgat cggcaacaac atcttcggcg gtgacgtcca gggcggttcg accatcaccc agcagtacgt caagaacgcc ctggtcggtt ccgagcgggt gggcgtgcgg ggcctgatcc ggaaggccaa ggagctggtg atctcgacga agatgtcgcg ccagtggtcc aaggaccagg tgatgcagtc gtacctgaac atcatctatt tcggtcgggg cgcctacggg gtggcggccg cgtcgcgggc gtacttcgac aaaccggtcg aggaactcaa cgtcgccgag ggggcgctgc tggccgcgct gatccagcgg ccctcgacgc tggacccggc cgccgatccg cagggcgcgg agacgcggtg gaactgggtg ctcgacggga tggtggagat cggcgcgctg tcgccgcagg agcgggccgc gcaggtgttc ccgccgacgg tgccgccgga actggcgcgg tcgcagaacc agaccaccgg ccccaacggg ctgatcgaac ggcaggtcac ccgggagctg ctggagctgt tcaacatcag tgagcaggcc ctcaacaccg agggtctgca gatcaccacg acgatcgacc cgcngcacag cacgccgcgg tggatgcggt cgagacctac ctggagggcc aggaccccga catgcgggcc gcggtggtgt ccatcgatcc gcgcaccggt gcggtgaann ctattacggc ggttcggacg ccaacggcta cgacttcgcg cagncgcgng cccaccggtt cgtcgttcan gtgttcgcnn tggnngccgc nnctcgagca ggnatgggnc tggnnatcag nnngnnnnct cnccnctgan gnnacngcat cnagaatcnn cnnntnnngn ancnncngnn gncgnncncn nnnngnnnnn nnnggtnnnc nnaancacnn ntncntncnn nnnnatgtcn nnnaaac
Expressed Protein Sequence
MAHHHHHHAY LIVDVPRPGD IRTNQVSTIL ASDGSELARI VPPEGNRIDV NINQIPVHVR DAVMAAEDRD FYSNPGFSFT GFARAIGNNI FGGDVQGGST ITQQYVKNAL VGSERVGVRG LIRKAKELVI STKMSRQWSK DQVMQSYLNI IYFGRGAYGV AAASRAYFDK PVEELNVAEG ALLAALIQRP STLDPAADPQ GAETRWNWVL DGMVEIGALS PQERAAQVFP PTVPPELARS QNQTTGPNGL IERQVTRELL ELFNISEQAL NTEGLQITTT IDPQAQQAAV DAVETYLEGQ DPDMRAAVVS IDPRTGAVKA YYGGSDANGY DFAQAAVPTG SSFKVFALVA ALEQGMGLGY QVDSSPLTVN GIEITNVEGS SCGRCNIAEA LKRSLNTSFY RLMLELKNGP EDVADAAHRA GIAESIPGIE HTLSEDGKGG PPNNGIVLGQ YQSRPIDMAS AYATLANSGV YHRPHFVQKV VNSEGQVLFD ADQRDDDGEQ RIEKDVADNV TAAMQPIAAY SNGHALAGGR PSAAKTGTHQ LGDTGENRDA WMVGYTPSLS TAVWVGTVDG NKPLRTKWGG KIYGSGLPSD IWKATMDGAL KGTERESFPK PGEIGGYAGV PQAPSPQRTS DDDGTTTVTP PSETVIQPSV EVAPGITIPL GPPTTVPLGP PGGQSPVPGP GAPQQPFSPQ QPYPPQQPGP GAPVVPEPQV PGVPPPPAPQ PFP
Full NT Sequence (Expression Vector + Insert)
TAATACGACT CACTATAGGG AGACCACAAC GGTTTCCCTC TAGAAATAAT TTTGTTTAAC TTTAAGAAGG AGATATACCA TGGCTCACCA CCACCACCAC CATGCCTACC TGATCGTCGA CGTGCCGAGA CCGGGTGACA TCCGGACCAA CCAGGTGTCG ACGATCCTGG CCAGCGACGG CAGTGAGCTG GCGAGAATCG TGCCGCCCGA AGGCAACCGG ATCGATGTGA ACATCAACCA GATACCGGTG CACGTGCGTG ACGCGGTGAT GGCCGCCGAG GACCGTGACT TCTACTCCAA CCCGGGTTTC TCCTTCACCG GGTTCGCCCG CGCGATCGGC AACAACATCT TCGGCGGTGA CGTCCAGGGC GGTTCGACCA TCACCCAGCA GTACGTCAAG AACGCCCTGG TCGGTTCCGA GCGGGTGGGC GTGCGGGGCC TGATCCGGAA GGCCAAGGAG CTGGTGATCT CGACGAAGAT GTCGCGCCAG TGGTCCAAGG ACCAGGTGAT GCAGTCGTAC CTGAACATCA TCTATTTCGG TCGGGGCGCC TACGGGGTGG CGGCCGCGTC GCGGGCGTAC TTCGACAAAC CGGTCGAGGA ACTCAACGTC GCCGAGGGGG CGCTGCTGGC CGCGCTGATC CAGCGGCCCT CGACGCTGGA CCCGGCCGCC GATCCGCAGG GCGCGGAGAC GCGGTGGAAC TGGGTGCTCG ACGGGATGGT GGAGATCGGC GCGCTGTCGC CGCAGGAGCG GGCCGCGCAG GTGTTCCCGC CGACGGTGCC GCCGGAACTG GCGCGGTCGC AGAACCAGAC CACCGGCCCC AACGGGCTGA TCGAACGGCA GGTCACCCGG GAGCTGCTGG AGCTGTTCAA CATCAGTGAG CAGGCCCTCA ACACCGAGGG TCTGCAGATC ACCACGACGA TCGACCCGCA GGCACAGCAG GCCGCGGTGG ATGCGGTCGA GACCTACCTG GAGGGCCAGG ACCCCGACAT GCGGGCCGCG GTGGTGTCCA TCGATCCGCG CACCGGTGCG GTGAAGGCCT ATTACGGCGG TTCGGACGCC AACGGCTACG ACTTCGCGCA GGCCGCGGTG CCCACCGGTT CGTCGTTCAA GGTGTTCGCG TTGGTGGCCG CGCTCGAGCA GGGCATGGGG CTGGGCTATC AGGTGGACAG CTCGCCGCTG ACGGTCAACG GCATCGAGAT CACCAACGTC GAGGGCAGCA GCTGCGGGCG GTGCAATATC GCCGAGGCGC TGAAGCGGTC ACTGAACACC AGCTTCTACC GGTTGATGCT GGAACTGAAG AACGGGCCCG AGGACGTCGC CGACGCCGCG CACCGCGCCG GGATCGCCGA GAGCATCCCC GGGATCGAGC ACACCCTGTC CGAGGACGGC AAGGGCGGAC CGCCGAACAA TGGAATCGTG CTGGGGCAGT ATCAGTCCCG TCCGATCGAC ATGGCCTCGG CGTACGCCAC CCTGGCCAAC TCCGGTGTGT ACCACCGGCC GCACTTCGTG CAGAAGGTGG TGAACTCCGA AGGCCAGGTG CTGTTCGACG CCGATCAGCG TGACGACGAC GGCGAGCAGC GCATCGAGAA GGACGTCGCC GACAACGTCA CCGCGGCCAT GCAGCCGATC GCCGCGTACT CCAACGGGCA CGCGCTGGCC GGCGGTCGGC CGTCGGCCGC CAAGACGGGC ACCCATCAGC TCGGGGACAC CGGGGAGAAC CGGGACGCCT GGATGGTGGG CTACACCCCG TCGCTGTCGA CGGCGGTGTG GGTCGGCACC GTCGACGGCA ACAAACCGCT GCGCACCAAG TGGGGCGGGA AGATCTACGG CTCCGGCCTG CCGTCGGACA TCTGGAAGGC CACCATGGAC GGCGCCCTGA AGGGTACCGA ACGGGAGTCG TTCCCCAAGC CGGGCGAGAT CGGCGGCTAC GCCGGGGTGC CGCAGGCCCC CTCGCCGCAG CGCACCTCGG ATGACGACGG GACGACGACG GTGACGCCGC CGTCGGAGAC GGTGATCCAG CCGAGTGTGG AAGTGGCGCC TGGCATCACG ATCCCGCTGG GACCGCCGAC GACGGTGCCG CTCGGCCCGC CCGGCGGGCA GTCCCCGGTG CCGGGTCCGG GTGCGCCGCA GCAGCCGTTC TCGCCACAGC AGCCGTACCC ACCGCAGCAG CCGGGGCCGG GCGCACCGGT GGTGCCTGAA CCGCAGGTGC CCGGCGTGCC GCCGCCCCCC GCACCACAGC CGTTCCCGTG AGTAAGATAG GATCCGGCTG CTAACAAAGC CCGAAAGGAA GCTGAGTTGG CTGCTGCCAC CGCTGAGCAA TAACTAGCAT AACCCCTTGG GGCCTCTAAA CGGGTCTTGA GGGGTTTTTT GCTGAAAGGA GGAACTATAT CCGGATATCC ACAGGACGGG TGTGGTCGCC ATGATCGCGT AGTCGATAGT GGCTCCAAGT AGCGAAGCGA GCAGGACTGG GCGGCGGCCA AAGCGGTCGG ACAGTGCTCC GAGAACGGGT GCGCATAGAA ATTGCATCAA CGCATATAGC GCTAGCAGCA CGCCATAGTG ACTGGCGATG CTGTCGGAAT GGACGATATC CCGCAAGAGG CCCGGCAGTA CCGGCATAAC CAAGCCTATG CCTACAGCAT CCAGGGTGAC GGTGCCGAGG ATGACGATGA GCGCATTGTT AGATTTCATA CACGGTGCCT GACTGCGTTA GCAATTTAAC TGTGATAAAC TACCGCATTA AAGCTTATCG ATGATAAGCT GTCAAACATG AGAATTCTTG AAGACGAAAG GGCCTCGTGA TACGCCTATT TTTATAGGTT AATGTCATGA TAATAATGGT TTCTTAGACG TCAGGTGGCA CTTTTCGGGG AAATGTGCGC GGAACCCCTA TTTGTTTATT TTTCTAAATA CATTCAAATA TGTATCCGCT CATGAGACAA TAACCCTGAT AAATGCTTCA ATAATATTGA AAAAGGAAGA GTATGAGTAT TCAACATTTC CGTGTCGCCC TTATTCCCTT TTTTGCGGCA TTTTGCCTTC CTGTTTTTGC TCACCCAGAA ACGCTGGTGA AAGTAAAAGA TGCTGAAGAT CAGTTGGGTG CACGAGTGGG TTACATCGAA CTGGATCTCA ACAGCGGTAA GATCCTTGAG AGTTTTCGCC CCGAAGAACG TTTTCCAATG ATGAGCACTT TTAAAGTTCT GCTATGTGGC GCGGTATTAT CCCGTGTTGA CGCCGGGCAA GAGCAACTCG GTCGCCGCAT ACACTATTCT CAGAATGACT TGGTTGAGTA CTCACCAGTC ACAGAAAAGC ATCTTACGGA TGGCATGACA GTAAGAGAAT TATGCAGTGC TGCCATAACC ATGAGTGATA ACACTGCGGC CAACTTACTT CTGACAACGA TCGGAGGACC GAAGGAGCTA ACCGCTTTTT TGCACAACAT GGGGGATCAT GTAACTCGCC TTGATCGTTG GGAACCGGAG CTGAATGAAG CCATACCAAA CGACGAGCGT GACACCACGA TGCCTGCAGC AATGGCAACA ACGTTGCGCA AACTATTAAC TGGCGAACTA CTTACTCTAG CTTCCCGGCA ACAATTAATA GACTGGATGG AGGCGGATAA AGTTGCAGGA CCACTTCTGC GCTCGGCCCT TCCGGCTGGC TGGTTTATTG CTGATAAATC TGGAGCCGGT GAGCGTGGGT CTCGCGGTAT CATTGCAGCA CTGGGGCCAG ATGGTAAGCC CTCCCGTATC GTAGTTATCT ACACGACGGG GAGTCAGGCA ACTATGGATG AACGAAATAG ACAGATCGCT GAGATAGGTG CCTCACTGAT TAAGCATTGG TAACTGTCAG ACCAAGTTTA CTCATATATA CTTTAGATTG ATTTAAAACT TCATTTTTAA TTTAAAAGGA TCTAGGTGAA GATCCTTTTT GATAATCTCA TGACCAAAAT CCCTTAACGT GAGTTTTCGT TCCACTGAGC GTCAGACCCC GTAGAAAAGA TCAAAGGATC TTCTTGAGAT CCTTTTTTTC TGCGCGTAAT CTGCTGCTTG CAAACAAAAA AACCACCGCT ACCAGCGGTG GTTTGTTTGC CGGATCAAGA GCTACCAACT CTTTTTCCGA AGGTAACTGG CTTCAGCAGA GCGCAGATAC CAAATACTGT CCTTCTAGTG TAGCCGTAGT TAGGCCACCA CTTCAAGAAC TCTGTAGCAC CGCCTACATA CCTCGCTCTG CTAATCCTGT TACCAGTGGC TGCTGCCAGT GGCGATAAGT CGTGTCTTAC CGGGTTGGAC TCAAGACGAT AGTTACCGGA TAAGGCGCAG CGGTCGGGCT GAACGGGGGG TTCGTGCACA CAGCCCAGCT TGGAGCGAAC GACCTACACC GAACTGAGAT ACCTACAGCG TGAGCTATGA GAAAGCGCCA CGCTTCCCGA AGGGAGAAAG GCGGACAGGT ATCCGGTAAG CGGCAGGGTC GGAACAGGAG AGCGCACGAG GGAGCTTCCA GGGGGAAACG CCTGGTATCT TTATAGTCCT GTCGGGTTTC GCCACCTCTG ACTTGAGCGT CGATTTTTGT GATGCTCGTC AGGGGGGCGG AGCCTATGGA AAAACGCCAG CAACGCGGCC TTTTTACGGT TCCTGGCCTT TTGCTGGCCT TTTGCTCACA TGTTCTTTCC TGCGTTATCC CCTGATTCTG TGGATAACCG TATTACCGCC TTTGAGTGAG CTGATACCGC TCGCCGCAGC CGAACGACCG AGCGCAGCGA GTCAGTGAGC GAGGAAGCGG AAGAGCGCCT GATGCGGTAT TTTCTCCTTA CGCATCTGTG CGGTATTTCA CACCGCATAT ATGGTGCACT CTCAGTACAA TCTGCTCTGA TGCCGCATAG TTAAGCCAGT ATACACTCCG CTATCGCTAC GTGACTGGGT CATGGCTGCG CCCCGACACC CGCCAACACC CGCTGACGCG CCCTGACGGG CTTGTCTGCT CCCGGCATCC GCTTACAGAC AAGCTGTGAC CGTCTCCGGG AGCTGCATGT GTCAGAGGTT TTCACCGTCA TCACCGAAAC GCGCGAGGCA GCTGCGGTAA AGCTCATCAG CGTGGTCGTG AAGCGATTCA CAGATGTCTG CCTGTTCATC CGCGTCCAGC TCGTTGAGTT TCTCCAGAAG CGTTAATGTC TGGCTTCTGA TAAAGCGGGC CATGTTAAGG GCGGTTTTTT CCTGTTTGGT CACTGATGCC TCCGTGTAAG GGGGATTTCT GTTCATGGGG GTAATGATAC CGATGAAACG AGAGAGGATG CTCACGATAC GGGTTACTGA TGATGAACAT GCCCGGTTAC TGGAACGTTG TGAGGGTAAA CAACTGGCGG TATGGATGCG GCGGGACCAG AGAAAAATCA CTCAGGGTCA ATGCCAGCGC TTCGTTAATA CAGATGTAGG TGTTCCACAG GGTAGCCAGC AGCATCCTGC GATGCAGATC CGGAACATAA TGGTGCAGGG CGCTGACTTC CGCGTTTCCA GACTTTACGA AACACGGAAA CCGAAGACCA TTCATGTTGT TGCTCAGGTC GCAGACGTTT TGCAGCAGCA GTCGCTTCAC GTTCGCTCGC GTATCGGTGA TTCATTCTGC TAACCAGTAA GGCAACCCCG CCAGCCTAGC CGGGTCCTCA ACGACAGGAG CACGATCATG CGCACCCGTG GCCAGGACCC AACGCTGCCC GAGATGCGCC GCGTGCGGCT GCTGGAGATG GCGGACGCGA TGGATATGTT CTGCCAAGGG TTGGTTTGCG CATTCACAGT TCTCCGCAAG AATTGATTGG CTCCAATTCT TGGAGTGGTG AATCCGTTAG CGAGGTGCCG CCGGCTTCCA TTCAGGTCGA GGTGGCCCGG CTCCATGCAC CGCGACGCAA CGCGGGGAGG CAGACAAGGT ATAGGGCGGC GCCTACAATC CATGCCAACC CGTTCCATGT GCTCGCCGAG GCGGCATAAA TCGCCGTGAC GATCAGCGGT CCAGTGATCG AAGTTAGGCT GGTAAGAGCC GCGAGCGATC CTTGAAGCTG TCCCTGATGG TCGTCATCTA CCTGCCTGGA CAGCATGGCC TGCAACGCGG GCATCCCGAT GCCGCCGGAA GCGAGAAGAA TCATAATGGG GAAGGCCATC CAGCCTCGCG TCGCGAACGC CAGCAAGACG TAGCCCAGCG CGTCGGCCGC CATGCCGGCG ATAATGGCCT GCTTCTCGCC GAAACGTTTG GTGGCGGGAC CAGTGACGAA GGCTTGAGCG AGGGCGTGCA AGATTCCGAA TACCGCAAGC GACAGGCCGA TCATCGTCGC GCTCCAGCGA AAGCGGTCCT CGCCGAAAAT GACCCAGAGC GCTGCCGGCA CCTGTCCTAC GAGTTGCATG ATAAAGAAGA CAGTCATAAG TGCGGCGACG ATAGTCATGC CCCGCGCCCA CCGGAAGGAG CTGACTGGGT TGAAGGCTCT CAAGGGCATC GGTCGACGCT CTCCCTTATG CGACTCCTGC ATTAGGAAGC AGCCCAGTAG TAGGTTGAGG CCGTTGAGCA CCGCCGCCGC AAGGAATGGT GCATGCAAGG AGATGGCGCC CAACAGTCCC CCGGCCACGG GGCCTGCCAC CATACCCACG CCGAAACAAG CGCTCATGAG CCCGAAGTGG CGAGCCCGAT CTTCCCCATC GGTGATGTCG GCGATATAGG CGCCAGCAAC CGCACCTGTG GCGCCGGTGA TGCCGGCCAC GATGCGTCCG GCGTAGAGGA TCGAGATCTC GATCCCGCGA AAT