MysmA.01365.a

Sorbitol utilization protein SOU2

CENTER ID: MysmA.01365.a
ORGANISM: Mycobacterium smegmatis ATCC 700084 / mc(2)155
ASSOCIATED DISEASE:
CURRENT STATUS: crystallized
COMMUNITY REQUEST: False
NIH RISK GROUP: 2
SELECT AGENT: False
NIH PRIORITY
pathogens category:
IIIC

Ordering Clones & Proteins

If there are materials available for this target, they will be listed below. Materials can be ordered from SSGCID using the button in the "order material" column. Clicking the button will add the material to a virtual cart. You may order multiple materials at a time at no cost to you, as this contract is funded by NIAID. When you are ready to place your order, click the "Place Order" link which will appear in the top right corner of the page after you place your first item in your cart.

Proteins

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
MysmA.01365.a.A1.PW28820 Full length( MysmA.01365.a ) 1 255

External Resources

RESOURCE REFERENCE ID
BV-BRC: fig|246196.19.peg.3553
RefSeq: YP_887907.1
UniProt: A0QYB9

Sequences

These sequences are the native gene sequence; sequences of constructs derived from these sequences may differ due to codon optimization or other protocols. To find the specific sequence of any material you may have ordered, click on the "info" button next to the name of that material.
AA Sequence
MDVRSAFDLS GRTALVTGGN QGLGKAFAIA LAQAGARVSF SGRNAERNEK TAAEAAAAGH QLHAITADIT RAEDVERMTA EAIEALGHID ILVNNAGTCH HGESWTVTEE QWDDVFDLNV KALWACSLAV GAHMRERGSG SVVNIGSMSG IIVNRPQMQP AYNASKAAVH HLTKSLAAEW APLGIRVNAL APGYVKTDMA PVDRPEFKRY WIDDTPQLRY AVPEEIAPSV VFLASDAASF ITGSVLVADG GYTAW
NT Sequence
ATGGATGTGC GTTCGGCATT CGATCTGAGC GGGCGCACTG CGCTGGTGAC CGGCGGAAAC CAGGGCCTGG GCAAGGCTTT CGCGATCGCA CTCGCACAGG CCGGTGCCCG TGTGTCCTTC TCGGGCCGCA ACGCCGAACG CAACGAGAAG ACCGCGGCCG AGGCCGCCGC GGCAGGACAC CAACTGCACG CGATCACGGC CGACATCACC AGGGCCGAGG ACGTCGAGCG CATGACGGCC GAGGCCATCG AAGCGCTCGG TCACATCGAC ATCCTGGTCA ACAACGCGGG CACGTGCCAC CACGGTGAGT CCTGGACGGT CACCGAAGAG CAGTGGGACG ACGTGTTCGA CCTCAACGTC AAGGCGCTGT GGGCGTGTTC GCTCGCCGTC GGTGCGCACA TGCGCGAGCG CGGCAGCGGT TCGGTGGTCA ACATCGGCTC GATGTCGGGC ATCATCGTCA ACCGCCCCCA GATGCAGCCC GCGTACAACG CCTCCAAGGC CGCGGTGCAC CACCTCACGA AATCCCTTGC CGCCGAGTGG GCCCCGTTGG GAATCCGGGT CAACGCGCTG GCTCCCGGAT ACGTGAAGAC CGACATGGCC CCGGTTGACC GGCCGGAGTT CAAGCGGTAC TGGATCGACG ACACCCCGCA GCTGCGCTAC GCGGTGCCCG AGGAGATCGC GCCCAGCGTG GTGTTCCTGG CCAGCGACGC GGCCTCCTTC ATCACCGGCT CGGTGCTCGT CGCGGACGGC GGATACACCG CATGG
Details for MysmA.01365.a.A1.PW28820
PURIFICATION DATe: 7/5/2010
CONCENTRATION: 25.3mg/ml
OBSERVED MW: data unavailable
EXPRESSION LEVEL: High Expression
PROTEIN PURIFICATION BUFFER: 25 mM HEPES pH 7.0, 500 mM NaCl, 5% Glycerol , 2 mM DTT, and 0.025% Azide
EXPRESSION HOST: data unavailable
VIAL COUNT (approx.): 6
VIAL VOLUME: 100µl
PERCENT IDENTITY: 100
PERCENT COVERAGE: 92
Protocol Notes
notes unavailable
Validated AA Sequence
MDVRSAFDLS GRTALVTGGN QGLGKAFAIA LAQAGARVSF SGRNAERNEK TAAEAAAAGH QLHAITADIT RAEDVERMTA EAIEALGHID ILVNNAGTCH HGESWTVTEE QWDDVFDLNV KALWACSLAV GAHMRERGSG SVVNIGSMSG IIVNRPQMQP AYNASKAAVH HLTKSLAAEW APLGIRVNAL APGYVKTDMA PVDRPEFKRY WIDDTPQLRY AVPEEIAPSV VFLASDAASF ITGSVLVADG GYTAW
Validated NT Sequence
tncnttcggg ctttgttagc agccggatcc tcgagaagct tggctgcaga acttgttcgt gctgtttatt accatgcggt gtatccgccg tccgcgacga gcaccgagcc ggtgatgaag gaggccgcgt cgctggccag gaacaccacg ctgggcgcga tctcctcggg caccgcgtag cgcagctgcg gggtgtcgtc gatccagtac cgcttgaact ccggccggtc aaccggggcc atgtcggtct tcacgtatcc gggagccagc gcgttgaccc ggattcccaa cggggcccac tcggcggcaa gggatttcgt gaggtggtgc accgcggcct tggaggcgtt gtacgcgggc tgcatctggg ggcggttgac gatgatgccc gacatcgagc cgatgttgac caccgaaccg ctgccgcgct cgcgcatgtg cgcaccgacg gcgagcgaac acgcccacag cgccttgacg ttgaggtcga acacgtcgtc ccactgctct tcggtgaccg tccaggactc accgtggtgg cacgtgcccg cgttgttgac caggatgtcg atgtgaccga gcgcttcgat ggcctcggcc gtcatgcgct cgacgtcctc ggccctggtg atgtcggccg tgatcgcgtg cagttggtgt cctgccgcgg cggcctcggc cgcggtcttc tcgttgcgtt cggcgttgcg gcccgagaag gacacacggg caccggcctg tgcgagtgcg atcgcgaaag ccttgcccag gccctggttt ccgccggtca ccagcgcagt gcgcccgctc agatcgaatg ccgaacgcac atccatcgaa ccaggaccct gggtctgagc ttccnngnac ccatatgatg gtgatggtga tgagccatgg natatctcct tcttaaagtt aaacnnnntt ntagaggnan cntnngncnn nnnnnnnnnn nnnnnnnnnn nnna
Expressed Protein Sequence
MAHHHHHHMG TLEAQTQGPG SMDVRSAFDL SGRTALVTGG NQGLGKAFAI ALAQAGARVS FSGRNAERNE KTAAEAAAAG HQLHAITADI TRAEDVERMT AEAIEALGHI DILVNNAGTC HHGESWTVTE EQWDDVFDLN VKALWACSLA VGAHMRERGS GSVVNIGSMS GIIVNRPQMQ PAYNASKAAV HHLTKSLAAE WAPLGIRVNA LAPGYVKTDM APVDRPEFKR YWIDDTPQLR YAVPEEIAPS VVFLASDAAS FITGSVLVAD GGYTAW
Full NT Sequence (Expression Vector + Insert)
TTCTTGAAGA CGAAAGGGCC TCGTGATACG CCTATTTTTA TAGGTTAATG TCATGATAAT AATGGTTTCT TAGACGTCAG GTGGCACTTT TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTTATTTTTC TAAATACATT CAAATATGTA TCCGCTCATG AGACAATAAC CCTGATAAAT GCTTCAATAA TATTGAAAAA GGAAGAGTAT GAGTATTCAA CATTTCCGTG TCGCCCTTAT TCCCTTTTTT GCGGCATTTT GCCTTCCTGT TTTTGCTCAC CCAGAAACGC TGGTGAAAGT AAAAGATGCT GAAGATCAGT TGGGTGCACG AGTGGGTTAC ATCGAACTGG ATCTCAACAG CGGTAAGATC CTTGAGAGTT TTCGCCCCGA AGAACGTTTT CCAATGATGA GCACTTTTAA AGTTCTGCTA TGTGGCGCGG TATTATCCCG TGTTGACGCC GGGCAAGAGC AACTCGGTCG CCGCATACAC TATTCTCAGA ATGACTTGGT TGAGTACTCA CCAGTCACAG AAAAGCATCT TACGGATGGC ATGACAGTAA GAGAATTATG CAGTGCTGCC ATAACCATGA GTGATAACAC TGCGGCCAAC TTACTTCTGA CAACGATCGG AGGACCGAAG GAGCTAACCG CTTTTTTGCA CAACATGGGG GATCATGTAA CTCGCCTTGA TCGTTGGGAA CCGGAGCTGA ATGAAGCCAT ACCAAACGAC GAGCGTGACA CCACGATGCC TGCAGCAATG GCAACAACGT TGCGCAAACT ATTAACTGGC GAACTACTTA CTCTAGCTTC CCGGCAACAA TTAATAGACT GGATGGAGGC GGATAAAGTT GCAGGACCAC TTCTGCGCTC GGCCCTTCCG GCTGGCTGGT TTATTGCTGA TAAATCTGGA GCCGGTGAGC GTGGGTCTCG CGGTATCATT GCAGCACTGG GGCCAGATGG TAAGCCCTCC CGTATCGTAG TTATCTACAC GACGGGGAGT CAGGCAACTA TGGATGAACG AAATAGACAG ATCGCTGAGA TAGGTGCCTC ACTGATTAAG CATTGGTAAC TGTCAGACCA AGTTTACTCA TATATACTTT AGATTGATTT AAAACTTCAT TTTTAATTTA AAAGGATCTA GGTGAAGATC CTTTTTGATA ATCTCATGAC CAAAATCCCT TAACGTGAGT TTTCGTTCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT TTTTTCTGCG CGTAATCTGC TGCTTGCAAA CAAAAAAACC ACCGCTACCA GCGGTGGTTT GTTTGCCGGA TCAAGAGCTA CCAACTCTTT TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AGATACCAAA TACTGTCCTT CTAGTGTAGC CGTAGTTAGG CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA GACGATAGTT ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCACACAGC CCAGCTTGGA GCGAACGACC TACACCGAAC TGAGATACCT ACAGCGTGAG CTATGAGAAA GCGCCACGCT TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGCCTG GTATCTTTAT AGTCCTGTCG GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG GGGCGGAGCC TATGGAAAAA CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG CTCACATGTT CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG AGTGAGCTGA TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA GCGCCTGATG CGGTATTTTC TCCTTACGCA TCTGTGCGGT ATTTCACACC GCATATATGG TGCACTCTCA GTACAATCTG CTCTGATGCC GCATAGTTAA GCCAGTATAC ACTCCGCTAT CGCTACGTGA CTGGGTCATG GCTGCGCCCC GACACCCGCC AACACCCGCT GACGCGCCCT GACGGGCTTG TCTGCTCCCG GCATCCGCTT ACAGACAAGC TGTGACCGTC TCCGGGAGCT GCATGTGTCA GAGGTTTTCA CCGTCATCAC CGAAACGCGC GAGGCAGCTG CGGTAAAGCT CATCAGCGTG GTCGTGAAGC GATTCACAGA TGTCTGCCTG TTCATCCGCG TCCAGCTCGT TGAGTTTCTC CAGAAGCGTT AATGTCTGGC TTCTGATAAA GCGGGCCATG TTAAGGGCGG TTTTTTCCTG TTTGGTCACT GATGCCTCCG TGTAAGGGGG ATTTCTGTTC ATGGGGGTAA TGATACCGAT GAAACGAGAG AGGATGCTCA CGATACGGGT TACTGATGAT GAACATGCCC GGTTACTGGA ACGTTGTGAG GGTAAACAAC TGGCGGTATG GATGCGGCGG GACCAGAGAA AAATCACTCA GGGTCAATGC CAGCGCTTCG TTAATACAGA TGTAGGTGTT CCACAGGGTA GCCAGCAGCA TCCTGCGATG CAGATCCGGA ACATAATGGT GCAGGGCGCT GACTTCCGCG TTTCCAGACT TTACGAAACA CGGAAACCGA AGACCATTCA TGTTGTTGCT CAGGTCGCAG ACGTTTTGCA GCAGCAGTCG CTTCACGTTC GCTCGCGTAT CGGTGATTCA TTCTGCTAAC CAGTAAGGCA ACCCCGCCAG CCTAGCCGGG TCCTCAACGA CAGGAGCACG ATCATGCGCA CCCGTGGCCA GGACCCAACG CTGCCCGAGA TGCGCCGCGT GCGGCTGCTG GAGATGGCGG ACGCGATGGA TATGTTCTGC CAAGGGTTGG TTTGCGCATT CACAGTTCTC CGCAAGAATT GATTGGCTCC AATTCTTGGA GTGGTGAATC CGTTAGCGAG GTGCCGCCGG CTTCCATTCA GGTCGAGGTG GCCCGGCTCC ATGCACCGCG ACGCAACGCG GGGAGGCAGA CAAGGTATAG GGCGGCGCCT ACAATCCATG CCAACCCGTT CCATGTGCTC GCCGAGGCGG CATAAATCGC CGTGACGATC AGCGGTCCAG TGATCGAAGT TAGGCTGGTA AGAGCCGCGA GCGATCCTTG AAGCTGTCCC TGATGGTCGT CATCTACCTG CCTGGACAGC ATGGCCTGCA ACGCGGGCAT CCCGATGCCG CCGGAAGCGA GAAGAATCAT AATGGGGAAG GCCATCCAGC CTCGCGTCGT GAACGCCAGC AAGACGTAGC CCAGCGCGTC GGCCGTAACA ACACCATTTA AATGGAGTGG TTACAAATGG AGTGGTTAAT TAACAACACC ATTTGTCGAC GCTCTCCCTT ATGCGACTCC TGCATTAGGA AGCAGCCCAG TAGTAGGTTG AGGCCGTTGA GCACCGCCGC CGCAAGGAAT GGTGCATGCA AGGAGATGGC GCCCAACAGT CCCCCGGCCA CGGGGCCTGC CACCATACCC ACGCCGAAAC AAGCGCTCAT GAGCCCGAAG TGGCGAGCCC GATCTTCCCC ATCGGTGATG TCGGCGATAT AGGCGCCAGC AACCGCACCT GTGGCGCCGG TGATGCCGGC CACGATGCGT CCGGCGTAGA GGATCGAGAT CTCGATCCCG CGAAATTAAT ACGACTCACT ATAGGGAGAC CACAACGGTT TCCCTCTAGA AATAATTTTG TTTAACTTTA AGAAGGAGAT ATACCATGGC TCATCACCAT CACCATCATA TGGGTACCCT GGAAGCTCAG ACCCAGGGTC CTGGTTCGAT GGATGTGCGT TCGGCATTCG ATCTGAGCGG GCGCACTGCG CTGGTGACCG GCGGAAACCA GGGCCTGGGC AAGGCTTTCG CGATCGCACT CGCACAGGCC GGTGCCCGTG TGTCCTTCTC GGGCCGCAAC GCCGAACGCA ACGAGAAGAC CGCGGCCGAG GCCGCCGCGG CAGGACACCA ACTGCACGCG ATCACGGCCG ACATCACCAG GGCCGAGGAC GTCGAGCGCA TGACGGCCGA GGCCATCGAA GCGCTCGGTC ACATCGACAT CCTGGTCAAC AACGCGGGCA CGTGCCACCA CGGTGAGTCC TGGACGGTCA CCGAAGAGCA GTGGGACGAC GTGTTCGACC TCAACGTCAA GGCGCTGTGG GCGTGTTCGC TCGCCGTCGG TGCGCACATG CGCGAGCGCG GCAGCGGTTC GGTGGTCAAC ATCGGCTCGA TGTCGGGCAT CATCGTCAAC CGCCCCCAGA TGCAGCCCGC GTACAACGCC TCCAAGGCCG CGGTGCACCA CCTCACGAAA TCCCTTGCCG CCGAGTGGGC CCCGTTGGGA ATCCGGGTCA ACGCGCTGGC TCCCGGATAC GTGAAGACCG ACATGGCCCC GGTTGACCGG CCGGAGTTCA AGCGGTACTG GATCGACGAC ACCCCGCAGC TGCGCTACGC GGTGCCCGAG GAGATCGCGC CCAGCGTGGT GTTCCTGGCC AGCGACGCGG CCTCCTTCAT CACCGGCTCG GTGCTCGTCG CGGACGGCGG ATACACCGCA TGGAAACAGC ACGAACAAGT TCTGCAGCCA AGCTTCTCGA GGATCCGGCT GCTAACAAAG CCCGAAAGGA AGCTGAGTTG GCTGCTGCCA CCGCTGAGCA ATAACTAGCA TAACCCCTTG GGGCCTCTAA ACGGGTCTTG AGGGGTTTTT TGCTGAAAGG AGGAACTATA TCCGGATATC CACAGGACGG GTGTGGTCGC CATGATCGCG TAGTCGATAG TGGCTCCAAG TAGCGAAGCG AGCAGGACTG GGCGGCGGCC AAAGCGGTCG GACAGTGCTC CGAGAACGGG TGCGCATAGA AATTGCATCA ACGCATATAG CGCTAGCAGC ACGCCATAGT GACTGGCGAT GCTGTCGGAA TGGACGATAT CCCGCAAGAG GCCCGGCAGT ACCGGCATAA CCAAGCCTAT GCCTACAGCA TCCAGGGTGA CGGTGCCGAG GATGACGATG AGCGCATTGT TAGATTTCAT ACACGGTGCC TGACTGCGTT AGCAATTTAA CTGTGATAAA CTACCGCATT AAAGCTTATC GATGATAAGC TGTCAAACAT GAGAA