MysmA.01156.b

O-acetylhomoserine sulfhydrylase

CENTER ID: MysmA.01156.b
ORGANISM: Mycobacterium smegmatis ATCC 700084 / mc(2)155
ASSOCIATED DISEASE:
CURRENT STATUS: crystallized
COMMUNITY REQUEST: False
NIH RISK GROUP: 2
SELECT AGENT: False
NIH PRIORITY
pathogens category:
IIIC

Ordering Clones & Proteins

If there are materials available for this target, they will be listed below. Materials can be ordered from SSGCID using the button in the "order material" column. Clicking the button will add the material to a virtual cart. You may order multiple materials at a time at no cost to you, as this contract is funded by NIAID. When you are ready to place your order, click the "Place Order" link which will appear in the top right corner of the page after you place your first item in your cart.

Proteins

CENTER
REFERENCE ID
DOMAIN/REGION
DESCRIPTION
INFO AA
START
AA
STOP
ORDER
MATERIAL
MysmA.01156.b.A1.PS00667 Full length( MysmA.01156.b ) 1 443
MysmA.01156.b.A1.PS00668 Full length( MysmA.01156.b ) 1 443

External Resources

RESOURCE REFERENCE ID
BV-BRC: fig|246196.19.peg.1637
RefSeq: YP_886029.1
UniProt: A0QSZ1

Sequences

These sequences are the native gene sequence; sequences of constructs derived from these sequences may differ due to codon optimization or other protocols. To find the specific sequence of any material you may have ordered, click on the "info" button next to the name of that material.
AA Sequence
MTTPDPTENW SFETKQIHAG QSPDSATHAR ALPIYQTTSY TFDDTSHAAA LFGLEVPGNI YTRIGNPTTD VVEQRIAALE GGVAALFLSS GQAAETFAIL NIAKAGDHIV SSPRLYGGTY NLLHYTLPKL GIETTFVENP DDLESWRAAV RPNTKAFFAE TISNPQIDIL DIPNVAAIAH EAGVPLIVDN TIATPYLIQP IAHGADIVVH SATKYLGGHG SAIAGVIVDG GTFDWTNGKF PGFTEPDPSY HGVVFAELGA PAYALKARVQ LLRDLGSAAA PFNAFLIAQG LETLSLRVER HVANAQKVAE FLENHPDVSS VNYAGLPSSP WYELGRKLAP KGTGAVLAFE LSGGLEAGKA FVNALTLHSH VANIGDVRSL VIHPASTTHQ QLSPEEQLST GVTPGLVRLA VGLEGIDDII ADLEQGFAAA RPFSGAAQTA QTV
NT Sequence
ATGACCACCC CCGATCCCAC CGAGAACTGG TCGTTCGAGA CCAAGCAGAT CCACGCGGGT CAGTCGCCCG ACAGCGCGAC CCACGCCCGT GCGCTGCCGA TCTACCAGAC CACGTCGTAC ACCTTCGACG ACACCAGCCA TGCCGCGGCC CTGTTCGGCC TTGAGGTTCC CGGCAACATC TACACCCGCA TCGGCAACCC CACCACCGAC GTCGTCGAGC AGCGCATCGC CGCACTCGAG GGTGGCGTCG CGGCGCTGTT CCTGTCCTCA GGGCAGGCCG CCGAGACGTT CGCGATCCTC AACATCGCCA AGGCCGGCGA CCACATCGTG TCCAGCCCGC GCCTGTACGG CGGCACCTAC AACCTGCTGC ACTACACGCT GCCCAAGCTG GGCATCGAGA CCACGTTCGT CGAGAACCCC GACGATCTGG AGTCGTGGCG CGCGGCGGTA CGCCCGAACA CCAAGGCGTT CTTCGCCGAG ACGATCTCCA ACCCCCAGAT CGACATCCTC GACATCCCGA ACGTCGCCGC GATCGCGCAC GAGGCGGGCG TCCCGTTGAT CGTCGACAAC ACGATCGCCA CGCCGTACCT GATCCAGCCG ATCGCCCACG GCGCCGACAT CGTCGTGCAC TCGGCCACCA AGTACCTGGG CGGGCACGGA TCGGCGATCG CGGGCGTCAT CGTCGACGGC GGCACGTTCG ACTGGACCAA CGGCAAGTTC CCCGGCTTCA CCGAACCGGA TCCCAGCTAC CACGGTGTGG TGTTCGCCGA GCTCGGTGCG CCGGCCTACG CCCTGAAGGC ACGCGTGCAA CTGCTGCGTG ACCTGGGCTC GGCGGCCGCC CCGTTCAACG CGTTCCTGAT CGCGCAGGGT CTGGAGACCC TGTCGCTGCG CGTCGAGCGC CATGTCGCCA ACGCGCAGAA GGTCGCCGAG TTCCTGGAGA ACCACCCCGA CGTGTCGTCG GTGAACTACG CGGGCCTGCC GTCCTCGCCG TGGTACGAGC TGGGCCGCAA ACTCGCCCCC AAGGGCACCG GCGCGGTGCT CGCGTTCGAG CTGTCGGGCG GCCTGGAGGC CGGTAAGGCC TTCGTGAACG CGCTGACGCT GCACAGTCAC GTCGCCAACA TCGGCGACGT GCGGTCGCTG GTGATCCACC CGGCGTCGAC GACGCACCAG CAGCTGAGCC CCGAGGAGCA GCTGTCGACG GGTGTCACGC CGGGCCTGGT GCGCCTGGCG GTCGGCCTCG AAGGCATCGA CGACATCATC GCCGACCTGG AGCAGGGGTT CGCCGCCGCG CGCCCGTTCA GCGGCGCGGC CCAGACGGCC CAGACGGTG
Details for MysmA.01156.b.A1.PS00667
PURIFICATION DATe: 6/29/2010
CONCENTRATION: 66.6mg/ml
OBSERVED MW: data unavailable
EXPRESSION LEVEL: Moderate Expression
PROTEIN PURIFICATION BUFFER: 20 mM HEPES, pH 7.0, 300 mM NaCl, 5% glycerol and 1 mM TCEP
EXPRESSION HOST: data unavailable
VIAL COUNT (approx.): 7
VIAL VOLUME: 200µl
PERCENT IDENTITY: 87
PERCENT COVERAGE: 80
Protocol Notes
notes unavailable
Validated AA Sequence
MAHHHHHHMG TLEAQTQGPG SMTTPDPTEN WSFETKQIHA GQSPDSATHA RALPIYQTTS YTFDDTSHAA ALFGLEVPGN IYTRIGNPTT DVVEQRIAAL EGGVAALFLS SGQAAETFAI LNIAKAGDHI VSSPRLYGGT YNLLHYTLPK LGIETTFVEN PDDLESWRAA VRPNTKAFFA ETISNPQIDI LDIPNVAAIA HEAGVPLIVD NTIATPYLIQ PIAHGADIVV HSATKYLGGH GSAIAGVIVD GGTFDWTNGK FPGFTEPDPS YHGVVFAELG APAYALKXRV QLLRDLGSAA APFNAFLIAQ GLETXSLXVE RHVXNAXXSX XXXXXPXXRX XYXXXXXXXL XXXXPXGXXX XXXXXXXXXX XXXX
Validated NT Sequence
attttgttta ctttangaag gagatatacc atggctcatc accatcacca tcatatgggt accctggaag ctcagaccca gggtcctggt tcgatgacca cccccgatcc caccgagaac tggtcgttcg agaccaagca gatccacgcg ggtcagtcgc ccgacagcgc gacccacgcc cgtgcgctgc cgatctacca gaccacgtcg tacaccttcg acgacaccag ccatgccgcg gccctgttcg gccttgaggt tcccggcaac atctacaccc gcatcggcaa ccccaccacc gacgtcgtcg agcagcgcat cgccgcactc gagggtggcg tcgcggcgct gttcctgtcc tcagggcagg ccgccgagac gttcgcgatc ctcaacatcg ccaaggccgg cgaccacatc gtgtccagcc cgcgcctgta cggcggcacc tacaacctgc tgcactacac gctgcccaag ctgggcatcg agaccacgtt cgtcgagaac cccgacgatc tggagtcgtg gcgcgcggcg gtacgcccga acaccaaggc gttcttcgcc gagacgatct ccaaccccca gatcgacatc ctcgacatcc cgaacgtcgc cgcgatcgcg cacgaggcgg gcgtcccgtt gatcgtcgac aacacgatcg ccacgccgta cctgatccag ccgatcgccc acggcgccga catcgtcgtg cactcggcca ccaagtacct gggcgggcac ggatcggcga tcgcgggcgt catcgtcgac ggcggcacgt tcgactggac caacggcaag ttccccggct tcaccgaacc ggatcccagc taccacggtg tggtgttcgc cgagctcggt gcgccggcct acgccctgaa ggnacgcgtg caactgctgc gtgacctggg ctcggcggcc gccccgttca acgcgttcct gatcgcgcag ggtctggaga ccntgtcgct gcncgtcgag cgccatgtcn ccaacgcgca nnnntcgnnn annnnctgnn nnnccccnnn nnntcgnnnn nantacncng nnngncnncn tcncnanctg gnnncaantc nncccnnngg nncnnnnnnn nnntnncntn nnnnnnncan nnnnnnnnnn nnnnnnnnnn nnc
Expressed Protein Sequence
MAHHHHHHMG TLEAQTQGPG SMTTPDPTEN WSFETKQIHA GQSPDSATHA RALPIYQTTS YTFDDTSHAA ALFGLEVPGN IYTRIGNPTT DVVEQRIAAL EGGVAALFLS SGQAAETFAI LNIAKAGDHI VSSPRLYGGT YNLLHYTLPK LGIETTFVEN PDDLESWRAA VRPNTKAFFA ETISNPQIDI LDIPNVAAIA HEAGVPLIVD NTIATPYLIQ PIAHGADIVV HSATKYLGGH GSAIAGVIVD GGTFDWTNGK FPGFTEPDPS YHGVVFAELG APAYALKARV QLLRDLGSAA APFNAFLIAQ GLETLSLRVE RHVANAQKVA EFLENHPDVS SVNYAGLPSS PWYELGRKLA PKGTGAVLAF ELSGGLEAGK AFVNALTLHS HVANIGDVRS LVIHPASTTH QQLSPEEQLS TGVTPGLVRL AVGLEGIDDI IADLEQGFAA ARPFSGAAQT AQTV
Full NT Sequence (Expression Vector + Insert)
TTCTTGAAGA CGAAAGGGCC TCGTGATACG CCTATTTTTA TAGGTTAATG TCATGATAAT AATGGTTTCT TAGACGTCAG GTGGCACTTT TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTTATTTTTC TAAATACATT CAAATATGTA TCCGCTCATG AGACAATAAC CCTGATAAAT GCTTCAATAA TATTGAAAAA GGAAGAGTAT GAGTATTCAA CATTTCCGTG TCGCCCTTAT TCCCTTTTTT GCGGCATTTT GCCTTCCTGT TTTTGCTCAC CCAGAAACGC TGGTGAAAGT AAAAGATGCT GAAGATCAGT TGGGTGCACG AGTGGGTTAC ATCGAACTGG ATCTCAACAG CGGTAAGATC CTTGAGAGTT TTCGCCCCGA AGAACGTTTT CCAATGATGA GCACTTTTAA AGTTCTGCTA TGTGGCGCGG TATTATCCCG TGTTGACGCC GGGCAAGAGC AACTCGGTCG CCGCATACAC TATTCTCAGA ATGACTTGGT TGAGTACTCA CCAGTCACAG AAAAGCATCT TACGGATGGC ATGACAGTAA GAGAATTATG CAGTGCTGCC ATAACCATGA GTGATAACAC TGCGGCCAAC TTACTTCTGA CAACGATCGG AGGACCGAAG GAGCTAACCG CTTTTTTGCA CAACATGGGG GATCATGTAA CTCGCCTTGA TCGTTGGGAA CCGGAGCTGA ATGAAGCCAT ACCAAACGAC GAGCGTGACA CCACGATGCC TGCAGCAATG GCAACAACGT TGCGCAAACT ATTAACTGGC GAACTACTTA CTCTAGCTTC CCGGCAACAA TTAATAGACT GGATGGAGGC GGATAAAGTT GCAGGACCAC TTCTGCGCTC GGCCCTTCCG GCTGGCTGGT TTATTGCTGA TAAATCTGGA GCCGGTGAGC GTGGGTCTCG CGGTATCATT GCAGCACTGG GGCCAGATGG TAAGCCCTCC CGTATCGTAG TTATCTACAC GACGGGGAGT CAGGCAACTA TGGATGAACG AAATAGACAG ATCGCTGAGA TAGGTGCCTC ACTGATTAAG CATTGGTAAC TGTCAGACCA AGTTTACTCA TATATACTTT AGATTGATTT AAAACTTCAT TTTTAATTTA AAAGGATCTA GGTGAAGATC CTTTTTGATA ATCTCATGAC CAAAATCCCT TAACGTGAGT TTTCGTTCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT TTTTTCTGCG CGTAATCTGC TGCTTGCAAA CAAAAAAACC ACCGCTACCA GCGGTGGTTT GTTTGCCGGA TCAAGAGCTA CCAACTCTTT TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AGATACCAAA TACTGTCCTT CTAGTGTAGC CGTAGTTAGG CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA GACGATAGTT ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCACACAGC CCAGCTTGGA GCGAACGACC TACACCGAAC TGAGATACCT ACAGCGTGAG CTATGAGAAA GCGCCACGCT TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGCCTG GTATCTTTAT AGTCCTGTCG GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG GGGCGGAGCC TATGGAAAAA CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG CTCACATGTT CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG AGTGAGCTGA TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA GCGCCTGATG CGGTATTTTC TCCTTACGCA TCTGTGCGGT ATTTCACACC GCATATATGG TGCACTCTCA GTACAATCTG CTCTGATGCC GCATAGTTAA GCCAGTATAC ACTCCGCTAT CGCTACGTGA CTGGGTCATG GCTGCGCCCC GACACCCGCC AACACCCGCT GACGCGCCCT GACGGGCTTG TCTGCTCCCG GCATCCGCTT ACAGACAAGC TGTGACCGTC TCCGGGAGCT GCATGTGTCA GAGGTTTTCA CCGTCATCAC CGAAACGCGC GAGGCAGCTG CGGTAAAGCT CATCAGCGTG GTCGTGAAGC GATTCACAGA TGTCTGCCTG TTCATCCGCG TCCAGCTCGT TGAGTTTCTC CAGAAGCGTT AATGTCTGGC TTCTGATAAA GCGGGCCATG TTAAGGGCGG TTTTTTCCTG TTTGGTCACT GATGCCTCCG TGTAAGGGGG ATTTCTGTTC ATGGGGGTAA TGATACCGAT GAAACGAGAG AGGATGCTCA CGATACGGGT TACTGATGAT GAACATGCCC GGTTACTGGA ACGTTGTGAG GGTAAACAAC TGGCGGTATG GATGCGGCGG GACCAGAGAA AAATCACTCA GGGTCAATGC CAGCGCTTCG TTAATACAGA TGTAGGTGTT CCACAGGGTA GCCAGCAGCA TCCTGCGATG CAGATCCGGA ACATAATGGT GCAGGGCGCT GACTTCCGCG TTTCCAGACT TTACGAAACA CGGAAACCGA AGACCATTCA TGTTGTTGCT CAGGTCGCAG ACGTTTTGCA GCAGCAGTCG CTTCACGTTC GCTCGCGTAT CGGTGATTCA TTCTGCTAAC CAGTAAGGCA ACCCCGCCAG CCTAGCCGGG TCCTCAACGA CAGGAGCACG ATCATGCGCA CCCGTGGCCA GGACCCAACG CTGCCCGAGA TGCGCCGCGT GCGGCTGCTG GAGATGGCGG ACGCGATGGA TATGTTCTGC CAAGGGTTGG TTTGCGCATT CACAGTTCTC CGCAAGAATT GATTGGCTCC AATTCTTGGA GTGGTGAATC CGTTAGCGAG GTGCCGCCGG CTTCCATTCA GGTCGAGGTG GCCCGGCTCC ATGCACCGCG ACGCAACGCG GGGAGGCAGA CAAGGTATAG GGCGGCGCCT ACAATCCATG CCAACCCGTT CCATGTGCTC GCCGAGGCGG CATAAATCGC CGTGACGATC AGCGGTCCAG TGATCGAAGT TAGGCTGGTA AGAGCCGCGA GCGATCCTTG AAGCTGTCCC TGATGGTCGT CATCTACCTG CCTGGACAGC ATGGCCTGCA ACGCGGGCAT CCCGATGCCG CCGGAAGCGA GAAGAATCAT AATGGGGAAG GCCATCCAGC CTCGCGTCGT GAACGCCAGC AAGACGTAGC CCAGCGCGTC GGCCGTAACA ACACCATTTA AATGGAGTGG TTACAAATGG AGTGGTTAAT TAACAACACC ATTTGTCGAC GCTCTCCCTT ATGCGACTCC TGCATTAGGA AGCAGCCCAG TAGTAGGTTG AGGCCGTTGA GCACCGCCGC CGCAAGGAAT GGTGCATGCA AGGAGATGGC GCCCAACAGT CCCCCGGCCA CGGGGCCTGC CACCATACCC ACGCCGAAAC AAGCGCTCAT GAGCCCGAAG TGGCGAGCCC GATCTTCCCC ATCGGTGATG TCGGCGATAT AGGCGCCAGC AACCGCACCT GTGGCGCCGG TGATGCCGGC CACGATGCGT CCGGCGTAGA GGATCGAGAT CTCGATCCCG CGAAATTAAT ACGACTCACT ATAGGGAGAC CACAACGGTT TCCCTCTAGA AATAATTTTG TTTAACTTTA AGAAGGAGAT ATACCATGGC TCATCACCAT CACCATCATA TGGGTACCCT GGAAGCTCAG ACCCAGGGTC CTGGTTCGAT GACCACCCCC GATCCCACCG AGAACTGGTC GTTCGAGACC AAGCAGATCC ACGCGGGTCA GTCGCCCGAC AGCGCGACCC ACGCCCGTGC GCTGCCGATC TACCAGACCA CGTCGTACAC CTTCGACGAC ACCAGCCATG CCGCGGCCCT GTTCGGCCTT GAGGTTCCCG GCAACATCTA CACCCGCATC GGCAACCCCA CCACCGACGT CGTCGAGCAG CGCATCGCCG CACTCGAGGG TGGCGTCGCG GCGCTGTTCC TGTCCTCAGG GCAGGCCGCC GAGACGTTCG CGATCCTCAA CATCGCCAAG GCCGGCGACC ACATCGTGTC CAGCCCGCGC CTGTACGGCG GCACCTACAA CCTGCTGCAC TACACGCTGC CCAAGCTGGG CATCGAGACC ACGTTCGTCG AGAACCCCGA CGATCTGGAG TCGTGGCGCG CGGCGGTACG CCCGAACACC AAGGCGTTCT TCGCCGAGAC GATCTCCAAC CCCCAGATCG ACATCCTCGA CATCCCGAAC GTCGCCGCGA TCGCGCACGA GGCGGGCGTC CCGTTGATCG TCGACAACAC GATCGCCACG CCGTACCTGA TCCAGCCGAT CGCCCACGGC GCCGACATCG TCGTGCACTC GGCCACCAAG TACCTGGGCG GGCACGGATC GGCGATCGCG GGCGTCATCG TCGACGGCGG CACGTTCGAC TGGACCAACG GCAAGTTCCC CGGCTTCACC GAACCGGATC CCAGCTACCA CGGTGTGGTG TTCGCCGAGC TCGGTGCGCC GGCCTACGCC CTGAAGGCAC GCGTGCAACT GCTGCGTGAC CTGGGCTCGG CGGCCGCCCC GTTCAACGCG TTCCTGATCG CGCAGGGTCT GGAGACCCTG TCGCTGCGCG TCGAGCGCCA TGTCGCCAAC GCGCAGAAGG TCGCCGAGTT CCTGGAGAAC CACCCCGACG TGTCGTCGGT GAACTACGCG GGCCTGCCGT CCTCGCCGTG GTACGAGCTG GGCCGCAAAC TCGCCCCCAA GGGCACCGGC GCGGTGCTCG CGTTCGAGCT GTCGGGCGGC CTGGAGGCCG GTAAGGCCTT CGTGAACGCG CTGACGCTGC ACAGTCACGT CGCCAACATC GGCGACGTGC GGTCGCTGGT GATCCACCCG GCGTCGACGA CGCACCAGCA GCTGAGCCCC GAGGAGCAGC TGTCGACGGG TGTCACGCCG GGCCTGGTGC GCCTGGCGGT CGGCCTCGAA GGCATCGACG ACATCATCGC CGACCTGGAG CAGGGGTTCG CCGCCGCGCG CCCGTTCAGC GGCGCGGCCC AGACGGCCCA GACGGTGAAA CAGCACGAAC AAGTTCTGCA GCCAAGCTTC TCGAGGATCC GGCTGCTAAC AAAGCCCGAA AGGAAGCTGA GTTGGCTGCT GCCACCGCTG AGCAATAACT AGCATAACCC CTTGGGGCCT CTAAACGGGT CTTGAGGGGT TTTTTGCTGA AAGGAGGAAC TATATCCGGA TATCCACAGG ACGGGTGTGG TCGCCATGAT CGCGTAGTCG ATAGTGGCTC CAAGTAGCGA AGCGAGCAGG ACTGGGCGGC GGCCAAAGCG GTCGGACAGT GCTCCGAGAA CGGGTGCGCA TAGAAATTGC ATCAACGCAT ATAGCGCTAG CAGCACGCCA TAGTGACTGG CGATGCTGTC GGAATGGACG ATATCCCGCA AGAGGCCCGG CAGTACCGGC ATAACCAAGC CTATGCCTAC AGCATCCAGG GTGACGGTGC CGAGGATGAC GATGAGCGCA TTGTTAGATT TCATACACGG TGCCTGACTG CGTTAGCAAT TTAACTGTGA TAAACTACCG CATTAAAGCT TATCGATGAT AAGCTGTCAA ACATGAGAA
Details for MysmA.01156.b.A1.PS00668
PURIFICATION DATe: 6/29/2010
CONCENTRATION: 45.7mg/ml
OBSERVED MW: data unavailable
EXPRESSION LEVEL: Moderate Expression
PROTEIN PURIFICATION BUFFER: 20 mM HEPES, pH 7.0, 300 mM NaCl, 5% glycerol and 1 mM TCEP
EXPRESSION HOST: data unavailable
VIAL COUNT (approx.): 9
VIAL VOLUME: 200µl
PERCENT IDENTITY: 87
PERCENT COVERAGE: 80
Protocol Notes
notes unavailable
Validated AA Sequence
MAHHHHHHMG TLEAQTQGPG SMTTPDPTEN WSFETKQIHA GQSPDSATHA RALPIYQTTS YTFDDTSHAA ALFGLEVPGN IYTRIGNPTT DVVEQRIAAL EGGVAALFLS SGQAAETFAI LNIAKAGDHI VSSPRLYGGT YNLLHYTLPK LGIETTFVEN PDDLESWRAA VRPNTKAFFA ETISNPQIDI LDIPNVAAIA HEAGVPLIVD NTIATPYLIQ PIAHGADIVV HSATKYLGGH GSAIAGVIVD GGTFDWTNGK FPGFTEPDPS YHGVVFAELG APAYALKXRV QLLRDLGSAA APFNAFLIAQ GLETXSLXVE RHVXNAXXSX XXXXXPXXRX XYXXXXXXXL XXXXPXGXXX XXXXXXXXXX XXXX
Validated NT Sequence
attttgttta ctttangaag gagatatacc atggctcatc accatcacca tcatatgggt accctggaag ctcagaccca gggtcctggt tcgatgacca cccccgatcc caccgagaac tggtcgttcg agaccaagca gatccacgcg ggtcagtcgc ccgacagcgc gacccacgcc cgtgcgctgc cgatctacca gaccacgtcg tacaccttcg acgacaccag ccatgccgcg gccctgttcg gccttgaggt tcccggcaac atctacaccc gcatcggcaa ccccaccacc gacgtcgtcg agcagcgcat cgccgcactc gagggtggcg tcgcggcgct gttcctgtcc tcagggcagg ccgccgagac gttcgcgatc ctcaacatcg ccaaggccgg cgaccacatc gtgtccagcc cgcgcctgta cggcggcacc tacaacctgc tgcactacac gctgcccaag ctgggcatcg agaccacgtt cgtcgagaac cccgacgatc tggagtcgtg gcgcgcggcg gtacgcccga acaccaaggc gttcttcgcc gagacgatct ccaaccccca gatcgacatc ctcgacatcc cgaacgtcgc cgcgatcgcg cacgaggcgg gcgtcccgtt gatcgtcgac aacacgatcg ccacgccgta cctgatccag ccgatcgccc acggcgccga catcgtcgtg cactcggcca ccaagtacct gggcgggcac ggatcggcga tcgcgggcgt catcgtcgac ggcggcacgt tcgactggac caacggcaag ttccccggct tcaccgaacc ggatcccagc taccacggtg tggtgttcgc cgagctcggt gcgccggcct acgccctgaa ggnacgcgtg caactgctgc gtgacctggg ctcggcggcc gccccgttca acgcgttcct gatcgcgcag ggtctggaga ccntgtcgct gcncgtcgag cgccatgtcn ccaacgcgca nnnntcgnnn annnnctgnn nnnccccnnn nnntcgnnnn nantacncng nnngncnncn tcncnanctg gnnncaantc nncccnnngg nncnnnnnnn nnntnncntn nnnnnnncan nnnnnnnnnn nnnnnnnnnn nnc
Expressed Protein Sequence
MAHHHHHHMG TLEAQTQGPG SMTTPDPTEN WSFETKQIHA GQSPDSATHA RALPIYQTTS YTFDDTSHAA ALFGLEVPGN IYTRIGNPTT DVVEQRIAAL EGGVAALFLS SGQAAETFAI LNIAKAGDHI VSSPRLYGGT YNLLHYTLPK LGIETTFVEN PDDLESWRAA VRPNTKAFFA ETISNPQIDI LDIPNVAAIA HEAGVPLIVD NTIATPYLIQ PIAHGADIVV HSATKYLGGH GSAIAGVIVD GGTFDWTNGK FPGFTEPDPS YHGVVFAELG APAYALKARV QLLRDLGSAA APFNAFLIAQ GLETLSLRVE RHVANAQKVA EFLENHPDVS SVNYAGLPSS PWYELGRKLA PKGTGAVLAF ELSGGLEAGK AFVNALTLHS HVANIGDVRS LVIHPASTTH QQLSPEEQLS TGVTPGLVRL AVGLEGIDDI IADLEQGFAA ARPFSGAAQT AQTV
Full NT Sequence (Expression Vector + Insert)
TTCTTGAAGA CGAAAGGGCC TCGTGATACG CCTATTTTTA TAGGTTAATG TCATGATAAT AATGGTTTCT TAGACGTCAG GTGGCACTTT TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTTATTTTTC TAAATACATT CAAATATGTA TCCGCTCATG AGACAATAAC CCTGATAAAT GCTTCAATAA TATTGAAAAA GGAAGAGTAT GAGTATTCAA CATTTCCGTG TCGCCCTTAT TCCCTTTTTT GCGGCATTTT GCCTTCCTGT TTTTGCTCAC CCAGAAACGC TGGTGAAAGT AAAAGATGCT GAAGATCAGT TGGGTGCACG AGTGGGTTAC ATCGAACTGG ATCTCAACAG CGGTAAGATC CTTGAGAGTT TTCGCCCCGA AGAACGTTTT CCAATGATGA GCACTTTTAA AGTTCTGCTA TGTGGCGCGG TATTATCCCG TGTTGACGCC GGGCAAGAGC AACTCGGTCG CCGCATACAC TATTCTCAGA ATGACTTGGT TGAGTACTCA CCAGTCACAG AAAAGCATCT TACGGATGGC ATGACAGTAA GAGAATTATG CAGTGCTGCC ATAACCATGA GTGATAACAC TGCGGCCAAC TTACTTCTGA CAACGATCGG AGGACCGAAG GAGCTAACCG CTTTTTTGCA CAACATGGGG GATCATGTAA CTCGCCTTGA TCGTTGGGAA CCGGAGCTGA ATGAAGCCAT ACCAAACGAC GAGCGTGACA CCACGATGCC TGCAGCAATG GCAACAACGT TGCGCAAACT ATTAACTGGC GAACTACTTA CTCTAGCTTC CCGGCAACAA TTAATAGACT GGATGGAGGC GGATAAAGTT GCAGGACCAC TTCTGCGCTC GGCCCTTCCG GCTGGCTGGT TTATTGCTGA TAAATCTGGA GCCGGTGAGC GTGGGTCTCG CGGTATCATT GCAGCACTGG GGCCAGATGG TAAGCCCTCC CGTATCGTAG TTATCTACAC GACGGGGAGT CAGGCAACTA TGGATGAACG AAATAGACAG ATCGCTGAGA TAGGTGCCTC ACTGATTAAG CATTGGTAAC TGTCAGACCA AGTTTACTCA TATATACTTT AGATTGATTT AAAACTTCAT TTTTAATTTA AAAGGATCTA GGTGAAGATC CTTTTTGATA ATCTCATGAC CAAAATCCCT TAACGTGAGT TTTCGTTCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT TTTTTCTGCG CGTAATCTGC TGCTTGCAAA CAAAAAAACC ACCGCTACCA GCGGTGGTTT GTTTGCCGGA TCAAGAGCTA CCAACTCTTT TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AGATACCAAA TACTGTCCTT CTAGTGTAGC CGTAGTTAGG CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA GACGATAGTT ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCACACAGC CCAGCTTGGA GCGAACGACC TACACCGAAC TGAGATACCT ACAGCGTGAG CTATGAGAAA GCGCCACGCT TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGCCTG GTATCTTTAT AGTCCTGTCG GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG GGGCGGAGCC TATGGAAAAA CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG CTCACATGTT CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG AGTGAGCTGA TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA GCGCCTGATG CGGTATTTTC TCCTTACGCA TCTGTGCGGT ATTTCACACC GCATATATGG TGCACTCTCA GTACAATCTG CTCTGATGCC GCATAGTTAA GCCAGTATAC ACTCCGCTAT CGCTACGTGA CTGGGTCATG GCTGCGCCCC GACACCCGCC AACACCCGCT GACGCGCCCT GACGGGCTTG TCTGCTCCCG GCATCCGCTT ACAGACAAGC TGTGACCGTC TCCGGGAGCT GCATGTGTCA GAGGTTTTCA CCGTCATCAC CGAAACGCGC GAGGCAGCTG CGGTAAAGCT CATCAGCGTG GTCGTGAAGC GATTCACAGA TGTCTGCCTG TTCATCCGCG TCCAGCTCGT TGAGTTTCTC CAGAAGCGTT AATGTCTGGC TTCTGATAAA GCGGGCCATG TTAAGGGCGG TTTTTTCCTG TTTGGTCACT GATGCCTCCG TGTAAGGGGG ATTTCTGTTC ATGGGGGTAA TGATACCGAT GAAACGAGAG AGGATGCTCA CGATACGGGT TACTGATGAT GAACATGCCC GGTTACTGGA ACGTTGTGAG GGTAAACAAC TGGCGGTATG GATGCGGCGG GACCAGAGAA AAATCACTCA GGGTCAATGC CAGCGCTTCG TTAATACAGA TGTAGGTGTT CCACAGGGTA GCCAGCAGCA TCCTGCGATG CAGATCCGGA ACATAATGGT GCAGGGCGCT GACTTCCGCG TTTCCAGACT TTACGAAACA CGGAAACCGA AGACCATTCA TGTTGTTGCT CAGGTCGCAG ACGTTTTGCA GCAGCAGTCG CTTCACGTTC GCTCGCGTAT CGGTGATTCA TTCTGCTAAC CAGTAAGGCA ACCCCGCCAG CCTAGCCGGG TCCTCAACGA CAGGAGCACG ATCATGCGCA CCCGTGGCCA GGACCCAACG CTGCCCGAGA TGCGCCGCGT GCGGCTGCTG GAGATGGCGG ACGCGATGGA TATGTTCTGC CAAGGGTTGG TTTGCGCATT CACAGTTCTC CGCAAGAATT GATTGGCTCC AATTCTTGGA GTGGTGAATC CGTTAGCGAG GTGCCGCCGG CTTCCATTCA GGTCGAGGTG GCCCGGCTCC ATGCACCGCG ACGCAACGCG GGGAGGCAGA CAAGGTATAG GGCGGCGCCT ACAATCCATG CCAACCCGTT CCATGTGCTC GCCGAGGCGG CATAAATCGC CGTGACGATC AGCGGTCCAG TGATCGAAGT TAGGCTGGTA AGAGCCGCGA GCGATCCTTG AAGCTGTCCC TGATGGTCGT CATCTACCTG CCTGGACAGC ATGGCCTGCA ACGCGGGCAT CCCGATGCCG CCGGAAGCGA GAAGAATCAT AATGGGGAAG GCCATCCAGC CTCGCGTCGT GAACGCCAGC AAGACGTAGC CCAGCGCGTC GGCCGTAACA ACACCATTTA AATGGAGTGG TTACAAATGG AGTGGTTAAT TAACAACACC ATTTGTCGAC GCTCTCCCTT ATGCGACTCC TGCATTAGGA AGCAGCCCAG TAGTAGGTTG AGGCCGTTGA GCACCGCCGC CGCAAGGAAT GGTGCATGCA AGGAGATGGC GCCCAACAGT CCCCCGGCCA CGGGGCCTGC CACCATACCC ACGCCGAAAC AAGCGCTCAT GAGCCCGAAG TGGCGAGCCC GATCTTCCCC ATCGGTGATG TCGGCGATAT AGGCGCCAGC AACCGCACCT GTGGCGCCGG TGATGCCGGC CACGATGCGT CCGGCGTAGA GGATCGAGAT CTCGATCCCG CGAAATTAAT ACGACTCACT ATAGGGAGAC CACAACGGTT TCCCTCTAGA AATAATTTTG TTTAACTTTA AGAAGGAGAT ATACCATGGC TCATCACCAT CACCATCATA TGGGTACCCT GGAAGCTCAG ACCCAGGGTC CTGGTTCGAT GACCACCCCC GATCCCACCG AGAACTGGTC GTTCGAGACC AAGCAGATCC ACGCGGGTCA GTCGCCCGAC AGCGCGACCC ACGCCCGTGC GCTGCCGATC TACCAGACCA CGTCGTACAC CTTCGACGAC ACCAGCCATG CCGCGGCCCT GTTCGGCCTT GAGGTTCCCG GCAACATCTA CACCCGCATC GGCAACCCCA CCACCGACGT CGTCGAGCAG CGCATCGCCG CACTCGAGGG TGGCGTCGCG GCGCTGTTCC TGTCCTCAGG GCAGGCCGCC GAGACGTTCG CGATCCTCAA CATCGCCAAG GCCGGCGACC ACATCGTGTC CAGCCCGCGC CTGTACGGCG GCACCTACAA CCTGCTGCAC TACACGCTGC CCAAGCTGGG CATCGAGACC ACGTTCGTCG AGAACCCCGA CGATCTGGAG TCGTGGCGCG CGGCGGTACG CCCGAACACC AAGGCGTTCT TCGCCGAGAC GATCTCCAAC CCCCAGATCG ACATCCTCGA CATCCCGAAC GTCGCCGCGA TCGCGCACGA GGCGGGCGTC CCGTTGATCG TCGACAACAC GATCGCCACG CCGTACCTGA TCCAGCCGAT CGCCCACGGC GCCGACATCG TCGTGCACTC GGCCACCAAG TACCTGGGCG GGCACGGATC GGCGATCGCG GGCGTCATCG TCGACGGCGG CACGTTCGAC TGGACCAACG GCAAGTTCCC CGGCTTCACC GAACCGGATC CCAGCTACCA CGGTGTGGTG TTCGCCGAGC TCGGTGCGCC GGCCTACGCC CTGAAGGCAC GCGTGCAACT GCTGCGTGAC CTGGGCTCGG CGGCCGCCCC GTTCAACGCG TTCCTGATCG CGCAGGGTCT GGAGACCCTG TCGCTGCGCG TCGAGCGCCA TGTCGCCAAC GCGCAGAAGG TCGCCGAGTT CCTGGAGAAC CACCCCGACG TGTCGTCGGT GAACTACGCG GGCCTGCCGT CCTCGCCGTG GTACGAGCTG GGCCGCAAAC TCGCCCCCAA GGGCACCGGC GCGGTGCTCG CGTTCGAGCT GTCGGGCGGC CTGGAGGCCG GTAAGGCCTT CGTGAACGCG CTGACGCTGC ACAGTCACGT CGCCAACATC GGCGACGTGC GGTCGCTGGT GATCCACCCG GCGTCGACGA CGCACCAGCA GCTGAGCCCC GAGGAGCAGC TGTCGACGGG TGTCACGCCG GGCCTGGTGC GCCTGGCGGT CGGCCTCGAA GGCATCGACG ACATCATCGC CGACCTGGAG CAGGGGTTCG CCGCCGCGCG CCCGTTCAGC GGCGCGGCCC AGACGGCCCA GACGGTGAAA CAGCACGAAC AAGTTCTGCA GCCAAGCTTC TCGAGGATCC GGCTGCTAAC AAAGCCCGAA AGGAAGCTGA GTTGGCTGCT GCCACCGCTG AGCAATAACT AGCATAACCC CTTGGGGCCT CTAAACGGGT CTTGAGGGGT TTTTTGCTGA AAGGAGGAAC TATATCCGGA TATCCACAGG ACGGGTGTGG TCGCCATGAT CGCGTAGTCG ATAGTGGCTC CAAGTAGCGA AGCGAGCAGG ACTGGGCGGC GGCCAAAGCG GTCGGACAGT GCTCCGAGAA CGGGTGCGCA TAGAAATTGC ATCAACGCAT ATAGCGCTAG CAGCACGCCA TAGTGACTGG CGATGCTGTC GGAATGGACG ATATCCCGCA AGAGGCCCGG CAGTACCGGC ATAACCAAGC CTATGCCTAC AGCATCCAGG GTGACGGTGC CGAGGATGAC GATGAGCGCA TTGTTAGATT TCATACACGG TGCCTGACTG CGTTAGCAAT TTAACTGTGA TAAACTACCG CATTAAAGCT TATCGATGAT AAGCTGTCAA ACATGAGAA