A4793_00100 : CDS information

close this sectionLocation

Organism
StrainNRRL 15009
Entry nameA47934
Contig
Start / Stop / Direction18,518 / 12,951 / - [in whole cluster]
18,518 / 12,951 / - [in contig]
Locationcomplement(12951..18518) [in whole cluster]
complement(12951..18518) [in contig]
TypeCDS
Length5,568 bp (1,855 aa)
Click on the icon to see Genetic map.

close this sectionAnnotation

Category1.2 NRPS
Productnon-ribosomal peptide synthetase
Product (GenBank)StaD
GenestaD
ORF24
Gene (GenBank)
EC number
Keyword
Note
Note (GenBank)
  • peptide synthetase (module 7)
Reference
ACC
PmId
[12060705] Assembling the glycopeptide antibiotic scaffold: The biosynthesis of A47934 from Streptomyces toyocaensis NRRL15009. (Proc Natl Acad Sci U S A. , 2002)
comment
Streptomyces toyocaensis NRRL15009由来A47934生合成gene clusterの同定論文。

34のORFsからなるclusterは、A47934の生合成とその調節に必要だと予測される遺伝子をすべて含む。グリコペプチド耐性を担う酵素をコードするORFsも含む。

StaD: Peptide synthetase (module 7)

配列解析のみ。NRPS genes staA-Dは、heptapeptide骨格の組立を触媒する酵素をコードすると予測される。StaDにはmodule 7(C-A-T-E-Te domain)が含まれる。
Related Reference
ACC
Q70AZ6
NITE
Teicp2_00210
PmId
[25686610] X-domain of peptide synthetases recruits oxygenases crucial for glycopeptide biosynthesis. (Nature. , 2015)
[26549530] Sequential In Vitro Cyclization by Cytochrome P450 Enzymes of Glycopeptide Antibiotic Precursors Bearing the X-Domain from Nonribosomal Peptide Biosynthesis. (Angew Chem Int Ed Engl. , 2015)
[27213615] Regulation of the P450 Oxygenation Cascade Involved in Glycopeptide Antibiotic Biosynthesis. (J Am Chem Soc. , 2016)
comment
BLAST id79%
Actinoplanes teichomyceticus ATCC 31121株_tcp ORF12
Non-ribosomal peptide synthetase

---
[PMID: 25686610](2015)
SUPPLEMENTARY INFORMATION内のリストにこのUniProt entryの記載があるが、同グループの他論文(PMID: 25358800, 25586301)ではDSM43866株を使用してATCC 31121株由来のgene nameを採用している混乱があるため、同様の可能性あり。

X-domainの結晶構造解析や相互作用の実験から、X-domainがcytochrome P450(oxy genesでコードされる)による架橋形成に関与することがわかった。

---
[PMID: 26549530](2015) abstract + supporting information
PMID: 25686610の続報。同様にsupporting informationでこのUniProt entryの記載あり。その他タンパクもATCC 31121株のentriesを採用している。

close this sectionPKS/NRPS Module

A7 L-3,5-dihydroxyphenylglycine(Dpg)
C15..310
A493..889
PCP971..1038
X1053..1352
TE1626..1839

close this sectionSequence

selected fasta
>non-ribosomal peptide synthetase [StaD]
MTVDDTRAKPRSGVEDVWPLSPLQEGMLYHTALDKDGPDTYTVQSVYGIEGPLDSERLQR
SWQALLDRHAALRVCFRYVSGAQMVQVVLRDVKIPWRETDLSGLPDDLADDEVGRLAEEE
LAERFRLETAPLTKLHLIRLGPESHRLVHTLHHVLADGWSMPVIHRELSAIYAAGGDPSG
LAATASYRDYLSWLGRQDKEAARAAWRKELAGLDAPTTVAPADPSRIPDIDTVMTELSPE
LTNDLAQLARGHDLTLSTVVQGVWGVVLSQLAGRTDVVFGATVSGRPAELAGVESMVGML
LGTLPVRVRLDGGRRAVDLLADLQRNQSALMAHQHLGLQEAQAVAGLGALFDTLVVYENF
PRTGLDRPADDGLNMRPVRKGRDSSHYPFTLVTGPGERMPVILNYDRGLFEREVAESVLG
AFVRVVERLVTEPDVLVGRLTLLSEAERAMVVNEWNATAGPVPGESVVELFGRRVDTAPH
AVAITDASGTDLTYAEVDQASNRLAAYLTGRGVRRGALVGVVMERSADLVITFLAIWKAG
AAFVPIDTGNPAERTALILADSGVSTVVCTIATQAAAPENAIVLDAPETRAAVDEQAGTA
PEIRVGADDLAYVMYTSGSTGVPKGVAVTHGGVAGLAGDAGWRIGPDDGVLMHATHVFDA
SLYEMWVPLATGGRILLAEPGVVDADGVRRAVERGATALHLTAGTFRALAEASPECFTGL
TEVGTGGDVVPAFAVENLRRAQPALRVRNTYGPTETTLCATWKPIEPGDGIGRELPIGRP
MANRGIYILDAFLQPVAPGITGELYIVGTGLARGYLGRPDLTAERFVACPFRAGERMYRT
GDLARWNRDGEVVFVGRADDQVKIRGFRVELAEVEAVLAAQPGVTEAVAMAREDQPGERR
LVGYVVTDGGEADVDEMRQRMSLVLPSYMVPVAIVVRPGLPITANGKVDRRALPAPDLAG
RTSEKAPESETEKVLCALYAEILGVERVGVDDAFHDLGGSSALAMRLIARIREEIGADLP
IRQLFSSPTPAGIARALAAKSRPALEAAATRPEEVPLTVRQLRAWLLARPGETTAGMHTS
VALRVRGRLDVPALEAALGDVASRHEILRTTFPGQATTVHQHVHDSATVQLTPIPAAEED
LPGLLAERARQPFDLTRDMPWRCDLFALAEREHVLALTVHRIAADDDSLDVYFRDLGAAY
GARRAGRVPERAPLALQFADYAIWEQRLREGEREQGSLIDDQASFWRDHLAGIDGDTVLP
FDRPRQAIPSRRAGAVALRLDGAPHAQLLAAVDSAGADAHQLVHAALAMLLTRLGAGEDL
VIGTTLPRDEDLIDLEPMIGPFARPLPLRTDVAGDPTFREVVARVQETVRDTRQNLDVPF
ERIADLLELPASLSRHPVFQVSLDVAEEDTGAWDATGLPALRTSVEPGRPEAIELDLAFK
LTEHRDEDDHGDGIDGELRYAVDLFDASTAESLARRLVRVLEQVAEDPDRRVSDLDILLD
DAERERPAEAPAVWTGSVPPAVADLAQDGPLGALVLDDRLLPVAPGAVGDLYVTGPAVDA
VPADRSLACPFGAQGRRMLRTGSLARWSAAGTLTLLGERRRSSVAAKTAAGDFEVLLPLR
PGGDRPPLFCVHASGGLSWNYGPLLRELPSNQPVYGIQARGLARTEALPGGVDEMAADYV
SQIRTVQPTGPYHFLGWSLGGRIAQAMAALLEAEGEQVGVLALLDAYPTYMGKKARGDGR
TQAAVDKLKEQQMELAAGLVRGDGARARLEEVMRNLAEVGPRHEAPSFAGDVLLFVASKD
RPPHMPVDWAIASWQPLTSGTVEHHEIPVDHNEMMQPASLARIGAVVAEKLRPRP
selected fasta
>non-ribosomal peptide synthetase [StaD]
GTGACTGTTGACGACACTCGTGCAAAGCCGCGCTCCGGCGTGGAAGACGTCTGGCCCCTG
TCGCCGCTGCAGGAGGGAATGCTTTACCACACGGCTCTGGACAAGGACGGCCCCGACACC
TACACCGTTCAGTCCGTCTACGGCATCGAGGGACCGCTGGACTCCGAGCGTTTGCAGAGG
TCCTGGCAGGCGCTCCTGGACCGGCACGCCGCGCTGCGGGTCTGTTTCCGGTACGTCAGC
GGCGCGCAGATGGTTCAGGTCGTCTTACGCGACGTCAAGATCCCGTGGCGTGAGACAGAC
CTCTCCGGACTCCCGGACGACCTGGCGGACGACGAGGTGGGGCGGCTGGCGGAGGAGGAG
CTGGCCGAGCGGTTCCGTCTCGAGACGGCCCCGTTGACGAAGCTGCACCTGATCCGGCTC
GGCCCGGAGAGCCACCGGCTCGTGCACACCCTTCATCACGTCCTGGCCGACGGTTGGTCG
ATGCCGGTCATTCACCGCGAGCTCTCCGCGATCTACGCGGCGGGCGGGGACCCGTCCGGG
CTGGCGGCCACGGCCTCCTACCGCGACTATCTCTCCTGGCTGGGCCGCCAGGACAAGGAG
GCGGCCCGGGCCGCCTGGCGGAAGGAGCTCGCCGGCCTGGACGCCCCCACCACGGTCGCC
CCGGCCGATCCGAGCCGAATACCGGACATCGACACGGTGATGACCGAGCTCTCCCCGGAG
CTGACGAACGATCTGGCACAGCTGGCCCGCGGCCACGACCTCACGTTGAGCACCGTCGTC
CAGGGTGTGTGGGGTGTCGTGCTGTCGCAGCTGGCAGGCCGCACCGACGTGGTGTTCGGC
GCGACCGTCTCGGGACGGCCGGCCGAGCTGGCCGGCGTCGAGTCGATGGTCGGCATGTTG
CTCGGCACCCTGCCGGTGCGCGTCCGGCTCGACGGCGGGCGGCGGGCCGTCGATCTGCTG
GCCGATCTGCAGCGGAACCAGTCGGCGCTCATGGCCCACCAGCATCTCGGCCTTCAGGAG
GCACAGGCCGTCGCCGGACTCGGAGCGCTCTTCGACACGCTCGTCGTCTACGAGAACTTC
CCCCGCACCGGACTCGACCGGCCGGCGGACGACGGCCTGAACATGCGCCCCGTGCGAAAG
GGACGCGACTCCTCGCATTACCCGTTCACCCTGGTCACCGGGCCGGGCGAGCGGATGCCG
GTCATTCTCAACTACGACCGGGGCCTGTTCGAGCGGGAGGTCGCCGAATCCGTCCTGGGC
GCGTTCGTCCGGGTGGTGGAACGGCTGGTCACCGAGCCCGACGTCCTGGTCGGCCGGCTG
ACCCTGCTGAGCGAAGCCGAGCGCGCCATGGTGGTGAACGAGTGGAACGCGACCGCTGGC
CCGGTGCCCGGTGAGTCCGTGGTCGAGCTGTTCGGGCGGCGGGTGGACACCGCACCCCAC
GCGGTGGCGATCACCGATGCGAGCGGCACGGACCTGACCTACGCCGAGGTCGACCAGGCT
TCGAACAGGCTGGCCGCATACCTCACCGGTCGCGGCGTCCGGCGCGGCGCCCTGGTCGGT
GTGGTCATGGAGAGGTCCGCCGACCTGGTGATCACGTTCCTGGCGATCTGGAAGGCGGGC
GCCGCGTTCGTCCCGATCGACACGGGGAATCCGGCTGAGCGGACCGCGCTCATCCTCGCC
GACTCCGGGGTCTCGACCGTCGTGTGCACGATCGCCACCCAGGCGGCCGCGCCGGAGAAC
GCGATCGTCCTCGACGCGCCGGAGACCCGCGCGGCCGTCGACGAACAGGCCGGCACCGCT
CCGGAGATCCGGGTCGGCGCGGACGATCTGGCGTACGTGATGTACACCTCCGGTTCCACC
GGCGTCCCGAAGGGCGTGGCCGTCACCCATGGGGGAGTGGCCGGTCTGGCGGGCGACGCG
GGCTGGCGGATCGGTCCCGACGACGGCGTGCTGATGCACGCGACGCACGTCTTCGACGCC
TCGCTCTACGAGATGTGGGTGCCGCTCGCCACGGGCGGCCGGATCCTGCTCGCCGAGCCG
GGAGTGGTGGACGCCGACGGCGTGCGCCGGGCCGTCGAACGGGGCGCGACCGCCCTCCAC
CTCACCGCCGGAACCTTCCGCGCCCTGGCGGAGGCGTCACCGGAATGCTTCACCGGCCTG
ACCGAGGTCGGCACCGGCGGAGACGTCGTTCCCGCCTTCGCGGTGGAGAACCTGCGGCGG
GCCCAGCCCGCTCTCCGGGTGAGGAACACCTACGGGCCGACCGAGACCACCCTGTGCGCG
ACGTGGAAGCCGATCGAGCCCGGTGACGGGATCGGGCGTGAGCTGCCGATCGGCCGCCCG
ATGGCGAACCGCGGGATCTACATCCTCGACGCCTTCCTGCAGCCGGTCGCGCCGGGAATC
ACCGGCGAGCTGTACATCGTCGGTACCGGCCTGGCCCGCGGGTACCTCGGCAGGCCGGAC
CTGACGGCCGAACGGTTCGTCGCATGCCCGTTCCGGGCCGGTGAGCGCATGTACCGCACC
GGAGACCTGGCGCGCTGGAACCGCGACGGGGAGGTGGTGTTCGTCGGGCGGGCCGACGAC
CAGGTGAAGATCCGGGGCTTCCGGGTGGAGCTGGCCGAGGTGGAAGCCGTGCTGGCGGCC
CAGCCGGGAGTGACCGAGGCGGTCGCCATGGCGCGCGAGGACCAGCCCGGCGAGCGGCGT
CTGGTCGGCTACGTCGTCACCGACGGAGGCGAGGCCGACGTCGACGAGATGCGGCAGCGG
ATGAGCCTCGTCCTGCCTTCCTACATGGTCCCCGTGGCGATCGTCGTCCGTCCGGGCCTG
CCCATCACTGCGAACGGCAAGGTGGATCGCCGGGCCCTGCCCGCCCCCGACCTCGCGGGA
CGCACTTCGGAGAAGGCGCCCGAGAGCGAGACCGAGAAGGTGCTGTGCGCGCTGTACGCC
GAGATCCTCGGGGTGGAGCGGGTGGGCGTCGACGACGCCTTCCACGACCTGGGCGGCAGC
TCGGCGCTGGCCATGCGTCTCATCGCGCGGATCCGCGAGGAGATCGGCGCGGATCTGCCC
ATCCGGCAGCTGTTCTCCTCGCCCACGCCCGCGGGCATCGCCCGGGCGCTGGCGGCGAAG
TCACGTCCCGCGCTGGAGGCCGCCGCCACCCGGCCGGAGGAGGTGCCCCTCACCGTCCGA
CAGCTCCGCGCCTGGCTGCTGGCCCGTCCCGGAGAGACGACCGCGGGCATGCACACCTCG
GTCGCGCTGCGCGTGCGCGGCCGACTGGACGTGCCCGCGCTGGAGGCGGCGCTCGGTGAC
GTCGCGTCCCGGCACGAGATCCTCCGGACGACCTTCCCCGGCCAGGCCACGACCGTCCAC
CAGCACGTCCACGACTCCGCGACGGTTCAGCTGACGCCGATTCCGGCCGCCGAGGAGGAC
CTCCCCGGGCTTCTCGCCGAACGGGCCCGGCAGCCCTTCGACCTCACCCGTGACATGCCG
TGGCGCTGCGACCTCTTCGCGCTCGCGGAGCGGGAGCACGTGCTGGCGCTGACGGTGCAC
CGGATCGCCGCCGACGACGACTCGCTGGACGTGTACTTCCGGGACCTGGGGGCCGCGTAC
GGCGCGCGGCGCGCGGGCCGGGTCCCGGAGCGCGCGCCACTGGCCCTCCAGTTCGCCGAC
TACGCCATCTGGGAGCAGCGGCTGCGCGAGGGCGAACGCGAGCAGGGAAGCCTGATCGAC
GACCAGGCGTCCTTCTGGCGGGACCACCTCGCCGGCATCGACGGGGATACGGTCCTGCCC
TTCGACCGTCCGCGTCAGGCCATCCCGTCGCGGCGGGCGGGCGCGGTCGCCCTGCGGCTG
GACGGCGCCCCGCACGCCCAACTGCTGGCGGCCGTGGACTCGGCGGGCGCGGACGCCCAC
CAGTTGGTGCATGCCGCGCTCGCCATGCTGCTGACCCGACTGGGCGCCGGCGAGGACCTC
GTGATCGGCACGACGCTGCCGCGGGACGAGGACCTGATCGACCTCGAGCCGATGATCGGG
CCGTTCGCCCGGCCGCTGCCCCTGCGCACCGACGTCGCGGGCGACCCCACCTTCCGGGAG
GTCGTCGCCCGGGTGCAGGAGACCGTCCGGGACACCCGCCAGAACCTGGACGTCCCGTTC
GAGCGGATCGCCGATCTGCTTGAGCTGCCCGCTTCGCTCTCCCGCCACCCCGTGTTCCAG
GTGTCCCTGGACGTGGCCGAGGAGGACACCGGCGCGTGGGACGCGACGGGACTGCCTGCC
CTGCGCACCAGTGTCGAACCCGGCCGGCCCGAGGCCATCGAGCTGGACCTCGCGTTCAAG
CTCACCGAGCACCGCGACGAGGACGACCACGGCGACGGCATCGACGGAGAACTCCGCTAC
GCCGTCGACCTGTTCGACGCTTCCACGGCCGAATCGCTGGCACGACGGCTGGTCCGCGTC
CTGGAACAGGTGGCGGAGGACCCCGACCGGCGCGTCAGCGATCTGGACATCCTGCTGGAC
GACGCCGAGCGCGAGCGTCCGGCCGAGGCCCCGGCCGTGTGGACCGGGTCCGTGCCCCCG
GCAGTCGCCGACCTGGCCCAGGACGGCCCGCTCGGCGCGCTCGTGCTCGACGACCGGCTG
CTCCCCGTCGCGCCCGGAGCCGTCGGTGATCTCTACGTCACCGGCCCCGCGGTCGACGCC
GTCCCGGCCGACCGGAGCCTGGCCTGCCCGTTCGGGGCGCAGGGGCGGCGCATGCTGCGC
ACCGGCTCGCTCGCCCGCTGGTCCGCTGCCGGAACCCTGACCCTCCTGGGCGAGCGGCGG
CGATCCAGCGTCGCGGCGAAGACCGCCGCGGGCGACTTCGAGGTCCTGCTGCCGCTGCGT
CCCGGCGGCGACCGTCCGCCGCTGTTCTGCGTCCATGCGAGCGGCGGCCTGAGCTGGAAC
TACGGGCCGCTGCTGCGGGAACTCCCGTCGAACCAGCCGGTCTACGGAATCCAGGCGCGC
GGCCTGGCCCGCACCGAGGCCCTGCCCGGCGGAGTCGACGAGATGGCGGCCGACTACGTC
TCGCAGATCCGCACCGTGCAGCCCACCGGGCCCTACCACTTCCTCGGCTGGTCCCTCGGC
GGGCGGATCGCGCAGGCGATGGCCGCGCTGCTCGAGGCCGAGGGCGAGCAGGTCGGCGTG
CTCGCGCTGCTCGACGCCTATCCCACCTACATGGGGAAGAAGGCACGCGGCGACGGGCGT
ACCCAGGCGGCCGTCGACAAGCTGAAGGAACAGCAGATGGAGCTCGCCGCCGGGCTGGTC
AGGGGGGACGGAGCCCGGGCCCGCCTGGAAGAGGTCATGCGGAACCTCGCCGAGGTCGGG
CCGAGGCACGAGGCCCCGAGCTTCGCCGGCGACGTCCTGCTGTTCGTCGCCTCGAAGGAC
CGTCCCCCGCACATGCCCGTCGACTGGGCGATCGCCAGCTGGCAGCCCTTGACCAGCGGG
ACCGTGGAACACCACGAGATCCCCGTCGACCACAACGAGATGATGCAGCCCGCGTCGCTG
GCCCGGATCGGGGCCGTCGTCGCCGAGAAGCTCCGGCCGCGGCCGTAG
[7] C15..310
[7] A493..889
[7] L-3,5-dihydroxyphenylglycine(Dpg)659..759
[7] PCP971..1038
[7] X1053..1352
[7] TE1626..1839
[7] C43..930
[7] A1477..2667
[7] L-3,5-dihydroxyphenylglycine(Dpg)1975..2277
[7] PCP2911..3114
[7] X3157..4056
[7] TE4876..5517

close this sectionFeature

BLASTP
Database:UniProtKB:2011_09
show BLAST table
InterPro
Database:interpro:38.0
IPR000873 AMP-dependent synthetase/ligase (Domain)
 [493-888]  8.29999999999992e-106 PF00501
PF00501   AMP-binding
IPR001031 Thioesterase (Domain)
 [1626-1839]  5.30000000000001e-37 PF00975
PF00975   Thioesterase
IPR001242 Condensation domain (Domain)
 [15-310]  6.90000000000007e-66 PF00668 [1053-1352]  6.90000000000001e-49 PF00668
PF00668   Condensation
IPR009081 Acyl carrier protein-like (Domain)
 [974-1037]  6.3e-12 PF00550
PF00550   PP-binding
 [964-1039]  5.69999708783953e-19 SSF47336
SSF47336   ACP_like
 [971-1038]  PS50075
PS50075   ACP_DOMAIN
 [962-1041]  3.8e-26 G3DSA:1.10.1200.10
G3DSA:1.10.1200.10   ACP_like
IPR010071 Amino acid adenylation domain (Domain)
 [493-889]  1e-127 TIGR01733
TIGR01733   AA-adenyl-dom
IPR020845 AMP-binding, conserved site (Conserved_site)
 [613-624]  PS00455
PS00455   AMP_BINDING
SignalP No significant hit
TMHMM No significant hit
Page top