A4793_00130 : CDS information

close this sectionLocation

Organism
StrainNRRL 15009
Entry nameA47934
Contig
Start / Stop / Direction41,458 / 35,246 / - [in whole cluster]
41,458 / 35,246 / - [in contig]
Locationcomplement(35246..41458) [in whole cluster]
complement(35246..41458) [in contig]
TypeCDS
Length6,213 bp (2,070 aa)
Click on the icon to see Genetic map.

close this sectionAnnotation

Category1.2 NRPS
Productnon-ribosomal peptide synthetase
Product (GenBank)StaA
GenestaA
ORF21
Gene (GenBank)
EC number
Keyword
Note
Note (GenBank)
  • peptide synthetase (modules 1-2)
Reference
ACC
PmId
[12060705] Assembling the glycopeptide antibiotic scaffold: The biosynthesis of A47934 from Streptomyces toyocaensis NRRL15009. (Proc Natl Acad Sci U S A. , 2002)
comment
Streptomyces toyocaensis NRRL15009由来A47934生合成gene clusterの同定論文。

34のORFsからなるclusterは、A47934の生合成とその調節に必要だと予測される遺伝子をすべて含む。グリコペプチド耐性を担う酵素をコードするORFsも含む。

StaA: Peptide synthetase (modules 1-2)

配列解析のみ。NRPS genes staA-Dは、heptapeptide骨格の組立を触媒する酵素をコードすると予測される。StaAにはmodule 1(A-T domain)とmodule 2(C-A-T-E domain)が含まれる。

close this sectionPKS/NRPS Module

A1 L-4-hydroxyphenylglycine(Hpg)
2 tyrosine
A35..425
PCP499..565
C599..890
A1057..1447
PCP1521..1587
E1627..2057

close this sectionSequence

selected fasta
>non-ribosomal peptide synthetase [StaA]
MNSVLSTPTVPELFARQAERTPEAVAVVDGDRFVTYRQLDELAGRLAGRLIGRGVRRGDR
VAVLMERSADLVVTLLAVWKAGAAYVPVDAAHPAPRVAFVVADSGASLMACSAATAGRVP
EGVEPVVVTDEGRGDASAVPVSPGDLAYVMYTSGSTGTPKGVAVPHRSVAELAGNPGWAV
KPGDAILMHAPHAFDASLFEIWVPLVSGARVVIAEPGAVDARRLREAIAAGVTKVHLTAG
SFRALAEESSESFAGLQEVLTGGDVVPAHAVEKVRKAVPQARIRHLYGPTETTLCATWHL
LQPSEALGPVLPIGRPLPGRRAQVLDASLRPLPPGVVGDLYLSGAGLADGYLDRAALTAE
RFVADPSVPGGRMYRTGDLVQWTADGELLFVGRADDQVKIRGFRIEPGEIEAALTAQPDV
HEAVVVAIDGRLIGYAVTDVDPVVLRERLGATLPEYMVPAVVITLDGLPLTRNGKVDRAA
LPAPVFGTNAAGREPATEAERVLSSLFAELLSLDRVGVDESFFALGGDSIVAMRLAARAS
RAGLLVTPSQIFNERTPARLAAVARDAAADTPGTPDGRPLIALTAEEEAELAIAVPGAEE
VWPLAPLQEGLLFQSTLEEEGLDIYQAQWILELNGPLDVARLRAAWEAVFARRAELRVSF
VRLASGKTLQAVAGHVVLPWREVDLTGVSDAEAAVQALARQEQEQRFDLAAGPLFRLLLV
RFGEDRHRLLVVHHHILTDGWSVAAILNEVTEAYEGGGRLPERIGAASYRDYLAWLGRQD
KDAARAAWQAELSGLDEHASIAKTAAGTGYEYRIAYVVPELHTRLTQVARDHGLTLNTLV
QGAWAMVLARLARRTEVVFGTTVACRPAELPGVESMPGLMMNTVPVRVSLDGGQTVADLL
TGLQRRQAALIPHQHLGLPEIQKVAGPGAKFDTLLVFENYPRDFADQFTYLGTVEGTHYP
LTVGIIPGDHLRIQLAYRHGQIEETVAESTLGWFVGALGAVAADPSGLVGRAGMGGGRVS
GPAAAPAARESLPVLVARVVQERPHETAVVDGDGELTFGELWEQASALAAVLRARGVGPE
SRVGLAVGRSAWWVVGMLGVSLAGGAFVPVDPAYPAERVSLLLGDADPVLVVCDGKARDA
VPEEFADRSLVIDEVDLSAVPDAELPRVGPDDVAYVIYTSGSTGTPKGVVVTHAGLGNLA
AAQIDRFAVSPSSRVLQFAALGFDATVSEALMALLSGATLVMAPKQDLPPRVSLAEALER
WDVTHVTVPPSVLATADVLPESLETVVVAGEACPPGLADRWSEGRRLINAYGPTEATVCA
AMSMPLTAGRDVVPIGEPIAGSRCHVLDAFLRPLPPGVTGELYVSGIGLARGYLGRAALT
AERFVADPFVPGERMYRTGDLAHLTSSGELVFAGRADDQVKLRGFRIEPGEIESVLSGHP
QVAQAAVTVRDDRLLAHVSPTEVDPHAVREYLASRLPQHMVPAVVVLEALPTTPNGKIDR
SALPDPDRAAGTVGREPVTELEQVLCRLFAEVLGLERVSVDDGFFERGGDSITSMQLVAR
ARREGLALAAQDVFELRTPEELARLAARSPLPERLRPATADGTGEVPWTPVMRTLGDRIT
GAGFAQWVVVGAPADLGEGILAAALAALVDTHDMLRARVERGRLIVGERGSVDTAGRVRR
VGLDGGSLDEAVEAAVRDAVGRLDPETGVMVQAVWVDAGPERVGRLVMAVHHMAVDGVSW
RILVPDLRAACEAVLAGRTPALEPVAVPFRRWASLLVEWATSAERVGELPAWQAILSQVD
RPVDGRQGSTGSVGSRSWVMAGAEASALLGRVPGLFFCGVHEILLAGLAGAVARCFGGRV
VLVDVESHGRHAVGGLDLSRTVGWFTSVHPVRLDVSGIDLASGDLVKAVKEQSRAVPDDG
LGHGLLRHLNAATGPVLEAMPSPWIGFNYMGRFAVGEQNDVAEWQQAGDIGGSLDPAMAL
PHALQVDVVVRDMPQGPELRLMASWQAGLLEEAEIERFGRMWRDTLSGLAHLADDSSAGG
HTASDFDLLDLDQDEIEGFEAIATDFGGGR
selected fasta
>non-ribosomal peptide synthetase [StaA]
ATGAATTCCGTGCTGTCCACGCCGACGGTGCCGGAGCTGTTCGCCCGTCAGGCGGAGCGG
ACGCCCGAGGCGGTGGCCGTGGTCGACGGGGACCGGTTCGTGACCTACCGGCAGCTGGAC
GAACTCGCGGGCCGGCTGGCCGGGCGGCTGATCGGCCGGGGCGTCCGCCGTGGCGACCGT
GTGGCGGTCCTCATGGAGCGCTCGGCGGACCTGGTGGTGACGCTGCTCGCCGTATGGAAG
GCGGGAGCGGCGTACGTGCCGGTGGACGCTGCCCATCCCGCGCCGCGGGTGGCGTTCGTG
GTCGCGGACTCGGGTGCGTCCTTGATGGCATGTTCGGCCGCGACGGCCGGCCGCGTGCCG
GAGGGGGTCGAGCCGGTCGTCGTCACGGACGAGGGCCGGGGCGACGCGTCGGCGGTCCCG
GTGTCTCCCGGGGACCTGGCGTACGTGATGTACACCTCCGGGTCGACGGGGACACCCAAG
GGAGTGGCCGTCCCGCACCGGAGCGTCGCGGAGCTGGCGGGGAATCCCGGCTGGGCGGTG
AAACCCGGTGACGCGATCCTCATGCACGCGCCCCACGCCTTCGACGCGTCCCTCTTCGAG
ATCTGGGTGCCGCTTGTCTCGGGTGCCCGTGTGGTGATCGCTGAGCCGGGCGCAGTGGAC
GCCCGACGTCTGCGGGAGGCGATCGCGGCCGGCGTCACGAAGGTGCATCTGACGGCGGGC
AGCTTCCGCGCGCTGGCCGAGGAGTCGTCGGAATCCTTCGCCGGGCTCCAGGAGGTGCTG
ACGGGCGGCGACGTGGTGCCCGCACACGCGGTGGAGAAGGTGAGAAAGGCGGTCCCCCAG
GCACGGATCCGGCATCTGTACGGTCCGACGGAGACGACGCTGTGCGCGACGTGGCACCTG
CTCCAGCCGAGCGAGGCGCTGGGTCCCGTGCTGCCGATCGGTCGTCCGCTTCCGGGCCGC
CGGGCCCAGGTGCTCGACGCGTCGCTGCGGCCCCTGCCGCCGGGAGTGGTCGGTGACCTC
TATCTCTCGGGCGCCGGCCTTGCGGACGGCTACCTGGACCGGGCTGCGCTGACGGCGGAG
CGGTTCGTGGCGGATCCGTCCGTACCCGGCGGCCGGATGTACCGGACGGGGGACCTTGTC
CAGTGGACCGCCGACGGCGAGCTGTTGTTCGTGGGCAGAGCCGACGACCAGGTGAAGATC
CGCGGGTTCCGGATCGAGCCCGGTGAGATCGAGGCCGCGCTGACCGCCCAGCCGGATGTC
CACGAGGCCGTCGTGGTGGCGATCGACGGACGCCTGATCGGCTACGCGGTGACGGACGTG
GATCCCGTCGTCCTGCGTGAGCGTCTCGGGGCGACGCTGCCGGAGTACATGGTCCCGGCT
GTCGTGATCACGCTGGACGGGCTTCCGCTGACCCGCAACGGCAAGGTGGACCGGGCGGCG
CTGCCGGCGCCCGTCTTCGGGACGAACGCGGCGGGTCGCGAACCCGCCACCGAGGCCGAG
CGCGTCCTGAGCTCCCTGTTCGCCGAACTGCTCAGCCTGGACCGGGTCGGTGTCGACGAG
AGTTTCTTCGCCCTGGGCGGCGACTCGATCGTCGCCATGCGGCTGGCGGCGCGCGCGTCC
AGGGCCGGTCTGCTGGTGACGCCCTCGCAGATCTTCAACGAGAGGACCCCCGCACGGTTG
GCGGCCGTGGCGCGTGACGCGGCCGCCGACACCCCCGGCACCCCCGACGGCCGTCCCCTG
ATCGCCCTCACCGCGGAGGAGGAGGCGGAGCTGGCGATCGCCGTACCTGGTGCCGAGGAG
GTCTGGCCGCTCGCCCCACTCCAGGAGGGGCTGCTCTTCCAGTCGACCCTCGAGGAGGAG
GGCCTCGACATCTATCAGGCGCAGTGGATCCTGGAGCTGAACGGGCCGTTGGACGTGGCC
CGGCTCCGGGCCGCATGGGAAGCGGTCTTCGCGCGGCGCGCCGAGCTCAGGGTGAGCTTC
GTCCGGCTCGCGTCCGGGAAGACGCTCCAGGCCGTCGCCGGACACGTCGTCCTGCCCTGG
CGGGAGGTCGACCTCACCGGCGTGAGCGACGCCGAGGCGGCCGTCCAGGCCCTCGCCCGG
CAGGAACAGGAACAGCGATTCGACCTGGCCGCGGGCCCCCTGTTCCGGCTGCTGCTGGTC
CGGTTCGGCGAGGACCGGCACCGCCTGCTGGTCGTCCACCACCACATCCTGACCGACGGC
TGGTCGGTGGCGGCCATCCTCAACGAGGTGACCGAGGCGTACGAGGGCGGCGGTCGGCTC
CCGGAGCGGATCGGCGCGGCATCCTACCGGGACTACCTGGCCTGGCTGGGTCGGCAGGAC
AAGGACGCGGCACGCGCTGCCTGGCAGGCGGAGCTGTCCGGCCTCGACGAGCACGCGTCG
ATCGCGAAGACGGCGGCCGGGACGGGATACGAGTACCGCATCGCCTACGTCGTGCCGGAG
CTCCACACCCGGTTGACGCAGGTGGCCCGTGACCACGGGCTGACGCTGAACACGCTGGTG
CAGGGCGCGTGGGCGATGGTGCTGGCCCGGCTCGCGCGGCGCACCGAGGTGGTGTTCGGC
ACCACGGTCGCCTGCCGGCCCGCGGAGCTTCCGGGGGTGGAGTCGATGCCCGGGCTCATG
ATGAACACCGTGCCGGTCCGGGTGTCGCTCGACGGCGGGCAGACGGTCGCCGATCTGCTG
ACCGGTCTGCAGCGGCGGCAGGCGGCCCTGATCCCGCATCAGCACCTGGGACTGCCGGAG
ATCCAGAAGGTGGCCGGGCCGGGCGCGAAGTTCGACACGCTGCTCGTCTTCGAGAACTAC
CCGCGGGACTTCGCCGACCAGTTCACCTATCTGGGCACGGTCGAGGGGACCCACTATCCG
CTGACCGTCGGCATCATCCCGGGGGACCACCTCAGGATCCAGCTCGCCTACCGGCACGGG
CAGATCGAGGAAACCGTCGCCGAGTCCACGCTGGGATGGTTCGTCGGCGCCCTCGGCGCG
GTGGCCGCCGACCCTTCCGGGTTGGTGGGGCGTGCCGGAATGGGCGGGGGCCGGGTGAGC
GGCCCGGCCGCGGCGCCGGCGGCGAGGGAGTCCCTGCCGGTGCTGGTGGCGCGGGTCGTT
CAGGAGCGGCCGCACGAGACGGCGGTGGTGGACGGCGACGGCGAGCTGACCTTCGGGGAG
CTCTGGGAACAGGCGTCTGCGCTGGCCGCCGTGCTGAGGGCCCGCGGGGTCGGACCGGAG
TCCCGGGTGGGCCTCGCCGTGGGGCGGTCGGCGTGGTGGGTGGTCGGGATGCTGGGGGTG
TCGCTGGCCGGAGGCGCGTTCGTTCCGGTGGATCCCGCGTATCCGGCCGAGCGCGTGAGC
CTGCTTCTGGGCGACGCGGATCCCGTGCTGGTCGTGTGCGACGGGAAGGCACGGGACGCG
GTGCCCGAGGAGTTCGCCGACCGGTCGCTGGTGATCGACGAGGTCGATCTCTCGGCGGTC
CCGGATGCGGAGTTGCCGCGGGTGGGGCCCGACGACGTGGCGTATGTGATCTATACGTCG
GGGTCGACGGGAACCCCGAAGGGTGTCGTCGTCACCCACGCGGGCCTGGGCAATCTGGCG
GCGGCGCAGATCGACCGGTTCGCTGTCTCGCCGTCGTCACGGGTCCTGCAGTTCGCGGCG
CTCGGCTTCGACGCCACGGTCTCGGAGGCGCTGATGGCGCTGCTCTCGGGGGCGACGCTG
GTGATGGCACCGAAGCAGGACCTGCCGCCGCGGGTGTCGCTGGCCGAGGCACTGGAGCGG
TGGGACGTCACCCATGTGACGGTTCCGCCGTCGGTGCTCGCCACGGCCGACGTGCTGCCG
GAGAGCCTGGAGACGGTGGTGGTGGCAGGGGAGGCCTGCCCGCCGGGCCTCGCGGACCGC
TGGTCCGAGGGACGACGGCTGATCAACGCCTACGGGCCGACCGAGGCCACGGTGTGTGCC
GCGATGAGCATGCCGTTGACGGCGGGCCGGGACGTGGTCCCGATCGGGGAGCCGATCGCG
GGAAGCCGCTGCCACGTACTGGACGCGTTCCTTCGGCCGTTGCCGCCGGGGGTCACCGGT
GAGCTCTACGTGTCGGGGATCGGGTTGGCCCGTGGCTACCTGGGCCGTGCGGCGCTGACG
GCGGAACGATTCGTCGCCGATCCCTTCGTCCCCGGTGAGCGGATGTACCGGACAGGAGAC
CTGGCCCACCTGACGAGCAGCGGTGAGCTGGTGTTCGCCGGGCGGGCCGATGACCAGGTG
AAGCTCCGTGGGTTCCGGATCGAGCCCGGCGAGATCGAGTCCGTGCTGTCCGGCCACCCG
CAGGTCGCTCAAGCGGCTGTGACCGTCCGGGACGACCGCCTTCTGGCCCATGTGTCGCCG
ACCGAGGTCGATCCGCACGCGGTACGGGAGTACCTCGCCTCCCGGCTGCCCCAGCACATG
GTGCCCGCCGTGGTGGTGCTGGAAGCGCTGCCCACCACCCCGAACGGGAAGATCGACCGC
AGCGCGCTGCCCGACCCCGATCGCGCTGCCGGGACCGTCGGCCGAGAACCGGTCACGGAG
CTCGAACAGGTGCTGTGCCGGCTGTTCGCCGAGGTGCTCGGCCTGGAGCGGGTCAGCGTG
GACGACGGCTTCTTCGAGCGGGGCGGAGACTCGATCACCTCCATGCAACTGGTGGCGCGG
GCGCGGCGGGAAGGACTGGCCCTCGCCGCGCAGGACGTCTTCGAGCTGAGGACGCCGGAG
GAACTCGCACGGCTGGCGGCCCGGTCGCCTCTGCCGGAACGCCTCCGGCCCGCGACGGCC
GACGGTACGGGCGAGGTGCCGTGGACGCCGGTGATGCGAACGCTGGGCGACCGGATCACC
GGTGCGGGATTCGCGCAGTGGGTGGTTGTGGGCGCACCCGCGGATCTGGGTGAGGGGATC
CTGGCCGCCGCGCTGGCCGCCCTGGTCGACACCCACGACATGTTGCGAGCACGGGTGGAG
CGGGGACGCCTGATCGTAGGTGAACGTGGCTCCGTGGACACGGCCGGTCGGGTCCGGAGG
GTGGGGCTCGACGGAGGCTCGCTGGACGAGGCCGTGGAGGCCGCGGTGCGGGACGCGGTG
GGACGGCTGGACCCGGAGACGGGCGTGATGGTGCAGGCGGTGTGGGTGGACGCCGGGCCG
GAACGGGTGGGCCGATTGGTGATGGCGGTGCACCACATGGCCGTCGACGGCGTGTCCTGG
CGGATTCTGGTGCCGGATCTGCGGGCGGCGTGCGAGGCGGTCCTGGCGGGACGGACCCCT
GCGCTGGAGCCGGTGGCGGTGCCGTTCCGGCGGTGGGCGAGCCTGCTGGTGGAGTGGGCG
ACGTCCGCGGAGCGGGTCGGCGAACTGCCGGCGTGGCAGGCGATCCTGAGCCAGGTGGAC
CGGCCGGTCGACGGGCGGCAGGGGAGCACGGGGAGTGTGGGCTCACGGTCGTGGGTCATG
GCGGGAGCCGAGGCGTCGGCACTGCTGGGACGTGTTCCGGGGCTGTTCTTCTGCGGGGTC
CACGAGATACTGCTCGCGGGGTTGGCGGGGGCGGTGGCGCGCTGCTTCGGCGGCCGTGTG
GTGCTGGTGGACGTGGAGAGTCACGGCCGTCACGCGGTCGGCGGGCTGGATCTGTCCCGG
ACGGTCGGCTGGTTCACCAGCGTGCACCCGGTGCGCCTGGACGTCTCGGGGATCGACCTG
GCGTCCGGGGATCTGGTCAAGGCGGTGAAGGAGCAGTCACGGGCGGTGCCTGATGACGGG
CTCGGTCACGGGCTGTTGCGCCATCTGAACGCCGCGACGGGGCCGGTGCTGGAGGCCATG
CCGTCGCCCTGGATCGGATTCAACTACATGGGCCGGTTCGCCGTGGGCGAGCAGAACGAC
GTGGCGGAGTGGCAGCAGGCCGGTGACATCGGCGGTTCCCTGGACCCGGCCATGGCACTG
CCGCATGCGCTGCAGGTCGACGTGGTCGTCCGGGACATGCCGCAGGGCCCGGAACTGAGG
CTCATGGCGAGCTGGCAGGCCGGCCTCCTCGAAGAGGCCGAGATCGAACGGTTCGGTCGG
ATGTGGCGGGACACGCTGTCCGGCCTGGCCCACCTGGCGGACGACTCCTCGGCCGGGGGG
CACACCGCGTCCGACTTCGACCTTCTCGACCTCGATCAGGACGAGATCGAGGGTTTTGAA
GCCATAGCGACGGACTTCGGCGGAGGTCGGTAA
[1] A35..425
[1] L-4-hydroxyphenylglycine(Hpg)195..295
[1] PCP499..565
[2] C599..890
[2] A1057..1447
[2] tyrosine1224..1319
[2] PCP1521..1587
[2] E1627..2057
[1] A103..1275
[1] L-4-hydroxyphenylglycine(Hpg)583..885
[1] PCP1495..1695
[2] C1795..2670
[2] A3169..4341
[2] tyrosine3670..3957
[2] PCP4561..4761
[2] E4879..6171

close this sectionFeature

BLASTP
Database:UniProtKB:2011_09
show BLAST table
InterPro
Database:interpro:38.0
IPR000873 AMP-dependent synthetase/ligase (Domain)
 [35-425]  2.50000000000001e-108 PF00501 [1057-1447]  3.10000000000005e-113 PF00501
PF00501   AMP-binding
IPR001242 Condensation domain (Domain)
 [599-890]  2.19999999999997e-58 PF00668 [1627-1895]  1.2e-28 PF00668
PF00668   Condensation
IPR006162 Phosphopantetheine attachment site (PTM)
 [524-539]  PS00012
PS00012   PHOSPHOPANTETHEINE
IPR009081 Acyl carrier protein-like (Domain)
 [499-565]  PS50075 [1521-1587]  PS50075
PS50075   ACP_DOMAIN
 [490-567]  7.69999999999997e-46 G3DSA:1.10.1200.10 [1514-1588]  7.69999999999997e-46 G3DSA:1.10.1200.10
G3DSA:1.10.1200.10   ACP_like
 [503-562]  4.70000000000001e-10 PF00550 [1524-1585]  2.6e-12 PF00550
PF00550   PP-binding
 [492-566]  8.80000265836278e-16 SSF47336 [1514-1588]  6.69999687625999e-19 SSF47336
SSF47336   ACP_like
IPR010060 Non-ribosomal peptide synthase (Domain)
 [1903-2057]  5.00000000000001e-34 TIGR01720
TIGR01720   NRPS-para261
IPR010071 Amino acid adenylation domain (Domain)
 [35-425]  7.99999999999993e-128 TIGR01733 [1057-1447]  1.5e-127 TIGR01733
TIGR01733   AA-adenyl-dom
IPR020806 Polyketide synthase, phosphopantetheine-binding domain (Domain)
 [497-568]  4.89999951782359e-06 SM00823 [1522-1590]  3.29999802154459e-09 SM00823
SM00823   PKS_PP
IPR020845 AMP-binding, conserved site (Conserved_site)
 [149-160]  PS00455 [1176-1187]  PS00455
PS00455   AMP_BINDING
SignalP No significant hit
TMHMM No significant hit
Page top