Epoth_00060 : CDS information

close this sectionLocation

Organism
StrainSo ce90
Entry nameEpothilone
Contig
Start / Stop / Direction7,610 / 11,875 / + [in whole cluster]
7,610 / 11,875 / + [in contig]
Location7610..11875 [in whole cluster]
7610..11875 [in contig]
TypeCDS
Length4,266 bp (1,421 aa)
Click on the icon to see Genetic map.

close this sectionAnnotation

Category1.1 PKS
Productpolyketide synthase
Product (GenBank)polyketide synthase
Gene
Gene (GenBank)epoA
EC number
Keyword
Note
Note (GenBank)
  • EPOS A
Reference
ACC
PmId
[10662695] The biosynthetic gene cluster for the microtubule-stabilizing agents epothilones A and B from Sorangium cellulosum So ce90. (Chem Biol. , 2000)
[11217802] Studies on the biosynthesis of epothilones: the biosynthetic origin of the carbon skeleton. (J Antibiot (Tokyo). , 2000)
comment
[PMID: 10662695](2000)
生合成cluster報告論文。
EPOS A: PKS(Mod 0; KSQ, ATa, ER, ACP)
KSQとされているが、Gln(Q)への置換ではない。

module 0のKS domain active siteはcysteine→tyrosineに置換されており、inactiveかもしれない。
ER domainはputative ER signature motif GGVGxAAxQxARで高保存のarginine→glutamineに置換されており、機能しないかもしれない。

epoAのinsertional inactivationでepothilone産生はなくなる。

--
[PMID:11217802](2000)UniProt登録外
取り込み実験によりepothilone骨格の由来を調査。
epothilone生合成はacetateとcysteineに由来するthiazole部分の形成でスタートする。
Related Reference
ACC
Q9KJ00
PmId
[10649995] Cloning and heterologous expression of the epothilone gene cluster. (Science. , 2000)
[10831849] Isolation and characterization of the epothilone biosynthetic gene cluster from Sorangium cellulosum. (Gene. , 2000)
[11564558] Epothilone biosynthesis: assembly of the methylthiazolylcarboxy starter unit on the EpoB subunit. (Chem Biol. , 2001)
comment
2nd(Q9KJ00) 98%, 0.0
Sorangium cellulosum (Polyangium cellulosum)_epoA
EpoA

--
[PMID: 10649995, 10831849](2000)
Strain SMP44
Streptomyces coelicolorでepothilone biosynthetic gene clusterを発現してepothilone A and B産生を確認。

--
[PMID: 11564558](2001)
EpoAでのloadingの機構が図示されている(Fig. 9.)。

AT domainがmalonyl-CoAからmalonyl基をACPにロードし、KSy domainがdecarboxylationしてacetyl-S-EpoA(ACP)をもたらすと提唱されている。ER domainはたぶんnon-functionalである。

close this sectionPKS/NRPS Module

0 malonyl-CoA
KS12..388
AT543..821
er956..1271
ACP1313..1382

close this sectionSequence

selected fasta
>polyketide synthase [polyketide synthase]
MADRPIERAAEDPIAIVGASCRLPGGVIDLSGFWTLLEGSRDTVGRVPAERWDAAAWFDP
DPDAPGKTPVTRASFLSDVACFDASFFGISPREALRMDPAHRLLLEVCWEALENAAIAPS
ALVGTETGVFIGIGPSEYEAALPQATASAEIDAHGGLGTMPSVGAGRISYALGLRGPCVA
VDTAYSSSLVAVHLACQSLRSGECSTALAGGVSLMLSPSTLVWLSKTRALARDGRCKAFS
AEADGFGRGEGCAVVVLKRLSGARADGDRILAVIRGSAINHDGASSGLTVPNGSSQEIVL
KRALADAGCAASSVGYVEAHGTGTTLGDPIEIQALNAVYGLGRDVATPLLIGSVKTNLGH
PEYASGITGLLKVVLSLQHGQIPAHLHAQALNPRISWGDLRLTVTRARTPWPDWNTPRRA
GVSSFGMSGTNAHVVLEEAPAATCTPPAPERPAELLVLSARTASALDAQAARLRDHLETY
PSQCLGDVAFSLATTRSAMEHRLAVAATSREGLRAALDAAAQGQTSPGAVRSIADSSRGK
LAFLFTGQGAQTLGMGRGLYDVWSAFREAFDLCVRLFNQELDRPLREVMWAEPASVDAAL
LDQTAFTQPALFTFEYALAALWRSWGVEPELVAGHSIGELVAACVAGVFSLEDAVFLVAA
RGRLMQALPAGGAMVSIEAPEADVAAAVAPHAASVSIAAVNAPDQVVIAGAGQPVHAIAA
AMAARGARTKALHVSHAFHSPLMAPMLEAFGRVAESVSYRRPSIVLVSNLSGKACTDEVS
SPGYWVRHAREVVRFADGVKALHAAGAGTFVEVGPKSTLLGLVPACMPDARPALLASSRA
GRDEPATVLEALGGLWAVGGLVSWAGLFPSGGRRVPLPTYPWQRERYWIDTKADDAARGD
RRAPGAGHDEVEEGGAVRGGDRRSARLDHPPPESGRREKVEAAGDRPFRLEIDEPGVLDH
LVLRVTERRAPGLGEVEIAVDAAGLSFNDVQLALGMVPDDLPGKPNPPLLLGGECAGRIV
AVGEGVNGLVVGQPVIALSAGAFATHVTTSAALVLPRPQALSAIEAAAMPVAYLTAWYAL
DRIARLQPGERVLIHAATGGVGLAAVQWAQHVGAEVHATAGTPEKRAYLESLGVRYVSDS
RSDRFVADVRAWTGGEGVDVVLNSLSGELIDKSFNLLRSHGRFVELGKRDCYADNQLGLR
PFLRNLSFSLVDLRGMMLERPARVRALLEELLGLIAAGVFTPPPIATLPIARVADAFRSM
AQAQHLGKLVLTLGDPEVQIRIPTHAGAGPSTGDRDLLDRLASAAPAARAAALEAFLRTQ
VSQVLRTPEIKVGAEALFTRLGMDSLMAVELRNRIEASLKLKLSTTFLSTSPNIALLAQN
LLDALATALSLERVAAENLRAGVQNDFVSSGADQDWEIIAL
selected fasta
>polyketide synthase [polyketide synthase]
GTGGCGGATCGTCCCATCGAGCGCGCAGCCGAAGATCCGATTGCGATCGTCGGAGCGAGT
TGCCGTCTGCCCGGTGGCGTGATCGATCTGAGCGGGTTCTGGACGCTCCTCGAGGGCTCG
CGCGACACCGTCGGGCGAGTCCCCGCCGAACGCTGGGATGCAGCAGCGTGGTTTGATCCC
GACCCCGATGCCCCGGGGAAGACGCCCGTTACGCGCGCATCTTTCCTGAGCGACGTAGCC
TGCTTCGACGCCTCCTTCTTCGGCATCTCGCCTCGCGAAGCGCTGCGGATGGACCCTGCA
CATCGACTCTTGCTGGAGGTGTGCTGGGAGGCGCTGGAGAACGCCGCGATCGCTCCATCG
GCGCTCGTCGGTACGGAAACGGGAGTGTTCATCGGGATCGGCCCGTCCGAATATGAGGCC
GCGCTGCCGCAAGCGACGGCGTCCGCAGAGATCGACGCTCATGGCGGGCTGGGGACGATG
CCCAGCGTCGGAGCGGGCCGAATCTCGTATGCCCTCGGGCTGCGAGGGCCGTGTGTCGCG
GTGGATACGGCCTATTCGTCCTCGCTGGTGGCCGTTCATCTGGCCTGTCAGAGCTTGCGC
TCCGGGGAATGCTCCACGGCCCTGGCTGGTGGGGTATCGCTGATGTTGTCGCCGAGCACC
CTCGTGTGGCTCTCGAAGACCCGGGCGCTGGCCAGGGACGGTCGCTGCAAGGCATTTTCG
GCGGAGGCCGATGGGTTCGGACGAGGCGAAGGGTGCGCCGTCGTGGTCCTCAAGCGGCTC
AGTGGAGCCCGCGCGGACGGCGATCGGATATTGGCGGTGATTCGAGGATCCGCGATCAAT
CACGACGGTGCGAGCAGCGGTCTGACCGTGCCGAACGGGAGCTCCCAAGAAATCGTGCTG
AAACGGGCCCTGGCGGACGCAGGCTGCGCCGCGTCTTCGGTGGGTTATGTCGAGGCACAC
GGCACGGGCACGACGCTTGGTGACCCCATCGAAATCCAAGCTCTGAATGCGGTATACGGC
CTCGGGCGAGATGTCGCCACGCCGCTGCTGATCGGGTCGGTGAAGACCAACCTTGGCCAT
CCTGAGTATGCGTCGGGGATCACTGGGCTGCTGAAGGTCGTCTTGTCCCTTCAGCACGGG
CAGATTCCTGCGCACCTCCACGCGCAGGCGCTGAACCCCCGGATCTCATGGGGTGATCTT
CGGCTGACCGTCACGCGCGCCCGGACACCGTGGCCGGACTGGAATACGCCGCGACGGGCG
GGGGTGAGCTCGTTCGGCATGAGCGGGACCAACGCGCACGTGGTGCTGGAAGAGGCGCCG
GCGGCGACGTGCACACCGCCGGCGCCGGAGCGACCGGCAGAGCTGCTGGTGCTGTCGGCA
AGGACCGCGTCAGCCCTGGATGCACAGGCGGCGCGGCTGCGCGACCATCTGGAGACCTAC
CCTTCGCAGTGTCTGGGCGATGTGGCGTTCAGTCTGGCGACGACGCGCAGCGCGATGGAG
CACCGGCTCGCGGTGGCGGCGACGTCGAGGGAGGGGCTGCGGGCAGCCCTGGACGCTGCG
GCGCAGGGACAGACGTCGCCCGGTGCGGTGCGCAGTATCGCCGATTCCTCACGCGGCAAG
CTCGCCTTTCTCTTCACCGGACAGGGGGCGCAGACGCTGGGCATGGGCCGTGGGCTGTAC
GATGTATGGTCCGCGTTCCGCGAGGCGTTCGACCTGTGCGTGAGGCTGTTCAACCAGGAG
CTCGACCGGCCGCTCCGCGAGGTGATGTGGGCCGAACCGGCCAGCGTCGACGCCGCGCTG
CTCGACCAGACAGCCTTCACCCAGCCGGCGCTGTTCACCTTCGAATATGCGCTCGCCGCG
CTGTGGCGGTCGTGGGGTGTAGAGCCGGAGTTGGTCGCCGGCCATAGCATCGGTGAGCTG
GTGGCTGCCTGCGTGGCGGGCGTGTTCTCGCTTGAGGACGCGGTGTTCCTGGTGGCTGCG
CGCGGGCGCCTGATGCAGGCGCTGCCGGCCGGCGGGGCGATGGTGTCGATCGAGGCGCCG
GAGGCCGATGTGGCTGCTGCGGTGGCGCCGCACGCAGCGTCGGTGTCGATCGCCGCGGTC
AACGCTCCGGACCAGGTGGTCATCGCGGGCGCCGGGCAACCCGTGCATGCGATCGCGGCG
GCGATGGCCGCGCGCGGGGCGCGAACCAAGGCGCTCCACGTCTCGCATGCGTTCCACTCA
CCGCTCATGGCCCCGATGCTGGAGGCGTTCGGGCGTGTGGCCGAGTCGGTGAGCTACCGG
CGGCCGTCGATCGTCCTGGTCAGCAATCTGAGCGGGAAGGCTTGCACAGACGAGGTGAGC
TCGCCGGGCTATTGGGTGCGCCACGCGCGAGAGGTGGTGCGCTTCGCGGATGGAGTGAAG
GCGCTGCACGCGGCCGGTGCGGGCACCTTCGTCGAGGTCGGTCCGAAATCGACGCTGCTC
GGCCTGGTGCCTGCCTGCATGCCGGACGCCCGGCCGGCGCTGCTCGCATCGTCGCGCGCT
GGGCGTGACGAGCCGGCGACCGTGCTCGAGGCGCTCGGCGGGCTCTGGGCCGTCGGTGGC
CTGGTCTCCTGGGCCGGCCTCTTCCCCTCAGGGGGGCGGCGGGTGCCGCTGCCCACGTAC
CCTTGGCAGCGCGAGCGCTACTGGATCGACACGAAAGCCGACGACGCGGCGCGTGGCGAC
CGCCGTGCTCCGGGAGCGGGTCACGACGAGGTCGAGGAGGGGGGCGCGGTGCGCGGCGGC
GACCGGCGCAGCGCTCGGCTCGACCATCCGCCGCCCGAGAGCGGACGCCGGGAGAAGGTC
GAGGCCGCCGGCGACCGTCCGTTCCGGCTCGAGATCGATGAGCCAGGCGTGCTTGATCAC
CTCGTGCTTCGGGTCACGGAGCGGCGCGCCCCTGGTCTGGGCGAGGTCGAGATCGCCGTC
GACGCGGCGGGGCTCAGCTTCAATGATGTCCAGCTCGCGCTGGGCATGGTGCCCGACGAC
CTGCCGGGAAAGCCCAACCCTCCGCTGCTGCTCGGAGGCGAGTGCGCCGGGCGCATCGTC
GCCGTGGGCGAGGGCGTGAACGGCCTCGTGGTGGGCCAACCGGTCATCGCCCTTTCGGCG
GGAGCGTTTGCTACCCACGTCACCACGTCGGCTGCGCTGGTGCTGCCTCGGCCTCAGGCG
CTCTCGGCGATCGAGGCGGCCGCCATGCCCGTCGCGTACCTGACGGCATGGTACGCGCTC
GACAGAATAGCCCGCCTTCAGCCGGGGGAGCGGGTGCTGATCCATGCGGCGACCGGCGGG
GTCGGTCTCGCCGCGGTGCAGTGGGCGCAGCACGTGGGAGCCGAGGTCCATGCGACGGCC
GGCACGCCCGAGAAACGCGCCTACCTGGAGTCGCTGGGCGTGCGGTATGTGAGCGATTCC
CGCTCGGACCGGTTCGTCGCCGACGTGCGCGCGTGGACGGGCGGCGAGGGAGTAGACGTC
GTGCTCAACTCGCTCTCGGGCGAGCTGATCGACAAGAGTTTCAATCTCCTGCGATCGCAC
GGCCGGTTTGTGGAGCTCGGCAAGCGCGACTGTTACGCGGATAACCAGCTCGGGCTGCGG
CCGTTCCTGCGCAATCTCTCCTTCTCGCTGGTGGATCTCCGGGGGATGATGCTCGAGCGG
CCGGCGCGGGTCCGTGCGCTCTTGGAGGAGCTCCTCGGCCTGATCGCGGCAGGCGTGTTC
ACCCCTCCCCCCATCGCGACGCTCCCGATCGCCCGTGTCGCCGATGCGTTCCGGAGCATG
GCGCAGGCGCAGCATCTTGGGAAGCTCGTACTCACGCTGGGTGACCCGGAGGTCCAGATC
CGTATTCCAACCCACGCAGGCGCCGGCCCGTCCACCGGGGATCGGGACCTGCTCGACAGG
CTCGCGTCAGCTGCGCCGGCCGCGCGCGCGGCGGCGCTGGAGGCGTTCCTCCGTACGCAG
GTCTCGCAGGTGCTGCGCACGCCCGAAATCAAGGTCGGCGCGGAGGCGCTGTTCACCCGC
CTCGGCATGGACTCGCTCATGGCCGTGGAGCTGCGCAATCGTATCGAGGCGAGCCTCAAG
CTGAAGCTGTCGACGACGTTCCTGTCCACGTCCCCCAATATCGCCTTGTTGGCCCAAAAC
CTGTTGGATGCTCTCGCCACAGCTCTCTCCTTGGAGCGGGTGGCGGCGGAGAACCTACGG
GCAGGCGTGCAAAACGACTTCGTCTCATCGGGCGCAGATCAAGACTGGGAAATCATTGCC
CTATGA
[0] KS12..388
[0] AT543..821
[0] malonyl-CoA736..740
[0] er956..1271
[0] ACP1313..1382
[0] KS34..1164
[0] AT1627..2463
[0] malonyl-CoA2206..2220
[0] er2866..3813
[0] ACP3937..4146

close this sectionFeature

BLASTP
Database:UniProtKB:2011_09
show BLAST table
InterPro
Database:interpro:38.0
IPR001227 Acyl transferase domain (Domain)
 [536-668]  2.59999999999995e-75 G3DSA:3.40.366.10 [736-851]  2.59999999999995e-75 G3DSA:3.40.366.10
G3DSA:3.40.366.10   Ac_transferase_reg
IPR006162 Phosphopantetheine attachment site (PTM)
 [1340-1355]  PS00012
PS00012   PHOSPHOPANTETHEINE
IPR009081 Acyl carrier protein-like (Domain)
 [1302-1383]  1.2e-13 G3DSA:1.10.1200.10
G3DSA:1.10.1200.10   ACP_like
 [1313-1382]  PS50075
PS50075   ACP_DOMAIN
 [1317-1379]  4.5e-08 PF00550
PF00550   PP-binding
 [1306-1385]  1.09999909120787e-14 SSF47336
SSF47336   ACP_like
IPR011032 GroES-like (Domain)
 [947-1096]  2.10000267834031e-30 SSF50129
SSF50129   GroES_like
IPR013149 Alcohol dehydrogenase, C-terminal (Domain)
 [1100-1232]  4.5e-27 PF00107
PF00107   ADH_zinc_N
IPR013154 Alcohol dehydrogenase GroES-like (Domain)
 [974-1037]  1.3e-06 PF08240
PF08240   ADH_N
IPR014030 Beta-ketoacyl synthase, N-terminal (Domain)
 [12-262]  7.79999999999999e-90 PF00109
PF00109   ketoacyl-synt
IPR014031 Beta-ketoacyl synthase, C-terminal (Domain)
 [271-388]  1.99999999999999e-41 PF02801
PF02801   Ketoacyl-synt_C
IPR014043 Acyl transferase (Domain)
 [543-821]  1.2e-58 PF00698
PF00698   Acyl_transf_1
IPR016035 Acyl transferase/acyl hydrolase/lysophospholipase (Domain)
 [539-852]  5.50001754266469e-73 SSF52151
SSF52151   Acyl_Trfase/lysoPlipase
IPR016036 Malonyl-CoA ACP transacylase, ACP-binding (Domain)
 [670-735]  1.09999909120787e-14 SSF55048
SSF55048   Malonyl_transacylase_ACP-bd
IPR016038 Thiolase-like, subgroup (Domain)
 [14-275]  5.40000000000002e-91 G3DSA:3.40.47.10 [276-441]  4.9e-61 G3DSA:3.40.47.10
G3DSA:3.40.47.10   Thiolase-like_subgr
IPR016039 Thiolase-like (Domain)
 [4-387]  5.10003242202156e-95 SSF53901
SSF53901   Thiolase-like
IPR016040 NAD(P)-binding domain (Domain)
 [1054-1245]  4.70000000000003e-58 G3DSA:3.40.50.720
G3DSA:3.40.50.720   NAD(P)-bd
IPR020801 Polyketide synthase, acyl transferase domain (Domain)
 [544-842]  4.99996518094122e-107 SM00827
SM00827   PKS_AT
IPR020806 Polyketide synthase, phosphopantetheine-binding domain (Domain)
 [1314-1385]  6.90001478563507e-17 SM00823
SM00823   PKS_PP
IPR020841 Polyketide synthase, beta-ketoacyl synthase domain (Domain)
 [14-441]  SM00825
SM00825   PKS_KS
IPR020843 Polyketide synthase, enoylreductase (Domain)
 [956-1271]  SM00829
SM00829   PKS_ER
SignalP No significant hit
TMHMM No significant hit
Page top