Amph_00050 : CDS information

close this sectionLocation

Organism
StrainATCC 14899 (=NBRC 12895)
Entry nameAmphotericin B
Contig
Start / Stop / Direction33,584 / 50,518 / + [in whole cluster]
33,584 / 50,518 / + [in contig]
Location33584..50518 [in whole cluster]
33584..50518 [in contig]
TypeCDS
Length16,935 bp (5,644 aa)
Click on the icon to see Genetic map.

close this sectionAnnotation

Category1.1 PKS
Productpolyketide synthase
Product (GenBank)AmphJ
Gene
Gene (GenBank)amphJ
EC number
Keyword
Note
Note (GenBank)
  • polyketide synthase multienzyme polypeptide housing extension modules 15, 16, and 17
Reference
ACC
PmId
[11451671] Amphotericin biosynthesis in Streptomyces nodosus: deductions from analysis of polyketide synthase and late genes. (Chem Biol. , 2001)
comment
Amphotericin生合成gene clusterの報告。
amphJ: PKS protein, modules 15-17

polyketide構造からはDH15 and DH17は不要。
でもDH15はactive DH domainのmotif保存あり。
DH17はacitve siteでの置換があり、明らかにinactive.

close this sectionPKS/NRPS Module

15 malonyl-CoA
16 malonyl-CoA
17 malonyl-CoA
KS39..414
AT583..892
dh940..1102
KR1420..1600
ACP1704..1774
KS1798..2169
AT2308..2621
DH2668..2830
ER3159..3462
KR3472..3655
ACP3756..3826
KS3851..4221
AT4365..4676
dh4725..4888
KR5207..5387
ACP5486..5556

close this sectionSequence

selected fasta
>polyketide synthase [AmphJ]
MEQTMNAPAENVVAALRAAVKETERLRRRNRTIVAAAREPIAVVGMGCRFPGGVDSPQAL
WEMVAGGTDVISEFPDDRGWDLEALRTSGIDDRDTSVSQRGGFLDSIADFDPGFFGISPR
EAVTMDPQQRLLLETAWEAIERARIDATRLRGTRTGTFIGTNGQDYAYLLVRSLDDATGD
VGTGIAASAVSGRLSYTFGLEGPAITVDTACSSSLVALHLAVQSLRNGECTLALAGGVNV
MSTPGSLVEFSRQGGLAGDGRCKAFSDSADGTGWSEGAAVLALERLSDAQRNGHPVLAVI
RGSAVNQDGASNGFTAPNGPSQQRVIRQALSNAGLNPADVDVVEAHGTGTPLGDPIEAQS
ILATYGQDREQPLLLGSIKSNIGHTQSAASGVAGIMKMIMAMRNEVLPKTLHVDRPSTHV
DWTAGKVELLTENRPWPTAPDRPRRSGVSSFGVSGTNAHVIVEQAPQTPAHQPEEAPADP
SDEAPAAPRTAGVLPWVLSARSAAALREQAAALLAHLDAPGAPGALDTGYSLATTRASLE
HRLAVVTGADGTAGREALTSWLAGDPAPDAHEGRPVGRTRSAFLFSGQGAQRLGMGRELH
ARFPVFAEALDQVLDLLDEELDASLGDIIWGEEEAPLNETGFTQPALFAVEVALYRLVES
WGVTPDFVAGHSIGEIAAAHVAGVFSLEDACRLVAARAGLMQALPSGGAMVAVEATEDEV
LPLLTEGVAVAAVNGPTSVVVSGEEKATLAVAEQLAAKGRRTSRLRVSHAFHSPLMDPML
DDFRAVAETLSYDEPQLPVVSNLTGTLAADGQLTSPEYWVRHVREAVRFADGVRALAEAG
ADVLLELGPDGVLAALAQQSATALTVPFLRKDRPEEHSAVTALARLHTAGVTVDWAAFHD
GTGARTGELPTYAFQHERYWPKATATAVDATGLGLASADHPLLGAAMSVAGSDELLLTGT
LSLATHPWLADHSVDGMVVFPGTGFLELAVRAADQAGCDRVQELTVATPLVLPATGAVQM
QISVGAAGEDGSRELRFFTRPGEDFDAEWTQHATGRIASGEHVLGFDTTVWPPRDSEAVD
IEELFDRFASDGLEYGPVFRGLRAAWRQDDTVYAEVELPDSVEDAGAFGLHPALLHAALH
GTAFLSDDSGLLPFAWEGVSLHADGASTLRVRIASCGEDTVEIAAVDPAGQPVLSVESLT
LRAADSGAGAASRREEANSLFRVDWTPRTVAAPAAPATWAVLGEDPFGLTAALAGDSEAV
AGVHAPAATLEELAARSGAVPDMVAVTVRGDADGGPEAARELTREVLALVQGWLAEPAFA
SSRLVVVTRDAVADGERGAVDLAAAPVWGLVRSAQSENPGRLLLADVDDTADSLARLPLL
AGLFDAEEPQAVVREGTVRVGRLARLESGDSSARALDPEGTVLVTGGTGGLASALARHLV
AEHDIRHLLLTSRRGPDAPGAADLVQALAELGAEARVAACDVADREALAALLASVPAEHP
LTAVVHTAGVLDDGIFPSLTPDRLDSVMRPKVDAAWHLHDLTRDLDLAAFVLYSSTAGVM
GSPGQANYAAGNTFLDALAAHRQSLGLPATSLAWGAWEQGVGMTGQLSGQDARRISDAGG
MPLLSVERGLALYDAAMLADEPLVVPLGLGGGGPLPAGAGVPAILRGLVRTGGRRARAAT
AAVARAGLAERLAVLPEEQRRPFVVDLVRAEAAAVLGHGSADAVDSRREFRGLGFDSLTA
IELRNRLGKATGLTLPATLVFDYPTPEQLADHLLDELLGADAIEVFAASQSAADVHDDPV
VIVGMGCRFPGGVGSPEDLWDLLASGSDAITGFPADREWESSRLVAGEAGGVSAQGGFLS
DIAGFDADFFGISPREALAMDPQQRILLEVTWEAIERAGVDPTALRGSRTGVFMGVNGQD
YSSLVMGSRDDVAGHATAGLAVSVVSGRLSYALGLEGPALSVDTACSSSLVSLHLAAQAL
RSGECSMALVGGVTVMTTPANFAGFSRMGGLAQDGRCKAFSDSADGTGWSEGAAVLVVER
LSDARRAGHRVLAVVRGSAVNQDGASNGLTAPNGPSQQRVIRQALANAGLRPGDVDAVEA
HGTGTPPGDPIEAQALLATYGSDRDPQQPLLLGSVKSNIGHTQAAAGVAGLVKMVMAMRN
GVLPRTLHITEPSTHVDWSLGAVQVLTEETAWPETGRVRRAGVSSFGISGTNAHVILEGA
PDEPVPAPVADRPVPGAVAWPVSAKSEGALDDQAERLRESADALPALDTAYTLATGRADF
EHRAVLLAADGTLTEVARGVAEPHRSAFLFSGQGAQRLGMGRELHARFPVFAEAFDSVTA
LLESELDTSVREVMWGTDEGALNATAFTQPALFAVEVALYRLVESWGVTPDFVAGHSVGE
IAAAHVAGVFSLEDACRLVAARAGLMQALPSGGAMVAVEATEDEVLPLLTEGVAVAAVNG
PTSVVVSGEEQATLAVAEQLAAQGRRTSRLRVSHAFHSPLMDPMLEDFRAVAETLSYHEP
RIPVVSNLTGEVAAAGVHTHPDYWVRHVREAVRFADGVRGLADRGVTALLEIGPDGVLSA
LAAASLTDTDTVVVPALRKDRDETVSVLSGVARLYVAGVDVDWSAPLSGAGARIADVPTY
AFQHERYWPKAAPAALDATGLGLASADHPLLGAAMSVAGSDELLLTGSLSAATHPWLADH
VVGGMIFFPGTGFLELAVRAADQAGCDRVEELMIAAPLVLPATGAVQVQISVGAADEEGS
RELRFFTRPGEDFDAEWTQHATGRIGSGEQVIDFDATVWPPRDAEAIDIDGMFERYAADG
LEYGPVFRGLRAVWRQDDTVYAEVALPESVEDADAFGLHPALFDAALHSTVFLSAEGDTR
SLLPFAWEGVSLHADGASTLRVRIASCGEDTVQIAAVDPGGQPVVSVESLTLRAAGPGDA
AEPRRDDSNSLLRVDWTARTLGAPAAPATWAVLGEDPFGLTAALAGDSEAVAGVHAPAAT
IEELAARSGAVPDMVAVTVRGDADAGPDDAHELAHEVLALVQGWLAEPAFASSRLVVVTR
NAVADGERGAVDLAAAPVWGLVRSAQSENPGRLLLADVDDTADSLARLPLLAGLFDAEEP
QAVVREGTVRVGRLARLESGTSLVPPAGTPWRLGCRAKGSLDGLALLPYPEAVTPLTGRE
VRIGIRAAGLNFRDVLNALGMYPGEAGLFGSEGAGVVSEVGPDVTGLAPGDRVMGMVFGG
FGPLGVADERLLTRVPDDWSWETASSVPLVFLTAYHALKDLAGLRPGEKILIHAGAGGVG
MAAIQIAHHLGAEVFATASEGKWDVLRSLGVADDHIASSRTLDFETAFTEVAGDKGLDVV
LNALAGEFVDASMRLLGTGGRFLEMGKTDIRDSDAASDGITYRFFDLGMVDPDHIQQMLL
DLVDLFERDVLSPLPVRAWDVRRSREAFRFMSMAKHIGKIVLTMPRAMDPEGTVLVTGGT
GGLASALARHLVAEHGIKHLLLTSRRGPDAPGAADLVQALAELGAEARVAACDVADRDAL
AGLLASVPAERPLTAVVHTAGVLDDGILASLTPDRLDTVMRPKVDAAWHLHDLTRDLDLA
AFVLYSSTSGVFGSPGQANYAAGNTFLDALAAHRQSLGLPATSLAWNAWEQGSGMTSGLS
DEDMRRINDNSGMPLLSVERGLALYDAATLADEPLVVPLGLGGGGSLPPGMSVPAILRGL
VRTGGRRAKAGAAAVARAGLAERLAVLPEEQRLPFVVDLVRAEAATVLGHGSADAVDARR
EFRGLGFDSLTAIELRNRLGKASGLTLTATLVFDYPTPQQLAEHLLDELLGADAAEAFAA
PQTAAAASDDDPVVIVGMGCRFPGGVGSPEELWDLVASGTDAITGFPADREWESSTIGGE
PGDLSGQGGFLSDIADFDADFFGIAPREALAMDPQQRILLEVTWEAVERAGLDPTALRGS
RTGVFMGVSGQDYSGLVMRSRDDIASHATTGLAVSVVSGRLSYTLGLEGPALSVDTACSS
SLVSLHLAAQALRSGECTMALAGGVTVMTTPANFTGFSKMGGLAHDGRCKAFSDSADGTG
WSEGAAVLVLERLSDARRAGHRVLAVVRGSAVNQDGASNGLTAPNGPSQQRVIRQALANA
GLRPGDVDAVEAHGTGTPLGDPIEAQALIATYGSDRDPQQPLLLGSVKSNIGHTQSAAGA
AGLVKMVMAMHQGTLPRTLHVTEPSTHVDWSLGAVRLLTEETAWPETGRVRRAGVSSFGI
SGTNAHVILEGAPEPTADDRSAEDTVTPAVTPWVVSARSEQALDAQLERLRAHAAAHPEL
SGADIGLSLVTSRPSFEHRAVLLAGPDGITEAARAEAGTARTPAFLFSGQGAQRLGMGRE
LHARFPVFAEVFDSVTALLESELGTSVREVMWGTDEAALNSTAFTQPALFAVEVALYRLV
ESWGVTPDFVAGHSVGEIAAAHVAGVFSLEDACRLVAARARLMDALPRGGAMAAVEATED
EVLPLLDDGVAVAAVNGPTSVVVSGPEDGVDQLVALLESDGRRTTRLRVSHAFHSSLMDP
MLEDFRAVAETLSYHEPRIPVVSNLTGEVASAGTHTHPDYWVRHVREAVRFADGVRALAD
RGVTAFLEIGPDGVLSALAAASLPDTGTVVVPALRKDRDETVSVLTALARLHTAGLDTDW
SAHFAGTGARTVELPTYAFQGTRFWPDTTAAPGDAGGLGLDAGGHPLLTAATSVAGSDET
LLSGRLSAAAQPWLTGRTENGTTILPTAVLAELALHAAEACDRTTVENLTVGAPLALTGN
RPQRLQVLVGVPDETGRRTLTVHTRADGDDAPWVERATAMLTDAPAAATPDTVWPPADAT
PVDELPEPTGPSVLRAAWRRGGDVFAEVEITEQSPAEQAFALHPALLDTAVRAAVLLEGD
GGGSGDDTLDAVAWDGLVLHAAHPVLLRVRLTATGDDTWALEATDPQGGPVLSVASVTLG
ATVAAPVTGAPATDDAALLALDWVAPAPAPRSGDSGPWTVLGDALPGLDTALAAVDNVLV
TRADSLAELLDSGAPLPSLMLLPVEGGPTAGHDLPAAVRAATTRVLDLLRRWTSDPRTAD
SRLAIVTRGAVAAGREDVTDLAAAAVWGLVRSAQSENPGCFLLLDLDPADAAEATDAAVL
ASLPALFDAGETQAAVRGGALTVARLTRTEATPAPAADQVRAWDRDGTVLITGGTGGLGA
VLARHLVTGHGIKHLLLAGRRGPDAPGATALSKELSALGAEVTVRACDVSDRSAVDALLA
GLPAEHPLTAVVHTAGVLDDATIGTLTAEQLDTVLRPKADAAWHLHQATRALPLAGFVLY
SSVAGVTGGPGQGNYAAANTFLDALAAHRAAQGLPALSLAWGPWGQGAGMTGTLSDADLE
RMERSGMPPLTERQGLALFDAANGHDEALAVAIRVSRSAAAPDAGEVPAVLRSLVRARRR
AAATAGADGLTRRLAGLGAEQRHETLVGLVRQETAGVLGHSGADAVPADRDFSRLGFDSL
MAVELRTRLSAATGVRLPSTLVFDHPTPAAVARHLADSLTGQDRSGTAASPLAALDRLEA
ELSADGVDEAVRRGVEGRLRRLLAAWDGTGSDGNGPAVEERIEAASAEEIFAFIDNELGR
SSDS
selected fasta
>polyketide synthase [AmphJ]
ATGGAGCAGACGATGAACGCCCCCGCTGAGAACGTAGTTGCGGCACTGCGCGCCGCGGTC
AAGGAGACCGAGCGGCTGCGACGCCGCAACCGGACGATCGTGGCCGCGGCCCGGGAGCCG
ATCGCCGTGGTCGGCATGGGCTGCCGCTTCCCCGGCGGCGTCGACTCCCCGCAGGCCCTG
TGGGAGATGGTCGCCGGCGGCACCGATGTGATCTCCGAGTTCCCGGACGACCGGGGCTGG
GACCTGGAGGCGCTGCGCACCAGCGGCATCGACGACCGGGACACCTCCGTCAGCCAGCGG
GGCGGATTCCTGGACTCCATCGCCGACTTCGACCCCGGCTTCTTCGGGATCTCGCCGCGC
GAGGCCGTCACCATGGACCCGCAGCAGCGGCTGCTCCTGGAGACGGCCTGGGAGGCGATC
GAGCGCGCCCGCATCGACGCCACCCGGCTGCGCGGCACCCGCACCGGCACGTTCATCGGC
ACCAACGGGCAGGACTACGCCTATCTGCTGGTGCGGTCCCTGGACGACGCCACCGGGGAC
GTCGGCACCGGCATCGCCGCCAGTGCCGTCTCCGGCCGGCTGTCGTACACCTTCGGCCTG
GAGGGGCCGGCGATCACCGTCGACACCGCCTGCTCCTCCTCGCTGGTGGCGCTGCACCTC
GCCGTGCAGTCGCTGCGCAACGGCGAGTGCACCCTGGCGCTGGCCGGCGGCGTCAATGTG
ATGTCGACGCCGGGCTCCCTGGTCGAGTTCAGCCGGCAGGGCGGTCTCGCCGGGGACGGT
CGCTGCAAGGCGTTCTCCGACTCCGCGGACGGTACCGGCTGGTCCGAGGGCGCGGCCGTG
CTCGCCCTGGAGCGCCTCTCCGACGCCCAGCGCAACGGCCACCCCGTGCTCGCCGTCATC
CGCGGCTCCGCCGTCAACCAGGACGGTGCCTCCAACGGCTTCACCGCCCCCAACGGCCCC
TCCCAGCAGCGCGTCATCCGGCAGGCCCTCTCCAACGCGGGCCTGAACCCGGCCGATGTC
GACGTGGTCGAGGCGCACGGCACCGGCACTCCGCTCGGCGACCCCATCGAGGCCCAGAGC
ATCCTCGCCACCTACGGCCAGGACCGTGAACAGCCCCTGCTGCTCGGCTCGATCAAGTCC
AACATCGGCCACACCCAGTCCGCCGCGTCCGGTGTCGCCGGGATCATGAAGATGATCATG
GCGATGCGCAACGAGGTGCTGCCGAAGACCCTGCACGTCGACCGGCCCTCCACCCATGTC
GACTGGACGGCGGGCAAGGTCGAGCTGCTCACCGAGAACCGCCCCTGGCCCACCGCGCCC
GACCGCCCCCGCCGCTCCGGCGTCTCCTCCTTCGGCGTCAGCGGCACCAACGCCCATGTC
ATCGTCGAGCAGGCGCCTCAGACTCCCGCCCATCAGCCGGAAGAGGCGCCGGCCGACCCG
TCGGACGAGGCACCGGCCGCGCCCCGCACCGCCGGTGTCCTGCCCTGGGTGCTCTCGGCG
CGCTCCGCCGCCGCCCTGCGCGAGCAGGCCGCCGCCCTCCTCGCCCATCTGGACGCCCCC
GGTGCCCCCGGCGCGCTGGACACCGGGTACTCGCTGGCCACCACCCGCGCGTCCCTGGAG
CACAGACTCGCCGTCGTCACCGGCGCGGACGGCACCGCCGGCCGTGAGGCGCTCACCTCC
TGGCTGGCGGGGGACCCCGCCCCGGACGCCCACGAGGGCCGGCCCGTCGGCCGCACCCGC
AGCGCCTTCCTGTTCTCCGGGCAGGGCGCCCAGCGCCTCGGCATGGGCCGTGAACTGCAC
GCCCGCTTCCCGGTGTTCGCCGAAGCCCTGGACCAGGTCCTGGATCTGCTCGACGAGGAA
CTCGACGCGAGCCTCGGCGACATCATCTGGGGCGAGGAGGAAGCCCCGCTGAACGAGACG
GGCTTCACCCAGCCCGCCCTGTTCGCGGTCGAGGTCGCCCTCTACCGCCTGGTGGAGTCC
TGGGGCGTCACCCCCGACTTCGTCGCCGGCCACTCCATCGGTGAGATCGCGGCGGCGCAT
GTGGCGGGGGTGTTCTCGCTGGAGGACGCCTGCCGTCTGGTGGCCGCACGTGCCGGGCTG
ATGCAGGCGCTGCCGAGTGGTGGTGCCATGGTCGCCGTGGAGGCGACCGAGGACGAGGTG
CTGCCGCTGCTGACCGAGGGCGTGGCTGTCGCCGCGGTCAACGGCCCGACGTCTGTGGTC
GTTTCGGGTGAGGAGAAGGCCACTCTCGCGGTCGCCGAGCAGTTGGCGGCGAAGGGGCGT
CGTACCAGCCGTCTGCGGGTGAGCCATGCCTTCCACTCGCCGCTGATGGACCCCATGCTC
GACGACTTCCGCGCGGTCGCCGAGACACTCTCCTACGACGAGCCACAGCTCCCGGTCGTC
TCCAACCTGACCGGTACCCTCGCCGCCGACGGGCAGTTGACCAGCCCGGAGTACTGGGTG
CGTCATGTCCGCGAGGCCGTCCGGTTCGCCGACGGCGTCCGCGCGCTCGCCGAGGCCGGC
GCCGACGTCCTGCTCGAACTCGGCCCCGACGGTGTGCTCGCCGCCCTGGCCCAGCAGTCC
GCCACCGCCCTCACCGTCCCGTTCCTGCGCAAGGACCGCCCCGAGGAGCACAGCGCCGTC
ACCGCGCTGGCCCGGCTGCACACGGCCGGTGTCACGGTCGACTGGGCCGCGTTCCACGAC
GGCACCGGTGCCCGCACCGGCGAACTGCCCACCTACGCCTTCCAGCACGAGCGTTACTGG
CCCAAGGCGACCGCGACCGCTGTCGACGCCACCGGTCTCGGCCTCGCCTCCGCCGACCAT
CCGCTGCTGGGCGCCGCGATGTCGGTGGCCGGTTCCGACGAACTCCTGCTCACCGGCACC
CTGTCGCTCGCCACCCACCCCTGGCTCGCCGACCACAGCGTCGACGGCATGGTCGTCTTC
CCCGGTACCGGGTTCCTGGAGCTGGCGGTGCGCGCCGCCGACCAGGCCGGCTGCGACCGG
GTCCAGGAACTGACCGTCGCCACCCCGCTCGTGCTGCCCGCCACGGGCGCCGTGCAGATG
CAGATCTCCGTCGGCGCCGCCGGCGAGGACGGCTCCCGTGAGCTGCGCTTCTTCACCCGG
CCCGGTGAGGACTTCGACGCCGAGTGGACCCAGCACGCCACCGGCCGCATCGCCTCCGGC
GAGCACGTCCTCGGCTTCGACACCACGGTGTGGCCGCCGCGCGACTCCGAGGCCGTCGAC
ATCGAGGAACTGTTCGACCGCTTCGCCTCCGACGGTCTGGAGTACGGCCCGGTCTTCCGC
GGACTGCGCGCCGCCTGGCGCCAGGACGACACGGTCTACGCCGAGGTCGAACTGCCCGAC
TCCGTCGAGGACGCCGGTGCCTTCGGTCTGCACCCGGCCCTGCTGCACGCCGCCCTGCAC
GGCACCGCCTTCCTGTCCGACGACAGTGGACTGCTGCCGTTCGCCTGGGAGGGCGTGTCC
CTGCACGCGGACGGGGCGAGCACCCTGCGGGTGCGGATCGCCTCCTGCGGCGAGGACACG
GTGGAGATCGCGGCGGTCGACCCGGCGGGTCAGCCGGTCCTCTCCGTGGAGTCGCTGACG
CTGCGCGCCGCGGACTCGGGAGCCGGCGCCGCGTCGCGCCGTGAGGAGGCCAACTCGCTG
TTCCGCGTCGACTGGACGCCTCGTACGGTGGCGGCGCCGGCTGCGCCCGCCACCTGGGCG
GTGCTGGGCGAGGACCCGTTCGGCCTGACGGCCGCGCTGGCCGGTGACTCCGAGGCGGTG
GCGGGAGTGCATGCTCCGGCCGCGACGCTCGAGGAGCTGGCCGCCCGGTCCGGGGCGGTG
CCCGACATGGTCGCCGTGACGGTGCGCGGTGACGCCGATGGGGGACCCGAAGCCGCTCGC
GAGCTGACGCGTGAGGTGCTGGCCCTGGTGCAGGGGTGGCTGGCCGAGCCCGCCTTCGCC
TCCTCCCGGCTTGTGGTGGTCACCCGGGATGCGGTCGCCGACGGTGAGCGTGGTGCGGTC
GATCTGGCCGCCGCTCCTGTGTGGGGTCTGGTGCGTTCCGCCCAGTCCGAGAACCCGGGC
CGGCTGCTGCTGGCGGATGTCGACGACACCGCCGACTCGCTCGCCCGACTGCCGCTGCTT
GCGGGGCTGTTCGACGCGGAGGAGCCGCAGGCCGTCGTCCGTGAGGGCACGGTCCGGGTC
GGCCGGCTCGCCCGGCTGGAGTCCGGTGACTCTTCCGCCCGCGCCCTGGACCCCGAGGGG
ACCGTCCTGGTCACCGGTGGCACCGGCGGCCTCGCCTCCGCGCTGGCCCGCCACTTGGTC
GCCGAGCACGACATCAGGCATCTGCTGCTGACCAGCCGTCGTGGGCCCGACGCCCCGGGT
GCCGCCGACCTCGTCCAGGCGCTGGCCGAACTGGGTGCCGAGGCCAGGGTCGCCGCATGT
GATGTCGCCGACCGTGAAGCTCTGGCCGCGCTGTTGGCCTCCGTTCCGGCGGAGCACCCG
CTGACCGCGGTGGTGCACACGGCGGGTGTCCTGGACGACGGCATCTTCCCCTCGCTCACC
CCGGACCGTCTCGACTCGGTCATGCGGCCGAAGGTCGATGCCGCCTGGCATCTGCACGAC
CTCACCCGCGACCTCGACCTGGCCGCCTTCGTCCTGTACTCCTCGACGGCCGGTGTCATG
GGCAGCCCCGGTCAGGCCAACTACGCGGCGGGCAACACCTTCCTGGACGCGCTTGCCGCG
CACCGTCAGTCCCTCGGTCTGCCCGCCACCTCCCTCGCCTGGGGAGCCTGGGAACAAGGC
GTCGGCATGACCGGCCAGTTGAGCGGTCAGGACGCACGCCGCATCAGCGACGCCGGTGGG
ATGCCGCTGCTGTCCGTCGAGCGGGGCCTTGCCCTGTACGACGCCGCCATGCTCGCCGAC
GAGCCGCTGGTGGTGCCGCTCGGTCTGGGCGGCGGTGGCCCGTTGCCCGCCGGTGCCGGC
GTCCCCGCGATCCTGCGCGGACTGGTCCGCACCGGTGGACGCCGCGCCCGGGCGGCCACC
GCCGCGGTCGCCCGCGCCGGACTGGCCGAGCGGCTGGCCGTCCTGCCCGAGGAGCAGCGT
CGGCCGTTCGTCGTGGACCTGGTGCGTGCGGAGGCCGCCGCGGTCCTCGGGCACGGTTCC
GCCGACGCCGTGGACTCCCGTCGTGAGTTCCGTGGTCTCGGCTTCGACTCGCTCACCGCG
ATCGAGCTGCGCAACCGGCTCGGCAAGGCGACCGGTCTCACCCTCCCCGCCACGCTCGTC
TTCGACTACCCGACGCCCGAGCAGCTCGCCGACCATCTCCTGGACGAACTCCTCGGCGCC
GACGCCATCGAGGTCTTCGCCGCCTCTCAGAGCGCTGCCGATGTGCACGACGATCCCGTG
GTCATCGTCGGGATGGGCTGCCGCTTCCCGGGTGGCGTCGGATCGCCCGAGGACCTGTGG
GACCTCCTGGCGTCCGGTTCCGACGCCATCACCGGCTTCCCCGCCGATCGCGAGTGGGAA
TCGTCGAGGCTCGTCGCCGGTGAGGCCGGCGGCGTGTCCGCGCAGGGCGGGTTCCTGAGC
GACATCGCCGGCTTCGACGCGGACTTCTTCGGGATCTCGCCGCGCGAGGCCCTGGCCATG
GACCCGCAGCAGCGCATCCTGCTCGAGGTCACCTGGGAGGCCATCGAGCGCGCCGGGGTC
GACCCGACCGCCCTGCGCGGCAGCCGCACCGGTGTCTTCATGGGCGTCAACGGCCAGGAC
TACTCCAGCCTGGTCATGGGGTCCCGTGACGACGTCGCGGGCCACGCCACCGCCGGTCTG
GCCGTCAGTGTGGTCTCCGGCCGACTCTCCTACGCGCTCGGCCTGGAGGGTCCCGCCCTC
TCGGTCGACACGGCCTGTTCCTCCTCCCTGGTGTCGCTGCATCTCGCCGCGCAGGCGCTG
CGGTCGGGGGAGTGCAGCATGGCTCTGGTGGGTGGTGTCACCGTGATGACCACCCCCGCC
AACTTCGCCGGGTTCTCCCGGATGGGCGGTCTCGCCCAGGACGGACGCTGCAAGGCGTTC
TCCGACTCCGCCGACGGCACCGGCTGGTCCGAGGGCGCGGCCGTGCTCGTGGTGGAGCGC
CTCTCCGACGCCCGCCGCGCCGGACACCGGGTGCTCGCCGTGGTGCGTGGCTCCGCGGTG
AACCAGGACGGTGCCTCCAACGGCCTGACCGCCCCCAACGGCCCCTCCCAGCAGCGGGTC
ATCCGGCAGGCACTGGCCAACGCCGGTCTGCGTCCCGGCGATGTCGACGCCGTCGAGGCG
CATGGCACCGGCACACCGCCCGGTGACCCGATCGAGGCCCAGGCCCTGCTGGCCACCTAT
GGCTCCGACCGTGACCCGCAGCAGCCGCTGCTGCTCGGCTCGGTGAAGTCCAACATCGGC
CACACCCAGGCCGCCGCCGGTGTCGCCGGGCTCGTCAAGATGGTCATGGCGATGCGCAAC
GGCGTGCTGCCGCGCACCCTGCACATCACCGAGCCGTCCACGCATGTCGACTGGTCCCTG
GGCGCCGTACAGGTGCTCACCGAGGAGACCGCCTGGCCGGAGACGGGCCGGGTCCGCAGG
GCCGGTGTGTCCTCCTTCGGCATCAGCGGCACCAACGCCCATGTCATCCTCGAGGGCGCC
CCCGACGAGCCGGTGCCCGCGCCTGTCGCCGACCGGCCCGTGCCCGGCGCCGTCGCCTGG
CCCGTCTCGGCGAAGTCCGAGGGCGCGCTCGACGACCAGGCGGAGCGGCTGCGGGAGTCC
GCCGACGCGCTGCCCGCGCTCGACACCGCCTACACCCTGGCCACCGGCCGAGCCGACTTC
GAGCACCGGGCGGTCCTGCTCGCAGCCGACGGCACCCTCACCGAGGTCGCCCGGGGCGTC
GCCGAACCGCACCGCAGTGCCTTCCTGTTCTCCGGTCAGGGCGCCCAGCGCCTCGGCATG
GGGCGTGAACTGCACGCCCGTTTCCCGGTGTTCGCCGAGGCCTTCGACTCCGTCACCGCC
CTGCTGGAGAGCGAACTCGACACCTCCGTCCGTGAGGTGATGTGGGGCACCGACGAAGGC
GCCCTGAACGCCACCGCGTTCACCCAGCCCGCCCTGTTCGCGGTCGAGGTCGCCCTCTAC
CGTCTGGTGGAGTCCTGGGGAGTGACCCCCGACTTCGTGGCCGGTCACTCGGTCGGTGAG
ATCGCGGCGGCGCATGTGGCGGGGGTGTTCTCGCTGGAGGACGCCTGCCGGCTGGTGGCC
GCGCGTGCGGGGCTGATGCAGGCGTTGCCGAGTGGTGGTGCCATGGTCGCGGTGGAGGCG
ACCGAGGACGAGGTGCTGCCGCTGCTGACCGAGGGCGTGGCTGTGGCCGCGGTCAACGGC
CCGACGTCTGTGGTCGTTTCGGGTGAGGAGCAGGCCACCCTGGCGGTCGCCGAGCAGCTT
GCCGCGCAGGGCCGCCGCACCAGCCGTCTGCGGGTGAGTCATGCCTTCCACTCGCCCTTG
ATGGACCCGATGCTGGAGGACTTCCGGGCGGTCGCGGAGACGCTCTCGTACCACGAGCCG
CGGATCCCGGTCGTGTCCAACCTGACCGGTGAGGTCGCCGCTGCGGGCGTACACACCCAC
CCCGACTACTGGGTGCGTCATGTCCGTGAGGCCGTGCGGTTCGCCGACGGTGTCCGCGGG
CTGGCCGACCGGGGCGTGACCGCCTTGCTGGAGATCGGCCCCGACGGAGTGCTCTCCGCG
CTCGCCGCCGCCTCCCTGACCGACACCGACACCGTCGTCGTGCCGGCGCTGCGCAAGGAC
CGCGACGAGACCGTGTCCGTCCTCAGCGGTGTCGCCCGGCTGTACGTCGCCGGGGTCGAC
GTCGACTGGTCGGCCCCGCTGTCCGGCGCCGGCGCCCGCATCGCCGACGTGCCCACCTAC
GCCTTCCAGCACGAGCGTTACTGGCCCAAGGCGGCCCCCGCCGCCCTGGACGCCACCGGC
CTCGGTCTGGCCTCGGCCGACCACCCGCTGCTCGGTGCCGCCATGTCGGTGGCGGGCTCC
GACGAACTCCTGCTGACCGGCTCCCTGTCGGCCGCCACCCATCCCTGGCTGGCCGACCAT
GTCGTCGGCGGCATGATCTTCTTCCCCGGTACCGGGTTCCTGGAGCTGGCGGTGCGCGCC
GCGGACCAGGCCGGCTGTGACCGGGTCGAGGAACTGATGATCGCCGCGCCGCTGGTGCTG
CCCGCTACGGGCGCGGTCCAGGTGCAGATCTCCGTGGGCGCCGCCGACGAGGAGGGCTCT
CGCGAGCTGCGTTTCTTCACCCGGCCCGGTGAGGACTTCGACGCCGAATGGACCCAGCAC
GCCACCGGCCGCATCGGCTCGGGTGAGCAGGTCATCGATTTCGACGCCACGGTGTGGCCG
CCGCGCGACGCCGAGGCGATCGACATCGACGGCATGTTCGAGCGCTATGCCGCGGACGGT
CTGGAGTACGGCCCGGTCTTCCGTGGACTGCGTGCCGTCTGGCGTCAGGACGACACGGTC
TACGCCGAGGTCGCGCTGCCCGAGTCGGTCGAGGACGCCGACGCCTTCGGTCTGCACCCG
GCGCTCTTCGACGCCGCCCTGCACTCCACGGTCTTCCTGTCCGCCGAGGGCGACACCCGC
AGTCTGCTGCCGTTCGCCTGGGAGGGCGTGTCCCTGCACGCGGACGGGGCGAGCACCCTG
CGGGTGCGGATCGCCTCCTGCGGCGAGGACACGGTCCAGATCGCCGCGGTCGACCCGGGC
GGTCAGCCCGTGGTCTCCGTGGAGTCGCTGACGCTGCGTGCCGCGGGCCCGGGCGACGCT
GCCGAGCCGCGCCGTGACGACTCCAACTCCCTGCTCCGGGTCGACTGGACCGCCCGTACG
CTCGGGGCGCCTGCCGCCCCTGCCACCTGGGCGGTGCTGGGCGAGGACCCGTTCGGCCTG
ACGGCCGCGCTGGCCGGTGACTCCGAGGCGGTGGCGGGAGTGCATGCTCCGGCCGCGACG
ATCGAGGAGCTGGCCGCCCGGTCGGGGGCCGTGCCCGACATGGTCGCCGTGACGGTGCGG
GGCGACGCCGACGCGGGACCCGACGACGCTCATGAGCTGGCCCACGAGGTGCTGGCCCTG
GTGCAGGGGTGGCTGGCCGAGCCCGCCTTCGCCTCCTCCCGGCTTGTGGTGGTCACCCGG
AACGCGGTTGCCGACGGTGAGCGTGGTGCGGTCGATCTGGCGGCCGCTCCTGTGTGGGGT
CTGGTGCGTTCCGCCCAGTCCGAGAACCCGGGCCGGCTGCTGCTGGCGGATGTCGACGAC
ACCGCCGACTCGCTCGCCCGACTGCCGCTGCTTGCGGGGCTGTTCGACGCGGAGGAGCCG
CAGGCCGTCGTCCGTGAGGGCACGGTCCGGGTCGGCCGGCTCGCCCGGCTGGAGTCCGGC
ACCTCCCTCGTTCCGCCGGCCGGGACGCCGTGGCGGTTGGGCTGCCGCGCCAAGGGCAGC
CTCGACGGTCTCGCCCTGCTGCCGTACCCGGAGGCGGTGACACCGCTCACCGGCCGCGAG
GTGCGCATCGGTATCCGTGCCGCGGGCCTCAACTTCCGCGATGTCCTCAACGCGCTCGGC
ATGTATCCCGGTGAGGCGGGCCTGTTCGGCTCCGAGGGCGCGGGCGTAGTCAGCGAGGTC
GGCCCGGATGTCACCGGGCTCGCCCCCGGCGACCGGGTCATGGGCATGGTGTTCGGCGGT
TTCGGCCCGCTGGGCGTCGCCGACGAGCGTCTGCTGACCCGGGTTCCCGACGACTGGTCC
TGGGAGACCGCGTCGTCCGTGCCGCTGGTGTTCCTCACCGCTTACCACGCGCTGAAGGAC
CTCGCCGGGCTGCGGCCGGGGGAGAAGATACTGATCCACGCCGGCGCCGGTGGTGTCGGC
ATGGCGGCCATCCAGATCGCCCACCATCTCGGTGCGGAGGTCTTCGCGACGGCCAGCGAG
GGCAAGTGGGACGTACTGCGCTCCCTCGGGGTCGCCGACGACCACATCGCCTCCTCGCGC
ACCCTCGACTTTGAGACGGCGTTCACCGAGGTCGCGGGGGACAAGGGCCTCGATGTCGTC
CTGAACGCGCTGGCCGGGGAGTTCGTCGACGCGTCGATGCGGCTGCTCGGCACGGGCGGC
CGGTTCCTGGAGATGGGCAAGACCGACATCCGGGACTCCGACGCCGCGTCGGACGGCATC
ACCTACCGGTTCTTCGACCTGGGCATGGTCGACCCCGACCACATCCAGCAGATGCTGCTC
GACCTGGTCGACCTCTTCGAGCGGGACGTACTGTCCCCGCTGCCGGTCCGTGCCTGGGAC
GTCCGCCGATCCCGTGAGGCCTTCCGCTTCATGAGCATGGCCAAGCACATCGGCAAGATC
GTGCTGACCATGCCCCGGGCCATGGACCCCGAGGGCACCGTCCTGGTCACCGGCGGTACC
GGTGGCCTCGCCTCCGCGCTGGCCCGCCACTTGGTCGCCGAGCACGGCATCAAGCACCTG
CTGCTGACCAGCCGCCGCGGACCCGATGCCCCCGGCGCCGCCGACCTCGTCCAGGCACTG
GCCGAACTGGGTGCCGAGGCCAGGGTCGCCGCGTGTGATGTCGCCGACCGTGATGCCCTG
GCCGGGCTGCTCGCCTCCGTTCCGGCCGAGCGCCCGCTGACCGCGGTGGTGCACACGGCG
GGTGTCCTGGACGACGGCATCCTCGCCTCCCTCACCCCCGATCGCCTCGACACCGTCATG
CGGCCGAAGGTCGATGCCGCCTGGCATCTGCACGACCTCACCCGCGATCTCGACCTGGCC
GCCTTCGTCCTCTACTCCTCGACCTCCGGCGTCTTCGGCAGCCCGGGTCAGGCCAACTAC
GCGGCGGGCAACACCTTCCTGGACGCCCTCGCCGCGCACCGTCAGTCGCTCGGCCTGCCC
GCCACCTCCCTCGCCTGGAACGCCTGGGAGCAGGGCAGCGGCATGACGAGCGGACTGAGC
GACGAGGACATGCGCCGCATCAACGACAACAGCGGCATGCCGCTGCTGTCCGTCGAGCGG
GGCCTTGCCCTGTACGACGCGGCCACCCTCGCCGACGAACCGCTGGTGGTGCCGCTGGGC
CTCGGCGGTGGCGGCTCGCTGCCGCCCGGCATGAGCGTCCCCGCGATCCTGCGCGGACTG
GTCCGCACCGGCGGACGCCGGGCCAAGGCCGGCGCCGCCGCGGTCGCCCGCGCCGGACTC
GCCGAGCGGCTGGCCGTCCTGCCCGAGGAGCAGCGCCTGCCGTTCGTGGTCGACCTGGTA
CGGGCCGAGGCCGCCACGGTGCTCGGGCACGGTTCCGCCGACGCCGTCGACGCCCGCCGT
GAGTTCCGTGGCCTGGGCTTCGACTCGCTCACCGCGATCGAGCTGCGCAACCGGCTCGGC
AAGGCCTCAGGACTCACCCTGACCGCCACCCTGGTCTTCGACTACCCGACGCCCCAGCAG
CTCGCCGAGCACCTCCTGGACGAACTGCTCGGCGCCGACGCCGCGGAGGCCTTCGCCGCC
CCCCAGACCGCCGCCGCCGCGTCCGACGACGACCCCGTGGTCATCGTCGGGATGGGCTGC
CGCTTCCCGGGCGGCGTGGGATCGCCCGAGGAACTGTGGGACCTGGTGGCCTCCGGCACC
GACGCCATCACCGGCTTCCCCGCCGACCGGGAGTGGGAGTCGTCGACCATCGGCGGCGAG
CCGGGCGACCTGTCCGGGCAGGGCGGATTCCTCAGCGACATCGCTGACTTCGACGCCGAC
TTCTTCGGCATCGCACCGCGCGAGGCCCTCGCCATGGACCCGCAGCAGCGCATCCTGCTC
GAAGTCACCTGGGAGGCCGTGGAGCGCGCCGGACTGGACCCGACCGCCCTGCGCGGCAGC
CGTACCGGTGTCTTCATGGGCGTCAGCGGCCAGGACTACTCCGGTCTGGTCATGCGCTCC
CGCGACGACATCGCCAGCCACGCCACCACCGGCCTCGCCGTGAGCGTGGTCTCCGGCCGC
CTCTCCTACACGCTCGGCCTGGAGGGCCCCGCCCTCTCGGTCGACACCGCCTGCTCCTCC
TCCCTGGTCTCGCTGCACCTCGCCGCGCAGGCACTGCGATCGGGGGAGTGCACCATGGCG
CTCGCCGGTGGTGTCACCGTGATGACCACCCCGGCCAACTTCACCGGGTTCTCCAAGATG
GGCGGTCTCGCCCACGACGGACGCTGCAAGGCGTTCTCCGACTCCGCCGACGGCACCGGC
TGGTCCGAGGGCGCGGCGGTACTCGTCCTGGAGCGCCTCTCCGACGCCCGCCGCGCCGGA
CACCGGGTGCTCGCCGTGGTGCGTGGCTCCGCGGTGAACCAGGACGGTGCCTCCAACGGC
CTGACCGCCCCCAACGGCCCCTCCCAGCAGCGGGTCATCCGGCAGGCACTGGCCAACGCC
GGTCTGCGTCCCGGCGATGTCGACGCCGTCGAGGCACATGGCACCGGCACCCCGCTCGGT
GACCCGATCGAGGCCCAGGCCCTGATCGCCACCTACGGCTCCGACCGTGACCCGCAGCAG
CCGCTGCTGCTCGGCTCGGTGAAGTCCAACATCGGCCACACCCAGTCCGCGGCCGGCGCC
GCCGGACTCGTCAAGATGGTCATGGCGATGCACCAGGGCACGCTCCCGCGCACCCTGCAC
GTCACCGAGCCGTCCACGCACGTCGACTGGTCCCTGGGCGCCGTCCGGCTGCTCACCGAG
GAGACCGCCTGGCCGGAGACCGGCCGGGTCCGCAGGGCCGGTGTGTCCTCCTTCGGCATC
AGCGGCACCAACGCCCATGTCATCCTCGAGGGCGCCCCCGAGCCCACCGCGGACGACCGC
TCCGCCGAGGACACCGTGACGCCCGCCGTCACCCCGTGGGTCGTCTCGGCCCGCTCCGAG
CAGGCGCTCGACGCCCAGCTGGAGCGGCTGCGCGCCCATGCCGCCGCGCACCCGGAGCTG
TCCGGCGCCGACATCGGTCTGTCCCTGGTGACGAGCCGGCCCTCCTTCGAGCACCGGGCC
GTCCTGCTGGCGGGCCCCGACGGCATCACCGAGGCCGCGCGCGCCGAGGCCGGCACCGCC
CGCACACCGGCGTTCCTGTTCTCCGGTCAGGGCGCCCAGCGCCTCGGCATGGGCCGTGAA
CTGCACGCCCGTTTCCCGGTGTTCGCCGAGGTCTTCGACTCCGTCACCGCCCTGCTGGAG
AGCGAACTCGGCACCTCCGTCCGTGAGGTGATGTGGGGCACCGACGAAGCCGCCCTGAAC
TCCACCGCGTTCACCCAGCCCGCCCTCTTCGCGGTCGAGGTCGCCCTCTACCGCCTGGTG
GAGTCCTGGGGAGTGACCCCCGACTTCGTCGCCGGTCACTCGGTCGGTGAGATCGCGGCC
GCGCATGTGGCGGGGGTGTTCTCGCTGGAGGACGCCTGCCGGCTGGTCGCCGCCCGCGCC
CGGCTCATGGACGCGCTGCCCCGGGGCGGCGCCATGGCCGCCGTCGAGGCCACCGAGGAC
GAGGTGCTGCCGCTGCTCGACGACGGGGTCGCCGTCGCCGCCGTCAACGGCCCCACCTCC
GTCGTCGTCTCCGGCCCCGAGGACGGCGTCGACCAGCTGGTCGCCCTCCTGGAGTCGGAC
GGCCGCCGCACCACCCGGCTGCGCGTCAGCCACGCGTTCCACTCGTCGCTGATGGACCCG
ATGCTGGAGGACTTCCGGGCGGTCGCGGAGACGCTCTCGTACCACGAGCCGCGGATCCCG
GTCGTGTCCAATCTGACCGGTGAGGTCGCCTCCGCGGGCACCCACACCCACCCCGACTAC
TGGGTGCGCCACGTCCGTGAGGCGGTGCGCTTCGCCGACGGTGTCCGCGCGCTCGCCGAC
CGGGGTGTGACCGCCTTCCTGGAGATCGGCCCCGACGGAGTGCTCTCCGCGCTCGCCGCC
GCCTCCCTGCCCGACACCGGCACCGTCGTCGTGCCGGCGCTGCGCAAGGACCGCGACGAG
ACCGTGTCCGTCCTCACCGCCCTGGCCCGGCTGCACACCGCGGGCCTGGACACCGACTGG
AGCGCCCACTTCGCCGGCACCGGCGCCCGCACCGTCGAACTGCCCACCTACGCCTTCCAG
GGCACCCGCTTCTGGCCCGACACCACCGCCGCTCCCGGCGACGCGGGCGGACTGGGCCTG
GACGCCGGAGGGCACCCGCTGCTCACCGCCGCCACATCGGTCGCCGGCTCGGACGAGACG
CTGCTCAGCGGGCGCCTGTCCGCCGCCGCACAGCCCTGGCTCACCGGCCGCACGGAGAAC
GGCACCACCATCCTGCCCACCGCGGTCCTCGCCGAACTCGCGCTGCACGCCGCCGAGGCC
TGCGACCGCACCACCGTCGAGAACCTCACCGTCGGCGCACCCCTGGCGCTCACCGGGAAC
CGCCCCCAGCGCCTCCAGGTACTGGTGGGCGTCCCCGACGAGACCGGCCGGCGCACCCTC
ACCGTGCACACCCGCGCCGACGGCGACGACGCCCCCTGGGTCGAGCGGGCCACCGCCATG
CTCACCGACGCGCCCGCCGCCGCCACGCCCGACACCGTATGGCCGCCCGCCGACGCCACC
CCGGTCGACGAGCTGCCCGAACCCACCGGCCCGTCCGTGCTGCGCGCCGCCTGGCGGCGC
GGCGGGGACGTCTTCGCCGAGGTCGAGATCACCGAACAGAGCCCGGCCGAGCAGGCGTTC
GCCCTGCACCCGGCGCTCCTGGACACCGCGGTCCGCGCCGCCGTCCTCCTCGAAGGAGAC
GGGGGCGGATCCGGAGACGACACCCTCGACGCCGTCGCCTGGGACGGGCTCGTCCTGCAC
GCCGCGCACCCCGTCCTCCTGCGGGTACGGCTCACCGCCACCGGCGACGACACCTGGGCG
CTGGAGGCGACCGACCCACAGGGCGGCCCGGTGCTCTCGGTGGCCTCGGTCACCCTGGGC
GCCACCGTCGCCGCACCCGTCACCGGCGCCCCCGCCACGGACGACGCCGCGCTGCTCGCC
CTGGACTGGGTCGCGCCCGCCCCGGCCCCGCGCTCCGGCGACAGCGGCCCCTGGACGGTG
CTCGGCGACGCGCTGCCCGGCCTGGACACCGCGCTCGCCGCGGTGGACAACGTCCTCGTC
ACCCGCGCGGACTCGCTCGCCGAACTGCTGGACAGCGGGGCACCCCTGCCGTCGCTGATG
CTGCTGCCGGTCGAGGGCGGCCCCACCGCGGGGCACGACCTGCCCGCCGCGGTCCGCGCC
GCCACCACGAGGGTGCTGGACCTGCTGCGCCGCTGGACCTCCGACCCCCGCACCGCGGAC
TCCCGCCTCGCGATCGTCACCCGTGGCGCCGTCGCGGCCGGCCGGGAGGACGTGACCGAT
CTGGCGGCCGCAGCCGTCTGGGGCCTGGTCCGCTCCGCCCAGTCCGAGAACCCGGGGTGC
TTCCTGCTCCTCGACCTCGACCCGGCCGACGCGGCCGAGGCGACCGACGCCGCCGTCCTG
GCCTCGCTGCCCGCCCTGTTCGACGCGGGCGAGACCCAGGCCGCCGTCCGCGGCGGCGCG
CTCACCGTCGCCCGCCTCACCCGCACCGAGGCCACGCCCGCCCCGGCCGCCGACCAGGTC
CGTGCCTGGGACCGCGACGGCACCGTACTGATCACCGGCGGCACCGGAGGCCTCGGCGCC
GTACTCGCCCGCCACCTGGTCACCGGCCACGGCATCAAGCACCTGCTGCTGGCCGGCCGC
CGCGGACCCGACGCACCCGGGGCCACCGCCCTGAGCAAGGAACTCTCCGCGCTCGGCGCC
GAGGTCACCGTACGGGCCTGCGACGTCTCCGACCGGTCCGCCGTCGACGCGCTGCTCGCC
GGGCTGCCGGCCGAGCACCCGCTGACCGCGGTCGTGCACACGGCGGGCGTCCTCGACGAC
GCCACGATCGGCACCCTCACCGCCGAACAGCTCGACACCGTGCTGCGGCCCAAGGCGGAC
GCCGCCTGGCATCTGCACCAGGCGACCCGGGCCCTGCCGCTCGCCGGATTCGTCCTGTAC
TCCTCCGTGGCCGGCGTCACCGGCGGCCCCGGACAGGGCAACTACGCTGCCGCCAACACG
TTCCTCGACGCGCTCGCCGCGCACCGTGCGGCCCAGGGCCTGCCCGCCCTGTCGCTCGCC
TGGGGACCGTGGGGCCAGGGCGCCGGCATGACCGGCACGCTCAGCGACGCCGACCTCGAG
CGCATGGAACGCTCCGGGATGCCGCCGCTCACCGAACGACAGGGCCTCGCCCTGTTCGAC
GCGGCGAACGGCCACGACGAGGCGCTGGCCGTGGCGATCCGGGTCTCCCGGTCCGCCGCG
GCCCCCGACGCCGGCGAGGTGCCCGCGGTGCTGCGCTCCCTGGTCCGCGCCCGGCGTCGC
GCGGCGGCCACGGCCGGCGCCGACGGGCTCACCCGCCGACTGGCCGGCCTCGGCGCCGAG
CAGCGGCACGAAACCCTCGTCGGCCTGGTCCGCCAGGAGACGGCCGGGGTGCTCGGTCAC
TCCGGGGCCGACGCGGTCCCCGCCGACCGGGACTTCAGCCGGCTGGGCTTCGACTCGCTG
ATGGCGGTCGAACTGCGCACCCGGCTCTCCGCGGCCACCGGTGTGCGGCTGCCGTCCACG
CTGGTCTTCGACCACCCGACGCCCGCCGCCGTCGCCCGGCACCTGGCCGACTCCCTGACG
GGCCAGGACCGCAGCGGCACCGCCGCATCCCCGCTGGCGGCGCTCGACCGGCTGGAGGCG
GAACTGTCCGCCGACGGCGTGGACGAGGCCGTCCGCCGCGGGGTCGAGGGCCGGCTGCGG
CGGCTGCTGGCCGCCTGGGACGGCACCGGCTCGGACGGGAACGGACCGGCCGTCGAGGAG
CGGATCGAGGCGGCGAGCGCCGAGGAGATCTTCGCCTTCATCGACAACGAACTCGGCCGG
TCGTCGGATTCCTGA
[15] KS39..414
[15] AT583..892
[15] malonyl-CoA769..773
[15] dh940..1102
[15] KR1420..1600
[15] ACP1704..1774
[16] KS1798..2169
[16] AT2308..2621
[16] malonyl-CoA2494..2498
[16] DH2668..2830
[16] ER3159..3462
[16] KR3472..3655
[16] ACP3756..3826
[17] KS3851..4221
[17] AT4365..4676
[17] malonyl-CoA4551..4555
[17] dh4725..4888
[17] KR5207..5387
[17] ACP5486..5556
[15] KS115..1242
[15] AT1747..2676
[15] malonyl-CoA2305..2319
[15] dh2818..3306
[15] KR4258..4800
[15] ACP5110..5322
[16] KS5392..6507
[16] AT6922..7863
[16] malonyl-CoA7480..7494
[16] DH8002..8490
[16] ER9475..10386
[16] KR10414..10965
[16] ACP11266..11478
[17] KS11551..12663
[17] AT13093..14028
[17] malonyl-CoA13651..13665
[17] dh14173..14664
[17] KR15619..16161
[17] ACP16456..16668

close this sectionFeature

BLASTP
Database:UniProtKB:2011_09
show BLAST table
InterPro
Database:interpro:38.0
IPR001227 Acyl transferase domain (Domain)
 [581-701]  G3DSA:3.40.366.10 [768-873]  G3DSA:3.40.366.10 [2306-2426]  G3DSA:3.40.366.10 [2493-2603]  G3DSA:3.40.366.10 [4364-4483]  G3DSA:3.40.366.10 [4550-4660]  G3DSA:3.40.366.10
G3DSA:3.40.366.10   Ac_transferase_reg
IPR002198 Short-chain dehydrogenase/reductase SDR (Family)
 [1420-1587]  5.10000000000004e-64 PF00106
PF00106   adh_short
IPR006162 Phosphopantetheine attachment site (PTM)
 [1732-1747]  PS00012 [3784-3799]  PS00012 [5514-5529]  PS00012
PS00012   PHOSPHOPANTETHEINE
IPR009081 Acyl carrier protein-like (Domain)
 [1704-1774]  PS50075 [3756-3826]  PS50075 [5486-5556]  PS50075
PS50075   ACP_DOMAIN
 [1703-1777]  3.70000000000004e-67 G3DSA:1.10.1200.10 [3756-3829]  3.70000000000004e-67 G3DSA:1.10.1200.10 [5484-5559]  3.70000000000004e-67 G3DSA:1.10.1200.10
G3DSA:1.10.1200.10   ACP_like
 [1707-1773]  7.7e-12 PF00550 [3759-3825]  3.90000000000001e-11 PF00550 [5490-5555]  2.4e-12 PF00550
PF00550   PP-binding
 [1697-1814]  3.49999466863949e-28 SSF47336 [3749-3867]  9.19998414420358e-28 SSF47336 [5477-5563]  7.10001443313147e-23 SSF47336
SSF47336   ACP_like
IPR011032 GroES-like (Domain)
 [3155-3305]  7.10001443313147e-32 SSF50129
SSF50129   GroES_like
IPR013149 Alcohol dehydrogenase, C-terminal (Domain)
 [3298-3389]  2.5e-17 PF00107
PF00107   ADH_zinc_N
IPR013154 Alcohol dehydrogenase GroES-like (Domain)
 [3179-3235]  7.19999999999999e-07 PF08240
PF08240   ADH_N
IPR013968 Polyketide synthase, KR (Domain)
 [3472-3651]  2.80000000000001e-64 PF08659 [5207-5386]  2.59999999999998e-64 PF08659
PF08659   KR
IPR014030 Beta-ketoacyl synthase, N-terminal (Domain)
 [39-289]  3.19999999999996e-91 PF00109 [1798-2044]  8.1e-90 PF00109 [3851-4096]  3.39999999999996e-90 PF00109
PF00109   ketoacyl-synt
IPR014031 Beta-ketoacyl synthase, C-terminal (Domain)
 [297-414]  6.80000000000006e-47 PF02801 [2052-2169]  4.9e-49 PF02801 [4104-4221]  8.50000000000004e-49 PF02801
PF02801   Ketoacyl-synt_C
IPR014043 Acyl transferase (Domain)
 [583-892]  2.79999999999994e-68 PF00698 [2308-2621]  1.39999999999997e-66 PF00698 [4365-4676]  2.50000000000001e-72 PF00698
PF00698   Acyl_transf_1
IPR016035 Acyl transferase/acyl hydrolase/lysophospholipase (Domain)
 [579-888]  4.89996723796399e-74 SSF52151 [2305-2604]  5.60004908290974e-77 SSF52151 [4362-4661]  9.69997533354407e-79 SSF52151
SSF52151   Acyl_Trfase/lysoPlipase
IPR016036 Malonyl-CoA ACP transacylase, ACP-binding (Domain)
 [706-768]  1.09999909120787e-16 SSF55048 [2431-2493]  9.10000425645573e-17 SSF55048 [4488-4550]  6.79998707392904e-19 SSF55048
SSF55048   Malonyl_transacylase_ACP-bd
IPR016038 Thiolase-like, subgroup (Domain)
 [41-301]  G3DSA:3.40.47.10 [302-468]  G3DSA:3.40.47.10 [1798-2055]  G3DSA:3.40.47.10 [2056-2222]  G3DSA:3.40.47.10 [3851-4107]  G3DSA:3.40.47.10 [4108-4275]  G3DSA:3.40.47.10
G3DSA:3.40.47.10   Thiolase-like_subgr
IPR016039 Thiolase-like (Domain)
 [31-403]  2.39999798157263e-102 SSF53901 [1790-2222]  4.30000170645869e-103 SSF53901 [3843-4220]  2.7000136937899e-104 SSF53901
SSF53901   Thiolase-like
IPR016040 NAD(P)-binding domain (Domain)
 [1421-1606]  6.7999999999999e-116 G3DSA:3.40.50.720 [3295-3389]  9.60000000000003e-21 G3DSA:3.40.50.720 [3473-3659]  6.7999999999999e-116 G3DSA:3.40.50.720 [5206-5387]  6.7999999999999e-116 G3DSA:3.40.50.720
G3DSA:3.40.50.720   NAD(P)-bd
IPR018201 Beta-ketoacyl synthase, active site (Active_site)
 [202-218]  PS00606 [1957-1973]  PS00606 [4009-4025]  PS00606
PS00606   B_KETOACYL_SYNTHASE
IPR020801 Polyketide synthase, acyl transferase domain (Domain)
 [584-873]  1.10001116880365e-129 SM00827 [2309-2601]  6.29998179434741e-127 SM00827 [4366-4658]  SM00827
SM00827   PKS_AT
IPR020806 Polyketide synthase, phosphopantetheine-binding domain (Domain)
 [1705-1777]  8.1000073432326e-37 SM00823 [3757-3829]  5.8999880940569e-33 SM00823 [5487-5559]  3.29999077503323e-33 SM00823
SM00823   PKS_PP
IPR020807 Polyketide synthase, dehydratase domain (Domain)
 [940-1102]  4.10001399699315e-82 SM00826 [2668-2830]  2.89997807423186e-83 SM00826 [4725-4888]  1.90000694315261e-41 SM00826
SM00826   PKS_DH
IPR020841 Polyketide synthase, beta-ketoacyl synthase domain (Domain)
 [41-467]  SM00825 [1800-2222]  SM00825 [3853-4274]  SM00825
SM00825   PKS_KS
IPR020842 Polyketide synthase/Fatty acid synthase, KR (Domain)
 [1420-1600]  9.49996855930963e-67 SM00822 [3472-3655]  3.30001976117415e-67 SM00822 [5207-5387]  1.70000295590053e-61 SM00822
SM00822   PKS_KR
IPR020843 Polyketide synthase, enoylreductase (Domain)
 [3159-3462]  SM00829
SM00829   PKS_ER
SignalP No significant hit
TMHMM No significant hit
Page top