Spino_00210 : CDS information

close this sectionLocation

Organism
StrainNRRL 18538
Entry nameSpinosyn
Contig
Start / Stop / Direction59,803 / 76,569 / + [in whole cluster]
59,803 / 76,569 / + [in contig]
Location59803..76569 [in whole cluster]
59803..76569 [in contig]
TypeCDS
Length16,767 bp (5,588 aa)
Click on the icon to see Genetic map.

close this sectionAnnotation

Category1.1 PKS
Productpolyketide synthase
Product (GenBank)polyketide synthase extender modules 8-10
Gene
Gene (GenBank)spnE
EC number
Keyword
Note
Note (GenBank)
  • involved in spinosyn aglycone biosynthesis
Reference
ACC
PmId
[11358695] Cloning and analysis of the spinosad biosynthetic gene cluster of Saccharopolyspora spinosa. (Chem Biol. , 2001)
[11386361] A cluster of genes for the biosynthesis of spinosyns, novel macrolide insect control agents produced by Saccharopolyspora spinosa. (Antonie Van Leeuwenhoek. , 2000)
comment
[PMID:11358695](2001)
spinosad biosynthetic gene clusterの報告。
spnE(5589aa): PKS

---
[PMID:11386361](2000)
同じ著者らの先行論文。
spnE: Polyketide synthesis

module 8のAT domainはpropionate-specific motifあり。
しかしpropionateを取り込んでspinosyn Dを生成するのは15%だけ。
85%はacetateを取り込んでspinosyn Aを作る。
よってこのdomainの選択性は他の因子の影響による。

close this sectionPKS/NRPS Module

8 methylmalonyl-CoA
9 malonyl-CoA
10 malonyl-CoA
KS34..410
AT563..878
DH928..1093
KR1404..1584
ACP1684..1758
KS1782..2159
AT2320..2638
DH2685..2874
KR3193..3373
ACP3462..3532
KS3559..3934
AT4093..4411
DH4458..4619
KR4933..5113
ACP5217..5287
TE5364..5577

close this sectionSequence

selected fasta
>polyketide synthase [polyketide synthase extender modules 8-10]
MANEEKLREYLKRVVVELEEAHERLHELERQEHDPIAIVSMGCRYPGGVSTPEELWRLVV
DGGDAIANFPEDRGWNLDELFDPDPGRAGTSYVREGGFLRGVADFDAGLFGISPREAQAM
DPQQRLLLEISWEVFERAGIDPFSLRGTKTGVFAGLIYHDYASRFRKTPAEFEGYFATGN
AGSVASGRVAYTFGLEGPAVTVDTACSSSLVALHLACQSLRLGECDLALAGGISVMATPG
AFVEFSRQRALASDGRCKPFADAADGTGWGEGAGMLLLERLSDARRNGHPVLAAVVGSAI
NQDGTSNGLTAPSGPAQQRVIRQALANAGLSPAEVDVVEAHGTGTALGDPIEAQALIATY
GANRSADHPLLLGSLKSNIGHTQAAAGVAGVIKSVLAIRHREMPRSLHIDQPSQHVDWSA
GAVRLLTDSVDWPDLGRPRRAGVSSFGMSGTNAHLIVEEVSDEPVSGSTEPTGAFPWPLS
GKTETALREQAAELLSVVTEHPEPGLGDVGYSLATGRAAMEHRAVVVADDRDSFVAGLTA
LAAGVPAANVVQGAADCKGKVAFVFPGQGSHWQGMARELSESSPVFRRKLAECAAATAPY
VDWSLLGVLRGDPDAPALDRDDVIQLALFAMMVSLAELWRSCGVEPAAVVGHSQGEIAAA
HVAGALSLTDAVRIIAARCDAVSALTGKGGMLAIALPESAVVKRIAGLPELTVAAVNGPG
STVVSGEPSALERLQTELTAENVQTRRVGIDYASHSPQIAQVQGRLLDRLGEVGSEPAEI
AFYSTVTGERTDTGRLDADYWYQNLRQPVRFQQTVARMADQGYRFFVEVSPHPLLTAGIQ
ETLEAADAGGVVVGSLRRGEGGSRRWLTSLAECQVRGLPVNWEQVFLNTGARRVPLPTYP
FQRQRYWLESAEYDAGDLGSVGLLSAEHPLLGAAVTLADAGGFLLTGKLSVKTQPWLADH
VVGGAILLPGTAFVEMLIRAADQVGCDLIEELSLTTPLVLPATGAVQVQIAVGGPDEAGR
RSVRVHSCRDDAVPQDSWTCHATGTLTSSDHQDAGQGPDGIWPPNDAVAVPLDSFYARAA
ERGFDFGPAFQGLQAAWKRGDEIFAEVGLPTAHREDAGRFGIHPALLDAALQALGAAEED
PDEGWLPFAWQGVSLKATGALSLRVHLVPAGANAVSVFTTDTTGQAVLSIDSLVLRQISD
KQLAAARAMEHESLFRVDWKRISPGAAKPVSWAVIGNDELARACGSALGTELHPDLTGLA
DPPPDVVVVPCGASRQDLDVASEARAATQRMLDLIQDWLAAARFAGSRLVVVTCGAASTG
PAEGVSDLVHAASWGLLRSAQSENPDRFVLVDVDGTAESWRALAAAVRSGEPQLALRAGE
VRVPRLARCVAAEDSRIPVPGADGTVLISGGTGLLGGLVARHLVAERGVRRLVLAGRRGW
SAPGVTDLVDELVGLGAAVEVASCDVGDRAQLDRLLTTISAEFPLRGVVHAAGALADGVV
ESLTPEHVAKVFGPKAAGAWHLHELTLDLDLSFFVLFSSFSGVAGAAGQGNYAAANAFLD
GLAQHRRTAGLPAVSLAWGLWEQPSGMTGALDAAGRSRIARTNPPMSAPDGLRLFEMAFR
VPGESLLVPVHVDLNALRADAADGGVPALLRDLVPAPVRRSAVNESADVNGLVGRLRRLP
DLDQETQLLGLVREHVSAVLGHSGAVEVGADRAFRDLGFDSLSGVEFRNRLGGVLGVRLP
ATAVFDYPTPRALVRFLLDKLIGGVEAPTPAPAAVAAVTADDPVVIVGMGCRYPGGVSSP
EELWRLVAGGLDAVAEFPDDRGWDQAGLFDPDPDRLGTSYVCEGGFLRDAAEFDAGFFGI
SPREALAMDPQQRLLLEVAWETVERAGIDPLSLRGSRTGVFAGLMHHDYGARFITRAPEG
FEGYLGNGSAGGVFSGRVAYSFGFEGPAVTVDTACSSSLVALHLAGQALRSGECDLALAG
GVTVMATPGMFVEFSRQRGLAADGRCKSFAAAADGTGWGEGAGLVLLERLSDARRNGHAV
LAVVRGSAVNQDGASNGLTAPNGPSQQRVITQALASAGLSVSDVDAVEAHGTGTRLGDPI
EAQALIATYGQGRDSDRPLWLGSVKSNIGHTQAAAGVAGVIKMVMAMRHGQLPATLHVDE
PTSEVDWSAGDVQLLTENTPWPGNSHPRRVGVSSFGISGTNAHVILEQASKTPDETADKS
GPDSESTVDLPAVPLIVSGRTPAALSAQASALLSYLGERGDISTLDAAFSLASSRAALEE
RAVVLGADRETLLSGLEALASGREASGVVSGSPVSGGVGFVFAGQGGQWLGMGRGLYSVF
PVFADAFDEACAGLDAHLGQDVGVRDVVFGSDGSLLDRTLWAQSGLFALQVGLLSLLGSW
GVRPGVVLGHSVGEFAAAVAAGVLSLPDAARMVAGRARLMQALPSGGAMLAVAAGEEQLR
PLLADRVDGAGIAAVNAPESVVLSGDREVLDDIAGALDGQGIRWRRLRVSHAFHSYRMDP
MLQEFAEIARSVDYRRGDLPVVSTLTGELDTAGVMATPEYWVRQVREPVRFADGVRVLAQ
QGVATIFELGPDATLSALIPDCHSWADQAMPIPMLRKDRTETETVVAAVARAHTRGVPVE
WSAYFAGTGARRVELPTYAFQRQRYWLETSDYGDVTGIGLAAAEHPLLGAVVALADGDGM
VLTGRLSVGTHPWLAQHRVLGEVVVPGTAILEMALHAGARLGCDRVEELTLETPLVVPER
AAGAGSRGPAGGTTVSIETAEERVRTNDAIEIQLLVNAPDEGGRRRVSLYSRPAGGSRGG
GWTRHATGELVVGTTGGRAVPDWSAEGAESIALDEFYVALAGNGFEYGPLFQGLQAAWRR
GDEVLAEIAPPAEADAMASGYLLDPALLDAALQASALGDRPEQGGAWLPFSFTGVELSAP
AGTISRVRLETRRPDAISVAVMDESGRLLASIDSLRLRSVSSGQLANRDAVRDALFEVTW
EPVATQSTEPGRWALLGDTACGKDDLIKLATDSADRCADLAALAEKLDSSALVPDVVVYC
AGEQADPGTGAAALAETQQTLALLQAWLAEPRLAEARLVVVTCAAVTTAPSDGASELAHA
PLWGLLRAAQVENPGQFVLADVDGTAESWRALPSALGSMEPQLALRKGAVRAPRLASVAG
QIDVPAVVADPDRTVLISGGTGLLGGAVARHLVTERGVRRLVLTGRRGWDAPGITELVGE
LNGLGAVVDVVACDVADRADLESLLAAVPAEFPLCGVVHAAGALADGVIESLSPDDVGAV
FGPKAAGAWNLHELTRDTDLSFFALFSSLSGVAGAPGQGNYAAANAFLDALAHYRRSQGL
PAVSLAWGLWEQPSGMTETLSEVDRSRIARANPPLSTKEGLRLFDAGLALDRAAVVPAKL
DRTFLAEQARSGSLPALLTALVPPIRRNRRASGTELADEGTLLGVVREHAAAVLGYSSAA
DVGVERAFRDLGFDSLSGVELRNRLAGVLGVRLPATAVFDYPTPRALARFLHQELADEIA
TTPAPVTTTRAPVAEDDLVAIVGMGCRFPGQVSSPEELWRLVAGGVDAVADFPADRGWDL
AGLFDPDPERAGKTYVREGAFLTDADRFDAGFFGISPREALAMDPQQRLLLELSWEAIER
AGIDPGSLRGSRTGVFAGLMYHDYGARFASRAPEGFEGYLGNGSAGSVASGRIAYSFGFE
GPAVTVDTACSSSLVALHLAGQSLRSGECDLALAGGVTVMSTPGTFVEFSRQRGLAPDGR
CKSFAESADGTGWGEGAGLVLLERLSDARRNGHRVLAVVRGSAVNQDGASNGLTAPNGPS
QQRVIQQALASAGLSVSDVDAVEAHGTGTRLGDPIEAQALIATYGRDRDPGRPLWLGSVK
SNIGHTQAAAGVAGVIKMVMAMRHGQLPRTLHVDAPSSQVDWSAGRVQLLTENTPWPDSG
RPCRVGVSSFGISGTNAHVILEQSTGQMDQAAEPDSSPVLDVPVVPWVVSGKTPEALSAQ
AATLATYLDQNVDVSPLDVGISLAVTRSALDERAVVLGSDRDTLLSGLNALAAGHEAAGV
VTGPVGIGGRTGFVFAGQGGQWLGMGRRLYSEFPAFAGAFDEACAELDANLGREVGVRDV
VFGSDESLLDRTLWAQSGLFALQVGLWELLGTWGVRPSVVLGHSVGELAAAFAAGVLSMA
EAARLVAGRARLMQALPSGGAMLAVSATEARVGPLLDGVRDRVGVAAVNAPGSVVLSGDR
DVLDGIAGRLDGQGIRSRWLRVSHAFHSHRMDPMLAEFAELARSVDYRSPRLPIVSTLTG
NLDDVGVMATPEYWVRQVREPVRFADGVQALVDQGVDTIVELGPDGALSSLVQECVAESG
RATGIPLVRRDRDEVRTVLDALAQTHTRGGAVDWGSFFAGTRATQVDLPTYAFQRQRYWL
EPSDSGDVTGVGLTGAEHPLLGAVVPVAGGDEVLLTGRLSVGTHPWLAEHRVLGEVVVPG
TALLEMAWRAGSQVGCERVEELTLEAPLVLPERGAAAVQLAVGAPDEAGRRSLQLYSRGA
DEDGDWRRIASGLLAQANAVPPADSTAWPPDGAGQVDLAEFYERLAERGLTYGPVFQGLR
AAWRHGDDIFAELAGSPDASGFGIHPALLDAALHAMALGASPDSEARLPFSWRGAQLYRA
EGAALRVRLSPLGSGAVSLTLVDATGRRVAAVESLSTRPVSTDQIGAGRGDQERLLHVEW
VRSAESAGMSLTSCAVVGLGEPEWHAALKTTGVQVESHADLASLATEVAKRGSAPGAVIV
PCPRPRAMQELPTAARRATQQAMAMLQQWLADDRFVSTRLILLTHRAVSAVAGEDVLDLV
HAPLWGLVRSAQAEHPDRFALIDMDDERASQTALAEALTAGEAQLAVRSGVVLAPRLGQV
KVSGGEAFRWDEGTVLVTGGTGGLGALLARHLVSAHGVRHLLLASRRGLAAPGADELVAE
LEQAGADVAVVACDSADRDSLARLVASVPAENPLRVVVHAAGVLDDGVLMSMSPERLDAV
LRPKVDAAWYLHELTRELGLSAFVLFSSVAGLFGGAGQSNYAAGNAFLDALAHCRQAQGL
PALSLASGLWASIDGMAGDLAAADVERLSRAGIGPLSAPGGLALFDAAVGSDEPLLAPVR
LDVEALRVQARSVQTRIPEMLHGMAMGPSRRTPFTSRVEPLHERLAGLSEGERRQQVLQR
VRADIAVVLGHGRSSDVDIEKPLAELGFDSLTAIELRNRLATATGLRLPATLAFDHGTAA
ALAQHVCAQLGTATAPAPRRTDDNDATEPVRSLFQQAYAAGRILDGMDLVKVAAQLRPVF
GSPGELESLPKPVQLSRGPEELALVCMPALIGMPPAQQYARIAAGFRDVRDVSVIPMPGF
IAGEPLPSAIEVAVRTQAEAVLQEFAGGSFVLVGHSSGGWLAHEVAGELERRGVVPAGVV
LLDTYIPGEITPRFSVAMAHRTYEKLATFTDMQDVGITAMGGYFRMFTEWTPTPIGAPTL
FVRTEDCVADPEGRPWTDDSWRPGWTLADATVQVPGDHFSMMDEHAGSTAQAVASWLDKL
NQRTARQR
selected fasta
>polyketide synthase [polyketide synthase extender modules 8-10]
ATGGCCAATGAAGAAAAGCTCCGCGAGTACCTCAAGCGTGTCGTCGTCGAACTGGAAGAG
GCGCACGAACGCCTGCACGAGTTGGAGCGCCAGGAGCACGACCCCATCGCGATCGTGTCG
ATGGGATGTCGTTATCCCGGTGGCGTCTCCACTCCGGAGGAGCTGTGGCGACTGGTCGTC
GACGGAGGAGACGCGATCGCGAACTTCCCCGAAGACCGTGGCTGGAATCTGGACGAGCTG
TTCGATCCTGATCCGGGCCGAGCCGGGACCTCCTACGTCCGCGAGGGTGGTTTCCTGCGC
GGGGTCGCGGACTTCGATGCCGGGCTCTTCGGGATCAGTCCGCGCGAGGCACAGGCGATG
GACCCGCAACAGCGGTTGCTGCTGGAGATCTCGTGGGAGGTGTTCGAGCGCGCCGGCATT
GACCCGTTTTCTTTGCGGGGTACCAAGACCGGTGTGTTCGCGGGCCTGATCTACCACGAC
TACGCGTCGCGGTTTCGCAAGACCCCCGCGGAGTTCGAGGGTTACTTCGCCACCGGCAAC
GCGGGCAGCGTCGCATCCGGCCGGGTGGCTTACACCTTCGGGTTAGAGGGCCCGGCGGTC
ACCGTGGACACCGCCTGCTCGTCGTCCCTGGTGGCGCTGCACCTGGCCTGCCAGTCCCTG
CGGCTGGGCGAATGCGACCTGGCCCTGGCCGGTGGCATTTCGGTGATGGCCACGCCGGGA
GCCTTCGTCGAGTTCAGCCGGCAACGCGCACTCGCCTCGGATGGCCGGTGCAAGCCCTTC
GCGGATGCCGCCGACGGCACCGGCTGGGGCGAGGGCGCCGGAATGCTGCTGCTGGAACGG
CTGTCGGACGCACGACGAAACGGCCACCCGGTGCTGGCGGCGGTGGTCGGTTCCGCGATC
AACCAGGACGGGACGTCCAACGGCCTGACCGCGCCCAGCGGTCCCGCACAGCAGCGAGTG
ATCCGCCAAGCCCTGGCGAACGCCGGGTTGTCGCCCGCCGAGGTCGATGTGGTCGAGGCG
CACGGCACGGGCACGGCCTTGGGCGACCCGATCGAGGCGCAGGCCCTGATCGCCACCTAC
GGGGCGAACCGGTCGGCGGATCATCCGCTGCTGCTGGGTTCCCTCAAGTCGAACATCGGC
CACACCCAGGCTGCCGCCGGTGTGGCCGGGGTGATCAAGTCGGTCCTGGCCATCAGGCAC
CGGGAGATGCCCCGCAGCCTGCACATCGACCAGCCATCGCAGCACGTGGACTGGTCGGCG
GGCGCGGTGCGGCTGCTCACGGACAGCGTTGACTGGCCGGATCTCGGCAGGCCGCGCCGA
GCAGGGGTGTCCTCGTTCGGCATGAGCGGTACCAACGCACACCTGATCGTCGAGGAAGTA
TCCGACGAGCCGGTCTCGGGCAGTACCGAGCCGACCGGGGCATTTCCCTGGCCGCTGTCC
GGCAAGACGGAGACGGCATTGCGCGAGCAGGCTGCCGAGTTGCTCTCCGTAGTGACCGAG
CACCCGGAGCCGGGACTGGGGGACGTCGGGTACTCGCTGGCCACCGGTCGCGCTGCGATG
GAGCACCGGGCTGTCGTGGTTGCCGACGATCGGGACTCTTTCGTCGCCGGACTGACGGCG
TTGGCTGCGGGCGTTCCGGCAGCCAACGTGGTGCAGGGCGCGGCCGACTGCAAGGGAAAG
GTCGCGTTCGTGTTCCCCGGCCAGGGCTCGCATTGGCAGGGGATGGCGAGGGAACTGTCC
GAATCCTCGCCGGTGTTCCGGCGGAAGCTGGCGGAATGCGCGGCGGCTACGGCCCCTTAC
GTGGACTGGTCGCTGCTCGGCGTCCTTCGCGGTGATCCCGATGCACCCGCGCTGGATCGC
GACGACGTGATTCAGCTCGCGCTGTTCGCCATGATGGTGTCGCTGGCCGAACTGTGGCGT
TCGTGCGGAGTGGAGCCCGCCGCGGTGGTCGGTCATTCCCAGGGCGAGATCGCCGCCGCC
CATGTGGCAGGCGCTTTGTCCTTGACTGATGCGGTGCGCATCATCGCTGCCCGCTGCGAT
GCGGTGTCGGCGCTGACCGGGAAGGGAGGCATGCTCGCGATTGCCTTGCCGGAAAGCGCG
GTGGTGAAGCGAATCGCAGGCCTGCCGGAGCTGACCGTTGCGGCGGTCAACGGACCCGGC
TCCACTGTCGTTTCCGGCGAACCGTCGGCTCTGGAGCGTCTGCAGACCGAACTGACCGCG
GAAAACGTGCAGACCCGGCGGGTGGGAATTGATTACGCCTCGCATTCGCCGCAGATCGCG
CAGGTCCAGGGCCGGCTTCTGGACCGGCTGGGCGAAGTCGGGTCCGAACCTGCTGAGATC
GCTTTCTACTCGACGGTCACCGGCGAGCGGACGGACACCGGCCGACTCGACGCCGACTAC
TGGTACCAGAACCTTCGGCAGCCCGTCCGCTTCCAGCAGACCGTCGCCCGGATGGCAGAT
CAGGGCTATCGGTTCTTCGTCGAGGTGAGCCCGCACCCGCTGCTCACCGCCGGAATCCAG
GAAACGCTGGAAGCCGCGGACGCGGGCGGGGTGGTGGTCGGTTCGCTGCGGCGTGGCGAG
GGCGGCTCCCGGCGCTGGCTGACTTCGCTGGCCGAGTGCCAGGTGCGCGGACTGCCGGTG
AATTGGGAACAGGTATTCCTCAACACCGGAGCCCGACGCGTGCCGCTGCCGACCTACCCG
TTCCAGCGGCAGCGGTACTGGTTGGAGTCCGCCGAGTACGACGCGGGCGATCTCGGTTCG
GTGGGCTTGCTCTCCGCCGAGCATCCCCTGCTCGGGGCTGCGGTGACGCTGGCCGATGCG
GGCGGGTTCCTGCTGACCGGCAAGCTGTCGGTCAAGACCCAGCCCTGGTTGGCCGACCAC
GTGGTCGGCGGGGCGATCCTGCTGCCCGGCACCGCGTTCGTGGAAATGCTGATACGCGCC
GCGGACCAGGTCGGGTGCGATCTGATCGAGGAGTTGTCCCTGACGACTCCGCTGGTTTTG
CCCGCGACCGGTGCGGTGCAGGTGCAGATCGCGGTTGGCGGTCCGGACGAGGCCGGGCGC
CGCTCGGTCCGCGTGCATTCCTGTCGAGACGACGCCGTGCCGCAGGACTCGTGGACCTGC
CACGCGACCGGCACGTTGACCTCCAGCGATCACCAGGACGCCGGCCAGGGCCCCGATGGG
ATTTGGCCGCCCAACGATGCTGTCGCGGTTCCGCTGGACAGCTTCTACGCCCGCGCAGCT
GAGCGGGGCTTCGATTTCGGCCCGGCGTTCCAGGGGTTGCAGGCGGCTTGGAAGCGCGGA
GACGAGATCTTCGCCGAGGTCGGCCTGCCCACCGCACACCGCGAAGACGCCGGCAGGTTC
GGAATCCACCCTGCTCTGCTGGATGCGGCACTGCAGGCGCTGGGCGCAGCCGAAGAGGAT
CCGGACGAGGGATGGCTCCCGTTCGCGTGGCAAGGTGTGTCCCTCAAAGCGACGGGCGCA
CTTTCCCTTCGGGTGCACCTCGTTCCGGCGGGCGCGAATGCGGTGTCGGTGTTCACGACC
GACACGACTGGCCAAGCCGTGCTCTCCATCGATTCGCTGGTGCTGCGCCAGATTTCGGAC
AAGCAGTTGGCAGCGGCCCGTGCGATGGAACACGAGTCCCTGTTCCGGGTCGACTGGAAG
CGAATCTCGCCCGGCGCTGCCAAGCCGGTCTCCTGGGCAGTGATCGGCAATGACGAACTC
GCCCGAGCCTGCGGCTCGGCACTTGGCACGGAACTCCACCCCGACCTGACCGGGTTGGCT
GACCCGCCCCCGGACGTCGTGGTGGTGCCATGCGGTGCGTCTCGCCAGGACTTGGACGTT
GCTTCCGAGGCACGTGCCGCGACACAACGCATGCTTGACCTGATCCAGGATTGGTTGGCG
GCGGCGCGATTCGCCGGATCTCGCCTGGTGGTTGTGACGTGTGGTGCGGCGTCGACAGGT
CCCGCCGAGGGTGTTTCCGACCTGGTGCATGCTGCGTCGTGGGGTTTGTTGCGTTCGGCG
CAGTCGGAGAACCCGGACCGATTCGTGTTGGTCGATGTGGACGGAACCGCCGAATCATGG
CGTGCGCTCGCGGCGGCCGTGCGTTCCGGAGAACCGCAGCTGGCGTTGCGCGCCGGTGAA
GTCCGGGTGCCTCGCCTGGCGCGATGTGTTGCCGCCGAGGACAGCCGGATCCCAGTGCCC
GGTGCGGATGGGACGGTGTTGATTTCCGGCGGTACGGGCCTGCTGGGCGGGTTGGTTGCC
CGGCATTTGGTGGCGGAGCGCGGTGTCCGCCGCCTGGTGCTCGCGGGGCGACGCGGCTGG
AGCGCCCCCGGGGTCACCGACCTGGTGGATGAGTTGGTGGGCCTGGGAGCTGCGGTCGAG
GTGGCGAGCTGCGATGTCGGGGATCGGGCCCAGTTGGACCGGCTGCTGACGACGATCTCG
GCAGAGTTCCCGCTGCGCGGAGTGGTGCATGCGGCCGGGGCACTTGCCGACGGGGTCGTC
GAGTCGCTGACACCAGAGCACGTGGCAAAGGTGTTCGGCCCGAAGGCCGCCGGTGCGTGG
CACCTGCACGAGTTGACTCTTGATCTGGATCTCTCGTTCTTCGTGCTCTTCTCCTCGTTC
TCCGGCGTGGCGGGGGCTGCGGGTCAGGGAAACTACGCGGCGGCGAACGCGTTCCTGGAC
GGCCTGGCTCAGCACCGGCGGACGGCGGGGCTGCCTGCGGTGTCGCTGGCTTGGGGCTTG
TGGGAGCAGCCCAGCGGGATGACCGGAGCGCTCGATGCGGCGGGCCGTAGCCGCATTGCG
CGCACCAATCCGCCGATGTCCGCGCCGGACGGGTTGCGGCTGTTCGAGATGGCGTTTCGC
GTTCCGGGCGAATCGCTTCTGGTTCCGGTCCACGTCGACCTGAACGCCCTGCGCGCTGAT
GCGGCCGACGGCGGTGTGCCTGCGTTGTTGCGCGACCTGGTGCCAGCGCCCGTGCGGCGG
AGCGCGGTCAACGAGTCGGCGGACGTCAACGGTCTGGTTGGTCGGCTGCGGAGGCTGCCG
GACCTGGATCAGGAAACCCAGCTGTTGGGTTTGGTGCGCGAGCATGTTTCGGCGGTGCTG
GGGCATTCGGGTGCGGTCGAGGTCGGGGCCGATCGTGCTTTCCGGGATTTGGGTTTTGAT
TCGTTGTCCGGTGTGGAGTTTCGGAACCGGCTTGGCGGGGTGCTGGGCGTTCGGTTGCCG
GCTACTGCGGTGTTCGACTATCCGACACCGCGGGCGTTGGTTCGGTTCTTGCTCGACAAA
CTGATTGGTGGCGTGGAGGCTCCGACTCCCGCACCGGCGGCTGTGGCGGCGGTGACTGCT
GACGATCCCGTTGTGATCGTGGGGATGGGCTGTCGTTATCCGGGTGGGGTGTCCTCGCCG
GAGGAGCTTTGGCGTTTGGTGGCCGGGGGCTTGGATGCGGTGGCGGAGTTCCCGGACGAT
CGTGGCTGGGATCAGGCGGGGTTGTTCGATCCGGATCCCGATCGTCTTGGGACCTCGTAT
GTGTGTGAGGGTGGCTTCCTGCGAGATGCGGCAGAGTTCGATGCCGGTTTCTTCGGGATT
TCCCCGCGTGAGGCGTTGGCGATGGATCCGCAGCAGCGGTTGCTGCTGGAAGTCGCTTGG
GAAACCGTGGAGCGGGCGGGGATTGATCCGCTTTCGTTGCGGGGGAGCCGGACCGGCGTG
TTCGCGGGGCTGATGCACCACGACTACGGCGCGCGGTTCATCACGAGGGCGCCGGAGGGT
TTCGAGGGTTATCTAGGTAATGGCAGCGCGGGAGGCGTGTTTTCGGGTCGGGTTGCGTAT
TCGTTTGGTTTCGAGGGTCCTGCGGTGACGGTGGATACGGCGTGTTCGTCGTCGTTGGTG
GCGCTGCACCTGGCGGGTCAAGCACTGCGGTCTGGTGAGTGTGATCTGGCTCTTGCGGGT
GGTGTGACGGTGATGGCCACGCCGGGGATGTTCGTGGAGTTTTCGCGTCAACGGGGCTTG
GCGGCGGATGGGCGGTGCAAGTCGTTTGCGGCGGCTGCGGATGGCACCGGTTGGGGAGAA
GGCGCGGGCTTGGTGTTGTTGGAGCGGCTGTCGGATGCCCGGCGCAACGGGCACGCGGTT
CTGGCGGTCGTGCGGGGTAGCGCGGTGAATCAGGATGGTGCGTCGAATGGTTTGACGGCG
CCGAATGGGCCCTCGCAGCAGCGGGTGATCACGCAGGCGTTGGCGAGTGCTGGTTTGTCG
GTGTCTGATGTGGACGCCGTGGAGGCGCATGGGACTGGAACCAGGCTTGGTGATCCGATT
GAGGCGCAGGCTCTGATTGCCACTTACGGGCAGGGGCGGGATAGCGATCGGCCGTTGTGG
TTGGGGTCGGTGAAGTCGAATATTGGTCATACGCAGGCGGCGGCGGGTGTCGCTGGTGTG
ATCAAGATGGTGATGGCGATGCGGCACGGGCAGCTGCCCGCGACGTTGCATGTGGATGAA
CCTACGTCGGAAGTGGATTGGTCGGCGGGGGATGTCCAGCTCCTCACGGAGAACACCCCC
TGGCCCGGCAACAGCCATCCTCGGCGGGTGGGCGTGTCGTCGTTCGGGATCAGCGGCACC
AACGCACACGTCATCCTCGAACAAGCCTCGAAAACACCAGACGAGACTGCGGACAAGAGC
GGTCCCGATTCGGAATCGACCGTGGACCTTCCAGCGGTCCCGTTGATCGTGTCGGGGAGA
ACACCGGCAGCGCTCAGCGCTCAGGCGAGCGCATTGTTGTCCTATTTGGGTGAGCGTGGC
GATATTTCCACGCTGGATGCGGCGTTTTCGTTGGCTTCCTCCCGGGCCGCGTTGGAGGAG
CGGGCGGTGGTGCTGGGAGCGGACCGCGAAACGTTGTTGTCCGGGTTGGAAGCGCTGGCT
TCCGGTCGCGAGGCTTCTGGGGTGGTGTCGGGATCCCCGGTCTCTGGCGGGGTTGGGTTC
GTGTTCGCCGGTCAGGGCGGACAGTGGTTGGGGATGGGCCGGGGGCTCTACTCGGTTTTT
CCGGTGTTCGCTGACGCGTTTGACGAAGCATGTGCCGGACTGGACGCGCATCTGGGGCAG
GACGTGGGGGTCCGGGATGTGGTGTTTGGTTCCGACGGGTCCTTGTTGGATCGGACGCTG
TGGGCCCAGTCGGGTTTGTTCGCGTTGCAGGTTGGTTTGCTGAGCCTGCTGGGTTCGTGG
GGTGTCCGGCCGGGTGTGGTGCTGGGCCATTCGGTCGGCGAGTTCGCGGCGGCGGTTGCG
GCGGGAGTGTTGTCGTTGCCGGATGCGGCTCGGATGGTGGCGGGTCGTGCCCGGTTGATG
CAGGCGTTGCCTTCTGGCGGTGCCATGTTGGCGGTGGCTGCTGGTGAGGAGCAGCTGCGG
CCGTTGTTGGCCGATCGGGTTGATGGTGCGGGTATCGCCGCGGTCAACGCTCCTGAGTCG
GTGGTGCTCTCCGGCGATCGGGAGGTGCTTGACGACATCGCCGGCGCGCTGGATGGGCAA
GGGATTCGGTGGCGGCGGTTGCGGGTTTCGCATGCGTTTCATTCGTATCGGATGGACCCG
ATGTTGCAGGAGTTCGCCGAAATCGCACGCAGCGTGGACTACCGGCGTGGCGACCTACCG
GTCGTGTCGACGTTGACGGGTGAGCTCGACACCGCAGGTGTGATGGCTACGCCGGAGTAT
TGGGTGCGTCAGGTTCGAGAGCCCGTCCGCTTCGCCGACGGCGTCCGGGTGCTCGCGCAG
CAAGGGGTCGCCACGATCTTCGAACTCGGCCCTGATGCGACGCTGTCGGCCCTGATTCCC
GATTGTCATTCGTGGGCTGATCAGGCCATGCCGATTCCGATGCTGCGTAAAGACCGTACG
GAAACCGAAACTGTGGTCGCCGCGGTGGCGCGGGCGCACACGCGTGGTGTTCCGGTCGAA
TGGTCGGCGTATTTCGCCGGCACCGGGGCACGGCGGGTCGAGTTGCCGACGTATGCCTTC
CAGCGGCAGCGGTACTGGCTGGAAACATCGGATTACGGCGATGTGACGGGTATCGGCCTG
GCTGCGGCGGAGCATCCGTTGCTGGGGGCCGTGGTTGCGCTGGCCGATGGTGATGGGATG
GTGCTGACCGGCCGGTTGTCGGTGGGGACGCATCCGTGGCTGGCCCAGCATCGCGTGCTG
GGCGAGGTCGTCGTCCCCGGCACCGCCATCCTGGAGATGGCCCTGCACGCAGGGGCGCGT
CTCGGCTGTGACCGGGTGGAAGAGCTCACCCTGGAAACACCGCTGGTGGTCCCCGAACGC
GCGGCGGGTGCCGGTAGTCGTGGCCCTGCGGGAGGGACCACAGTTTCAATTGAAACTGCG
GAAGAACGTGTGCGGACGAACGACGCCATCGAAATCCAGCTGCTGGTGAACGCACCCGAC
GAAGGCGGTCGGCGAAGGGTGTCGCTGTATTCCCGCCCGGCCGGTGGGTCGAGAGGTGGG
GGTTGGACGCGCCACGCCACCGGCGAACTCGTCGTCGGCACCACCGGTGGTAGGGCGGTT
CCTGATTGGTCGGCTGAGGGTGCCGAGTCGATTGCTCTCGATGAGTTCTACGTCGCTCTG
GCCGGAAACGGGTTCGAGTACGGGCCGTTGTTCCAGGGGCTTCAGGCGGCATGGCGTCGT
GGTGACGAGGTTCTCGCCGAAATCGCCCCGCCGGCCGAGGCCGATGCGATGGCGTCGGGA
TACCTGCTCGACCCAGCGTTGCTGGATGCCGCGCTGCAGGCGTCCGCGCTCGGCGACCGC
CCGGAGCAAGGCGGCGCGTGGCTGCCGTTCTCATTCACCGGCGTCGAACTTTCCGCTCCG
GCAGGGACGATCAGCAGGGTGCGGCTGGAGACCAGGCGACCCGACGCGATATCGGTGGCC
GTGATGGATGAGAGTGGGCGGTTGCTCGCCTCGATCGATTCTCTCAGGCTACGAAGCGTG
TCGTCGGGACAGCTGGCGAATCGGGACGCTGTCCGCGACGCGCTGTTCGAGGTGACCTGG
GAGCCGGTGGCGACGCAGTCGACGGAACCGGGTCGCTGGGCCCTGCTTGGTGATACTGCC
TGCGGTAAAGACGATCTCATCAAACTCGCAACGGATTCCGCCGACCGCTGCGCGGATCTG
GCGGCGCTAGCCGAGAAACTTGATTCCAGCGCGCTGGTTCCTGATGTCGTGGTCTACTGC
GCCGGAGAACAGGCGGATCCCGGCACCGGCGCAGCCGCACTTGCGGAGACCCAGCAGACG
TTGGCTCTGCTCCAAGCGTGGTTGGCTGAGCCGCGGTTGGCCGAGGCACGTCTGGTGGTG
GTGACGTGTGCAGCGGTGACGACGGCTCCGAGTGACGGTGCATCAGAGCTGGCACATGCG
CCGTTGTGGGGGTTGTTGCGTGCCGCGCAGGTGGAGAACCCGGGGCAGTTTGTGCTGGCG
GACGTCGACGGAACCGCCGAATCGTGGCGTGCGTTGCCGAGTGCGTTGGGCTCGATGGAA
CCGCAGTTGGCCCTGCGGAAGGGCGCGGTGCGAGCGCCCCGCTTGGCTTCGGTCGCCGGG
CAGATCGACGTGCCCGCGGTTGTGGCGGATCCCGACCGAACCGTGCTGATTTCGGGCGGC
ACGGGCCTGTTGGGGGGCGCGGTTGCCCGCCACCTGGTGACCGAACGCGGTGTCCGCCGA
TTGGTGTTGACGGGCCGTCGTGGCTGGGATGCTCCTGGAATCACCGAGTTGGTGGGTGAG
CTGAACGGCCTCGGTGCCGTGGTCGACGTGGTGGCGTGCGACGTCGCGGATCGTGCTGAT
CTGGAGTCGTTGCTGGCGGCGGTCCCGGCGGAATTTCCGTTGTGCGGCGTGGTGCATGCC
GCGGGGGCGCTGGCCGACGGGGTGATCGAGTCGTTGTCACCGGACGACGTGGGAGCGGTG
TTCGGCCCGAAGGCGGCGGGGGCGTGGAATCTGCACGAGCTGACTCGTGATACGGACCTG
TCGTTCTTCGCGTTGTTCTCCTCGCTTTCCGGTGTTGCCGGCGCTCCTGGTCAGGGCAAT
TATGCGGCGGCGAACGCGTTCCTGGACGCATTGGCGCATTACCGGCGGTCACAGGGACTG
CCTGCGGTGTCGCTGGCCTGGGGCCTGTGGGAGCAGCCGAGCGGGATGACGGAGACGCTC
AGCGAGGTCGACCGGAGCAGGATCGCGCGCGCCAACCCGCCGTTGTCCACCAAGGAGGGA
TTGCGGCTGTTCGATGCCGGGCTGGCGCTGGACCGGGCAGCGGTAGTTCCGGCGAAGTTG
GACAGGACTTTCCTGGCCGAGCAGGCGCGGTCGGGCTCGCTGCCCGCATTGTTGACGGCA
CTGGTACCCCCCATCCGTCGTAATAGGCGGGCTAGCGGAACCGAGCTCGCGGACGAGGGC
ACCCTGCTCGGGGTGGTGCGGGAGCATGCCGCGGCCGTGCTGGGGTATTCGAGCGCGGCT
GACGTCGGGGTCGAGCGCGCTTTCCGGGATCTGGGTTTTGATTCGTTGTCTGGTGTGGAG
TTGCGGAACCGCCTTGCCGGGGTGCTGGGGGTGCGGTTGCCGGCGACTGCGGTGTTCGAC
TATCCGACGCCGAGGGCGCTGGCCCGGTTCCTGCACCAGGAACTGGCAGACGAGATCGCT
ACGACGCCAGCGCCGGTGACGACGACCAGGGCACCGGTCGCCGAAGACGATCTCGTCGCG
ATAGTCGGGATGGGATGCCGTTTTCCCGGTCAGGTGTCCTCGCCGGAGGAGCTCTGGCGT
TTGGTGGCCGGGGGCGTGGATGCGGTCGCGGACTTCCCAGCCGATCGCGGCTGGGATCTG
GCAGGCTTGTTCGATCCGGACCCGGAACGGGCTGGGAAGACCTACGTGCGGGAAGGGGCC
TTCCTCACCGACGCCGATCGGTTCGATGCGGGTTTCTTCGGGATTTCCCCGCGTGAGGCG
TTGGCGATGGATCCGCAGCAACGGCTGTTGCTGGAGCTGTCCTGGGAGGCCATTGAACGG
GCAGGGATCGATCCGGGTTCGCTGAGGGGGAGTCGGACCGGTGTGTTCGCGGGGCTGATG
TACCACGACTATGGCGCCCGGTTCGCCAGCCGAGCCCCGGAAGGTTTCGAGGGGTATCTC
GGCAATGGCAGTGCTGGGAGTGTCGCGTCGGGCCGGATTGCGTACTCGTTTGGTTTCGAG
GGTCCTGCGGTGACGGTGGATACTGCGTGTTCGTCGTCGTTGGTGGCGTTGCATTTGGCG
GGTCAGTCGTTGCGTTCCGGCGAATGCGATCTCGCCCTTGCCGGTGGTGTGACGGTGATG
TCGACGCCCGGGACGTTTGTGGAATTCTCCCGTCAGCGGGGCCTGGCACCGGACGGGCGG
TGCAAGTCGTTCGCGGAGAGCGCGGACGGTACCGGTTGGGGTGAGGGTGCTGGTTTGGTG
TTGTTGGAGCGGTTGTCGGATGCTCGGCGGAATGGGCATCGGGTGTTGGCGGTGGTTCGT
GGGTCGGCGGTGAATCAGGATGGTGCGTCGAATGGCTTGACCGCGCCGAATGGTCCCTCG
CAGCAGCGGGTCATCCAGCAGGCGTTGGCGAGTGCGGGTCTGTCGGTGTCCGATGTGGAT
GCCGTGGAGGCGCATGGGACCGGGACCAGGTTGGGTGATCCGATTGAGGCGCAGGCTCTG
ATTGCTACGTATGGGCGCGATCGTGATCCCGGTCGGCCGTTGTGGTTGGGGTCGGTGAAG
TCCAACATCGGTCATACGCAGGCGGCGGCGGGTGTTGCCGGTGTGATCAAGATGGTGATG
GCGATGCGGCACGGGCAACTTCCGCGCACGCTGCACGTGGATGCACCCTCCTCGCAGGTG
GATTGGTCGGCGGGGAGGGTCCAGCTCCTGACGGAGAACACGCCCTGGCCCGACAGTGGT
CGCCCCTGTCGGGTGGGGGTGTCGTCGTTCGGGATCAGCGGCACCAACGCGCACGTCATC
CTGGAACAGTCCACGGGGCAGATGGATCAGGCAGCGGAGCCGGATTCGAGTCCTGTTCTG
GATGTTCCGGTGGTGCCGTGGGTGGTGTCGGGCAAAACACCCGAAGCGCTATCCGCCCAG
GCGGCAACGTTGGCGACCTATTTGGACCAAAATGTTGATGTCTCCCCTCTGGACGTTGGG
ATTTCGCTTGCGGTGACCCGTTCGGCGCTGGATGAGCGGGCGGTGGTGCTGGGGTCGGAT
CGTGACACGTTGTTGTCTGGCCTGAATGCGCTGGCTGCCGGTCATGAGGCTGCTGGCGTG
GTTACGGGACCTGTCGGGATTGGTGGCCGGACCGGGTTTGTGTTCGCCGGTCAAGGCGGT
CAGTGGTTGGGGATGGGCCGCCGGTTGTACTCGGAGTTTCCGGCGTTCGCCGGTGCTTTC
GACGAAGCATGCGCCGAGCTCGATGCGAACCTGGGGAGGGAAGTCGGGGTTCGGGATGTG
GTGTTCGGCTCCGACGAGTCCTTGCTGGATCGGACTTTGTGGGCGCAGTCGGGTTTGTTC
GCGTTGCAGGTCGGTCTCTGGGAATTGTTGGGTACGTGGGGTGTTCGGCCCAGCGTAGTG
CTGGGGCATTCGGTCGGGGAGCTAGCCGCGGCGTTCGCCGCAGGTGTGCTGTCGATGGCG
GAGGCGGCTCGGCTGGTGGCGGGTCGTGCGCGGTTGATGCAGGCGTTGCCTTCTGGCGGT
GCCATGCTGGCGGTGTCCGCGACCGAGGCCCGAGTCGGCCCGCTGCTCGATGGGGTGCGG
GATCGTGTTGGTGTCGCAGCGGTTAACGCTCCGGGGTCGGTGGTGCTTTCCGGTGACCGG
GATGTGCTCGATGGCATTGCCGGTCGGCTGGACGGGCAAGGTATCCGGTCGAGGTGGTTG
CGGGTTTCGCACGCGTTTCATTCGCATCGGATGGATCCGATGCTGGCGGAGTTCGCCGAG
CTCGCACGGAGCGTGGACTACCGGTCTCCACGGCTGCCGATTGTCTCGACGCTGACCGGA
AACCTCGATGACGTGGGCGTGATGGCTACGCCGGAGTATTGGGTGCGCCAGGTGCGAGAG
CCCGTCCGCTTCGCCGACGGTGTCCAGGCGCTTGTGGACCAAGGCGTCGACACGATTGTG
GAACTCGGTCCGGACGGGGCGTTGTCGAGCTTGGTTCAAGAGTGTGTGGCGGAGTCCGGG
CGGGCGACGGGGATTCCGTTGGTGCGGAGAGACCGTGATGAGGTCCGAACGGTGCTGGAC
GCTTTGGCGCAGACCCACACTCGTGGTGGCGCGGTGGACTGGGGGTCATTTTTCGCTGGT
ACGAGGGCAACGCAAGTCGACCTTCCCACGTATGCCTTCCAACGACAGCGGTACTGGCTG
GAGCCATCGGATTCCGGTGATGTGACCGGTGTTGGCCTGACCGGGGCGGAGCATCCGCTG
TTGGGTGCCGTGGTGCCGGTCGCGGGCGGCGATGAGGTGCTGCTGACCGGCAGGCTGTCG
GTGGGGACGCATCCGTGGCTGGCGGAACACCGCGTGCTGGGCGAAGTCGTCGTCCCCGGC
ACCGCGTTGCTGGAGATGGCGTGGCGGGCCGGTAGCCAGGTCGGTTGTGAACGTGTGGAG
GAGCTCACCTTGGAGGCACCGCTGGTCCTGCCGGAGCGGGGCGCTGCGGCGGTGCAGTTG
GCGGTGGGGGCTCCGGATGAGGCCGGCCGGCGCAGTTTGCAGCTCTATTCCCGAGGCGCT
GATGAAGACGGCGACTGGCGGCGGATTGCCTCCGGGCTGTTGGCCCAGGCCAATGCGGTG
CCGCCGGCGGATTCGACGGCATGGCCGCCGGACGGCGCCGGGCAGGTCGATCTGGCGGAG
TTCTACGAGCGCCTCGCCGAGCGCGGCTTGACCTACGGTCCGGTATTCCAAGGGCTCCGC
GCCGCATGGCGGCACGGCGACGATATCTTCGCCGAATTGGCCGGGTCACCAGACGCCTCG
GGTTTCGGCATCCACCCGGCGCTGCTGGACGCTGCACTGCACGCGATGGCGCTTGGTGCT
TCGCCCGACTCGGAAGCGCGTCTGCCGTTTTCCTGGCGTGGCGCCCAGCTGTACCGCGCT
GAAGGAGCAGCGCTTCGGGTACGGCTCTCGCCGCTGGGCTCCGGTGCAGTCTCATTGACG
TTGGTGGATGCCACAGGGCGACGAGTCGCTGCGGTGGAATCGCTTTCGACGCGACCGGTC
TCCACCGACCAGATCGGTGCCGGTCGCGGCGATCAAGAGCGGCTGCTGCACGTCGAGTGG
GTAAGGTCGGCTGAATCTGCGGGGATGTCTCTGACCTCCTGCGCGGTGGTCGGTTTGGGC
GAACCGGAGTGGCACGCTGCGCTGAAGACCACTGGTGTCCAAGTCGAGTCCCATGCGGAC
CTTGCTTCGTTGGCCACCGAGGTTGCCAAGCGGGGTTCAGCTCCTGGTGCGGTCATCGTC
CCGTGCCCGCGACCCCGAGCGATGCAGGAGCTGCCGACCGCCGCGCGAAGGGCGACGCAA
CAGGCGATGGCGATGCTGCAGCAATGGCTTGCCGATGACCGGTTCGTCAGTACGCGCCTG
ATCCTGCTGACGCATCGGGCGGTCTCCGCAGTTGCTGGAGAAGACGTGCTCGACCTGGTA
CACGCGCCGCTGTGGGGCTTGGTCCGCAGCGCGCAAGCGGAGCACCCGGACCGATTCGCC
TTGATCGATATGGACGACGAGCGAGCATCGCAGACGGCACTCGCCGAAGCGCTGACTGCG
GGAGAAGCGCAGCTCGCGGTGCGGTCGGGAGTTGTGCTGGCGCCCCGCCTCGGCCAGGTG
AAGGTGAGTGGAGGTGAAGCGTTCAGGTGGGATGAAGGCACCGTGCTGGTCACCGGCGGA
ACCGGCGGGCTCGGGGCCCTGCTCGCACGCCATCTGGTCAGCGCCCACGGTGTGCGGCAC
CTGTTGCTCGCAAGTCGCCGTGGTCTGGCGGCGCCCGGAGCGGATGAGCTGGTGGCCGAG
CTGGAGCAGGCCGGCGCCGACGTCGCGGTCGTCGCGTGCGACTCGGCAGATCGGGACTCG
CTTGCGCGGCTGGTGGCGTCGGTGCCTGCGGAAAACCCGTTGCGGGTGGTGGTGCACGCC
GCCGGTGTGCTGGATGACGGTGTGCTGATGTCGATGTCGCCGGAGCGCTTGGACGCGGTG
TTGCGGCCCAAAGTGGATGCCGCGTGGTACCTGCACGAGCTGACTCGGGAACTCGGTCTG
TCGGCGTTCGTGTTGTTCTCCTCGGTCGCGGGCCTGTTCGGCGGTGCGGGGCAGAGCAAT
TACGCTGCCGGCAACGCTTTCCTGGATGCCTTGGCGCATTGCCGGCAGGCCCAGGGGCTG
CCCGCGCTGTCGCTGGCCTCCGGGCTGTGGGCGAGTATCGATGGAATGGCGGGCGACCTC
GCTGCGGCAGATGTGGAGCGGCTGTCGCGGGCAGGCATTGGCCCGCTTTCGGCACCGGGA
GGGCTGGCCTTGTTCGACGCTGCCGTTGGCTCGGACGAACCGTTGCTGGCACCGGTGCGA
CTGGATGTCGAAGCACTGCGTGTGCAGGCCCGATCCGTGCAGACCCGGATTCCGGAAATG
CTGCATGGCATGGCAATGGGGCCAAGCCGCCGCACTCCGTTCACTTCCAGGGTTGAGCCG
TTGCACGAACGGCTGGCCGGATTGTCGGAGGGCGAACGTCGGCAGCAAGTGCTCCAGCGC
GTCCGCGCCGATATCGCGGTGGTACTGGGGCACGGCAGGTCGAGCGATGTGGACATCGAG
AAGCCTTTGGCCGAGCTGGGTTTCGACTCGCTGACGGCCATCGAACTCCGCAACCGTCTC
GCTACCGCCACCGGACTGCGGCTTCCCGCGACGCTGGCCTTCGACCACGGCACTGCGGCG
GCACTCGCCCAGCACGTGTGCGCGCAGCTAGGCACCGCGACCGCGCCGGCACCGAGGCGA
ACCGACGACAACGACGCCACGGAGCCCGTGAGGTCGCTCTTCCAACAGGCGTATGCGGCT
GGCCGGATACTTGACGGGATGGATTTGGTGAAGGTCGCTGCCCAGTTGCGACCGGTGTTC
GGTTCGCCTGGCGAGCTGGAATCCCTGCCGAAACCCGTCCAGCTTTCCCGTGGTCCCGAA
GAGCTTGCCTTGGTGTGCATGCCGGCGCTGATCGGGATGCCGCCCGCACAGCAGTACGCG
CGGATCGCCGCCGGGTTCCGCGATGTGCGGGACGTTTCGGTGATCCCGATGCCTGGATTC
ATTGCGGGAGAACCGCTGCCGTCCGCCATCGAGGTGGCGGTTCGGACGCAGGCGGAGGCG
GTGCTGCAGGAATTCGCCGGGGGCTCGTTCGTACTGGTCGGGCATTCCTCCGGGGGCTGG
CTGGCGCACGAGGTAGCCGGTGAGCTGGAGCGTCGCGGGGTCGTCCCGGCCGGGGTCGTA
CTGCTGGACACCTACATCCCCGGTGAGATCACGCCGAGGTTCTCCGTGGCGATGGCCCAC
CGGACGTATGAGAAGCTCGCGACTTTCACGGACATGCAGGATGTCGGTATCACCGCGATG
GGCGGGTACTTCCGGATGTTCACCGAGTGGACTCCGACGCCGATCGGTGCTCCGACGCTG
TTCGTGCGGACCGAAGATTGCGTCGCAGACCCTGAAGGGCGGCCGTGGACAGATGACTCC
TGGCGGCCAGGGTGGACTCTCGCGGATGCCACGGTCCAGGTGCCGGGCGACCACTTCTCG
ATGATGGACGAGCACGCCGGGTCCACCGCACAGGCAGTCGCGAGTTGGCTTGACAAACTC
AACCAGCGCACCGCTCGGCAACGCTGA
[8] KS34..410
[8] AT563..878
[8] methylmalonyl-CoA752..756
[8] DH928..1093
[8] KR1404..1584
[8] ACP1684..1758
[9] KS1782..2159
[9] AT2320..2638
[9] malonyl-CoA2511..2515
[9] DH2685..2874
[9] KR3193..3373
[9] ACP3462..3532
[10] KS3559..3934
[10] AT4093..4411
[10] malonyl-CoA4284..4288
[10] DH4458..4619
[10] KR4933..5113
[10] ACP5217..5287
[10] TE5364..5577
[8] KS100..1230
[8] AT1687..2634
[8] methylmalonyl-CoA2254..2268
[8] DH2782..3279
[8] KR4210..4752
[8] ACP5050..5274
[9] KS5344..6477
[9] AT6958..7914
[9] malonyl-CoA7531..7545
[9] DH8053..8622
[9] KR9577..10119
[9] ACP10384..10596
[10] KS10675..11802
[10] AT12277..13233
[10] malonyl-CoA12850..12864
[10] DH13372..13857
[10] KR14797..15339
[10] ACP15649..15861
[10] TE16090..16731

close this sectionFeature

BLASTP
Database:UniProtKB:2011_09
show BLAST table
InterPro
Database:interpro:38.0
IPR001031 Thioesterase (Domain)
 [5364-5577]  1.9e-24 PF00975
PF00975   Thioesterase
IPR001227 Acyl transferase domain (Domain)
 [553-685]  G3DSA:3.40.366.10 [752-869]  G3DSA:3.40.366.10 [2316-2443]  G3DSA:3.40.366.10 [2511-2627]  G3DSA:3.40.366.10 [4090-4216]  G3DSA:3.40.366.10 [4284-4401]  G3DSA:3.40.366.10
G3DSA:3.40.366.10   Ac_transferase_reg
IPR002198 Short-chain dehydrogenase/reductase SDR (Family)
 [1404-1571]  3.7e-58 PF00106 [3194-3360]  4.10000000000004e-59 PF00106 [4933-5100]  1.70000000000001e-63 PF00106
PF00106   adh_short
IPR006162 Phosphopantetheine attachment site (PTM)
 [3490-3505]  PS00012 [5245-5260]  PS00012
PS00012   PHOSPHOPANTETHEINE
IPR009081 Acyl carrier protein-like (Domain)
 [1684-1758]  PS50075 [3462-3532]  PS50075 [5217-5287]  PS50075
PS50075   ACP_DOMAIN
 [1684-1798]  4.90001027782405e-28 SSF47336 [3455-3573]  2.19999900980708e-28 SSF47336 [5210-5324]  8.1000073432326e-22 SSF47336
SSF47336   ACP_like
 [1687-1761]  4.49999999999996e-66 G3DSA:1.10.1200.10 [3462-3537]  4.49999999999996e-66 G3DSA:1.10.1200.10 [5214-5291]  4.49999999999996e-66 G3DSA:1.10.1200.10
G3DSA:1.10.1200.10   ACP_like
 [1692-1757]  1.5e-11 PF00550 [3466-3531]  8.00000000000001e-12 PF00550 [5219-5286]  1.4e-10 PF00550
PF00550   PP-binding
IPR014030 Beta-ketoacyl synthase, N-terminal (Domain)
 [34-284]  2.59999999999995e-98 PF00109 [1782-2033]  3.19999999999996e-95 PF00109 [3559-3808]  2.30000000000004e-98 PF00109
PF00109   ketoacyl-synt
IPR014031 Beta-ketoacyl synthase, C-terminal (Domain)
 [292-410]  3.80000000000003e-45 PF02801 [2041-2159]  5.00000000000001e-49 PF02801 [3816-3934]  6.6999999999999e-49 PF02801
PF02801   Ketoacyl-synt_C
IPR014043 Acyl transferase (Domain)
 [563-878]  1.10000000000001e-99 PF00698 [2320-2638]  1.2e-61 PF00698 [4093-4411]  2.80000000000001e-64 PF00698
PF00698   Acyl_transf_1
IPR015083 Polyketide synthase, docking (Domain)
 [4-34]  1.10000029896148e-06 SSF101173
SSF101173   Polyketide_synth_docking
 [1-27]  4.9e-11 PF08990
PF08990   Docking
IPR016035 Acyl transferase/acyl hydrolase/lysophospholipase (Domain)
 [560-871]  7.40000780398351e-66 SSF52151 [2317-2603]  2.1000026783403e-65 SSF52151 [4090-4383]  8.10000734323267e-70 SSF52151
SSF52151   Acyl_Trfase/lysoPlipase
IPR016036 Malonyl-CoA ACP transacylase, ACP-binding (Domain)
 [687-751]  1.80000149287196e-15 SSF55048 [2445-2510]  3.30000526807186e-17 SSF55048 [4218-4283]  1.39999892049878e-17 SSF55048
SSF55048   Malonyl_transacylase_ACP-bd
IPR016038 Thiolase-like, subgroup (Domain)
 [34-294]  G3DSA:3.40.47.10 [297-461]  G3DSA:3.40.47.10 [1784-2045]  G3DSA:3.40.47.10 [2046-2211]  G3DSA:3.40.47.10 [3559-3820]  G3DSA:3.40.47.10 [3821-3984]  G3DSA:3.40.47.10
G3DSA:3.40.47.10   Thiolase-like_subgr
IPR016039 Thiolase-like (Domain)
 [26-408]  8.50003024809048e-104 SSF53901 [1774-2211]  8.99992502689852e-106 SSF53901 [3549-3932]  2.99998750445706e-107 SSF53901
SSF53901   Thiolase-like
IPR016040 NAD(P)-binding domain (Domain)
 [1403-1612]  8.29999999999993e-112 G3DSA:3.40.50.720 [3192-3379]  4.2e-112 G3DSA:3.40.50.720 [4932-5110]  8.29999999999993e-112 G3DSA:3.40.50.720
G3DSA:3.40.50.720   NAD(P)-bd
IPR018201 Beta-ketoacyl synthase, active site (Active_site)
 [197-213]  PS00606 [1946-1962]  PS00606 [3721-3737]  PS00606
PS00606   B_KETOACYL_SYNTHASE
IPR020801 Polyketide synthase, acyl transferase domain (Domain)
 [564-860]  9.09992432541739e-114 SM00827 [2321-2619]  8.79996401030984e-111 SM00827 [4094-4392]  9.59995698566587e-116 SM00827
SM00827   PKS_AT
IPR020802 Polyketide synthase, thioesterase domain (Domain)
 [5365-5577]  4.99996518094122e-118 SM00824
SM00824   PKS_TE
IPR020806 Polyketide synthase, phosphopantetheine-binding domain (Domain)
 [1689-1761]  3.1999989904635e-32 SM00823 [3463-3535]  7.90003079443025e-35 SM00823 [5218-5290]  1.50000306971104e-31 SM00823
SM00823   PKS_PP
IPR020807 Polyketide synthase, dehydratase domain (Domain)
 [928-1093]  1.90000694315261e-86 SM00826 [2685-2874]  1.09999184471405e-73 SM00826 [4458-4619]  6.69996745106047e-85 SM00826
SM00826   PKS_DH
IPR020841 Polyketide synthase, beta-ketoacyl synthase domain (Domain)
 [36-462]  SM00825 [1784-2211]  SM00825 [3559-3986]  SM00825
SM00825   PKS_KS
IPR020842 Polyketide synthase/Fatty acid synthase, KR (Domain)
 [1404-1584]  8.39998014024981e-60 SM00822 [3193-3373]  4.50001022902825e-61 SM00822 [4933-5113]  1.79999754022375e-63 SM00822
SM00822   PKS_KR
SignalP No significant hit
TMHMM No significant hit
Page top