Meili_00240 : CDS information

close this sectionLocation

Organism
StrainNS3226
Entry nameMeilingmycin
Contig
Start / Stop / Direction39,452 / 57,415 / + [in whole cluster]
39,452 / 57,415 / + [in contig]
Location39452..57415 [in whole cluster]
39452..57415 [in contig]
TypeCDS
Length17,964 bp (5,987 aa)
Click on the icon to see Genetic map.

close this sectionAnnotation

Category1.1 PKS
Productpolyketide synthase
Product (GenBank)modular polyketide synthase
Gene
Gene (GenBank)meiA2
EC number
Keyword
Note
Note (GenBank)
  • similar to AVES 2
Reference
ACC
PmId
[20348291] Cloning of separate meilingmycin biosynthesis gene clusters by use of acyltransferase-ketoreductase didomain PCR amplification. (Appl Environ Microbiol. , 2010)
comment
Meilingmycin Biosynthesis Gene Clustersを同定した文献。

Table1.
MeiA2(5,987 a.a.): PKS: module 3, 4, 5, 6

meiA2に関してはdeletionやcomplementationの実験等はしておらず、Meilingmycinの構造からの予測。

close this sectionPKS/NRPS Module

3 malonyl-CoA
not conserved HAFHS(H->G)
4 malonyl-CoA
not conserved HAFHS(H->G)
5 malonyl-CoA
not conserved HAFHS(H->G)
6 methylmalonyl-CoA
not conserved YASHS(S->C)
KS33..409
AT564..879
ACP949..1019
KS1042..1417
AT1576..1891
KR2204..2384
ACP2487..2557
KS2581..2956
AT3111..3426
KR3693..3874
ACP3978..4048
KS4078..4454
AT4625..4943
DH4994..5174
KR5523..5703
ACP5815..5885

close this sectionSequence

selected fasta
>polyketide synthase [modular polyketide synthase]
MHEDELLSYLKRVTADLDRTRRRLYEVTEREQEPIAIVGMACRFPGEVRSAEDFWQLIMA
ERDAIGDFPTDRGWDVERLYDPDPDRSGTCYTRHGGFLYDAAGFDAEFFETSPREALAMD
PQQRLLLETSWEAFEHAGIDPTSVRGSRTAVFTGINPPDYPVGHASRPPESAEGFILTGS
AGSIASGRIAYTLGLEGPAVTVDTACSSSLVALHLACQALRAEECSMALAGGVAVMSTPR
IFLEFARQRGLSVDGRCKAFGVGADGTGWAEGVGMLLVERLSDARRLGHRVLAVVRGSAV
NQDGASNGLTAPNGPSQQRVIRQALASARVGGADVDVVEGHGTGTRLGDPIEAQALLATY
GQERSGDEPLWLGSVKSNIGHAQAAAGVAGVIKMVMAMRCGVLPRTLHVQEPSPHVDWSS
GGVRLLTEAVPWPETGRARRAGVSSFGVSGTNAHIILEQAPPEEHDDPADVSSGSFPWMV
SAKSEQALQAQAAQLRAYLAARPGVGLADVGYALAAGRTAFDHRAVLLGPDREAFLEGLG
ALGAGEEHAGLVRGVATGAGKLAFVCSGQGTQRPRMGHELYRAFPLFAAAMDEACAYLDP
HLDRPLRDVVFAEPDSGTARLLQQTRYAQPALFALQVALHRLVTEHYDLTPHYYAGHSLG
EITAAHLAGILTLCDAARLVTTRARLMQSLPATGAMTTLQADPDELHEHLTRCEGRVSLA
AVNAPGSVVISGDRHDVDATAENFRTMGRKTTPLKVSGAFHSHHIDPLLDELRATAETLT
YHPPHTPLITTDLTDQDPTTPGYWVRQTRETVHYAHTTQQLHTHGVTAYLELGPDTTLTT
LTHHNLPHHTPLAIPLLHPDQPETHTTHTALAHLHTHGHPTTWHHHHTPTHHHPNLPTYP
FQHHHYWLHVSEPQGEQHTPRPISSEPPVTATLRQRLSHQPPDAQQQTLLDAVRTHIAGV
LGHGSPENIHAERAFKELGFTSLTAVQFRNRLCEATGLTLAPTIVFDYPSPAMLVQHLCQ
QLLGTRGETVSHVPHTAIGTGEPIAIVGMACRYPGDVQSPEQLWDLLVTEQDAISGFPTD
RGWDLDNLYDPDPERFGTSYTREGGFIHQAGEFDAEFFGISPREALAMDPQQRILLEVSW
EAFERAGIDPTSVRGTQTGIFAGLAYHDYAQRFPIAPEGFEGYLVHGSAGSIASGRVAYT
FGLEGPAVTVDTACSSSLVALHLACQALRSGECSMALAGGVTVMSTPAAFIEFSRQRGLS
PDGRCKAFSATADGTGWGEGAGMVLVERLSDAQRLGHPILAVVRGTAVNQDGASNGLTAP
NGLSQQRVIRQALANAELTPAAIDAVEGHGTGTTLGDPIEAQALLATYGQDRSADQPLLL
GSMKSNIGHAQAAAGVGGIVKMVLAMRHGVLPRTLHVQEPSPHVDWASGGVRLLTEAVPW
PETGRARRAGVSSFGVSGTNAHVILEQAPPTKAPANDEVTSAPTPSLEPWLVSAKSEDAL
QAQARRLRQYLALAPKLDLADVGYALATGRTAFDHRAVLLGPDREAFLEGLGALGAGEEH
AGLVRGVATGAGKLAFVCSGQGTQRPRMGHELYRAFPLFAAAMDEACTHLDPHLDHPLRD
VMFAEPDTDTAQLLHQTRYAQPALFALQVALHRLVTEHYDLTPHYYAGHSLGEITAAHLA
GILTLPDAARLVTTRARLMQSLPATGAMTTLQADPDELHEHLTRCEGRVSLAAVNAPGSV
VISGDRHDVDATAENFRAMGRKTTPLKVSGAFHSHHIDPLLDELRTTAETLTYHPPHTPL
ITTNPTDHDPTTPGYWIRQTRETVHYTHTTQQLHTHGVTAYLELGPDTTLTTLTHHNLPH
HTPLAIPLLHPDQPETHTTHTALAHLHTHGHPTTWHHHHTPTHHHPNLPTYPFQHHHYWL
NTTTPTPNTTSIRFLESLESEDVDAVASALAIDRDSSLETVLPALSTWRKRHDGQAVIDS
WGYRETWKPVTLSQSGTRSPGTWLIVVPTLQSRTAPIDVIVDALHRLGARTITLTLDSSC
ADHETLARRLAEHSDLVNIDGVVSLLAFDEEPHPEHPHLPTGTALTLTLIQTLTTHPNTT
APLWCLTQTATTTHPTDTLNHPTQAHIWGLGRTTALEHPNHWGGLIDLPTTPTPQHLHHL
TTALTTQHNERELALRPTGLHARRLVRTTARGALDDPQPWKPHGTILITGGTGAIGTHIA
TWIATHHPHCHLLLTNRQGPNTPHNTQLHHHLTQLGATTTITTCDTTNPNQLTNLLNTIP
PTHPLTTIIHTAGVLDDATLATQTHDRLSTVLGPKAHAAHHLHNLTRHLDLDAFVLISST
AATFGSPGQANYAAANAYLDALAQHRRAQGLPATSIAWGLWEQGGLANAEITGHLARRGL
LPLPTEPALTALSQAIASPGQAHHIIADIDWGSFAVNLSPSGESVPLTQDIPEARTQRAR
QAMDQDSGTALHRQLADRSPSEQQEVLLQLVRAQVASVLGHSDIDAVPSDRPFKELGLDS
LAAVETRNRLTALAGLRLPTSLIFDHPTPTRLAQYLQTELVGEESGLASAVAFSPAAKTD
DSIAIIGMACRFPGGVSTPEEFWDLILMEHDAIADFPTNRGWDLAGIFHPDPAHKGTCYT
QQGGFLYDAAEFDPVFFDISPREALAMDPQQRLLLETSWEALERAHIDPRSLQGSPVGVF
TGINAQDYAIHLERSPESVEGYVLTGSSSSIASGRIAYTLGLEGPAVTVDTACSSSLVAL
HLASQSLRSGECSMALAGGVTVMSTPTTFVEFARQRGLSTDGRCRAFSSTADGTGWGEGV
GILLVERLSDARRLGHQVLAVVRGSAVNQDGASNGLTAPNGPSQQRVIRQALAGAGLTVP
EIDAVEGHGTGTTLGDPIEAQALLATYGQERPDDRPVWLGSVKSNIGHAQAAAGVAGVIK
MVMAMRHGVLPRTLHVQEPSPHVDWSSGGVRLLTEAVPWPETGRARRAGVSSFGVSGTNA
HIILEQAPPTEDREPGAGSPSSSPWMVSAKSEQALQAQAAQLRAYLAAHPEVGLADVGYA
LAAGRTAFDHRAVLLGPDREAFLEALRALEAGEEHAGLVRGVATGTGKLAFVCSGQGTQR
PRMGHELYHAFPLFAAAMDEACTHLDPHLDHPLRDVMFAEPDTDTAQLLHQTRYAQPALF
ALQISLHRLVTEHYGLTPHYYAGHSLGEITAAHLAGILTLPDAARLVTTRARLMQSLPAT
GAMTTLQADPDELHEHLAGLEGRVSLAAVNAPASVVISGDRHDVDATAENFRAMGRKTTP
LKVSGAFHSHHIDPLLDELRTTAETLTYHQPHTPLITTDLTDQDPTTPGYWVRQTRETVH
YAHTTQQLHTHGVTAYLELGPDTTLTTLTHHNLPHHTPLAIPLLHPDQPETHTTHTALAH
LHTHGHPTTWHHHHTPTHHHPNLPTYPFQHHHYWLNTTTATPNTTDAWRYDEVWQPLDLA
DAEPMPPGDWLVVIPARQAGHPHVDAILSGLREHGGIRMTELVLDPADIDPLVLRQHLAD
AIEQPDDLSISGVLSLLAFDEEPHPEHPHLPTGTALTLTLIQTLTTHPNTTAPLWCLTQT
ATTTHPTDTLNHPTQAHIWGLGRTTALEHPNHWGGLIDLPTTPTPQHLHHLTTALTTQHN
DDQLAIRDTGLQTRRLTRTATTPSNPQPWKPHGTILITGGTGAIGTHIATWIATHHPHCH
LLLTNRQGPNTPHNTQLHHHLTQLGATTTITTCDTTNPNQLTNLLNTIPPTHPLTTIIHT
AGINLKSTIADLSAADLAETAGAKATGAAILHELLREHDTIERFVLFSSIAATWGSANQA
GYAAANAYLDALAQHRRAQGLPATSIAWGPWDGAGMAADGDTRAHLRRRGLRAMSPDLAL
AALDRVLGHGPDSAASTVVADVDWEDFATTFTARRPAPLIDDIPEVRQVLRGDAAPSSAD
SLREQLAQRSPKEQQQALLDVVRTHAAAVLGHSSPESIDAQQAFSALGFDSLTAVEFRNR
VAAAIGLALPTTLVFDHPSPTECATHLRTALLGGTDDDLREGMSGTSAERTQAAAILDEP
IAIVGMACRYPGGVRSAEDLWRLVASGTDAITEFPTDRGWDVERIYHPDPDHEGTCCTQH
GGFLYDAGEFDPAFFGISPREALAMDPQQRLLLEASWEAFEHAGIDPETLRGSQTGVFVG
INVQDYAAHVRQVPQAVAGYALTGSSGSVASGRIAYVFGLEGPTVSVDTACSSSLVALHM
AGQALRTTECSLALVGGVMVMSTPATFIEFSRQRGLSPDGRCKAFSATADGTGWAEGVGM
LLVERLSDARRNGHRVLAVVRGSAINQDGASNGLTAPNGPAQQRVIRQALAGAGLSPSEV
DAVEGHGTGTVLGDPVEAQALLATYGQDRAEDHPLWLGSVKSNIGHAQAAAGVGGVIKMV
MALQHDTLPRTLHADEPSPHVDWSAGAVRLLTDAVPWVRNGRPRRAGVSSFGVSGTNAHL
ILEEAPSDEPDGPGVAGPTAAPAVEASAVSLPWLLSAKSADALRAQARQLREFVSAAPEA
GSGPGLADIGYSLATHRSAFEHRAVVIGSDRADFLGGLDALAADEAHSAVVTGIARKAGD
LGKVVFVFPGQGGQWAGMGLRLLKTSPVFAQSIQACEQALAPHTDWTLTDILHRPHTDPL
WQRADVIQPALFALMTSLTTLWQSHGLNPDAVIGHSQGEITAAHISGALSLEDAAKIVAL
RSQTLQTLQGSGGMASVPLPADQVTALLHTMWPDQLWVAAINAPTTTVISGDTQALTQAL
NHYRDQDIDAKRIPVDYASHCPHIQAVQHELSDLLQDITPRAATTPFYSTTDNQWTDTTT
LNAHYWYRNLRQPVHLTNAITNLTHQGHHTYIEISPHPTLTPAIQETTHTTHTPTTVIST
LRRNHNDTHQLLHALAHAHTTGHPINWHPTHQHHTPTPQHTDLPTYPFQHQRYWLNTPTQ
TGDAAAIGLDPAHHPLLGAAIPLADSDGHLFTGRLSLRTHPWLADHAFAGVALVPGTAFL
DIALQAGERVGCRHLEELSLHAPLLLPQRGGVVLQISVGAPDCEGRREFAAYARSHDDVS
GTENALGTEGLEAATQSWTRHATGTLTAATPPAAAAGPKAGAWPPGKADAVDLDGLYERL
TGAELAYGPAFHGLRAAWREGSDIFAEVRLPEPQARDAGRFGIHPALLDAALHTLGLDPA
LSEEPADASEGQRAARLPFVWRGVTLHRRGGEVLRVRLSPGPGNGVVAIEATDESGRPVA
SVEALVLRPVSAGEVRAAAHSEHHESLFGLEWPTATLPTADHSPSDPSSFAVVGADPSGL
PYRRHDDWAALLDAVEADGAPELIVVPCGGDEDGDVAGGAAVRTAVRNVLHLLQSWLGDD
LFADSRLVVLTRGAVATRREDDVTDLPGAAVWGLVRSAQSENPGRITLVDWDGHGSLAQV
LPAALAGGEPQLAVRDGEVCVPRLVRMPRQDRPAPTDGAADGSADGPTDASAGASPWALD
PAGTVLITGGTGVLGGLIARHLVAAHGVRQLLLVGRRGAQAEGVRALAAELEAVGATVTV
AACDAADREALATLLGRVPEQHPLTAVVHASGVLDDGTIPSLTPERIDTVFRAKVVPALL
LHELTRDADLAAFVMFSSAASVLGSPGQGNYAAANAVLDALARHRRAQGRPATSLAWGLW
AQGSGMTRHLDGTDHARISRGGMAPLATEEALALFDASSAAGEPFLVPARFELGSLRSRA
TGAGVPALLRGLVPASARRGGAADRGEDGEDRTDVGVSLRERLARCGGKEQQGILTRLVR
SHAAAVIGHAGIEEVAERRAFRELGFDSLTAVELRNRLTTATGLRLPATVAFDFPTPTAL
AEHVRALLLRANGNGTGADGTSASEAGEEELRAAVASIPLGRLREAGLLSALLELAEAPG
GLGGGLGSVGVAAVPAARSAEGPGSIDEMDIDSLIGLAHGDQPGSDS
selected fasta
>polyketide synthase [modular polyketide synthase]
GTGCACGAAGACGAACTCCTCAGCTACCTGAAGCGAGTGACCGCGGACCTGGACCGGACT
CGCCGTCGTCTTTACGAGGTCACCGAGCGGGAGCAAGAACCCATCGCCATTGTCGGCATG
GCCTGCCGCTTCCCCGGCGAGGTCCGGTCCGCGGAGGACTTCTGGCAGCTGATCATGGCG
GAGCGGGACGCGATCGGGGATTTCCCCACCGACCGTGGCTGGGACGTCGAGCGGCTGTAC
GACCCGGACCCGGACCGATCCGGCACCTGCTACACGCGACACGGTGGTTTCCTCTACGAC
GCCGCCGGATTCGACGCCGAATTCTTCGAGACCTCACCACGTGAGGCGCTTGCGATGGAT
CCGCAGCAGCGGCTGCTGCTGGAGACCTCGTGGGAGGCGTTCGAACACGCCGGCATCGAC
CCGACCTCCGTACGCGGCAGCCGGACTGCCGTGTTCACCGGCATCAACCCACCGGACTAC
CCCGTCGGACACGCCTCAAGGCCTCCGGAGTCCGCGGAGGGCTTCATCCTCACCGGCAGC
GCGGGGAGCATCGCCTCCGGCCGAATCGCGTACACGCTGGGCTTGGAAGGGCCTGCGGTC
ACCGTGGACACGGCGTGTTCGTCGTCGCTGGTCGCCCTGCATCTGGCCTGCCAGGCACTG
CGGGCCGAGGAGTGCTCCATGGCGCTGGCTGGTGGAGTGGCCGTCATGTCGACCCCACGC
ATTTTCCTGGAGTTTGCGCGGCAGCGGGGGTTGTCGGTGGATGGGCGGTGCAAGGCGTTT
GGGGTGGGTGCGGATGGTACGGGGTGGGCGGAGGGGGTGGGGATGCTGTTGGTGGAGCGG
TTGTCTGATGCGCGGCGGTTGGGGCATCGGGTGTTGGCGGTGGTGCGGGGTTCTGCGGTG
AATCAGGACGGGGCGAGCAATGGTTTGACGGCGCCGAACGGTCCGTCGCAGCAGCGGGTG
ATCCGGCAGGCGTTGGCCAGTGCGCGGGTTGGTGGGGCGGATGTGGATGTGGTGGAGGGG
CACGGTACGGGGACGCGGCTGGGTGATCCGATCGAGGCGCAGGCGTTGCTGGCGACCTAC
GGTCAGGAGCGGTCGGGGGATGAACCGTTGTGGTTGGGGTCGGTGAAGTCGAATATCGGG
CATGCGCAGGCTGCGGCGGGTGTTGCGGGTGTCATCAAGATGGTGATGGCGATGCGGTGT
GGGGTGTTGCCGCGGACGTTGCATGTGCAGGAGCCGTCGCCGCATGTGGACTGGTCCTCG
GGTGGGGTGCGGCTGCTGACGGAGGCGGTGCCGTGGCCGGAGACGGGTCGTGCGCGGCGT
GCGGGGGTGTCGTCGTTCGGGGTCAGCGGCACCAACGCGCACATCATCCTCGAACAGGCA
CCGCCGGAGGAGCACGACGATCCGGCGGACGTTTCGTCCGGGTCGTTTCCGTGGATGGTG
TCGGCCAAGTCCGAACAGGCACTACAGGCACAGGCAGCGCAGCTGCGCGCGTATCTGGCG
GCACGTCCCGGGGTGGGGCTGGCTGATGTCGGGTATGCGCTGGCCGCCGGCCGTACCGCC
TTCGACCACCGTGCCGTGCTCCTGGGCCCGGACCGCGAAGCCTTCCTCGAAGGGCTGGGG
GCTCTGGGGGCCGGTGAGGAACACGCCGGGCTCGTACGGGGCGTGGCGACGGGTGCGGGG
AAGCTGGCGTTCGTGTGTTCCGGGCAGGGCACGCAGCGCCCTCGTATGGGGCACGAGCTG
TACCGCGCCTTCCCGCTGTTCGCCGCAGCCATGGACGAAGCCTGCGCATACCTGGACCCG
CATCTCGACCGGCCTCTGCGGGATGTCGTGTTCGCCGAGCCGGACTCCGGTACGGCCCGG
CTGCTGCAGCAGACGCGCTATGCCCAGCCCGCGCTGTTCGCCCTCCAAGTCGCCCTGCAT
CGCCTGGTCACCGAACACTACGACCTCACGCCCCACTACTACGCGGGCCATTCCCTGGGG
GAGATCACCGCGGCCCACCTCGCCGGGATCCTCACCCTCTGCGACGCGGCGCGTCTGGTC
ACCACCCGCGCCCGCCTGATGCAGTCTCTCCCCGCCACCGGCGCGATGACCACCCTCCAA
GCAGACCCCGACGAACTCCACGAACACCTCACACGATGCGAGGGACGGGTGTCGCTCGCG
GCCGTGAACGCGCCCGGGTCCGTGGTCATCAGCGGTGACCGCCACGACGTAGACGCCACG
GCCGAAAACTTCCGCACCATGGGGCGCAAGACCACCCCGTTGAAGGTCAGCGGCGCCTTC
CACTCACACCACATCGACCCACTCCTCGACGAACTCCGCGCCACCGCCGAAACCCTCACC
TACCACCCACCCCACACCCCCCTCATCACGACCGACCTGACCGACCAGGACCCCACCACA
CCTGGCTATTGGGTCCGGCAAACACGCGAGACCGTCCACTACGCCCACACCACCCAACAA
CTCCACACCCACGGCGTCACCGCCTACCTCGAACTCGGCCCCGACACCACACTCACCACC
CTCACCCACCACAACCTCCCCCACCACACCCCCCTAGCCATCCCCCTCCTCCACCCCGAC
CAACCCGAAACCCACACCACCCACACCGCCCTCGCCCACCTCCACACCCACGGCCACCCC
ACCACCTGGCACCACCACCACACCCCCACCCACCACCACCCAAACCTCCCCACCTACCCC
TTCCAACACCACCACTACTGGCTTCATGTCTCCGAACCGCAGGGCGAGCAGCACACCCCC
CGCCCCATCTCGTCCGAACCGCCCGTCACAGCTACTCTTCGTCAGCGGCTCTCTCACCAA
CCACCAGACGCGCAGCAGCAGACTCTCCTCGACGCCGTTCGCACACACATCGCCGGCGTC
CTCGGCCACGGGAGTCCCGAGAACATCCACGCCGAGCGAGCATTCAAGGAACTGGGCTTC
ACCTCGCTGACTGCCGTCCAGTTCCGCAATCGCCTGTGCGAGGCCACCGGGTTGACCCTC
GCTCCGACCATCGTGTTCGACTATCCGAGCCCCGCCATGCTCGTACAGCATTTGTGCCAG
CAGTTGCTGGGGACGCGAGGCGAAACCGTCAGCCACGTCCCGCACACAGCCATTGGAACA
GGCGAACCCATCGCAATCGTCGGCATGGCATGCCGCTACCCGGGAGACGTACAGTCCCCG
GAACAGCTGTGGGATCTGCTCGTGACCGAGCAGGACGCGATCTCGGGATTCCCGACCGAT
CGGGGCTGGGACCTCGACAACCTCTACGACCCGGATCCCGAGCGGTTCGGAACGTCGTAC
ACGAGGGAGGGGGGTTTTATTCACCAAGCGGGCGAATTCGACGCCGAATTTTTCGGCATC
AGCCCCCGCGAGGCCCTGGCCATGGACCCTCAGCAGCGAATTCTTCTCGAGGTCTCCTGG
GAAGCGTTCGAGCGAGCAGGCATCGACCCCACCTCGGTACGCGGCACCCAAACCGGGATC
TTCGCCGGCCTGGCCTACCACGACTACGCGCAGCGCTTCCCGATCGCCCCCGAGGGGTTC
GAGGGATACCTCGTTCATGGAAGCGCGGGCAGTATCGCCTCCGGACGGGTCGCCTACACC
TTCGGCCTGGAGGGGCCAGCGGTCACCGTGGACACGGCATGTTCGTCATCGCTGGTCGCC
CTGCATCTGGCCTGCCAGGCGCTGCGGTCAGGGGAATGCTCGATGGCCCTGGCCGGCGGG
GTCACCGTTATGTCCACCCCGGCGGCCTTCATCGAGTTCTCCCGGCAGCGTGGGCTGTCC
CCGGACGGTCGGTGCAAGGCATTCTCCGCCACAGCCGATGGCACCGGGTGGGGCGAGGGT
GCGGGCATGGTGCTGGTGGAACGGCTCTCGGACGCGCAGCGGCTCGGGCATCCCATCCTG
GCCGTAGTACGAGGAACGGCGGTCAACCAGGACGGCGCCAGCAACGGGCTCACCGCGCCC
AACGGCCTCTCACAACAACGGGTCATCCGCCAAGCCCTCGCCAACGCCGAGCTGACTCCA
GCTGCCATTGACGCAGTGGAGGGGCACGGTACGGGGACGACGCTGGGCGACCCGATCGAA
GCCCAGGCCCTTCTGGCGACATACGGTCAGGACCGCTCCGCGGACCAGCCACTTTTGCTC
GGCTCGATGAAGTCGAACATCGGACACGCACAGGCGGCGGCCGGGGTGGGCGGGATCGTC
AAGATGGTACTGGCGATGCGGCACGGGGTGTTGCCGCGGACGTTGCACGTCCAGGAGCCG
TCGCCACATGTGGACTGGGCCTCGGGAGGGGTGCGGCTGCTGACGGAGGCGGTGCCGTGG
CCGGAGACGGGTCGTGCGCGGCGTGCGGGCGTATCTTCCTTCGGCGTCAGCGGCACCAAC
GCGCACGTGATCCTCGAACAGGCACCACCGACCAAGGCCCCGGCAAACGATGAAGTCACC
TCAGCGCCAACACCGTCGCTTGAACCATGGCTTGTGTCCGCGAAATCCGAGGACGCGCTG
CAAGCGCAGGCGCGACGGCTGCGCCAGTACCTCGCTTTGGCCCCGAAACTCGACTTGGCC
GATGTTGGGTACGCCCTGGCCACCGGCCGTACCGCCTTCGACCACCGTGCCGTACTCCTG
GGCCCGGACCGCGAAGCCTTCCTCGAAGGGCTGGGGGCTCTGGGGGCCGGTGAGGAACAC
GCCGGGCTCGTACGGGGCGTGGCGACGGGTGCGGGGAAGCTGGCGTTCGTGTGTTCCGGG
CAGGGCACGCAGCGCCCTCGTATGGGGCACGAGCTGTACCGCGCCTTCCCGCTGTTCGCC
GCAGCCATGGACGAAGCCTGCACACACCTCGACCCACACCTCGACCATCCCCTGCGGGAC
GTCATGTTCGCCGAGCCGGACACCGACACCGCCCAGCTGCTCCACCAGACCCGCTACGCC
CAGCCCGCCCTGTTCGCCCTCCAAGTCGCCCTGCATCGCCTGGTCACCGAACACTACGAC
CTCACGCCCCACTACTACGCGGGTCACTCTCTGGGGGAGATCACCGCGGCCCACCTCGCC
GGGATCCTCACCCTCCCCGACGCGGCGCGCCTGGTCACCACCCGCGCCCGACTCATGCAA
TCTCTCCCCGCCACCGGCGCGATGACCACCCTCCAAGCAGACCCCGACGAACTCCACGAA
CACCTCACACGATGCGAAGGACGGGTGTCGCTCGCGGCGGTCAACGCGCCCGGGTCCGTG
GTCATCAGCGGTGACCGCCACGACGTAGACGCCACGGCCGAAAACTTCCGCGCCATGGGG
CGCAAGACCACCCCGTTGAAGGTCAGCGGCGCCTTCCACTCACACCACATCGACCCACTC
CTCGACGAACTCCGCACCACCGCAGAAACCCTCACCTACCACCCACCCCACACCCCCCTC
ATCACCACCAACCCCACCGACCACGACCCCACCACACCTGGCTATTGGATCCGGCAAACA
CGCGAGACCGTCCACTACACCCACACCACCCAACAACTCCACACCCACGGCGTCACCGCC
TACCTCGAACTCGGCCCCGACACCACACTCACCACCCTCACCCACCACAACCTCCCCCAC
CACACCCCCCTAGCCATCCCCCTCCTCCACCCCGACCAACCCGAAACCCACACCACCCAC
ACCGCCCTCGCCCACCTCCACACCCACGGCCACCCCACCACCTGGCACCACCACCACACC
CCCACCCACCACCACCCAAACCTCCCCACCTACCCCTTCCAACACCACCACTACTGGCTC
AACACCACCACCCCAACCCCCAACACGACCAGCATCCGCTTCTTGGAATCGTTGGAGAGC
GAAGACGTCGACGCAGTGGCCTCCGCCCTGGCCATCGACCGTGACTCTTCCCTCGAAACA
GTCCTGCCGGCACTCTCCACCTGGCGTAAGCGCCACGATGGCCAGGCCGTCATCGACTCC
TGGGGCTACCGCGAGACGTGGAAACCGGTCACCCTCTCCCAGAGCGGCACCAGGTCACCC
GGCACTTGGCTCATCGTCGTCCCCACTCTTCAGAGCAGGACCGCCCCCATCGACGTGATC
GTCGACGCCCTGCATCGCCTTGGAGCGCGGACCATCACGCTCACCCTCGACTCCTCCTGT
GCCGACCACGAAACCCTTGCTCGGCGGCTCGCCGAACACAGTGATCTGGTGAACATCGAC
GGCGTGGTCTCGCTGCTCGCCTTCGACGAGGAACCCCACCCCGAACACCCCCATCTACCC
ACCGGCACCGCACTCACCCTCACCCTCATCCAAACCCTCACCACCCACCCCAACACCACA
GCACCCCTGTGGTGCCTCACCCAAACCGCCACCACCACCCACCCAACCGACACCCTCAAC
CACCCCACCCAAGCCCACATCTGGGGACTCGGACGCACCACCGCACTCGAACACCCCAAC
CACTGGGGCGGACTCATCGACCTCCCCACCACCCCCACCCCACAACACCTCCACCACCTC
ACCACCGCCCTCACCACCCAACACAACGAACGCGAACTCGCCCTACGCCCCACCGGCCTG
CACGCCCGCCGTCTGGTCCGAACCACCGCCCGCGGCGCACTCGACGACCCCCAGCCCTGG
AAGCCCCACGGCACCATCCTCATCACCGGCGGCACCGGAGCCATCGGCACCCACATCGCC
ACCTGGATCGCCACCCACCACCCCCACTGCCACCTCCTCCTCACCAACCGACAAGGACCC
AACACACCCCACAACACCCAACTCCACCACCACCTCACCCAACTCGGCGCCACCACCACC
ATCACCACCTGCGACACCACCAACCCCAACCAACTCACCAACCTCCTCAACACCATCCCC
CCAACCCACCCCCTCACCACCATCATCCACACCGCAGGTGTCCTCGACGACGCCACGTTG
GCCACCCAAACTCATGATCGCCTATCGACCGTCCTCGGCCCCAAGGCACACGCCGCCCAC
CATCTGCACAACCTCACTCGGCATCTGGACCTCGATGCCTTTGTCCTGATCTCCTCCACG
GCGGCCACCTTCGGCAGCCCTGGGCAGGCCAACTACGCCGCCGCCAACGCCTACCTCGAC
GCCCTGGCCCAGCACCGTCGCGCCCAAGGACTCCCCGCCACCTCAATCGCCTGGGGACTC
TGGGAGCAGGGCGGCCTGGCCAACGCCGAAATCACTGGGCACCTGGCACGCCGAGGTCTG
CTCCCCCTGCCCACGGAACCGGCGCTCACCGCGCTTTCGCAGGCCATCGCCAGCCCGGGG
CAAGCGCACCACATCATCGCTGACATCGACTGGGGCTCCTTCGCCGTCAATCTGAGCCCC
AGCGGCGAATCCGTTCCGCTCACCCAGGACATTCCCGAAGCCCGGACGCAGCGTGCCCGG
CAGGCCATGGATCAGGATTCCGGGACGGCCCTGCACCGGCAACTGGCCGACCGAAGCCCG
AGCGAGCAACAGGAGGTGCTTCTCCAACTCGTACGGGCGCAGGTGGCCTCGGTCCTCGGA
CACAGCGACATCGACGCGGTGCCCTCCGACCGGCCCTTCAAGGAACTGGGACTCGATTCC
CTCGCCGCCGTCGAGACCCGCAACCGGCTTACCGCCCTAGCCGGACTCCGTCTCCCCACC
AGCCTCATCTTCGATCACCCGACGCCGACCAGGCTCGCCCAGTATCTGCAAACCGAATTG
GTGGGTGAGGAATCCGGTCTAGCGAGCGCAGTGGCATTCAGCCCCGCCGCCAAGACCGAC
GACTCCATCGCGATCATCGGCATGGCATGCCGGTTCCCGGGCGGCGTGAGTACGCCCGAG
GAGTTCTGGGATCTCATCCTCATGGAACACGACGCGATCGCGGACTTCCCGACAAACCGC
GGCTGGGATCTTGCGGGAATCTTCCACCCTGACCCCGCGCACAAGGGCACCTGTTACACC
CAGCAGGGCGGCTTCTTGTACGACGCGGCCGAGTTCGACCCCGTCTTCTTCGACATCAGT
CCCCGCGAAGCACTGGCCATGGACCCCCAGCAGCGGCTCTTGCTGGAGACCAGCTGGGAG
GCCCTGGAGCGAGCGCATATCGACCCACGGTCGCTGCAAGGCAGTCCTGTCGGTGTCTTC
ACCGGCATCAATGCCCAGGACTACGCGATCCATCTAGAGCGTTCACCGGAGTCCGTCGAG
GGCTACGTCCTCACCGGCAGCTCGAGCAGCATCGCATCGGGCCGGATCGCCTACACCCTG
GGCTTGGAAGGCCCCGCCGTCACCGTGGACACGGCATGTTCGTCGTCCCTGGTCGCCCTG
CATCTGGCCAGCCAGTCGCTGCGGTCGGGGGAATGCTCGATGGCCCTGGCCGGCGGGGTC
ACCGTCATGTCCACCCCCACCACTTTCGTCGAGTTCGCGCGACAGCGGGGACTTTCCACA
GACGGTCGGTGCCGTGCCTTCTCCTCCACCGCCGACGGAACCGGCTGGGGCGAGGGCGTG
GGCATCTTGTTGGTGGAGCGGTTGTCTGATGCGCGGCGGTTGGGGCATCAGGTGTTGGCG
GTGGTGCGGGGTTCCGCGGTCAACCAGGACGGTGCGTCGAATGGTTTGACGGCGCCGAAT
GGTCCGTCGCAGCAGCGGGTGATTCGGCAGGCGTTGGCGGGTGCGGGGCTGACGGTCCCG
GAGATCGACGCGGTGGAGGGACACGGCACAGGAACGACTCTTGGGGATCCGATCGAGGCG
CAGGCGTTGCTGGCGACCTACGGTCAGGAACGCCCTGATGATCGACCCGTCTGGTTGGGG
TCGGTGAAGTCGAATATCGGGCATGCGCAGGCCGCTGCGGGTGTTGCGGGTGTCATCAAG
ATGGTGATGGCGATGCGGCATGGGGTGTTGCCGCGGACGTTGCACGTCCAGGAGCCGTCG
CCGCACGTGGACTGGTCCTCAGGTGGAGTGCGGCTGCTGACGGAGGCGGTGCCGTGGCCG
GAGACGGGTCGTGCCCGGCGTGCGGGGGTGTCGTCCTTCGGGGTCAGCGGCACCAACGCG
CACATCATCCTCGAACAGGCACCACCCACAGAAGACAGGGAACCCGGCGCCGGCTCTCCT
TCTTCCTCTCCGTGGATGGTGTCGGCCAAGTCCGAACAGGCACTCCAAGCACAGGCAGCG
CAGCTGCGCGCGTATCTGGCGGCACATCCCGAGGTGGGGCTGGCTGATGTCGGGTATGCG
CTGGCCGCCGGCCGTACCGCCTTCGACCACCGTGCCGTACTCCTGGGCCCGGACCGCGAA
GCCTTCCTCGAAGCGCTGAGGGCTCTGGAGGCCGGTGAGGAACACGCCGGGCTCGTACGG
GGCGTGGCGACGGGCACGGGGAAGCTGGCGTTCGTATGTTCCGGGCAGGGCACGCAACGC
CCCCGTATGGGACACGAGCTCTACCACGCCTTCCCGCTGTTCGCCGCAGCCATGGACGAA
GCCTGCACACACCTCGACCCACACCTCGACCATCCCCTGCGGGACGTCATGTTCGCCGAG
CCGGACACCGACACCGCCCAGCTGCTCCACCAGACCCGCTACGCCCAGCCCGCCCTGTTC
GCCCTCCAGATCTCCCTGCACCGCCTGGTCACCGAACACTACGGCCTCACGCCCCACTAC
TACGCGGGCCATTCCCTGGGGGAGATCACCGCGGCCCACCTCGCCGGGATCCTCACCCTC
CCCGACGCGGCGCGTCTGGTCACCACCCGCGCCCGCCTGATGCAGTCTCTCCCCGCCACC
GGCGCGATGACCACCCTCCAAGCAGACCCCGACGAACTCCACGAACACCTCGCTGGCCTT
GAGGGGCGGGTGTCGCTCGCGGCGGTCAACGCGCCCGCGTCCGTGGTCATCAGCGGTGAC
CGCCACGACGTAGACGCCACGGCCGAAAACTTCCGCGCCATGGGGCGCAAGACCACCCCG
TTGAAGGTCAGCGGCGCCTTCCACTCACACCACATCGACCCACTCCTCGACGAACTCCGC
ACCACCGCAGAAACCCTCACCTACCACCAGCCCCACACCCCCCTCATCACCACCGACCTG
ACCGACCAGGACCCCACCACACCTGGCTATTGGGTCCGGCAAACACGCGAGACCGTCCAC
TACGCCCACACCACCCAACAACTCCACACCCACGGCGTCACCGCCTACCTCGAACTCGGC
CCCGACACCACACTCACCACCCTCACCCACCACAACCTCCCCCACCACACCCCCCTAGCC
ATCCCCCTCCTCCACCCCGACCAACCCGAAACCCACACCACCCACACCGCCCTCGCCCAC
CTCCACACCCACGGCCACCCCACCACCTGGCACCACCACCACACCCCCACCCACCACCAC
CCAAACCTCCCCACCTACCCCTTCCAACACCACCACTACTGGCTCAACACCACCACTGCA
ACCCCCAACACGACCGACGCATGGCGCTACGACGAGGTCTGGCAGCCGCTCGACCTGGCT
GACGCCGAGCCGATGCCGCCCGGGGACTGGCTGGTCGTCATTCCCGCGCGGCAGGCGGGG
CATCCTCACGTCGACGCCATCCTCAGCGGGCTACGAGAGCACGGCGGCATCCGCATGACC
GAGCTCGTACTCGACCCGGCGGATATCGATCCGCTGGTCCTTCGTCAGCACCTCGCCGAC
GCGATCGAGCAACCCGATGACCTGAGTATCAGCGGCGTGCTCTCGCTACTCGCCTTCGAT
GAGGAACCCCACCCCGAACACCCCCATCTACCCACCGGCACCGCACTCACCCTCACCCTC
ATCCAAACCCTCACCACCCACCCCAACACCACAGCACCCCTGTGGTGCCTCACCCAAACC
GCCACCACCACCCACCCAACCGACACCCTCAACCACCCCACCCAAGCCCACATCTGGGGA
CTCGGACGCACCACCGCACTCGAACACCCCAACCACTGGGGCGGACTCATCGACCTCCCC
ACCACCCCCACCCCACAACACCTCCACCACCTCACCACCGCCCTCACCACCCAACACAAC
GACGATCAACTGGCCATCCGTGACACGGGGCTACAGACCCGACGCCTGACCCGCACCGCC
ACCACGCCCAGCAACCCCCAGCCATGGAAACCCCACGGCACCATCCTCATCACCGGCGGC
ACCGGAGCCATCGGCACCCACATCGCCACCTGGATCGCCACCCACCACCCCCACTGCCAC
CTCCTCCTCACCAACCGACAAGGACCCAACACACCCCACAACACCCAACTCCACCACCAC
CTCACCCAACTCGGCGCCACCACCACCATCACCACCTGCGACACCACCAACCCCAACCAA
CTCACCAACCTCCTCAACACCATCCCCCCAACCCACCCCCTCACCACCATCATCCACACC
GCAGGAATCAACCTGAAGTCGACCATCGCCGATCTCAGTGCGGCGGATCTCGCCGAGACA
GCCGGGGCGAAGGCGACCGGGGCGGCGATTCTGCACGAGTTGCTGCGCGAGCACGACACG
ATCGAGCGCTTTGTTCTCTTCTCCTCCATTGCCGCCACCTGGGGCAGCGCGAACCAGGCC
GGATACGCCGCCGCCAATGCGTATCTCGATGCCCTGGCCCAGCACCGCCGCGCCCAAGGA
CTCCCCGCCACCTCCATCGCCTGGGGGCCCTGGGATGGCGCGGGGATGGCCGCGGACGGA
GACACACGCGCTCACCTGCGCCGCCGCGGCCTCCGGGCCATGTCCCCTGACCTCGCCCTT
GCCGCACTCGATCGCGTCCTCGGCCACGGCCCCGACAGCGCCGCCTCCACTGTCGTCGCG
GATGTCGACTGGGAGGACTTCGCCACCACCTTCACCGCCCGGCGGCCGGCACCGCTCATC
GACGACATCCCCGAGGTCCGCCAGGTACTACGCGGCGATGCCGCCCCGTCCTCCGCCGAC
TCTCTTCGCGAGCAACTGGCACAGCGTTCGCCCAAGGAACAACAGCAGGCCCTTCTCGAT
GTCGTCCGCACACACGCGGCCGCCGTGCTCGGTCATTCCAGCCCCGAATCCATCGACGCT
CAGCAAGCCTTCAGCGCGCTTGGGTTCGACTCCCTCACCGCTGTCGAGTTCCGAAACCGC
GTCGCGGCCGCCATCGGTCTGGCTCTCCCCACCACCCTTGTCTTCGACCATCCAAGCCCC
ACGGAGTGCGCCACGCATCTGCGTACGGCGCTGCTGGGCGGGACCGACGACGACCTGCGC
GAGGGCATGTCAGGCACATCGGCCGAGCGCACTCAAGCAGCAGCAATCCTCGACGAGCCC
ATCGCAATCGTCGGGATGGCCTGCCGCTATCCGGGCGGGGTGCGGTCGGCTGAGGATCTA
TGGCGGTTGGTCGCGTCGGGCACTGACGCCATCACCGAGTTCCCGACAGACCGCGGCTGG
GACGTCGAGCGGATCTATCACCCGGACCCCGACCACGAGGGCACCTGCTGCACTCAGCAT
GGCGGGTTCCTCTATGACGCCGGTGAGTTCGACCCGGCGTTCTTCGGCATCAGCCCGCGC
GAAGCCCTCGCCATGGACCCGCAGCAACGGCTGCTGCTTGAAGCCTCGTGGGAAGCGTTC
GAACACGCAGGCATCGACCCGGAGACCCTGCGCGGCAGTCAGACCGGAGTCTTCGTCGGA
ATCAACGTGCAGGACTACGCCGCTCATGTGCGACAGGTGCCGCAGGCCGTGGCCGGCTAT
GCGCTGACCGGCAGTTCGGGGAGCGTTGCCTCCGGCCGTATCGCCTATGTCTTCGGCCTT
GAGGGGCCGACGGTGTCGGTGGACACGGCCTGCTCGTCCTCCTTGGTCGCGCTGCACATG
GCCGGTCAGGCGCTGCGCACGACGGAGTGCTCGCTCGCGCTCGTCGGTGGAGTGATGGTG
ATGTCCACCCCGGCGACCTTCATCGAGTTCTCCCGCCAGCGGGGCCTGTCCCCCGACGGG
CGGTGCAAGGCGTTCTCGGCCACCGCCGACGGCACGGGCTGGGCGGAGGGAGTCGGCATG
CTCCTCGTGGAGCGGCTGTCCGACGCCCGGCGCAACGGCCACCGCGTTCTGGCCGTGGTG
CGCGGCAGCGCGATCAATCAGGACGGCGCCAGCAACGGCCTCACCGCCCCCAACGGCCCC
GCGCAACAGCGCGTCATCCGCCAGGCCCTGGCCGGCGCCGGGCTCTCACCTTCCGAGGTC
GACGCGGTGGAGGGGCATGGCACCGGAACGGTCCTCGGCGACCCCGTCGAGGCCCAGGCG
CTGCTGGCCACGTACGGGCAGGACCGGGCCGAGGACCATCCGCTATGGCTGGGGTCGGTG
AAGTCCAACATCGGGCACGCCCAAGCGGCGGCCGGTGTCGGCGGGGTGATCAAGATGGTG
ATGGCCCTCCAGCACGACACACTGCCCCGTACGTTGCATGCCGACGAGCCATCGCCGCAT
GTGGACTGGTCAGCCGGGGCGGTACGGCTGCTGACCGACGCCGTGCCGTGGGTACGGAAC
GGGCGTCCACGGCGTGCCGGTGTCTCCTCCTTCGGCGTCAGCGGCACCAACGCGCACCTC
ATTCTGGAAGAGGCGCCCTCCGACGAGCCCGACGGTCCGGGGGTGGCCGGGCCGACTGCC
GCGCCCGCCGTGGAGGCGTCTGCTGTATCCCTGCCATGGCTGCTGTCCGCGAAGTCGGCC
GACGCTCTCCGCGCCCAGGCCCGCCAGTTGCGGGAGTTCGTATCTGCGGCTCCCGAAGCC
GGTTCCGGTCCGGGGCTCGCGGACATCGGATACTCACTGGCCACGCACCGGTCGGCTTTC
GAGCATCGTGCGGTGGTTATCGGCTCCGACCGAGCCGACTTTCTGGGTGGTCTGGATGCT
CTGGCGGCAGATGAGGCCCACTCTGCTGTGGTCACGGGTATCGCGAGGAAGGCCGGTGAC
CTGGGGAAGGTGGTGTTCGTCTTCCCCGGGCAGGGTGGTCAGTGGGCCGGGATGGGACTG
CGGCTGCTCAAGACCTCGCCCGTCTTCGCGCAATCCATCCAGGCCTGCGAACAAGCCCTC
GCCCCCCACACCGACTGGACCCTGACCGACATCCTGCACCGCCCCCACACCGACCCCCTG
TGGCAGCGCGCCGACGTCATCCAGCCCGCCCTCTTCGCCCTCATGACCTCCCTCACCACC
CTCTGGCAATCCCACGGCCTCAACCCCGACGCCGTCATCGGCCACTCCCAAGGCGAAATC
ACCGCCGCCCACATCAGCGGAGCACTGAGCCTGGAAGACGCCGCGAAAATCGTCGCCCTC
CGCAGCCAGACCCTGCAAACCCTCCAAGGCTCAGGCGGCATGGCCTCCGTACCACTGCCC
GCAGACCAGGTCACCGCACTGCTGCACACCATGTGGCCCGACCAGCTATGGGTCGCCGCC
ATCAACGCCCCCACCACCACAGTCATCTCCGGCGACACACAAGCCCTCACACAAGCGCTG
AACCACTACCGGGACCAAGACATCGACGCGAAACGCATCCCGGTCGACTACGCCTCCCAC
TGCCCCCACATCCAGGCCGTCCAACACGAACTCTCAGACCTGTTGCAGGACATCACCCCA
CGGGCCGCGACCACCCCCTTCTACTCCACCACCGACAACCAATGGACCGACACCACCACC
CTCAACGCCCACTACTGGTACCGAAACCTCCGCCAACCCGTCCACCTCACCAACGCCATC
ACCAACCTCACCCACCAAGGCCACCACACCTACATCGAAATCAGCCCCCACCCCACCCTC
ACCCCCGCCATCCAGGAAACCACCCACACCACCCACACCCCCACCACCGTCATCAGCACA
CTCCGCCGCAACCACAACGACACCCACCAACTCCTCCACGCCCTCGCCCACGCCCACACC
ACCGGCCACCCCATCAACTGGCACCCCACCCACCAACACCACACCCCAACCCCCCAACAC
ACCGACCTCCCCACCTACCCCTTCCAACACCAACGCTACTGGCTCAACACCCCCACCCAA
ACAGGAGACGCAGCAGCCATCGGCCTGGACCCGGCACATCACCCGCTGCTCGGCGCCGCG
ATTCCGCTCGCGGACAGCGACGGACATCTCTTCACCGGCCGGCTGTCCCTGCGTACGCAC
CCTTGGCTGGCCGACCATGCCTTCGCCGGTGTCGCGCTTGTGCCCGGTACGGCGTTCCTG
GACATCGCCTTGCAGGCCGGGGAGCGCGTCGGGTGCCGGCACCTCGAAGAACTCTCCTTG
CATGCGCCCCTACTTCTCCCGCAGCGTGGCGGGGTCGTTCTGCAAATCAGCGTCGGGGCC
CCGGACTGCGAGGGGCGGCGGGAGTTCGCCGCGTACGCGCGGAGCCATGACGACGTCTCG
GGTACAGAGAACGCGCTGGGCACCGAGGGTTTGGAGGCCGCGACCCAGTCGTGGACGCGG
CATGCCACAGGCACGCTGACCGCCGCCACACCCCCCGCCGCGGCAGCCGGACCGAAGGCC
GGAGCCTGGCCTCCGGGCAAGGCCGACGCTGTGGACCTTGACGGGCTGTATGAGCGGCTG
ACGGGGGCTGAGTTGGCCTACGGGCCCGCGTTCCATGGGCTGCGTGCCGCCTGGCGCGAG
GGCTCGGACATCTTCGCTGAGGTGCGGCTGCCCGAACCGCAGGCCCGCGACGCCGGCCGG
TTCGGCATCCACCCCGCCCTCCTGGACGCGGCGCTGCACACCCTCGGCCTCGATCCGGCG
CTCAGTGAGGAGCCCGCCGATGCATCGGAGGGGCAGCGGGCGGCTCGGTTGCCGTTCGTC
TGGCGTGGGGTGACGCTGCACCGCAGGGGCGGTGAGGTGCTACGCGTGCGGCTCTCGCCG
GGGCCGGGCAATGGAGTGGTGGCCATCGAGGCGACCGACGAGTCGGGGCGGCCAGTGGCC
TCCGTCGAGGCACTCGTGCTGCGGCCGGTGTCCGCGGGTGAGGTGCGGGCCGCCGCCCAC
TCGGAGCATCACGAGTCGCTGTTCGGCCTGGAATGGCCCACCGCGACATTGCCCACCGCA
GACCACTCTCCTTCCGATCCCTCGTCCTTCGCCGTCGTCGGCGCCGATCCGTCCGGGCTT
CCCTACCGGCGCCATGACGACTGGGCCGCGCTGCTCGACGCCGTGGAGGCCGATGGCGCT
CCGGAGTTGATCGTCGTCCCGTGTGGCGGGGACGAGGACGGTGATGTTGCGGGCGGCGCG
GCCGTCCGCACGGCCGTGCGTAACGTGCTGCACCTGCTGCAGTCCTGGCTCGGCGACGAC
CTGTTCGCGGACAGTCGGCTGGTCGTACTGACGCGTGGGGCGGTCGCCACCCGCCGGGAG
GACGACGTCACGGATCTGCCCGGCGCGGCGGTGTGGGGCCTGGTCCGCTCGGCGCAGTCC
GAGAACCCGGGCCGGATCACCCTGGTGGACTGGGATGGACACGGCTCCTTGGCCCAGGTG
CTTCCCGCGGCACTGGCCGGTGGTGAGCCTCAGCTGGCCGTCCGCGACGGAGAGGTGTGC
GTACCGCGGTTGGTGCGGATGCCACGGCAGGATCGGCCCGCACCGACCGACGGGGCGGCC
GATGGGTCGGCCGATGGGCCCACCGACGCGTCAGCCGGCGCGTCGCCATGGGCGCTCGAT
CCGGCGGGCACCGTGCTGATCACGGGTGGCACCGGTGTGCTGGGTGGGCTGATCGCCCGC
CACCTCGTCGCGGCGCACGGTGTACGGCAGTTGCTGCTGGTCGGCCGTCGCGGTGCGCAG
GCCGAGGGGGTACGGGCGCTCGCGGCCGAGCTCGAGGCTGTCGGCGCGACGGTGACCGTC
GCCGCCTGCGATGCCGCCGACCGCGAGGCGCTGGCCACTTTGCTGGGCCGCGTCCCCGAA
CAGCATCCGCTGACCGCCGTGGTCCATGCCTCCGGGGTGCTCGACGACGGCACGATCCCC
TCGCTGACACCCGAGCGGATCGACACCGTCTTCCGCGCCAAGGTGGTCCCGGCGCTGCTT
CTGCATGAGCTCACACGCGACGCGGATCTGGCCGCGTTTGTGATGTTCTCCTCCGCGGCA
TCCGTGCTCGGTTCGCCGGGCCAGGGCAACTACGCGGCGGCCAACGCTGTACTGGACGCG
CTGGCCCGGCACCGCCGGGCCCAGGGGCGGCCCGCGACGTCGTTGGCCTGGGGGCTTTGG
GCGCAGGGCAGCGGGATGACACGCCATCTGGATGGCACCGACCACGCCCGGATCAGCCGT
GGCGGGATGGCGCCACTGGCCACCGAGGAGGCCCTTGCCCTCTTCGACGCGTCGTCGGCG
GCCGGGGAGCCGTTCCTCGTCCCGGCCCGGTTCGAGCTCGGCTCGCTGCGCTCGCGGGCG
ACCGGCGCCGGGGTGCCGGCGCTGCTGCGTGGGCTCGTCCCGGCCTCCGCCCGCCGCGGC
GGCGCGGCCGACCGGGGCGAGGACGGCGAGGACAGGACGGACGTGGGCGTGTCGCTCCGC
GAGCGGCTGGCCCGCTGCGGCGGCAAGGAGCAGCAGGGCATCCTCACTCGTCTCGTACGG
TCCCACGCCGCCGCGGTGATCGGCCATGCGGGGATCGAGGAGGTGGCGGAGCGGCGTGCC
TTCCGGGAACTGGGCTTTGACTCGCTGACCGCCGTCGAGCTGCGCAATAGGCTCACCACC
GCCACCGGTCTGCGGTTGCCCGCGACGGTCGCGTTCGACTTTCCGACCCCGACCGCGCTC
GCGGAGCATGTGCGTGCCCTGCTGCTGCGGGCGAACGGGAACGGCACCGGGGCGGACGGG
ACCTCGGCCTCGGAGGCAGGCGAGGAGGAGCTGCGGGCCGCCGTGGCGTCCATTCCGCTC
GGCAGGCTCCGGGAGGCCGGTCTGCTGTCTGCGCTGCTGGAACTCGCCGAGGCCCCTGGC
GGCCTCGGAGGCGGCCTCGGGAGCGTCGGGGTCGCTGCCGTCCCGGCCGCCCGATCCGCG
GAAGGCCCCGGCTCGATCGACGAGATGGACATCGACAGTCTCATCGGCCTGGCGCACGGC
GACCAGCCCGGCTCGGACTCGTAG
[3] KS33..409
[3] AT564..879
[3] malonyl-CoA not conserved HAFHS(H->G)758..762
[3] ACP949..1019
[4] KS1042..1417
[4] AT1576..1891
[4] malonyl-CoA not conserved HAFHS(H->G)1770..1774
[4] KR2204..2384
[4] ACP2487..2557
[5] KS2581..2956
[5] AT3111..3426
[5] malonyl-CoA not conserved HAFHS(H->G)3305..3309
[5] KR3693..3874
[5] ACP3978..4048
[6] KS4078..4454
[6] AT4625..4943
[6] methylmalonyl-CoA not conserved YASHS(S->C)4817..4821
[6] DH4994..5174
[6] KR5523..5703
[6] ACP5815..5885
[3] KS97..1227
[3] AT1690..2637
[3] malonyl-CoA not conserved HAFHS(H->G)2272..2286
[3] ACP2845..3057
[4] KS3124..4251
[4] AT4726..5673
[4] malonyl-CoA not conserved HAFHS(H->G)5308..5322
[4] KR6610..7152
[4] ACP7459..7671
[5] KS7741..8868
[5] AT9331..10278
[5] malonyl-CoA not conserved HAFHS(H->G)9913..9927
[5] KR11077..11622
[5] ACP11932..12144
[6] KS12232..13362
[6] AT13873..14829
[6] methylmalonyl-CoA not conserved YASHS(S->C)14449..14463
[6] DH14980..15522
[6] KR16567..17109
[6] ACP17443..17655

close this sectionFeature

BLASTP
Database:UniProtKB:2011_09
show BLAST table
InterPro
Database:interpro:38.0
IPR001227 Acyl transferase domain (Domain)
 [555-690]  G3DSA:3.40.366.10 [758-870]  G3DSA:3.40.366.10 [1567-1702]  G3DSA:3.40.366.10 [1770-1882]  G3DSA:3.40.366.10 [3102-3237]  G3DSA:3.40.366.10 [3305-3417]  G3DSA:3.40.366.10 [4621-4748]  G3DSA:3.40.366.10 [4817-4934]  G3DSA:3.40.366.10
G3DSA:3.40.366.10   Ac_transferase_reg
IPR006162 Phosphopantetheine attachment site (PTM)
 [2515-2530]  PS00012 [5843-5858]  PS00012
PS00012   PHOSPHOPANTETHEINE
IPR009081 Acyl carrier protein-like (Domain)
 [942-1058]  2.19999900980708e-26 SSF47336 [2480-2597]  2.30000440726534e-28 SSF47336 [3971-4095]  4.80000830876748e-26 SSF47336 [5807-5902]  1.50000306971104e-21 SSF47336
SSF47336   ACP_like
 [946-1023]  1.39999999999997e-84 G3DSA:1.10.1200.10 [2484-2560]  1.39999999999997e-84 G3DSA:1.10.1200.10 [3975-4051]  1.39999999999997e-84 G3DSA:1.10.1200.10 [5815-5888]  1.39999999999997e-84 G3DSA:1.10.1200.10
G3DSA:1.10.1200.10   ACP_like
 [949-1019]  PS50075 [2487-2557]  PS50075 [3978-4048]  PS50075 [5815-5885]  PS50075
PS50075   ACP_DOMAIN
 [951-1018]  1.1e-10 PF00550 [2490-2556]  5.20000000000001e-10 PF00550 [3981-4047]  4.3e-10 PF00550 [5819-5884]  2e-10 PF00550
PF00550   PP-binding
IPR013968 Polyketide synthase, KR (Domain)
 [2204-2383]  1e-61 PF08659 [3693-3873]  7.00000000000005e-56 PF08659 [5523-5702]  1.40000000000001e-60 PF08659
PF08659   KR
IPR014030 Beta-ketoacyl synthase, N-terminal (Domain)
 [33-284]  4.40000000000003e-95 PF00109 [1042-1292]  1.39999999999997e-98 PF00109 [2581-2831]  4.59999999999996e-97 PF00109 [4078-4329]  5.30000000000001e-99 PF00109
PF00109   ketoacyl-synt
IPR014031 Beta-ketoacyl synthase, C-terminal (Domain)
 [292-409]  3.5e-47 PF02801 [1300-1417]  1.9e-45 PF02801 [2839-2956]  1.2e-47 PF02801 [4337-4454]  3.29999999999997e-46 PF02801
PF02801   Ketoacyl-synt_C
IPR014043 Acyl transferase (Domain)
 [564-879]  7.00000000000005e-56 PF00698 [1576-1891]  2.59999999999998e-55 PF00698 [3111-3426]  5.69999999999998e-57 PF00698 [4625-4943]  1.20000000000001e-102 PF00698
PF00698   Acyl_transf_1
IPR015083 Polyketide synthase, docking (Domain)
 [1-33]  1.19999985703101e-06 SSF101173
SSF101173   Polyketide_synth_docking
 [2-26]  1e-07 PF08990
PF08990   Docking
IPR016035 Acyl transferase/acyl hydrolase/lysophospholipase (Domain)
 [561-875]  1.99999636034521e-60 SSF52151 [1573-1887]  4.39999001237725e-61 SSF52151 [3108-3422]  1.99999636034521e-60 SSF52151 [4623-4928]  3.89999861795218e-64 SSF52151
SSF52151   Acyl_Trfase/lysoPlipase
IPR016036 Malonyl-CoA ACP transacylase, ACP-binding (Domain)
 [692-757]  1.80000149287196e-16 SSF55048 [1704-1769]  7.49999605851445e-17 SSF55048 [3239-3304]  8.30001403791589e-17 SSF55048 [4750-4816]  9.69999663390112e-16 SSF55048
SSF55048   Malonyl_transacylase_ACP-bd
IPR016038 Thiolase-like, subgroup (Domain)
 [35-296]  G3DSA:3.40.47.10 [297-463]  G3DSA:3.40.47.10 [1044-1304]  G3DSA:3.40.47.10 [1305-1471]  G3DSA:3.40.47.10 [2583-2843]  G3DSA:3.40.47.10 [2844-3010]  G3DSA:3.40.47.10 [4078-4340]  G3DSA:3.40.47.10 [4341-4507]  G3DSA:3.40.47.10
G3DSA:3.40.47.10   Thiolase-like_subgr
IPR016039 Thiolase-like (Domain)
 [25-408]  3.99998544139406e-99 SSF53901 [1034-1416]  3.60002015031011e-103 SSF53901 [2573-2955]  1.90000694315261e-103 SSF53901 [4071-4453]  3.49999466863949e-101 SSF53901
SSF53901   Thiolase-like
IPR016040 NAD(P)-binding domain (Domain)
 [2205-2390]  4.30000000000003e-103 G3DSA:3.40.50.720 [3683-3882]  2.70000000000002e-99 G3DSA:3.40.50.720 [5523-5704]  1.5e-94 G3DSA:3.40.50.720
G3DSA:3.40.50.720   NAD(P)-bd
IPR018201 Beta-ketoacyl synthase, active site (Active_site)
 [197-213]  PS00606 [1205-1221]  PS00606 [2744-2760]  PS00606 [4242-4258]  PS00606
PS00606   B_KETOACYL_SYNTHASE
IPR020801 Polyketide synthase, acyl transferase domain (Domain)
 [565-937]  8.39998014024973e-103 SM00827 [1577-1873]  1.39999277195148e-104 SM00827 [3112-3408]  3.99998544139406e-107 SM00827 [4627-4925]  2.99998750445706e-110 SM00827
SM00827   PKS_AT
IPR020806 Polyketide synthase, phosphopantetheine-binding domain (Domain)
 [950-1022]  5.10001002358244e-29 SM00823 [2488-2560]  1.89999859865865e-32 SM00823 [3979-4051]  3.69999438558106e-32 SM00823 [5816-5888]  1.99999636034521e-31 SM00823
SM00823   PKS_PP
IPR020807 Polyketide synthase, dehydratase domain (Domain)
 [4994-5174]  1.39999277195148e-77 SM00826
SM00826   PKS_DH
IPR020841 Polyketide synthase, beta-ketoacyl synthase domain (Domain)
 [35-462]  SM00825 [1044-1470]  SM00825 [2583-3009]  SM00825 [4081-4507]  SM00825
SM00825   PKS_KS
IPR020842 Polyketide synthase/Fatty acid synthase, KR (Domain)
 [2204-2384]  7.6999848053369e-56 SM00822 [3693-3874]  5.6999970878395e-46 SM00822 [5523-5703]  2.1000026783403e-59 SM00822
SM00822   PKS_KR
SignalP
 [1-15]  0.071 Signal
Eukaryota   
TMHMM No significant hit
Page top