Soraph_00070 : CDS information

close this sectionLocation

Organism
StrainSo ce26
Entry nameSoraphen
Contig
Start / Stop / Direction14,325 / 33,272 / + [in whole cluster]
14,325 / 33,272 / + [in contig]
Location14325..33272 [in whole cluster]
14325..33272 [in contig]
TypeCDS
Length18,948 bp (6,315 aa)
Click on the icon to see Genetic map.

close this sectionAnnotation

Category1.1 PKS
Productpolyketide synthase
Product (GenBank)soraphen polyketide synthase A
Gene
Gene (GenBank)sorA
EC number
Keyword
Note
Note (GenBank)
Reference
ACC
PmId
[12039053] Characterization of the biosynthetic gene cluster for the antifungal polyketide soraphen A from Sorangium cellulosum So ce26. (Gene. , 2002)
[15289572] Heterologous production of the antifungal polyketide antibiotic soraphen A of Sorangium cellulosum So ce26 in Streptomyces lividans. (Microbiology. , 2004)
comment
[PMID: 12039053](2002)
Soraphen A biosynthetic gene clusterの報告。
SorA(6315aa): PKS

Mod1: ACP-KS-ATcb-ATa-KR-ACP
Mod2: KS-ATa-DH-ER-KR-ACP
Mod3: KS-ATm-DH-ER-KR-ACP

ATa= incorporate acetate
ATm= incorporate methoxymalonate
ATcb= incorporate carboxybenzoate starting unit

module1において、AT1a(ATcb)がcarboxybenzoyl-CoAを認識してひとつめのACP1aにロード。もうひとつのAT1bはmalonyl-CoAを認識して2つめのACP1bにロードすると考えられている。

Soraphenを合成しないMutantはSorAとSorBに変異箇所があることが確認された。

---
[PMID: 15289572](2004)
S. lividansでのsor genes異種性発現。

SorABRCDFE株 + cinnamate
SorABRCDFE株 + badA(benzoate-CoA ligase) + benzoate or cinnamate
の組み合わせでSoraphen A産生。

SorABRCDFE株で発現されているのは、the sorA, and the sorB4M, sorRCDFE operons.

close this sectionPKS/NRPS Module

0
1 malonyl-CoA
2 malonyl-CoA
3 methoxymalonyl-ACP
ACP14..87
KS110..486
AT632..949
AT1066..1387
KR1709..1888
ACP1992..2062
KS2092..2466
AT2628..2949
DH2999..3162
ER3487..3793
KR3803..3983
ACP4085..4155
KS4177..4547
AT4707..5025
DH5073..5236
ER5561..5867
KR5877..6057
ACP6159..6229

close this sectionSequence

selected fasta
>polyketide synthase [soraphen polyketide synthase A]
MTKEYTRPQSAPLTEGDLLTLIVAHLAERLRMDARFIDVHEPFSRHGLDSRGAVDLVVDL
RTALGRPLSPVVVWQHPTPDALARHLAGGADAREGQARADSAYERPGAPNEPIAIVGMAC
RFPGAPDVDSYWRLLSGGVDAVTEVPAGRWDMDAFYDRDPRSLGDVSTLRGGFIDDVDRF
DAMFFGISPREAVSMDPQQRLMLELAWEALEDAGIVAERLKESLTGVFFGCIWDDYVTLI
HQRGRGAIAQHTVTGNHRSIIANRVSYTLDLRGPSMTVDSACSSALVTIHMACESLRSGE
STLALAGGVNLNIAPESTIGVHKFGGLSPDGRCFTFDARANGYVRGEGGGVVVLKRLSSA
IADGDPIICVIRGSAVNNDGASNGLTGPNPLAQEAVLRTAYERAGVNPADVQYVELHGTG
TQLGDPVEASALGAVLGKRRPAERPLLVGSAKTNVGHLEGAAGIVGLLKAALCLKHKQLA
PNLNFETPNPHIPFAELNLKVQGALGPWPDMDRPLVCGVSSFGLGGTNAHVVLSEWASLE
AELHPLAAESPEALREEVQRRLLTMTSLVGRAPLSFLCGRSAAQRSAKEHRLAVTARSFE
ELKQRLLGFLEHEKHVSVSAGRVDLGAAPKVVFVFAGQGAQWFGMGRALLQREPVFRTTI
EQCSSFIQQNLGWSLLDELMTDRESSRLDEIDVSLPAIISIEIALAAQWRAWGVEPAFVV
GHSTGEIAAAHVAGVLSIEDAMRTICAYGRIIRKLRGKGGMGLVALSWEDAGKELTGYEG
RLFRAIEHSADSTVLAGEPDALDALLQALERKNVFCRRVAMDVAPHCPQVDCLRDELFDA
LREVRPNKAQIPIVSEVTGTALDGERFDASHWVRNFGDPALFSTAIDHLLQEGFDIFLEL
TPHPLALPAIESNLRRSGRRGVVLPSLRRNEDERGVMLDTLGVLYVRGAPVRWDNVYPAA
FESMPLPSTAGGGKPLPPMPLLISARTDAALAAQAARLRAHLDSHLDLELVDVAYSLAAT
RTHFERRAVVVARDRAGILDGLDALAHGGSAALLGRSAAHGKLAILFTGQGSQRPTMGRA
LYDAFPVFRGALDAAAAHLDRDLDRPLRDVLFAPDGSEQAARLDQTAFTQPALFALEVAL
FELLQSFGLKPALLLGHSIGELVAAHVAGVLSLQDACTLVAARAKLMQALPQGGAMVTLQ
ASEQEARDLLQAAEGRVSLAAVNGHLSTVVAGDEDAVLKIARQVEALGRKATRLRVSHAF
HSPHMDGMLDDFRRVAQGLTFHPARIPIISNVTGARATDQELASPETWVRHVRDTVRFLD
GVRTLHAEGARAFLELGPHPVLSALAQDALGHDEGPSPCAFLPTLRKGRDDAEAFTAALG
ALHAAGLTPDWNAFFAPFAPCKVPLPTYTFQRERFWLDASTAHAASATPAAALEGRFWQA
VESGDIDTLSSELHVDGDEQRAALALVLPTLSSFRHKRQEQSTVDAWRYRVTWKPLTTAA
TPADLAGTWLLVVPSALGDDALLATLTEALTRRGARVLALRVSDIHIGRSALVEHLREAL
AETAPLRGVLSLLALDEHRLADRSALPAGLALSLALVQGLDDLAIEAPLWLFTRGAVSIG
HSDPITHPTQAMIWGLGRVVGLEHPERWGGLVDVSAGVDESAVGRLLPALAQRHDEDQLA
LRPAGLYARRIVRAPLGDAPPAREFRPRGTILITGGTGALGAHVARWLARQGAEHLILIS
RRGAEAPGASELHAELNALGVRTTLAACDVADRSALQALLDSIPSDCPLTAVFHTAGARD
DGLIGDMTPERIERVLAPKLDSALHLHELTKNSALDAFVLYASLSGVLGNPGQANYAAAN
AFLDALAEHRRSLGLTATSVAWGGWGGGGMATERVAAQLQQRGLLQMAPSLALAALAQAL
QQDETTITVADIDWSRFAPAFSVARQRPLLRDLPEAQRALQASEGASSEHGPATGLLDEL
RSRSESEQLDLLATLVRGETATVLGHAEASHVDPDKGFMDLGLDSLMTVELRRRLQKATG
VKLPPTLAFDHPSPHRVAFFLRDSLARAFGTRLSAERDGAALPAPGATSDSDEPIAIVGM
ALRLPGGIGDVDALWDFLHQGRDAVEPIPPTRWDAGALYDPDPDAKAKSYVRHAAMLDQV
DLFDPGFFGISPREAKHIDPQHRLLLEAAWQALEEAGIVPSTLKDSPTGVFVGIGASEYA
PREPGAEDSEAYIVQGTYASFAAGRLAFTLGLQGPALSVDTACSSSLVALHLACQALRRD
ECNLALAAGASVMVSPETFVLLSRLRALAPDGRSKTFSASADGYGRGEGVIVLALERLRD
ALAQGRRVLAVVRGTAVNHDGASSGITAPNGTSQKKVLRAALHDARIAPADVDVVECHGT
GTSLGDPIEVQALAAVYGEGRSAEKPLFLGAVKTNVGHLEAAAGLAGVAKIVASLLHNAL
PPTLHTTPRNPLIAWDALAVAVVDATRPWVRHADGRPRRAGVSAFGLSGTNAHVILEEAP
AIARVEPAASQPASEPLPAAWPVLLSAKSEAAVRAQAKRLRDHLLAKSELALADVAYSLA
TTRAHFEQRAALLVKGRDELLSALDSLAQGHSAAVLGRSGAPGKLAVLFTGQGSQRPTMG
RALYDAFPVFRDALDTVAAHLDRDLDRPLRDVLFAPDGSEQAARLDQTAFTQPALFALEV
ALFQLLQSFGLKPALLLGHSIGELVAAHVAGVLSLQDACTLVAARAKLMQALPQGGAMVT
LRASEEEVRDLLQPYDGRASLAALNGPLSTVVAGDEDAVVEIARQAEALGRKTTRLRVSH
AFHSPHMDGMLDDFRRVAQSLTYHPARIPIISNVTGARATDHELASPDYWVRHVRHTVRF
LDGVRALHAEGARVFLELGPHAVLSALAQDALGQDEGTSPCAFLPTLRKGRDDAEAFTAA
LGALHAAGLTPDWSAFFAPFAPRKVSLPTYAFQRERFWLDASKAHAADVASAGLTSTDHP
LLGAGVPLADRDGFLFTGRLSLSEHPWLADHVVFGTPILPGTAFLELALFVAGRVGLDTV
EELTLETPLALPSEGALLVQVSVGPLDDAGRRPLSLHSRPQGAPQDAPWTRHASGSLAPA
TPSPSFDLHDWPPSGATQVDTQGLYATLESAGLAYGPQFQGLRSVWRRGDELFAEAQLPD
AAKKDAARFALHPALLDSALHALALDDERAPGVALPFSWGGVSLRAVGATTLRVRFHRPK
GETAGSLVLADAAGGPIASVQALATRITSAEQLRTPGASHHDALFRVDWSELPSPTSPSG
APSAVLLGIGGLDLAPEVPLARVADLAALQSALDQGASPPGLVVVPFMARTADDLIQSAH
SITARALALLQAWLADERLASSRLVLLTRRAIAARADEDVKDLAHAPLWGLARSAQSEHP
ELPLFLVDLDLSEASQHTLLAALETGERHSRLRNGKPFIPRLANARSKDELIAPDASNWR
LHIPTKGNFDALTLVDAPLARAPLAHGQVRVAVHAAAFNFRDVLDTLGLYPGDAGPLGGE
GAGIVTEVGPGVSRYTVGDRVMGIFGAACGPTAIADARMICPIPHAWSFAQAASVPIIYL
TAYYGLVDLGHLKPNQRVLIHAAAGGVGTAAVQLARHLGAEVFATASAGKWSALRALGFD
DAHLASSRDLDFEQHFLRSTHGRGVDVVLDCLAREFVDASLRLMPSGGRFVEMGKTDIRE
PDAVGVAYPGVVYRAFDLIEAGPDRIEQMLAELLSLFERGALRPPPITSWDIRHAPQAFR
ALAQARHVGKFVLTIPRPIDPEGTVLITGGTGTLGALVARHLVARHGAKHLLLTSRQGAH
APGAEASRTELEALGASVTLRACDAADPRALQALLDSIPSAHPLTAVVHAAGALDDGLLG
AMSPERIDRVFAPKLDAAWHLHELTQDKPLAAFVLFSSAAGVLGSPGQSNYAAANAFLDA
LAHHRRAHGLPASSLAWGYWAERSRMTEHLSAADVSRMRRAGVRPLATDEALSLFDAALL
RPEPALVPARFDVNALGANADEVPPLFQRLVRARVARKAASNTALASSLSQRLSSLPPAE
SERFLLDLVRTEAATVLGLASFESLDPHRPLQELGLDSLIALELRGRLAAATGLRLQPTL
LFDYPTPAALSRFFTTQFFGETTDRPAAPLTPAGSEDPIAIVSMSCRFPGDVRTPEDLWK
LLLDGKDAISSFPQNRGWSLDALDAPGRFPVREGGFVYDADAFDPAFFGISPREALAIDP
QQRLLLEISWEALERAGIDPASLQGSQSGVFVGIIHNDYGAWLMNGTDEHKGFAATGSTA
SVASGRIAYTFGFQGPAISVDTACSSSLVAVHLACQALRHGECSLALAGGVTVLATPAVF
VAFDSESAGAPDGRCKAFSAEANGAGWAEGAGMLLLERLSDAVRNGHPVLAVLRGSAVNQ
DGRSQGLTAPNGPAQERVIRQALDSARLTPKDIDAVEAHGTGTTLGDPIEAQAILATYGE
SHSQDSPLWLGSLKSNMGHTQAAAGVGSVIKMVLALQHGLLPKTLHAKNPSPHIDWSPGT
VKLLDEPVVWKTNGHPRRAGVSSFGFSGTNAHVILEEAPAIARAESAAAQPASEPLPAAW
PVLLSAKSEAALRAQAARLRDHLQAHPDLELADVAYSLATTRAHFERRAVVVAKDRDEAT
FALDAFEQGSPAHHVAHGEARVAGKLVFVFPGQGSQWPGMAQQLLTTSDAFRAQVEACAR
AFAPHLGWSLLAVLRGDEGAPSLERIEVVQPALFTVMVSLAALWRSRGIEPDAVVGHSQG
ELAAAYVAGALSLDDAAKVVARRSRLLSTLSGQGAMAAVERPPAALEPYLARFGRRLSIA
AINSPSATTVSGEPDAIDHLLRLLKAEQIFALKLRVDVASHGAQIEGMREQLLEELREIE
PRESRIPFYSTVRGEKLAGTELGAAYWYDNLLRPVRFADATQLLLDDAHRFFVEVSPHPV
LMLPLEETLEASGLPTAVLGSLWQDEGDLSRFLASLGELYARGYAVDWRAFFEPLRPRRV
ALPTYAFQRERFWLDAPTAHADVASAGLTSADHPLLGAAVRLADTDAFLFTGRLSLQSHP
WLAEHAAFGIPILPGTAFLELALLAADRVGLDTVEEVTLEAPLALPSQGTILIQISVGPM
DEAGRRSLSLHGRTEDAPQDAPWTRHASGSLAKAAPSLSFDLHEWAPPGGTPVDTQGSYA
GLESGGLAYGPQFQGLRSVWKRGDELFAEAKLPDAGAKDAARFALHPALFDSALHALVLE
DERTPGVALPFSWRGVSLRSVGATTLRVRFHRPNGKSSVSLLLGDAAGEPLASVQALATR
ITSQEQLRTQGASLHDALFRVVWRDLPSPTSLSEAPKGVLLETGGLDLALQASLARYDGL
AALRSALDQGASPPGLVVVPFIDSPSGDLIESAHNSTARALALLQAWLDDERLASSRLVL
LTRQAIATHPDEDVLDLPHAPLWGLVRTAQSEHPELPLFLVDLDLGQASERALLGALDTG
ERQLALRHGKCLVPRLVNARSTEALIAPNVSTWSLHIPTKGTFDSLALVDAPLARAPLAQ
GQVRVAVHAAGLNFRDVLNTLGMLPDNAGPLGGEGAGIVTEVGPGVSRYTVGDRVMGIFR
GGFGPTVVADARMICPIPDAWSFVQAASVPVVFLTAYYGLVDVGHLKPNQRVLIHAAAGG
VGTAAVQLARHLGAEVFATASPGKWDALRALGFDDAHLASSRDLEFEQHFLRSTRGRGMD
VVLNALAREFVDASLRLLPSGGSFVEMGKTDIREPDAVGLAYPGVVYRAFDLLEAGPDRI
QEMLAELLDLFERGVLRPPPITSWDIRHAPQAFRALAQARHIGKFVLTVPRPIDPEGTIL
VTGGTGTLGALIARHLVANRGAKHLLLTSRKGASAPGAEALRSELEALGAAVTLARCDAA
DPRALQALLDSIPSAHPLTAVVHAAGALDDGLISAMSPERIDRVFAPKLDAAWHLHQLTQ
DKPLAAFVLFSSASGVLGGMGQSNYAAANAFLDALAHHRRVHGLPASSLAWGHWAERSGM
TRHLSGVDTARMRRAGLRSIASDEGLALFDMALGRPEPALVPARFDMNALGAKADGLPSM
FQGLVRARVARKVASNNALAASLTQRLASLPPTDRERMLLDLVRAEAAIVLGLASFESLD
PRRPLQELGLDSLMAIELRNRLAAATGLRLQATLLFDHPTPAALATLLLGKLLQHEAADP
RPLAAELDRLEATLSAIAVDAQARPKIILRLQSWLSKWSDAQAADAGPILGKDFKSATKE
ELFAAFDEAFGGLGK
selected fasta
>polyketide synthase [soraphen polyketide synthase A]
ATGACAAAGGAGTACACGCGTCCGCAGTCGGCGCCGTTGACTGAGGGCGACCTCCTCACC
TTGATCGTCGCCCATCTCGCGGAGCGGCTGCGCATGGACGCGCGGTTCATCGATGTCCAC
GAGCCCTTCAGCCGCCATGGGCTCGACTCACGCGGCGCGGTGGACCTGGTCGTGGATCTG
AGGACGGCGCTTGGGCGCCCGCTGTCGCCCGTCGTGGTTTGGCAACACCCGACTCCCGAT
GCCCTGGCCCGCCACCTGGCCGGTGGGGCGGACGCGCGCGAGGGCCAAGCGCGCGCGGAC
TCTGCCTACGAGCGTCCTGGAGCACCAAATGAGCCCATCGCGATCGTCGGGATGGCCTGC
CGGTTCCCGGGGGCGCCGGACGTGGACAGCTACTGGAGACTCCTATCCGGCGGCGTGGAT
GCGGTCACCGAGGTGCCCGCTGGCCGGTGGGACATGGACGCGTTCTACGATCGCGATCCT
CGCTCTCTCGGCGACGTGAGCACCCTCCGGGGCGGCTTCATCGATGACGTCGATCGCTTC
GACGCGATGTTCTTCGGCATCTCTCCGCGAGAGGCCGTCTCCATGGATCCACAGCAACGG
CTCATGCTGGAGCTCGCGTGGGAGGCCCTGGAAGACGCGGGGATCGTCGCGGAGAGGCTG
AAGGAAAGTCTGACCGGAGTCTTCTTCGGGTGCATCTGGGACGACTACGTCACGCTGATC
CATCAGAGGGGGCGAGGAGCCATCGCCCAGCATACGGTGACGGGGAATCACCGCAGCATC
ATCGCGAACCGCGTATCGTACACGCTCGACCTGCGCGGTCCCAGCATGACGGTCGATTCG
GCCTGCTCGTCCGCGCTCGTTACCATACATATGGCCTGCGAGAGCCTGCGCAGCGGCGAG
TCCACATTGGCCCTCGCAGGCGGCGTCAACCTGAACATAGCTCCCGAGAGCACGATCGGT
GTCCACAAGTTCGGCGGCCTGTCCCCCGACGGCCGCTGCTTCACCTTCGATGCGCGCGCG
AACGGCTATGTGCGCGGCGAGGGGGGCGGCGTGGTGGTGCTCAAACGCCTGTCCTCGGCC
ATCGCGGACGGCGATCCCATCATTTGTGTCATCCGCGGCTCCGCGGTCAACAACGATGGT
GCCAGCAACGGGTTGACCGGCCCCAATCCCCTGGCACAGGAAGCCGTCTTGCGGACCGCG
TACGAACGGGCAGGCGTGAACCCGGCCGATGTTCAGTATGTCGAGCTGCACGGAACTGGC
ACCCAACTGGGGGATCCCGTCGAGGCAAGCGCGCTCGGTGCAGTGCTCGGAAAGAGAAGG
CCCGCCGAACGCCCGCTGCTCGTGGGATCCGCCAAGACCAACGTCGGGCACCTGGAAGGT
GCCGCCGGCATCGTAGGGCTGCTCAAGGCAGCGCTCTGCCTCAAACACAAGCAGCTCGCG
CCCAACCTCAACTTCGAGACCCCGAATCCGCACATTCCATTCGCCGAGCTGAATCTGAAG
GTGCAGGGCGCTCTGGGGCCTTGGCCGGACATGGATCGTCCGCTCGTTTGCGGCGTGAGT
TCGTTCGGTCTGGGAGGGACGAACGCGCACGTCGTGCTGTCGGAGTGGGCATCGCTCGAG
GCCGAGCTCCACCCTCTCGCCGCAGAAAGCCCGGAGGCGCTGCGCGAAGAGGTGCAGCGG
CGGCTCTTGACCATGACCTCGCTCGTCGGGCGAGCACCCCTGTCGTTCCTGTGCGGTCGC
TCGGCGGCACAGCGCTCTGCGAAGGAGCATCGCCTCGCGGTCACCGCGCGCTCGTTCGAG
GAGCTGAAGCAGCGTCTGCTAGGCTTTCTCGAGCATGAGAAGCACGTCTCCGTGTCGGCG
GGGCGAGTGGATCTGGGCGCGGCGCCCAAGGTGGTCTTCGTCTTCGCCGGGCAGGGGGCG
CAGTGGTTCGGCATGGGTCGAGCGCTGCTGCAACGCGAGCCCGTATTCCGGACGACGATC
GAGCAGTGCAGCTCCTTCATCCAGCAGAACCTGGGCTGGTCGTTGCTCGATGAGCTGATG
ACAGATCGGGAGAGCTCGCGGCTCGATGAGATCGACGTCAGCCTCCCGGCCATCATATCC
ATCGAGATCGCCCTGGCGGCGCAATGGCGTGCTTGGGGCGTCGAGCCGGCGTTCGTAGTG
GGCCATAGCACAGGCGAGATCGCGGCGGCTCATGTCGCCGGCGTCCTGAGCATCGAGGAC
GCGATGCGGACCATCTGCGCGTACGGGCGCATCATCCGCAAGCTCCGAGGCAAGGGGGGC
ATGGGGCTCGTGGCGCTGTCGTGGGAAGACGCTGGCAAGGAGCTGACCGGCTACGAGGGG
CGCCTCTTCCGCGCGATAGAGCACAGCGCGGATTCAACGGTGCTGGCGGGCGAGCCGGAC
GCGCTCGACGCGCTGCTCCAGGCACTGGAGCGGAAGAACGTCTTTTGTCGTCGAGTGGCG
ATGGACGTTGCCCCCCATTGCCCCCAGGTCGACTGCCTTCGCGACGAGTTGTTCGATGCG
CTCCGTGAGGTGCGGCCCAACAAAGCGCAGATCCCCATCGTCTCCGAAGTGACGGGTACC
GCGCTCGACGGCGAGCGCTTCGACGCTTCCCACTGGGTCCGAAATTTCGGCGATCCTGCG
CTCTTCTCCACGGCCATCGATCATCTTTTGCAGGAAGGATTCGACATCTTCCTGGAGCTC
ACGCCACATCCCCTCGCGCTACCTGCGATCGAGTCCAACCTGCGCCGGTCCGGCCGGCGT
GGCGTCGTGCTCCCGTCGCTCCGCCGTAACGAGGACGAGCGTGGGGTGATGCTGGACACG
TTGGGCGTCCTCTATGTGCGAGGCGCGCCGGTGCGGTGGGACAATGTCTATCCGGCAGCC
TTCGAGAGCATGCCTTTGCCCTCGACGGCCGGTGGCGGGAAGCCGCTGCCACCCATGCCC
CTGCTCATATCGGCCAGAACGGACGCAGCCCTCGCTGCGCAAGCCGCGCGGCTGCGGGCG
CACCTCGATTCTCATCTCGACCTCGAGCTCGTGGACGTCGCCTATTCCCTCGCCGCCACG
CGGACGCACTTCGAGCGGCGCGCGGTGGTGGTCGCGCGCGATCGCGCGGGCATCCTCGAT
GGGCTGGACGCGCTCGCCCACGGCGGCTCCGCCGCCCTCCTCGGACGGAGCGCCGCGCAC
GGAAAGCTCGCCATTCTCTTTACGGGACAAGGAAGCCAGCGGCCCACCATGGGCCGAGCG
CTCTACGATGCTTTCCCCGTCTTCCGAGGTGCCCTCGACGCCGCCGCGGCTCACCTCGAC
CGCGACCTCGACCGCCCCCTGCGCGACGTCCTCTTCGCTCCCGACGGCTCCGAGCAGGCC
GCGCGCCTCGACCAGACCGCCTTCACCCAGCCGGCCCTGTTTGCCCTCGAAGTCGCCCTT
TTTGAGCTTCTTCAATCCTTCGGCCTAAAGCCCGCTCTCCTCCTCGGGCACTCCATCGGG
GAGCTCGTCGCCGCCCATGTCGCCGGCGTCCTTTCTCTCCAGGACGCCTGCACTCTCGTC
GCCGCCCGCGCGAAGCTCATGCAAGCGCTCCCACAAGGCGGCGCCATGGTCACCCTCCAG
GCCTCCGAGCAAGAAGCTCGCGACCTGCTTCAGGCCGCGGAAGGACGCGTCAGCCTCGCC
GCCGTCAACGGACATCTCTCCACCGTCGTCGCCGGCGACGAAGACGCAGTGCTCAAGATC
GCCCGGCAGGTCGAAGCCCTCGGACGAAAGGCCACACGCCTGCGCGTCAGCCACGCCTTC
CACTCCCCTCACATGGACGGCATGCTCGACGACTTCCGCCGCGTCGCCCAGGGCCTCACC
TTCCATCCTGCGCGCATCCCCATCATCTCCAACGTCACCGGCGCGCGCGCCACAGACCAG
GAGCTGGCGTCGCCCGAAACTTGGGTCCGCCACGTCCGCGACACCGTCCGCTTCCTCGAC
GGCGTCCGTACCCTCCACGCCGAAGGAGCACGCGCTTTCCTCGAGCTCGGGCCTCACCCT
GTACTCTCCGCCCTTGCGCAAGACGCCCTCGGACACGACGAAGGCCCGTCGCCATGCGCC
TTCCTTCCCACCCTCCGCAAGGGACGCGACGACGCCGAGGCGTTCACCGCCGCGCTCGGC
GCTCTCCACGCTGCAGGGCTCACCCCCGACTGGAACGCTTTCTTCGCGCCCTTCGCTCCA
TGCAAAGTCCCACTCCCCACCTATACCTTCCAGCGTGAGCGCTTCTGGCTCGACGCCTCT
ACAGCACACGCCGCCAGCGCCACTCCCGCTGCGGCGCTCGAGGGGCGGTTCTGGCAAGCC
GTCGAGAGCGGCGACATCGACACACTCAGCAGCGAGCTCCACGTGGACGGCGATGAGCAG
CGCGCCGCCCTTGCCCTCGTCCTTCCCACCCTCTCGAGCTTTCGCCACAAGCGGCAAGAG
CAGAGCACGGTCGATGCCTGGCGCTACCGCGTCACCTGGAAGCCTCTGACCACCGCCGCC
ACGCCCGCCGACCTCGCCGGCACCTGGCTCCTCGTCGTGCCGTCCGCGCTGGGCGACGAC
GCGCTCCTCGCCACGCTCACCGAGGCACTCACCCGGCGCGGAGCGCGCGTCCTCGCGCTG
CGCGTGAGCGATATCCACATAGGCCGCAGCGCTCTCGTCGAGCACCTGCGCGAGGCTCTG
GCGGAGACCGCCCCGCTGCGCGGCGTGCTCTCGCTCCTCGCCCTCGATGAGCATCGCCTC
GCGGACCGTTCTGCTCTGCCCGCGGGTCTGGCCCTGTCGCTCGCCCTCGTCCAAGGCCTC
GACGACCTCGCCATCGAGGCTCCCTTGTGGCTCTTCACCCGCGGCGCCGTCTCCATCGGA
CACTCCGACCCCATCACTCATCCCACCCAGGCCATGATCTGGGGCCTTGGCCGCGTCGTC
GGCCTCGAGCACCCCGAGCGATGGGGCGGGCTCGTCGACGTCAGCGCTGGGGTCGACGAG
AGCGCCGTGGGCCGCTTGCTCCCGGCCCTCGCCCAGCGCCACGACGAAGACCAGCTCGCT
CTCCGCCCGGCCGGACTCTACGCTCGCCGCATCGTCCGTGCCCCGCTCGGCGATGCGCCT
CCCGCGCGGGAGTTTAGACCCCGAGGCACCATCCTCATCACCGGAGGCACCGGCGCCCTC
GGCGCTCACGTCGCCCGATGGCTCGCTCGCCAGGGCGCAGAGCACCTCATCCTCATCAGC
CGCCGAGGCGCCGAGGCCCCTGGCGCCTCGGAGCTCCACGCCGAGCTCAATGCCCTCGGC
GTCCGCACCACCCTCGCCGCGTGCGATGTCGCCGATAGAAGCGCTCTCCAAGCTCTCCTC
GACAGCATTCCGTCGGACTGCCCGCTCACGGCGGTGTTTCACACGGCAGGAGCTCGCGAC
GATGGCCTGATCGGCGACATGACGCCCGAGCGCATCGAGCGGGTCCTTGCGCCCAAGCTC
GATTCGGCGTTGCACTTGCACGAGCTCACGAAAAATAGCGCTCTCGACGCCTTCGTCCTC
TACGCTTCACTCTCGGGTGTCCTCGGCAATCCCGGTCAGGCCAATTACGCCGCTGCAAAC
GCTTTCCTCGATGCCCTGGCCGAGCATCGGCGTAGCCTTGGACTGACGGCGACGTCCGTG
GCGTGGGGCGGGTGGGGCGGCGGTGGCATGGCCACCGAGCGCGTGGCAGCCCAGCTCCAG
CAACGCGGGCTGTTGCAGATGGCCCCCTCGCTTGCCCTGGCGGCGCTCGCGCAAGCCCTG
CAGCAAGACGAGACCACCATCACTGTCGCCGATATCGACTGGTCGCGCTTTGCGCCTGCG
TTCAGCGTCGCTCGCCAGAGGCCGCTGCTGCGCGATCTGCCAGAAGCGCAGCGGGCTCTC
CAAGCCAGCGAAGGCGCGTCCTCCGAGCACGGCCCGGCCACGGGCCTGCTCGACGAGCTC
CGAAGCCGCTCGGAAAGCGAGCAGCTCGATCTGCTCGCAACGCTTGTGCGCGGCGAGACG
GCCACTGTCCTCGGCCACGCCGAGGCCTCCCATGTCGACCCCGACAAGGGCTTCATGGAC
CTCGGTCTCGATTCGCTCATGACCGTCGAGCTCCGCCGGCGCTTGCAAAAGGCCACCGGC
GTCAAGCTCCCGCCCACGCTCGCGTTCGATCACCCCTCTCCTCATCGCGTCGCGTTTTTC
TTGCGCGACTCGCTCGCCCGAGCCTTCGGCACGAGGCTCTCCGCCGAACGCGACGGCGCC
GCGCTCCCGGCTCCTGGCGCCACCAGCGACAGCGACGAGCCGATTGCCATCGTCGGCATG
GCCCTCCGTCTGCCGGGCGGCATTGGCGATGTCGACGCTCTTTGGGATTTCCTCCACCAA
GGACGCGACGCGGTCGAGCCCATCCCACCTACCCGATGGGATGCCGGTGCCCTCTACGAC
CCTGATCCCGACGCCAAGGCCAAGAGCTACGTCCGGCATGCTGCCATGCTCGACCAGGTC
GATCTCTTCGACCCTGGATTCTTTGGCATCAGCCCTCGTGAGGCCAAACACATCGACCCC
CAGCACCGCCTACTCCTCGAAGCTGCCTGGCAGGCCCTCGAAGAGGCCGGTATCGTCCCC
TCCACCCTCAAGGATTCTCCCACCGGCGTGTTCGTCGGCATCGGCGCCAGCGAGTACGCG
CCGCGGGAACCGGGCGCGGAGGATTCCGAAGCTTACATCGTCCAAGGCACTTACGCGTCC
TTTGCCGCGGGGCGCTTGGCCTTCACGCTCGGGCTGCAAGGGCCAGCGCTCTCGGTCGAC
ACCGCTTGCTCCTCCTCGCTCGTCGCCCTCCACCTCGCCTGCCAGGCCCTTCGCCGCGAT
GAGTGCAACCTCGCCCTCGCCGCAGGGGCCTCTGTCATGGTCTCTCCCGAGACCTTCGTC
CTCCTTTCCCGCCTGCGCGCTTTGGCCCCCGACGGCCGCTCCAAGACCTTCTCGGCCAGC
GCCGACGGCTACGGTCGCGGTGAAGGCGTCATCGTCCTTGCCCTCGAGCGGCTCCGCGAC
GCCCTTGCCCAAGGACGCCGCGTCCTCGCCGTCGTGCGCGGCACCGCCGTCAACCACGAC
GGCGCATCGAGCGGCATCACCGCCCCCAATGGCACCTCCCAGAAGAAGGTCCTCCGCGCC
GCGCTCCACGACGCCCGCATCGCTCCCGCCGATGTCGACGTCGTCGAGTGCCATGGCACC
GGCACCTCCTTGGGCGATCCCATCGAGGTCCAAGCCCTGGCTGCTGTCTACGGTGAAGGC
AGATCCGCGGAAAAGCCTCTTTTTCTGGGTGCGGTCAAGACCAACGTTGGCCACCTCGAG
GCCGCCGCCGGCCTCGCGGGCGTCGCCAAGATCGTCGCTTCCCTCCTGCACAACGCCCTG
CCCCCCACCCTCCACACCACCCCACGCAATCCCCTGATCGCGTGGGATGCGCTCGCCGTC
GCCGTCGTCGATGCCACGAGGCCTTGGGTCCGCCACGCGGATGGGCGTCCCCGCCGCGCC
GGCGTCTCCGCCTTCGGACTCTCCGGCACCAACGCTCACGTCATCCTCGAAGAGGCCCCC
GCCATCGCCCGGGTCGAGCCCGCAGCGTCACAGCCGGCGTCCGAGCCGCTTCCCGCAGCG
TGGCCCGTGCTCCTGTCGGCCAAGAGCGAGGCGGCCGTGCGCGCCCAGGCAAAGCGGCTC
CGCGACCACCTCCTCGCCAAAAGCGAGCTCGCCCTCGCCGACGTGGCCTATTCGCTCGCG
ACCACGCGCGCCCACTTCGAGCAGCGCGCCGCTCTCCTCGTCAAAGGCCGCGACGAGCTC
CTCTCCGCCCTCGATTCGCTGGCCCAAGGACATTCCGCCGCCGTGCTCGGACGAAGCGGC
GCCCCAGGAAAGCTCGCCGTCCTCTTCACGGGGCAAGGAAGCCAGCGGCCCACCATGGGC
CGCGCCCTCTACGACGCTTTCCCCGTCTTCCGGGACGCCCTCGACACCGTCGCCGCCCAC
CTCGACCGCGACCTCGACCGCCCCCTGCGCGACGTCCTCTTCGCTCCCGACGGCTCCGAG
CAGGCCGCGCGCCTCGACCAAACCGCCTTCACCCAGCCGGCCCTGTTTGCCCTCGAAGTC
GCCCTCTTTCAGCTTCTCCAATCCTTCGGTCTGAAGCCCGCTCTCCTCCTCGGACACTCC
ATTGGCGAGCTCGTCGCCGCCCACGTCGCCGGCGTCCTTTCTCTCCAGGACGCCTGCACC
CTCGTCGCCGCCCGCGCAAAGCTCATGCAAGCGCTCCCACAAGGCGGCGCCATGGTCACC
CTCCGAGCCTCCGAGGAGGAAGTCCGCGACCTTCTCCAGCCCTACGATGGACGAGCTAGC
CTCGCCGCCCTCAATGGGCCTCTCTCCACCGTCGTCGCTGGCGATGAAGACGCGGTGGTG
GAGATCGCCCGCCAGGCCGAAGCCCTCGGACGAAAGACCACACGCCTGCGCGTCAGCCAC
GCCTTCCACTCTCCGCACATGGACGGAATGCTCGACGACTTCCGCCGCGTCGCCCAGAGC
CTCACCTACCATCCCGCACGCATCCCCATCATCTCCAACGTCACCGGCGCGCGCGCCACG
GACCACGAGCTCGCCTCGCCCGACTACTGGGTCCGCCACGTTCGCCACACCGTCCGCTTC
CTCGACGGCGTACGTGCCCTTCACGCCGAAGGGGCACGCGTCTTTCTCGAGCTCGGGCCT
CACGCTGTCCTCTCCGCCCTTGCGCAAGACGCCCTCGGACAGGACGAAGGCACGTCGCCA
TGCGCCTTCCTTCCCACCCTCCGCAAGGGACGCGACGACGCCGAGGCGTTCACCGCCGCG
CTCGGCGCTCTCCACGCTGCAGGGCTCACACCCGACTGGAGCGCTTTCTTCGCCCCCTTC
GCTCCACGCAAGGTCTCCCTCCCCACCTATGCCTTCCAGCGCGAGCGCTTCTGGCTCGAT
GCCTCCAAGGCACACGCTGCCGACGTCGCCTCCGCAGGCCTGACCTCGACCGATCACCCG
CTGCTCGGCGCCGGCGTCCCCCTCGCCGACCGCGATGGCTTCCTCTTCACAGGACGACTC
TCACTCTCAGAGCATCCGTGGCTCGCCGATCACGTCGTCTTCGGTACACCCATCCTTCCG
GGCACTGCCTTTCTCGAGCTTGCCCTGTTCGTCGCCGGTCGCGTCGGCCTCGACACCGTC
GAAGAGCTCACCCTCGAAACCCCCCTCGCTCTCCCGTCTGAAGGCGCCCTCCTCGTCCAG
GTGTCGGTCGGGCCTTTGGACGACGCAGGACGAAGGCCACTCTCTCTTCACAGCCGACCC
CAAGGCGCTCCTCAGGACGCCCCTTGGACTCGCCACGCGAGCGGCTCGCTCGCTCCAGCT
ACCCCGTCCCCTTCCTTCGATCTCCACGACTGGCCTCCCTCGGGCGCCACCCAGGTAGAC
ACCCAAGGCCTCTACGCAACCCTCGAAAGCGCTGGGCTTGCCTACGGCCCTCAGTTCCAG
GGCCTCCGCTCGGTCTGGAGGCGCGGCGACGAGCTCTTTGCGGAAGCTCAGCTCCCGGAC
GCCGCCAAAAAGGATGCCGCTCGGTTTGCCCTCCACCCCGCCCTGCTCGACAGCGCCCTG
CACGCGCTTGCCCTTGACGACGAGCGGGCACCGGGCGTCGCGCTGCCCTTCTCGTGGGGC
GGAGTCTCTCTGCGCGCTGTCGGTGCCACCACGCTGCGCGTGCGCTTCCACCGTCCGAAA
GGCGAAACCGCCGGCTCGCTCGTCCTCGCCGACGCCGCAGGCGGACCCATCGCCTCGGTG
CAAGCGCTCGCCACGCGCATCACGTCCGCCGAGCAGCTCCGCACCCCAGGAGCTTCCCAC
CACGATGCCCTCTTCCGCGTCGACTGGAGCGAGCTGCCGAGCCCCACCTCACCGTCTGGA
GCCCCAAGCGCCGTCCTTCTCGGCATCGGCGGCCTCGACCTCGCCCCCGAGGTGCCTCTC
GCCCGCGTCGCCGACCTCGCTGCCCTCCAGAGCGCGCTCGACCAAGGCGCTTCGCCTCCA
GGCCTCGTCGTCGTCCCCTTCATGGCTAGAACCGCCGACGACCTCATCCAGAGCGCCCAC
TCCATCACCGCGCGCGCCCTCGCCCTGCTGCAAGCCTGGCTGGCCGACGAACGCCTCGCC
TCCTCGCGCCTCGTCCTGCTCACCCGACGCGCCATCGCTGCCCGCGCCGATGAAGACGTC
AAGGACCTCGCTCACGCCCCTCTCTGGGGGCTCGCACGCTCCGCGCAAAGCGAACACCCA
GAACTCCCGCTCTTTCTCGTCGACCTGGACCTCAGTGAGGCCTCCCAGCACACCCTGCTC
GCCGCGCTCGAAACAGGAGAGCGTCACTCGCGTCTCCGCAACGGAAAACCCTTCATCCCG
AGATTGGCGAATGCACGCTCGAAGGATGAGCTCATCGCCCCGGACGCGTCCAACTGGCGC
CTCCATATTCCGACCAAAGGCAACTTCGACGCGCTCACCCTCGTCGACGCCCCTCTAGCC
CGTGCGCCCCTCGCACACGGCCAAGTCCGCGTCGCCGTGCACGCCGCAGCCTTCAATTTC
CGCGATGTCCTCGACACCCTTGGTCTGTATCCGGGCGACGCGGGACCGCTCGGCGGCGAA
GGCGCAGGCATCGTTACTGAAGTCGGTCCAGGTGTTTCGCGGTACACCGTAGGCGACCGG
GTGATGGGGATCTTCGGCGCAGCTTGCGGTCCCACGGCCATCGCCGACGCCCGCATGATC
TGCCCCATCCCCCACGCCTGGTCCTTCGCCCAAGCCGCCAGCGTCCCCATCATCTATCTC
ACCGCCTACTACGGACTCGTCGATCTCGGGCATCTGAAACCCAATCAACGTGTCCTCATC
CATGCGGCCGCCGGCGGCGTCGGGACGGCCGCCGTTCAGCTCGCACGCCACCTCGGCGCC
GAGGTCTTTGCCACCGCCAGCGCAGGGAAGTGGAGCGCTCTCCGCGCGCTCGGCTTCGAC
GACGCGCACCTCGCGTCCTCACGTGACCTGGACTTCGAGCAGCACTTCCTGCGCTCCACG
CATGGGCGCGGCGTGGATGTCGTCCTCGACTGCTTGGCACGCGAGTTCGTCGACGCTTCG
CTGCGCCTCATGCCGAGCGGTGGACGCTTCGTCGAGATGGGCAAGACGGACATCCGTGAG
CCCGACGCGGTCGGCGTCGCCTACCCTGGTGTCGTTTACCGCGCCTTCGACCTCATAGAG
GCCGGACCGGATCGAATAGAGCAGATGCTCGCAGAGCTGCTCAGCCTCTTCGAGCGCGGT
GCGCTTCGTCCGCCGCCCATCACATCTTGGGACATCCGTCATGCCCCCCAGGCCTTTCGC
GCGCTCGCTCAGGCGCGGCATGTTGGGAAGTTCGTCCTCACCATACCCCGTCCCATAGAC
CCCGAAGGCACCGTCCTCATCACGGGAGGCACCGGCACGCTAGGAGCCCTGGTCGCACGC
CATCTCGTCGCAAGACACGGCGCCAAGCACCTGCTTCTCACGTCGAGGCAGGGCGCGCAC
GCTCCGGGCGCCGAGGCCTCGCGAACCGAGCTCGAAGCGCTGGGGGCCTCTGTCACACTT
CGCGCGTGCGACGCGGCCGACCCACGCGCCCTCCAAGCCCTCTTGGACTCCATCCCGAGC
GCTCACCCGCTCACCGCCGTCGTCCACGCCGCGGGCGCCCTCGACGACGGCCTGCTCGGC
GCCATGAGCCCCGAGCGCATCGACCGCGTCTTTGCCCCCAAGCTCGATGCTGCTTGGCAC
TTGCATGAGCTCACCCAAGACAAGCCCCTCGCCGCCTTCGTCCTCTTCTCGTCCGCTGCT
GGCGTCCTTGGTAGTCCAGGTCAGTCGAACTACGCCGCGGCCAATGCCTTCCTCGATGCG
CTCGCGCATCACCGGCGTGCCCACGGGCTCCCGGCCTCCTCGCTCGCATGGGGCTATTGG
GCCGAGCGCAGTCGAATGACCGAGCACCTCAGCGCCGCCGATGTTTCTCGCATGAGGCGC
GCCGGCGTCCGGCCCCTCGCCACAGACGAGGCGCTCTCCCTCTTCGATGCGGCTCTCTTG
CGGCCCGAGCCCGCCCTGGTCCCCGCACGCTTCGACGTGAACGCGCTCGGCGCGAATGCC
GACGAGGTGCCCCCGCTGTTCCAGCGTCTCGTCCGCGCTCGCGTCGCACGCAAGGCCGCC
AGCAATACCGCCCTGGCCTCTTCACTCTCTCAGCGCCTCTCCTCCCTCCCGCCCGCAGAA
AGCGAGCGCTTCCTCCTCGATCTCGTCCGCACCGAAGCCGCCACCGTCCTCGGCCTCGCC
TCATTCGAATCGCTCGATCCCCATCGCCCTCTCCAAGAGCTCGGCCTCGATTCTCTTATC
GCTCTCGAGCTCCGAGGTCGACTCGCCGCGGCCACCGGGCTGCGACTCCAACCTACTCTC
CTCTTCGACTATCCAACCCCGGCTGCACTCTCACGCTTTTTCACGACGCAGTTCTTCGGG
GAAACCACCGACCGTCCCGCAGCGCCGCTCACCCCGGCGGGAAGCGAAGACCCTATCGCC
ATCGTGTCGATGAGCTGCCGCTTCCCTGGCGACGTGCGCACGCCCGAGGATCTCTGGAAG
CTCTTGCTCGATGGGAAAGATGCCATCTCCAGCTTTCCCCAGAATCGCGGTTGGAGTCTC
GATGCGCTCGACGCTCCCGGTCGCTTCCCAGTCCGAGAGGGAGGCTTCGTCTACGACGCA
GACGCCTTCGATCCGGCCTTCTTCGGGATCAGTCCACGCGAAGCGCTCGCCATCGATCCC
CAACAGCGGCTCCTCCTCGAGATCAGCTGGGAAGCGTTGGAGCGTGCAGGCATCGACCCG
GCCTCGCTCCAAGGGAGCCAAAGCGGCGTTTTCGTCGGCATTATACACAACGACTACGGC
GCATGGCTGATGAACGGGACTGACGAACACAAGGGATTCGCTGCCACGGGTAGCACGGCG
AGCGTCGCCTCCGGCCGGATCGCCTATACGTTCGGCTTCCAAGGGCCCGCCATCAGCGTT
GACACGGCGTGCAGCTCCTCGCTCGTCGCGGTTCACCTCGCCTGCCAGGCCCTCCGGCAC
GGCGAATGCTCCCTGGCGCTCGCTGGCGGCGTGACCGTCCTGGCCACGCCAGCAGTCTTC
GTCGCGTTCGACTCCGAGAGCGCGGGTGCCCCCGATGGTCGCTGCAAGGCCTTCTCGGCG
GAAGCGAACGGTGCGGGCTGGGCCGAGGGCGCCGGGATGCTCTTGCTCGAGCGCCTCTCC
GATGCGGTCCGAAACGGTCATCCCGTCCTCGCCGTCCTTCGAGGCTCCGCCGTCAACCAA
GACGGCCGAAGCCAGGGCCTCACCGCGCCCAACGGCCCTGCCCAGGAGCGGGTCATCCGG
CAGGCGCTCGACAGCGCGCGGCTCACGCCAAAGGACATCGACGCCGTCGAGGCTCACGGC
ACGGGGACGACCCTCGGAGACCCCATCGAGGCTCAAGCCATTCTTGCCACCTATGGAGAG
TCCCATTCCCAAGACAGCCCCCTCTGGCTTGGAAGTCTCAAGTCCAACATGGGACATACT
CAGGCCGCGGCCGGCGTAGGAAGCGTCATCAAGATGGTGCTCGCGTTGCAGCACGGTCTC
TTGCCCAAGACCCTCCATGCCAAAAACCCTTCCCCCCACATCGACTGGTCTCCGGGCACG
GTAAAGCTCCTGGACGAGCCCGTCGTCTGGAAGACCAATGGGCATCCACGCCGCGCTGGC
GTCTCCTCGTTCGGGTTCTCCGGCACCAATGCCCACGTCATCCTCGAAGAGGCCCCCGCC
ATCGCCCGGGCCGAGTCCGCCGCCGCACAGCCTGCGTCCGAGCCGCTTCCCGCAGCGTGG
CCCGTGCTCCTTTCAGCCAAGAGCGAGGCGGCCCTGCGCGCGCAGGCCGCGCGGTTGCGG
GACCACCTCCAGGCACACCCCGACCTCGAGCTCGCGGACGTCGCCTATTCACTCGCCACG
ACGCGGGCGCACTTCGAGCGGCGCGCGGTGGTCGTCGCAAAGGACCGCGACGAGGCCACC
TTCGCCCTCGATGCCTTCGAGCAAGGCAGCCCGGCCCACCACGTCGCGCACGGCGAAGCC
AGGGTCGCGGGCAAGCTCGTCTTCGTCTTTCCAGGCCAGGGATCCCAGTGGCCCGGAATG
GCGCAGCAACTGCTCACGACATCCGATGCGTTCCGCGCGCAAGTCGAAGCGTGCGCGCGC
GCGTTCGCACCTCACCTCGGCTGGTCGCTCTTGGCCGTGCTCCGCGGCGACGAGGGGGCC
CCGTCGCTGGAGCGGATCGAGGTCGTGCAACCAGCGCTCTTCACCGTCATGGTCTCCTTG
GCTGCCCTCTGGCGCTCCAGGGGTATCGAGCCCGATGCCGTCGTTGGACACAGCCAAGGC
GAGCTCGCCGCCGCCTACGTGGCCGGCGCGCTGTCGCTCGACGACGCCGCCAAGGTGGTG
GCACGGCGCAGCCGCCTGCTGAGCACGCTCTCCGGTCAGGGCGCGATGGCCGCCGTGGAG
CGGCCGCCCGCGGCGCTCGAGCCCTACCTCGCGCGCTTCGGTCGGCGCCTCTCCATCGCC
GCCATCAACAGCCCGAGCGCCACCACGGTCTCCGGCGAGCCCGACGCCATTGACCATCTG
CTCCGGCTGCTCAAAGCCGAGCAGATCTTCGCGCTCAAGCTGCGCGTCGACGTGGCGTCC
CACGGCGCGCAGATCGAAGGCATGCGCGAGCAGCTGCTCGAGGAGCTCCGCGAGATCGAG
CCGCGGGAAAGCCGAATTCCGTTCTACTCCACGGTTCGAGGCGAGAAGCTCGCCGGTACC
GAGCTCGGCGCCGCCTACTGGTACGACAACCTGCTGCGGCCCGTCCGCTTCGCCGACGCC
ACCCAGCTCCTGCTCGACGACGCGCACCGCTTCTTCGTCGAGGTGAGCCCCCATCCGGTG
CTGATGCTGCCGCTTGAGGAGACCCTCGAAGCCTCCGGTCTCCCCACGGCGGTCCTTGGC
TCGCTCTGGCAGGACGAGGGGGACCTCTCGCGCTTTCTCGCTTCGCTCGGCGAGCTCTAC
GCGCGCGGATACGCCGTCGATTGGCGCGCTTTCTTCGAGCCGCTGCGGCCGCGTCGCGTC
GCTCTGCCCACGTATGCCTTCCAGCGCGAGCGCTTCTGGCTCGACGCCCCCACAGCACAC
GCCGACGTCGCCTCCGCAGGCCTGACCTCGGCCGACCACCCGCTGCTCGGCGCCGCCGTC
CGCCTCGCCGACACCGATGCCTTCCTCTTCACCGGCCGCCTCTCGCTGCAGAGCCATCCC
TGGCTCGCCGAGCACGCCGCCTTCGGCATACCCATCCTGCCGGGCACCGCCTTTCTCGAG
CTTGCCCTGCTCGCCGCCGATCGCGTCGGCCTCGACACCGTCGAAGAGGTCACGCTCGAA
GCTCCCCTCGCTCTCCCCTCTCAAGGCACCATTCTCATCCAGATCTCCGTCGGACCCATG
GACGAGGCGGGACGAAGGTCGCTCTCCCTCCATGGCCGGACCGAGGACGCTCCTCAGGAC
GCCCCTTGGACGCGCCACGCGAGCGGGTCGCTCGCTAAAGCTGCCCCCTCCCTCTCCTTC
GATCTTCACGAATGGGCTCCTCCGGGGGGCACGCCGGTGGACACCCAAGGCTCTTACGCA
GGCCTCGAAAGCGGGGGGCTCGCCTATGGGCCTCAGTTCCAGGGACTTCGCTCCGTCTGG
AAGCGCGGCGACGAGCTCTTCGCCGAGGCCAAGCTCCCGGACGCAGGCGCCAAGGATGCC
GCTCGGTTCGCCCTCCACCCCGCCCTGTTCGACAGCGCCCTGCACGCGCTTGTCCTTGAA
GACGAGCGGACGCCGGGCGTCGCTCTGCCCTTCTCGTGGAGAGGAGTCTCGCTGCGCTCC
GTCGGCGCCACCACCCTGCGCGTGCGCTTCCATCGTCCGAATGGCAAGTCCTCCGTGTCG
CTCCTCCTCGGCGACGCCGCAGGCGAGCCCCTCGCCTCGGTCCAAGCGCTCGCCACGCGC
ATCACGTCCCAGGAGCAGCTCCGCACCCAGGGAGCTTCCCTCCACGATGCTCTCTTCCGG
GTTGTCTGGAGAGATCTGCCCAGCCCTACGTCGCTCTCTGAGGCCCCGAAGGGTGTCCTC
CTAGAGACAGGGGGTCTCGACCTCGCGCTGCAGGCGTCTCTCGCCCGCTACGACGGTCTC
GCTGCCCTCCGGAGCGCGCTCGACCAAGGCGCTTCGCCTCCGGGCCTCGTCGTCGTCCCC
TTCATCGATTCGCCCTCTGGCGACCTCATAGAGAGCGCTCACAACTCCACCGCGCGCGCC
CTCGCCTTGCTGCAAGCGTGGCTTGACGACGAACGCCTCGCCTCCTCGCGCCTCGTCCTG
CTCACCCGACAGGCCATCGCAACCCACCCCGACGAGGACGTCCTCGACCTCCCTCACGCT
CCTCTCTGGGGCCTTGTGCGCACCGCGCAAAGCGAACACCCGGAGCTCCCTCTCTTCCTC
GTCGACCTGGACCTCGGTCAGGCCTCGGAGCGCGCCCTGCTCGGCGCGCTCGACACAGGA
GAGCGTCAGCTCGCTCTCCGCCATGGAAAATGCCTCGTCCCGAGGTTGGTGAATGCACGC
TCGACAGAGGCGCTCATCGCGCCGAACGTATCCACGTGGAGCCTTCATATCCCGACCAAA
GGCACCTTCGACTCGCTCGCCCTCGTCGACGCTCCTCTAGCCCGTGCGCCCCTCGCACAA
GGCCAAGTCCGCGTCGCCGTGCACGCGGCAGGTCTCAACTTCCGCGATGTCCTCAACACC
CTTGGCATGCTTCCGGACAACGCGGGGCCGCTCGGCGGCGAAGGCGCGGGCATTGTCACC
GAAGTCGGCCCAGGTGTTTCCCGATACACTGTAGGCGACCGGGTGATGGGCATCTTCCGC
GGAGGCTTTGGCCCCACGGTCGTCGCCGACGCCCGCATGATCTGCCCCATCCCCGATGCC
TGGTCCTTCGTCCAAGCCGCCAGCGTCCCCGTCGTCTTTCTCACCGCCTACTATGGACTC
GTCGATGTCGGGCATCTCAAGCCCAATCAACGTGTCCTCATCCATGCGGCCGCAGGCGGC
GTCGGTACTGCCGCCGTCCAGCTCGCGCGCCACCTCGGCGCCGAAGTCTTCGCCACCGCC
AGTCCAGGGAAGTGGGACGCTCTGCGCGCGCTCGGCTTCGACGATGCGCACCTCGCGTCC
TCACGTGACCTGGAATTCGAGCAGCATTTCCTGCGCTCCACACGAGGGCGCGGCATGGAT
GTCGTCCTCAACGCCTTGGCGCGCGAGTTCGTCGACGCTTCGCTGCGTCTCCTGCCGAGC
GGTGGAAGCTTTGTCGAGATGGGCAAGACGGATATCCGCGAGCCCGACGCCGTAGGCCTC
GCCTACCCCGGCGTCGTTTACCGCGCCTTCGATCTCTTGGAGGCTGGACCGGATCGAATT
CAAGAGATGCTCGCAGAGCTGCTCGACCTGTTCGAGCGCGGCGTGCTTCGTCCGCCGCCC
ATCACGTCCTGGGACATCCGGCATGCCCCCCAGGCGTTCCGCGCGCTCGCTCAGGCGCGG
CATATTGGAAAGTTCGTCCTCACCGTTCCCCGTCCCATCGATCCCGAAGGCACCATCCTC
GTCACGGGAGGCACCGGCACGCTCGGCGCGCTCATCGCGCGCCACCTCGTCGCCAATCGC
GGCGCCAAGCACCTGCTCCTCACCTCGCGAAAGGGTGCGAGCGCTCCGGGGGCCGAGGCA
TTGCGGAGCGAGCTCGAAGCTCTGGGGGCTGCGGTCACGCTCGCCCGGTGCGACGCGGCC
GATCCACGCGCGCTCCAAGCCCTCTTGGACAGCATCCCGAGCGCTCACCCGCTCACGGCC
GTCGTGCACGCCGCCGGCGCCCTTGACGATGGGCTGATCAGCGCCATGAGCCCCGAGCGC
ATCGACCGCGTCTTTGCTCCCAAGCTCGACGCCGCTTGGCACTTGCATCAGCTCACCCAG
GACAAGCCGCTCGCGGCCTTCGTCCTCTTCTCGTCCGCCTCCGGCGTCCTCGGCGGTATG
GGTCAATCCAACTACGCGGCGGCCAATGCGTTCCTTGACGCGCTCGCGCATCACCGACGC
GTCCATGGGCTCCCAGCCTCCTCGCTCGCATGGGGCCATTGGGCCGAGCGCAGCGGAATG
ACCCGACACCTCAGCGGCGTCGATACCGCTCGCATGAGGCGCGCCGGTCTCCGATCCATC
GCCTCGGACGAGGGTCTCGCCCTCTTCGATATGGCGCTCGGGCGCCCGGAGCCCGCGCTG
GTCCCCGCCCGCTTCGACATGAACGCGCTCGGCGCGAAGGCCGACGGGCTACCCTCGATG
TTCCAGGGTCTCGTCCGCGCTCGCGTCGCGCGCAAGGTCGCCAGCAATAATGCCCTGGCC
GCGTCGCTCACCCAGCGCCTCGCCTCCCTCCCGCCCACCGACCGCGAGCGCATGCTGCTC
GATCTCGTCCGCGCCGAAGCCGCCATCGTCCTCGGCCTCGCCTCGTTCGAATCGCTCGAT
CCCCGTCGCCCTCTTCAAGAGCTCGGTCTCGATTCCCTCATGGCCATCGAGCTCCGAAAT
CGACTCGCCGCCGCCACAGGCTTGCGACTCCAAGCCACCCTCCTCTTCGACCACCCGACG
CCCGCCGCGCTCGCGACCCTGCTGCTCGGGAAGCTCCTCCAGCATGAAGCTGCCGATCCT
CGCCCCTTGGCCGCAGAGCTCGACAGGCTAGAGGCCACTCTCTCCGCGATAGCCGTGGAC
GCTCAAGCACGCCCGAAGATCATATTACGCCTGCAATCCTGGTTGTCGAAGTGGAGCGAC
GCTCAGGCTGCCGACGCTGGACCGATTCTCGGCAAGGATTTCAAGTCTGCTACGAAGGAA
GAGCTCTTCGCTGCTTTTGACGAAGCGTTCGGAGGCCTGGGTAAATGA
[0] ACP14..87
[1] KS110..486
[1] AT632..949
[1] AT1066..1387
[1] malonyl-CoA1258..1262
[1] KR1709..1888
[1] ACP1992..2062
[2] KS2092..2466
[2] AT2628..2949
[2] malonyl-CoA2820..2824
[2] DH2999..3162
[2] ER3487..3793
[2] KR3803..3983
[2] ACP4085..4155
[3] KS4177..4547
[3] AT4707..5025
[3] methoxymalonyl-ACP4898..4902
[3] DH5073..5236
[3] ER5561..5867
[3] KR5877..6057
[3] ACP6159..6229
[0] ACP40..261
[1] KS328..1458
[1] AT1894..2847
[1] AT3196..4161
[1] malonyl-CoA3772..3786
[1] KR5125..5664
[1] ACP5974..6186
[2] KS6274..7398
[2] AT7882..8847
[2] malonyl-CoA8458..8472
[2] DH8995..9486
[2] ER10459..11379
[2] KR11407..11949
[2] ACP12253..12465
[3] KS12529..13641
[3] AT14119..15075
[3] methoxymalonyl-ACP14692..14706
[3] DH15217..15708
[3] ER16681..17601
[3] KR17629..18171
[3] ACP18475..18687

close this sectionFeature

BLASTP
Database:UniProtKB:2011_09
show BLAST table
InterPro
Database:interpro:38.0
IPR001227 Acyl transferase domain (Domain)
 [627-755]  G3DSA:3.40.366.10 [823-939]  G3DSA:3.40.366.10 [1057-1190]  G3DSA:3.40.366.10 [1258-1377]  G3DSA:3.40.366.10 [2621-2752]  G3DSA:3.40.366.10 [2820-2939]  G3DSA:3.40.366.10 [4700-4830]  G3DSA:3.40.366.10 [4898-5014]  G3DSA:3.40.366.10
G3DSA:3.40.366.10   Ac_transferase_reg
IPR002198 Short-chain dehydrogenase/reductase SDR (Family)
 [1709-1875]  1.5e-62 PF00106
PF00106   adh_short
IPR002364 Quinone oxidoreductase/zeta-crystallin, conserved site (Conserved_site)
 [3615-3636]  PS01162 [5689-5710]  PS01162
PS01162   QOR_ZETA_CRYSTAL
IPR006162 Phosphopantetheine attachment site (PTM)
 [2020-2035]  PS00012 [4113-4128]  PS00012 [6187-6202]  PS00012
PS00012   PHOSPHOPANTETHEINE
IPR009081 Acyl carrier protein-like (Domain)
 [22-86]  4.1e-08 PF00550 [1997-2057]  1e-12 PF00550 [4088-4153]  2e-11 PF00550 [6162-6227]  1.4e-11 PF00550
PF00550   PP-binding
 [14-87]  PS50075 [1992-2062]  PS50075 [4085-4155]  PS50075 [6159-6229]  PS50075
PS50075   ACP_DOMAIN
 [11-127]  8.30001403791589e-22 SSF47336 [1985-2109]  2.49999811956465e-25 SSF47336 [4078-4193]  2.70000183580794e-27 SSF47336 [6150-6236]  2.90000354677981e-21 SSF47336
SSF47336   ACP_like
 [17-87]  1.79999999999997e-72 G3DSA:1.10.1200.10 [1992-2065]  1.79999999999997e-72 G3DSA:1.10.1200.10 [4084-4156]  1.79999999999997e-72 G3DSA:1.10.1200.10 [6158-6232]  1.79999999999997e-72 G3DSA:1.10.1200.10
G3DSA:1.10.1200.10   ACP_like
IPR011032 GroES-like (Domain)
 [3478-3622]  4.70000326671387e-29 SSF50129 [5552-5696]  2.90000354677981e-32 SSF50129
SSF50129   GroES_like
IPR013149 Alcohol dehydrogenase, C-terminal (Domain)
 [3626-3718]  2.3e-18 PF00107 [5700-5826]  2.4e-16 PF00107
PF00107   ADH_zinc_N
IPR013154 Alcohol dehydrogenase GroES-like (Domain)
 [3507-3573]  1.2e-06 PF08240 [5581-5638]  3.8e-07 PF08240
PF08240   ADH_N
IPR013968 Polyketide synthase, KR (Domain)
 [3803-3982]  1.20000000000001e-66 PF08659 [5877-6056]  7.59999999999998e-65 PF08659
PF08659   KR
IPR014030 Beta-ketoacyl synthase, N-terminal (Domain)
 [110-360]  5.09999999999993e-93 PF00109 [2092-2341]  1.79999999999997e-86 PF00109 [4177-4422]  1.79999999999997e-90 PF00109
PF00109   ketoacyl-synt
IPR014031 Beta-ketoacyl synthase, C-terminal (Domain)
 [369-486]  1.99999999999999e-41 PF02801 [2349-2466]  8.89999999999994e-43 PF02801 [4430-4547]  2.89999999999999e-46 PF02801
PF02801   Ketoacyl-synt_C
IPR014043 Acyl transferase (Domain)
 [632-949]  2.59999999999995e-81 PF00698 [1066-1387]  6.10000000000004e-69 PF00698 [2628-2949]  4.89999999999996e-70 PF00698 [4707-5025]  1.29999999999998e-101 PF00698
PF00698   Acyl_transf_1
IPR016035 Acyl transferase/acyl hydrolase/lysophospholipase (Domain)
 [630-932]  7.79998304125571e-61 SSF52151 [1062-1352]  2.50000909916183e-71 SSF52151 [2624-2919]  3.99998544139406e-72 SSF52151 [4705-5017]  5.50001754266469e-66 SSF52151
SSF52151   Acyl_Trfase/lysoPlipase
IPR016036 Malonyl-CoA ACP transacylase, ACP-binding (Domain)
 [757-822]  4.99999811956429e-12 SSF55048 [1192-1257]  9.00000407957274e-16 SSF55048 [2754-2819]  7.30000590915208e-17 SSF55048 [4832-4897]  1.29999924468179e-17 SSF55048
SSF55048   Malonyl_transacylase_ACP-bd
IPR016038 Thiolase-like, subgroup (Domain)
 [110-371]  G3DSA:3.40.47.10 [372-537]  G3DSA:3.40.47.10 [2095-2353]  G3DSA:3.40.47.10 [2354-2521]  G3DSA:3.40.47.10 [4176-4432]  G3DSA:3.40.47.10 [4434-4600]  G3DSA:3.40.47.10
G3DSA:3.40.47.10   Thiolase-like_subgr
IPR016039 Thiolase-like (Domain)
 [103-538]  4.30000170645869e-101 SSF53901 [2085-2465]  2.99998750445706e-98 SSF53901 [4169-4546]  1.3000049540733e-98 SSF53901
SSF53901   Thiolase-like
IPR016040 NAD(P)-binding domain (Domain)
 [1709-1906]  2.99999999999998e-113 G3DSA:3.40.50.720 [3578-3767]  3.30000000000002e-113 G3DSA:3.40.50.720 [3804-3991]  8.70000000000003e-129 G3DSA:3.40.50.720 [5697-5791]  1.99999999999999e-45 G3DSA:3.40.50.720 [5877-6060]  1.60000000000002e-116 G3DSA:3.40.50.720
G3DSA:3.40.50.720   NAD(P)-bd
IPR018201 Beta-ketoacyl synthase, active site (Active_site)
 [273-289]  PS00606 [2254-2270]  PS00606 [4335-4351]  PS00606
PS00606   B_KETOACYL_SYNTHASE
IPR020801 Polyketide synthase, acyl transferase domain (Domain)
 [634-931]  1.59998835313644e-95 SM00827 [1066-1369]  9.59995698566587e-128 SM00827 [2628-2931]  SM00827 [4709-5006]  2.1000026783403e-118 SM00827
SM00827   PKS_AT
IPR020806 Polyketide synthase, phosphopantetheine-binding domain (Domain)
 [19-90]  2.2999993566538e-15 SM00823 [1993-2065]  1.10000150671642e-31 SM00823 [4086-4158]  1.79999754022375e-26 SM00823 [6160-6232]  2.39999798157265e-30 SM00823
SM00823   PKS_PP
IPR020807 Polyketide synthase, dehydratase domain (Domain)
 [2999-3162]  1.09999184471405e-81 SM00826 [5073-5236]  4.30000170645869e-74 SM00826
SM00826   PKS_DH
IPR020841 Polyketide synthase, beta-ketoacyl synthase domain (Domain)
 [113-538]  SM00825 [2095-2521]  SM00825 [4179-4600]  SM00825
SM00825   PKS_KS
IPR020842 Polyketide synthase/Fatty acid synthase, KR (Domain)
 [1709-1888]  1.40000506907309e-62 SM00822 [3803-3983]  6.10002656149838e-71 SM00822 [5877-6057]  9.09992432541739e-66 SM00822
SM00822   PKS_KR
IPR020843 Polyketide synthase, enoylreductase (Domain)
 [3487-3793]  SM00829 [5561-5867]  SM00829
SM00829   PKS_ER
SignalP No significant hit
TMHMM No significant hit
Page top