Soraph_00080 : CDS information

close this sectionLocation

Organism
StrainSo ce26
Entry nameSoraphen
Contig
Start / Stop / Direction33,269 / 59,722 / + [in whole cluster]
33,269 / 59,722 / + [in contig]
Location33269..59722 [in whole cluster]
33269..59722 [in contig]
TypeCDS
Length26,454 bp (8,817 aa)
Click on the icon to see Genetic map.

close this sectionAnnotation

Category1.1 PKS
Productpolyketide synthase
Product (GenBank)soraphen polyketide synthase B
Gene
Gene (GenBank)sorB
EC number
Keyword
Note
Note (GenBank)
  • similar to product encoded by GenBank Accession Number M63676
Reference
ACC
PmId
[12039053] Characterization of the biosynthetic gene cluster for the antifungal polyketide soraphen A from Sorangium cellulosum So ce26. (Gene. , 2002)
[15289572] Heterologous production of the antifungal polyketide antibiotic soraphen A of Sorangium cellulosum So ce26 in Streptomyces lividans. (Microbiology. , 2004)
comment
[PMID: 12039053](2002)
Soraphen A biosynthetic gene clusterの報告。
SorB(8815aa): PKS

Mod4: KS-ATa-KR-ACP
Mod5: KS-ATp-DH-ER-KR-ACP
Mod6: KS-ATp-KR-ACP
Mod7: KS-ATm-KR-ACP
Mod8: KS-ATp-dh-KR-ACP-TE

ATa= incorporate acetate
ATp= incorporate propionate
ATm= incorporate methoxymalonate

Soraphenを合成しないMutantはSorAとSorBに変異箇所があることが確認された。

---
[PMID: 15289572](2004)
S. lividansでのsor genes異種性発現。

SorABRCDFE株 + cinnamate
SorABRCDFE株 + badA(benzoate-CoA ligase) + benzoate or cinnamate
の組み合わせでSoraphen A産生。

SorABRCDFE株で発現されているのは、the sorA, and the sorB4M, sorRCDFE operons.

close this sectionPKS/NRPS Module

4 malonyl-CoA
5 methylmalonyl-CoA
6 methylmalonyl-CoA
7 methoxymalonyl-ACP
8 methylmalonyl-CoA
KS34..404
AT564..886
KR1207..1387
ACP1491..1561
KS1589..1963
AT2125..2442
DH2493..2656
ER2981..3287
KR3297..3477
ACP3579..3649
KS3671..4041
AT4201..4512
KR4839..5019
ACP5123..5193
KS5221..5595
AT5757..6074
KR6394..6574
ACP6675..6749
KS6775..7149
AT7311..7628
DH7678..7842
KR8161..8341
ACP8442..8512
TE8596..8807

close this sectionSequence

selected fasta
>polyketide synthase [soraphen polyketide synthase B]
MNNDEKLVSYLQQAMNELQRAHQRLRAVEEKEHEPIAIVAMSCRFPGDVRTPEDLWKLLL
DGKDAISDLPPNRGWKLDALDVHGRSPVREGGFFYDADAFDPAFFGISPREALAIDPQQR
LLLEISWEAFERAGIDPASLQGSQSGVFVGVIHNDYDALLENAAGEHKGFVATGSTASVA
SGRIAYTFGFQGPAISVDTACSSSLVAVHLACQALRRGECSLALAGGVTVMATPAVFVAF
DSESAGAPDGRCKSFSVEANGSGWAEGAGMLLLERLSDAVQNGHPVLAVLRGSAVNQDGR
SQGLTAPNGPAQERVIRQALDSARLTPKDVDVVEAHGTGTTLGDPIEAQAILATYGEAHS
QDRPLWLGSLKSNLGHAQAAAGVGSVIKMVLALQQGLLPKTLHAQNPSPHIDWSPGTVKL
LNEPVVWTTNGHPRHAGVSAFGISGTNAHVILEEAPAIARVEPAASQPASEPLPAAWPVL
LSAKSEAAVRAQAKRLRDHLLAKSELALADVAYSLATTRAHFEQRAALLVKGRDELLSAL
DALAQGHSAAVLGRSGAPGKLAVLFTGQGSQRPTMGRGLYDVFPVFRDALDTVGAHLDRE
LDRPLRDVLFAPDGSEQAARLEQTAFTQPALFALEVALFQLLQSFGLKPALLLGHSIGEL
VAAHVAGVLSLQDGCTLVAARAKLMQALPQGGAMVTLRASEEEVRDLLQPYEGRASLAAL
NGPLSTVVAGDEDAVVEIARQAEALGRKTTRLRVSHAFHSPHMDGMLDDFRRVAQSLTYH
PARIPIISNVTGARATDHELASPDYWVRHVRHTVRFLDGVRALHAEGARVFLELGPHAVL
SALAQDALGQDEGTSPCAFLPTLRKGRDDAEAFTAALGALHSAGITPDWSAFFAPFAPRK
VSLPTYAFQRERFWPDASKAPGADVSHLAPLEGGLWQAIERGDLDALSGQLHVDGDERRA
ALALLLPTLSSFRHERQEQSTVDAWRYRITWKPLTTAETPADLAGTWLVVVPAALDDDAL
PSALTEALTRRGARVLALRLSQAHLDREALAEHLRQACAETAPIRGVLSLLALDERPLAD
RPALPAGLALSLSLAQALGDLDLEAPLWFFTRGAVSIGHSDPLAHPAQAMTWGLGRVIGL
EHPDRWGGLVDVCAGVDESAVGRLLPALAERHDEDQLALRPAGLYARRIVRAPLGDAPPA
RDFTPGGTILITGGTGAIGAHVARWLARRGAQHLVLISRRGAEAPGASELHDELSALGAR
TTLAACDVADRNAVATLLEQLDAEGSQVRAVFHASGIEHHAPLDATSFRDLAEVVSGKVE
GAKHLHDLLGSRPLDAFVLFSSGAAVWGGGQQGGYAAANAFLDALAEHRRSAGLTATSVA
WGAWGGGGMATDQAAAHLQQRGLSRMAPSLALAALALALEHDETTVTVADIDWARFAPSF
SAARPRPLLRDLPEAQRALETSEGASSEHGPAPDLLDKLRSRSESEQLRLLVSLVRHETA
LVLGHEGASHVDPDKGFLDLGLDSLMAVELRRRLQQATGIKLPATLAFDHPSPHRVALFL
RDSLAHALGTRLSVEPDAAALPALRAASDEPIAIVGMALRLPGGVGDVDALWEFLAQGRD
GVEPIPKARWDAAALYDPDPDAKTKSYVRHAAMLDQVDLFDPAFFGISPREAKHLDPQHR
LLLESAWQALEDAGIVPPTLKDSPTGVFVGIGASEYALREASTEDSDAYALQGTAGSFAA
GRLAYTLGLQGPALSVDTACSSSLVALHLACQALRQGECNLALAAGVSVMASPEGFVLLS
RLRALAPDGRSKTFSANADGYGRGEGVIVLALERLGDALARGHRVLALVRGTAINHDGAS
SGITAPNGTSQQKVLRAALHDARITPADVDVVECHGTGTSLGDPIEVQALAAVYADGRPA
EKPLLLGALKTNIGHLEAASGLAGVAKIVASLRHDALPPTLHTGPRNPLIDWDTLAIDVV
DTPRSWARHEDSSPRRAGVSAFGLSGTNAHVILEEAPAALSGEPATSQTASRPLPAACAV
LLSARSEAAVRAQAKRLRDHLLAHDDLALIDVAYSQATTRAHFEHRAALLARDRDELLSA
LDSLAQDKPAPSTVLGRSGSHGKVVFVFPGQGSQWEGMALSLLDSSPVFRAQLEACERAL
APHVEWSLLAVLRRDEGAPSLDRVDVVQPALFAVMVSLAALWRSLGVEPAAVVGHSQGEI
AAAFVAGALSLEDAARIAALRRKALTTVGGNGGMAAVELGASDLQTYLAPWGDRLSTAAV
NSPRATLVSGEPAAVDALLDVLTATKVFARKIRVDYASHSAQMDAVQDELAAGLANIAPR
TCELPLYSTVTGTRLDGSELDGAYWYRNLRQTVLFSSATERLLDDGHRFSVEVSPHPVLT
LALRETCERSPLDPVVVGSIRREEGHLARLLLSWAELSTRGLALDWKDFFAPYAPRKVSL
PTYPFQRERFWLDVSTDERFRRRLRRPDLGRPIPLLGAAVAFADRGGFLFTGRLSLAEHP
WLEGHAVFGTPILPGTGFLELALHVAHRVGLDTVEELTLEAPLALPSQDTVLLQISVGPV
DDAGRRALSFHSRQEDALQDGPWTRHASGSLSPATPSLSADLHEWPPSSAIPVDLEGLYA
TLANLGLAYGPEFQGLRSVYKRGDELFAEAKLPEAAEKDAARFALHPALLDSALHALAFE
DEQRGTVALPFSWSGVSLRSVGATTLRVRFHRPKGESSVSIVLADAAGDPLASVQALAMR
TTSAAQLRTPAASHHDALFRVDWSELQSPTSPPAAPSGVLLGTGGHDLALDAPLARYADL
AALRSALDQGASPPGLVVAPFIDRPAGDLVPSAHEATALALALLQAWLADERLASSRLVL
VTRRAVATHTEDDVKDLAHAPLWGLARSAQSEHPDLPLFLVDIDLSEASQQALLGALDTG
ERQLALRNGKPLIPRLAQPRSTDALIPPQAPTWRLHIPTKGTFDALALVDAPEAQAPLAH
GQVRIAVHAAGLNFRDVVDTLGMYPGDAPPLGGEGAGIVTEVGPGVSRYTVGDRVMGVFG
AAFGPTAIADARMICPIPHAWSFAQAASVPIIYLTAYYGLVDLGHLKPNQRVLIHAAAGG
VGTAAVQLARHLGAEVFATASPGKWSALRALGFDDAHLASSRDLGFEQHFLRSTHGRGMD
VVLDCLAREFVDASLRLMPSGGRFIEMGKTDIREPDAIGLAYPGVVYRAFDVTEAGPDRI
GQMLAELLSLFERGVLRLPPITSWDIRHAPQAFRALAQARHVGKFVLTIPRPIDPEGTVL
ITGGTGTLGVLVARHLVAKHSAKHLLLTSRKGARAPGAEALRSELEALGASVTLVACDVA
DPRALRTLLDSIPRDHPITAVVHAAGALDDGPLGSMSAERIARVFDPKLDAAWYLHELTQ
DEPVAAFVLFSAASGVLGGPGQSNYAAANAFLDALAHHRRAQGLPAASLAWGYWAERSGM
TRHLSAADAARMRRAGVRPLDTDEALSLFDVALLRPEPALVPAPFDYNVLSTSADGVPPL
FQRLVRARIARKAASNTALASSLAEHLSSLPPAERERVLLDLVRTEAASVLGLASFESLD
PHRPLQELGLDSLMALELRNRLAAAAGLRLQATLLFDYPTPTALSRFFTTHLFGGTTHRP
GVPLTPGGSEDPIAIVAMSCRFPGDVRTPEDLWKLLLDGQDAISGFPQNRGWSLDALDAP
GRFPVREGGFVYDADAFDPAFFGISPREALAVDPQQRILLEITWEAFERAGIDPASLQGS
QSGVFVGVWQSDYHASLVNATGEYKGLVATGSAASVASGRIAYTFGLQGPAISVDTACSS
SLVAVHLACQALRHGECSLALAGGVTIMATPGIFIAFDSESAGAPDGRCKAFSAEADGSG
WAEGAGMLLLERLSDAVQNGHPVLAVLRGSAVNQDGRSQGLTAPNGPAQERVIRQALDSA
RLTPKDVDVVEAHGTGTTLGDPIEAQAVFATYGEAHSQDRPLWLGSLKSNLGHTQAAAGV
GGIIKMVLALQHGLLPKTLHAQNPSPHIDWSPGIVKLLNEAVAWTTSGHPRRAGVSSFGV
SGTNAHVILEEAPAATRAESGASQPASQPLPAAWPVVLSARSEAAVRAQAQRLREHLLAQ
GDLTLADVAYSLATTRAHFEHRAALVAHDRDELLSALDSLAQDKPAPSTVLGRSGSHGKV
VFVFPGQGSQWEGMALSLLDSSPVFRTQLEACERALRPHVEWSLLAVLRRDEGAPSLDRV
DVVQPALFAVMVSLAALWRSLGVEPAAVVGHSQGEIAAAFVAGALSLEDAARIAALRSKA
SPPSPATGMAAVELGASDLQTYLAPWGDRLSIAAVNSPRATLVSGEPAAVDALIDSLTAA
QVFARRVRVDYASHSAQMDAVQDELAAGLANIAPRTCELPLYSTVTGTRLDGSELDGAYW
YRNLRQTVLFSSATERLLDDGHRFFVEVSPHPVLTLALRETCERSPLDPVVVGSIRRDEG
HLPRLLALLGRALWPGLTPEWKAFFAPFAPRKVSLPTYAFQRERFWLDAPNAHPEGVAPA
APIDGRFWQAIERGDLDALSGQLHADGDEQRAALALLLPTLSSFHHQRQEQSTVDTWRYR
ITWRPLTTAATPADLAGTWLLVVPSALGDDALPATLTDALTRRGARVLALRLSQVHIGRA
ALTEHLREAVAETAPIRGVLSLLALDERPLADHAALPAGLALSLALVQALGDLALEAPLW
LFTRGAVSIGHSDPLAHPTQAMIWGLGRVVGLEHPERWGGLVDLGAALDASAAGRLLPAL
AQRHDEDQLALRPAGLYARRFVRAPLGDAPAARGFMPRGTILITGGTGAIGAHVARWLAR
KGAEHLVLISRRGAQAEGAVELHAELTALGARVTFAACDVADRSAVATLLEQLDAGGPQV
SAVFHAGGIEPHAPLAATSMEDLAEVVSGKVQGARHLHDLLGSRPLDAFVLFSSGAVVWG
GGQQGGYAAANAFLDALAEQRRSLGLTATSVAWGVWGGGGMATGLLAAQLEQRGLSPMAP
SLAVATLALALEHDETTLTVADIDWARFAPSFSAARSRPLLRDLPEAQRALEASADASSE
QDGATGLLDKLRNRSESEQIHLLSSLVRHEAALVLGHTDASQVDPHKGFMDLGLDSLMTV
ELRRRLQQATGIKLPATLAFDHPSPHRVALFLRDSLAHALGARLSVERDAAALPALRSAS
DEPIAIVGMALRLPGGIGDVDALWEFLAQGRDAVEPIPHARWDAGALYDPDPDAKAKSYV
RHAAMLDQVDLFDPAFFGISPREAKYLDPQHRLLLESAWLALEDAGIVPSTLKDSPTGVF
VGIGASEYALRNTSSEEVEAYALQGTAGSFAAGRLAYTLGLQGPALSVDTACSSSLVALH
LACQALRQGECNLALAAGVSVMASPGLFVVLSRMRALAPDGRSKTFSTNADGYGRGEGVV
VLALERLGDALARGHRVLALVRGTAMNHDGASSGITAPNGTSHQKVLRAALHDAHIGPAD
VDVVECHGTGTSLGDPIEVQALAAVYADGRPAEKPLLLGALKTNIGHLEAASGLAGVAKI
VASLRHDALPPTLHTTPRNPLIEWDALAIDVVDATRAWARHEDGSPRRAGVSAFGLSGTN
AHVILEEAPAIPQAEPTAAQLASQPLPAAWPVLLSARSEPAVRAQAQRLRDHLLAHDDLA
LADVAYSLATTRATFEHRAALVVHDREELLSALDSLAQGRPAPSTVVERSGSHGKVVFVF
PGQGSQWEGMALSLLDTSPVFRAQLEACERALAPHVDWSLLAVLRGEEGAPPLDRVDVVQ
PALFSMMVSLAALWRSMGVEPDAVVGHSQGEIAAACVAGALSLEDAAKLVALRSRALVEL
AGQGAMAAVELPEAEVARRLQRYGDRLSIGAINSPRFTTISGEPPAVAALLRDLESEGVF
ALKLSYDFASHSAQVESIRDELLDLLSWLEPRSTAVPFYSTVSGAAIDGSELDAAYWYRN
LRQPVRFADAVQGLLAGEHRFFVEVSPSPVLTLALHELLEASERSAAVVGSLWSDEGDLR
RFLVSLSELYVNGFALDWTTILPPGKRVPLPTYPFQRERFWLDASTAPAAGVNHLAPLEG
RFWQAIESGNIDALSGQLHVDGDEQRAALALLLPTLASFRHERQEQGTVDAWRYRITWKP
LTTATTPADLAGTWLLVVPAALDDDALPSALTEALARRGARVLAVRLSQAHLDREALAEH
LRQACAETAPPRGVLSLLALDESPLADHAAVPAGLAFSLTLVQALGDIALDAPLWLFTRG
AVSVGHSDPIAHPTQAMTWGLGRVVGLEHPERWGGLVDVGAAIDASAVGRLLPVLALRND
EDQLALRPAGFYARRLVRAPLGDAPPARTFKPRGTLLITGGTGAAGAHVARWLAREGAEH
LVLISRRGAQAEGASELHAELTALGARVTFAACDVADRSAVATLLEQLDAEGSQVRAVFH
AGGIGRHAPLAATSLMELADVVSAKVLGAGNLHDLLGPRPLDAFVLFSSIAGVWGGGQQA
GYAAGNAFLDALADQRRSLGQPDTSVVWGAWGGGGGIFTGPLAAQLEQRRLSPMAPSLAV
AALAQALEHDETTVTVADIDWARFAPSISVARSRRSCATCPSSAPSKTEKARPPPSTARP
PDLLDKLRSRSESEQLRLLAALVCDETALVLGHEGRFPARPRQGFFDLGLDSIMTVELRR
RLQQATGIKLPATLAFDHPSPHRVALFMRDSLAHALGTRLSAEATPPRSGRASSDEPIAI
VGMALRLPGGVGDVDALWEFLHQGRDAVEPIPQSRWDAGALYDPDPDADAKSYVRHAAML
DQIDLFDPAFFGISPREAKHLDPQHRLLLESAWLALEDAGIVPTSLKDSLTGVFVGICAG
EYAMQEASSEGSEVYFIQGTSASFGAGGLAYTLGLQGPRSSVDTACSSSLVSLHLACQAL
RQGECNLALAAGVSLMVSPQTFVILSRLRALAPDGRSKTFSDNADGYGRGEGVVVLALER
IGDALARRHRVLVLVRGTAINHDGASSGITAPNGTSQQKVLRAALHDARITPADVDVVEC
HGTGTSLGDPIEVQALAAVYADGRPAEKPLLLGALKTNIGHLEAASGLAGVAKMVASLRH
DALPPTLHATPRNPLIEWEALAIDVVDTPRPWPRHEDGSPRRAGISAFGFSGTNAHVILE
EAPAALPAEPATSQPASQAAPAAWPVLLSARSEAAVRAQAKRLRDHLVAHDDLTLADVAY
SLATTRAHFEHRAALVAHNRDELLSALDSLAQDKPAPSTVLGRSGSHGKLVFVFPGQGSQ
WEGMALSLLDSSPVFRAQLEACERALAPHVEWSLLAVLRRDEGAPSLDRVDVVQPALFAV
MVSLAALWRSLGVEPAAVVGHSQGEIAAAFVAGALSLEDAARIAALRSKALTTVAGNGAM
AAVELGASDLQTYLAPWGDRLSIAAVNSPRATLVSGEPAAIDALIDSLTAAQVFARKVRV
DYASHSAQMDAVQDELAAGLANIAPRTCELPLYSTVTGTRLDGSELDGAYWYRNLRQTVL
FSSATERLLDDGHRFFVEVSPHPVLTLALRETCERSPLDPVVVGSIRRDEGHLARLLLSW
AELSTRGLALDWNAFFAPFAPRKVSLPTYPFQRERFWLDASTAHAADVASAGLTSADHPL
LGAAVALADRDGFVFTGRLSLAEHPWLEDHVVFGIPCPARRRLLELALHVAHLVGLDTVE
DVTLDPPLALPSQGAVLLQISVGPADGAGRRALSVHSRRHDALQDGPWTRHASGSLAQAS
PSHCLRCSANGPPSGATQVDTQGFYAALESAGLAYGPEFQGLRRRLQARRRALRRSQAPG
RRRRGRRSFCPPPRPARQRLAGARLCRRPGKGLQDALLVERSIAALRSEPPPCACVSTVL
RANPRARSSSPTPEANPSPRCKRSPCAPRPPSSSADPGASHLDALFRIDWSELQSPTSPP
IAPSGALLGTEGLDLGTRVPLDRYTDLAALRSALDQGASPPSLVIAPFIALPEGDLIASA
RETTAHALALLQAWLADERLASSRLALVTRRAVATHAEEDVKGLAHAPLWGLARSAQSEH
PERPLVLVDLDDSEASQHALLGALDAREPEIALRNGKPLVPRLSRLPQAPTDTASPAGLG
GTVLITGGTGTLGALVARRLVVNHDAKHLLLTSRQGASAPGADVLRSELEALGASVTLAA
CDVADPRALKDLLDNIPSAHPVAAVVHAASVLDGDLLGAMSLERIDRVFAPKIDAAWHLH
QLTQDKPLAAFILFSSVAGVLGSSGHSNYAAASAFLDALAHHRRAQGLPASSLAWSHWAE
RSAMTEHVSAAGAPRMERAGLPSTSEERLALFDAALFRTETALVPARFDLSALRANAGSV
PPLFQRLVRARTVRKAASNTAQASSLTERLSALPPAERERALLDLIRTEAAAVLGLASFE
SLDPDRLLQELGLDSIIALDLRNRLAAATGVRLPATLLFEHPTPAALAALLLARLEPGMR
RGPAKDGASPTDTESDGALLGMVQPANEIGAIEEARNLIAAALKVRLAVEDASKRSAVAI
AEEPPTRLARGQATPQLICFPAFVVPSAPIQYARFASHLRDRRDIWFIPHPGYRHKTPLT
RSLDELVSSHARTTLACARNSPFVLFGHSSGGNIAHMVAEHLESIGHGPAGVVLLDSYDY
ASPAVEAGLKIFHVEQLQTWGASDAGLTAEAWYYEHIGLETWKPRQLAAPTLHVRATEPM
KQFVGSEGAPAEWRASWKLPHVAIDAPGDHATVVDHPFLAQAVDDWLSSLSNEPSNQ
selected fasta
>polyketide synthase [soraphen polyketide synthase B]
ATGAATAACGACGAGAAGCTTGTCTCCTACCTACAGCAGGCGATGAATGAGCTTCAGCGT
GCTCATCAGCGCCTCCGCGCGGTCGAAGAGAAGGAGCACGAGCCCATCGCCATCGTGGCG
ATGAGCTGCCGCTTCCCGGGCGACGTGCGCACGCCCGAGGATCTCTGGAAGCTCTTGCTC
GATGGGAAAGATGCTATCTCCGACCTTCCCCCAAACCGTGGTTGGAAGCTCGACGCGCTC
GACGTCCACGGTCGCTCCCCAGTCCGAGAGGGAGGCTTCTTCTACGACGCAGACGCCTTC
GATCCGGCCTTCTTCGGGATCAGCCCACGCGAGGCGCTCGCCATCGATCCCCAGCAGCGG
CTCCTCCTCGAGATCTCATGGGAAGCCTTCGAGCGTGCGGGCATCGACCCTGCCTCGCTC
CAAGGGAGCCAAAGCGGCGTCTTCGTCGGCGTGATACACAACGACTACGACGCATTGCTG
GAGAACGCAGCTGGCGAACACAAAGGATTCGTTGCCACCGGCAGCACAGCGAGCGTCGCC
TCCGGCCGGATCGCGTATACATTCGGCTTTCAAGGGCCCGCCATCAGCGTGGACACGGCG
TGCAGCTCCTCGCTCGTCGCGGTTCACCTCGCCTGCCAGGCCCTGCGCCGTGGCGAATGC
TCCCTGGCGCTCGCCGGCGGCGTGACCGTCATGGCCACGCCAGCAGTCTTCGTCGCGTTC
GATTCCGAGAGCGCGGGCGCCCCCGATGGTCGCTGCAAGTCGTTCTCGGTGGAGGCCAAC
GGTTCGGGCTGGGCCGAGGGCGCCGGGATGCTCCTGCTCGAGCGCCTCTCCGATGCCGTC
CAAAACGGTCATCCCGTCCTCGCCGTCCTTCGAGGCTCCGCCGTCAACCAGGACGGCCGG
AGCCAAGGCCTCACCGCGCCCAATGGCCCTGCCCAAGAGCGCGTCATCCGGCAAGCGCTC
GACAGCGCGCGGCTCACTCCAAAGGACGTCGACGTCGTCGAGGCTCACGGCACGGGAACC
ACCCTCGGAGACCCCATCGAGGCACAGGCCATTCTTGCCACCTATGGCGAGGCCCATTCC
CAAGACAGACCCCTCTGGCTTGGAAGTCTCAAGTCCAACCTGGGACATGCTCAGGCCGCG
GCCGGCGTGGGAAGCGTCATCAAGATGGTGCTCGCGTTGCAGCAAGGCCTCTTGCCCAAG
ACCCTCCATGCCCAGAATCCCTCCCCCCACATCGACTGGTCTCCGGGCACGGTAAAGCTC
CTGAACGAGCCCGTCGTCTGGACGACCAACGGGCATCCTCGCCACGCCGGCGTCTCCGCC
TTCGGCATCTCCGGCACCAACGCCCACGTCATCCTCGAAGAGGCCCCCGCCATCGCCCGG
GTCGAGCCCGCAGCGTCACAGCCCGCGTCCGAGCCGCTTCCCGCAGCGTGGCCCGTGCTC
CTGTCGGCCAAGAGCGAGGCGGCCGTGCGCGCCCAGGCAAAGCGGCTCCGCGACCACCTC
CTCGCCAAAAGCGAGCTCGCCCTCGCCGATGTGGCCTATTCGCTCGCGACCACGCGCGCC
CACTTCGAGCAGCGCGCCGCTCTCCTCGTCAAAGGCCGCGACGAGCTCCTCTCCGCCCTC
GATGCGCTGGCCCAAGGACATTCCGCCGCCGTGCTCGGACGAAGCGGGGCCCCAGGAAAG
CTCGCCGTCCTCTTCACGGGGCAAGGAAGCCAGCGGCCCACCATGGGCCGCGGCCTCTAC
GACGTTTTCCCCGTCTTCCGGGACGCCCTCGACACCGTCGGCGCCCACCTCGACCGCGAG
CTCGACCGCCCCCTGCGCGACGTCCTCTTCGCTCCCGACGGCTCCGAGCAGGCCGCGCGC
CTCGAGCAAACCGCCTTCACCCAGCCGGCCCTGTTTGCCCTCGAAGTCGCCCTCTTTCAG
CTTCTACAATCCTTCGGTCTGAAGCCCGCTCTCCTCCTCGGACACTCCATTGGCGAGCTC
GTCGCCGCCCACGTCGCCGGCGTCCTTTCTCTCCAGGACGGCTGCACCCTCGTCGCCGCC
CGCGCAAAGCTCATGCAAGCGCTCCCACAAGGCGGCGCCATGGTCACCCTCCGAGCCTCC
GAGGAGGAAGTCCGCGACCTTCTCCAGCCCTACGAAGGCCGAGCTAGCCTCGCCGCCCTC
AATGGGCCTCTCTCCACCGTCGTCGCTGGCGATGAAGACGCGGTGGTGGAGATCGCCCGC
CAGGCCGAAGCCCTCGGACGAAAGACCACACGCCTGCGCGTCAGCCACGCCTTCCATTCC
CCGCACATGGACGGAATGCTCGACGACTTCCGCCGCGTCGCCCAGAGCCTCACCTACCAT
CCCGCACGCATCCCCATCATCTCCAACGTCACCGGCGCGCGCGCCACGGACCACGAGCTC
GCCTCGCCCGACTACTGGGTCCGCCACGTTCGCCACACCGTCCGCTTCCTCGACGGCGTA
CGTGCCCTTCACGCCGAAGGGGCACGTGTCTTTCTCGAGCTCGGGCCTCACGCTGTCCTC
TCCGCCCTTGCGCAAGACGCCCTCGGACAGGACGAAGGCACGTCGCCATGCGCCTTCCTT
CCCACCCTCCGCAAGGGACGCGACGACGCCGAGGCGTTCACCGCCGCGCTCGGCGCTCTC
CACTCCGCAGGCATCACACCCGACTGGAGCGCTTTCTTCGCCCCCTTCGCTCCACGCAAG
GTCTCCCTCCCCACCTATGCCTTCCAGCGCGAGCGCTTCTGGCCCGACGCCTCCAAGGCA
CCCGGCGCCGACGTCAGCCACCTTGCTCCGCTCGAGGGGGGGCTCTGGCAAGCCATCGAG
CGCGGGGACCTCGATGCGCTCAGCGGTCAGCTCCACGTGGACGGCGACGAGCGGCGCGCC
GCGCTCGCCCTGCTCCTTCCCACCCTCTCGAGCTTTCGCCACGAGCGGCAAGAGCAGAGC
ACGGTCGACGCCTGGCGCTACCGTATCACCTGGAAGCCTCTGACCACCGCCGAAACACCC
GCCGACCTCGCCGGCACCTGGCTCGTCGTCGTGCCGGCCGCTCTGGACGACGACGCGCTC
CCCTCCGCGCTCACCGAGGCGCTCACCCGGCGCGGCGCGCGCGTCCTCGCCTTGCGCCTG
AGCCAGGCCCACCTGGACCGCGAGGCTCTCGCCGAGCATCTGCGCCAGGCTTGCGCCGAG
ACCGCCCCGATTCGCGGCGTGCTCTCGCTCCTCGCCCTCGACGAGCGCCCCCTCGCAGAC
CGTCCTGCCCTGCCCGCCGGACTCGCCCTCTCGCTTTCTCTCGCTCAAGCCCTCGGCGAC
CTCGACCTCGAGGCGCCCTTGTGGTTCTTCACGCGCGGCGCCGTCTCCATTGGACACTCT
GACCCCCTCGCCCATCCCGCCCAGGCCATGACCTGGGGCTTGGGCCGCGTCATCGGCCTC
GAGCACCCCGACCGGTGGGGAGGTCTCGTCGACGTCTGCGCTGGGGTCGACGAGAGCGCC
GTGGGCCGCTTGCTGCCGGCCCTCGCCGAGCGCCACGACGAAGACCAGCTCGCTCTCCGC
CCGGCCGGACTCTACGCTCGCCGCATCGTCCGCGCCCCGCTCGGCGATGCGCCTCCCGCG
CGCGACTTCACGCCCGGAGGCACCATTCTCATCACCGGCGGCACCGGCGCCATTGGCGCT
CACGTCGCCCGATGGCTCGCTCGAAGAGGCGCTCAGCACCTCGTCCTCATCAGCCGCCGA
GGCGCCGAGGCCCCTGGCGCCTCGGAGCTCCACGACGAGCTCTCGGCCCTCGGCGCGCGC
ACCACCCTCGCCGCGTGCGATGTCGCCGACCGGAATGCTGTCGCCACGCTTCTTGAGCAG
CTCGACGCCGAAGGGTCGCAGGTCCGCGCCGTGTTCCACGCGAGCGGCATCGAACACCAC
GCTCCGCTCGACGCCACCTCTTTCAGGGATCTCGCCGAGGTTGTCTCCGGCAAGGTCGAA
GGTGCAAAGCACCTCCACGACCTGCTCGGCTCTCGACCCCTCGACGCCTTTGTTCTCTTT
TCGTCCGGCGCGGCCGTCTGGGGCGGCGGACAGCAAGGCGGCTACGCGGCCGCAAACGCC
TTCCTCGACGCCCTTGCCGAGCATCGGCGCAGCGCTGGATTGACAGCGACGTCGGTGGCC
TGGGGCGCGTGGGGCGGCGGCGGCATGGCCACCGATCAGGCGGCAGCCCACCTCCAACAG
CGCGGTCTGTCGCGGATGGCCCCCTCGCTTGCCCTGGCGGCGCTCGCGCTGGCTCTGGAG
CACGACGAGACCACCGTCACCGTCGCCGACATCGACTGGGCGCGCTTTGCGCCTTCGTTC
AGCGCCGCTCGCCCCCGCCCGCTCCTGCGCGATTTGCCCGAGGCGCAGCGCGCTCTCGAG
ACCAGCGAAGGCGCGTCCTCCGAGCATGGCCCGGCCCCCGACCTCCTCGACAAGCTCCGG
AGCCGCTCGGAGAGCGAGCAGCTTCGTCTGCTCGTCTCGCTGGTGCGCCACGAGACGGCC
CTCGTCCTCGGCCACGAAGGCGCCTCCCATGTCGACCCCGACAAGGGCTTCCTCGATCTC
GGTCTCGATTCGCTCATGGCCGTCGAGCTTCGCCGGCGCTTGCAACAGGCCACCGGCATC
AAGCTCCCGGCCACCCTCGCCTTCGACCATCCCTCTCCTCATCGAGTCGCGCTCTTCTTG
CGCGACTCGCTCGCCCACGCCCTCGGCACGAGGCTCTCCGTCGAGCCCGACGCCGCCGCG
CTCCCGGCGCTTCGCGCCGCGAGCGACGAGCCCATCGCCATCGTCGGCATGGCCCTCCGC
CTGCCGGGCGGCGTCGGCGATGTCGACGCTCTTTGGGAGTTCCTGGCCCAGGGACGCGAC
GGCGTCGAGCCCATTCCAAAGGCCCGATGGGATGCCGCTGCGCTCTACGACCCCGACCCC
GACGCCAAGACCAAGAGCTACGTCCGGCATGCCGCCATGCTCGACCAGGTCGACCTCTTC
GACCCTGCCTTCTTTGGCATCAGCCCCCGGGAGGCCAAACACCTCGACCCCCAGCACCGC
CTGCTCCTCGAATCTGCCTGGCAGGCCCTCGAAGACGCCGGCATCGTCCCCCCCACCCTC
AAGGATTCCCCCACCGGCGTCTTCGTCGGCATCGGCGCCAGCGAATACGCATTGCGAGAG
GCGAGCACCGAAGATTCCGACGCTTATGCCCTCCAAGGCACCGCCGGGTCCTTTGCCGCG
GGGCGCTTGGCCTACACGCTCGGCCTGCAAGGGCCCGCGCTCTCGGTCGACACCGCCTGC
TCCTCCTCGCTCGTCGCCCTCCACCTCGCCTGCCAAGCCCTCCGACAGGGCGAGTGCAAC
CTCGCCCTCGCCGCGGGCGTCTCCGTCATGGCCTCCCCCGAGGGCTTCGTCCTCCTTTCC
CGCCTGCGCGCCTTGGCGCCCGACGGCCGCTCCAAGACCTTCTCGGCCAACGCCGACGGC
TACGGACGCGGAGAAGGCGTCATCGTCCTTGCCCTCGAGCGGCTCGGTGACGCCCTCGCC
CGAGGACACCGCGTCCTCGCCCTCGTCCGCGGCACCGCCATCAACCACGACGGCGCGTCG
AGCGGTATCACCGCCCCCAACGGCACCTCCCAGCAGAAGGTCCTCCGCGCCGCGCTCCAC
GACGCCCGCATCACCCCCGCCGACGTCGACGTCGTCGAGTGCCATGGCACCGGCACCTCC
TTGGGAGACCCCATCGAGGTGCAAGCCCTGGCCGCCGTCTACGCCGACGGCAGACCCGCT
GAAAAGCCTCTCCTTCTCGGCGCGCTCAAGACCAACATCGGCCATCTCGAGGCCGCCTCC
GGCCTCGCGGGCGTCGCCAAGATCGTCGCCTCCCTCCGCCATGACGCCCTGCCCCCCACC
CTCCACACGGGCCCGCGCAATCCCTTGATTGATTGGGATACACTCGCCATCGACGTCGTT
GATACCCCGAGGTCTTGGGCCCGCCACGAAGATAGCAGTCCCCGCCGCGCCGGCGTCTCC
GCCTTCGGACTCTCCGGCACCAACGCCCACGTCATCCTCGAGGAGGCTCCCGCCGCCCTG
TCGGGCGAGCCCGCCACCTCACAGACGGCGTCGCGACCGCTCCCCGCGGCGTGTGCCGTG
CTCCTGTCGGCCAGGAGCGAGGCCGCCGTCCGCGCCCAGGCGAAGCGGCTCCGCGACCAC
CTCCTCGCCCACGACGACCTCGCCCTTATCGATGTGGCCTATTCGCAGGCCACCACCCGC
GCCCACTTCGAGCACCGCGCCGCTCTCCTGGCCCGCGACCGCGACGAGCTCCTCTCCGCG
CTCGACTCGCTCGCCCAGGACAAGCCCGCCCCGAGCACCGTTCTCGGCCGGAGCGGAAGC
CACGGCAAGGTCGTCTTCGTCTTTCCTGGGCAAGGCTCGCAGTGGGAAGGGATGGCCCTC
TCCCTGCTCGACTCCTCGCCGGTCTTCCGCGCTCAGCTCGAAGCATGCGAGCGCGCGCTC
GCTCCTCACGTCGAGTGGAGCCTGCTCGCCGTCCTGCGCCGCGACGAGGGCGCCCCCTCC
CTCGACCGCGTCGACGTCGTACAGCCCGCCCTCTTTGCCGTCATGGTCTCCCTGGCCGCC
CTCTGGCGCTCGCTCGGCGTCGAGCCCGCCGCCGTCGTCGGCCACAGCCAGGGCGAGATC
GCCGCCGCCTTCGTCGCAGGCGCTCTCTCCCTCGAGGACGCGGCGCGCATCGCCGCCCTG
CGCAGGAAAGCGCTCACCACCGTCGGCGGCAACGGCGGCATGGCCGCCGTCGAGCTCGGC
GCCTCCGACCTCCAGACCTACCTCGCTCCCTGGGGCGACAGGCTCTCCACCGCCGCCGTC
AACAGCCCCAGGGCTACCCTCGTATCCGGCGAGCCCGCCGCCGTCGACGCGCTGCTCGAC
GTCCTCACCGCCACCAAGGTGTTCGCCCGCAAGATCCGCGTCGACTACGCCTCCCACTCC
GCCCAGATGGACGCCGTCCAAGACGAGCTCGCCGCAGGTCTAGCCAACATCGCTCCTCGG
ACGTGCGAGCTCCCTCTTTATTCGACCGTCACCGGCACCAGGCTCGACGGCTCCGAGCTC
GACGGCGCGTACTGGTATCGAAACCTCCGGCAAACCGTCCTGTTCTCGAGCGCGACCGAG
CGGCTCCTCGACGATGGGCATCGCTTCTCCGTCGAGGTCAGCCCCCATCCCGTGCTCACG
CTCGCCCTCCGCGAGACCTGCGAGCGCTCACCGCTCGATCCCGTCGTCGTCGGCTCCATT
CGACGAGAAGAAGGCCACCTCGCCCGCCTGCTCCTCTCCTGGGCGGAGCTCTCTACCCGA
GGCCTCGCGCTCGACTGGAAGGACTTCTTCGCGCCCTACGCTCCCCGCAAGGTCTCCCTC
CCCACCTACCCCTTCCAGCGAGAGCGGTTCTGGCTCGACGTCTCCACGGACGAACGCTTC
CGACGTCGCCTCCGCAGGCCTGACCTCGGCCGACCAATCCCGCTGCTCGGCGCCGCCGTC
GCCTTCGCCGACCGCGGTGGCTTTCTCTTTACAGGGCGGCTCTCCCTCGCAGAGCACCCG
TGGCTCGAAGGCCATGCCGTCTTCGGCACACCCATCCTACCGGGCACCGGCTTTCTCGAG
CTCGCCCTGCACGTCGCCCACCGCGTCGGCCTCGACACCGTCGAAGAGCTCACGCTCGAG
GCCCCTCTCGCTCTCCCATCGCAGGACACCGTCCTCCTCCAGATCTCCGTCGGGCCCGTG
GACGACGCAGGACGAAGGGCGCTCTCTTTCCATAGCCGACAAGAGGACGCGCTTCAGGAT
GGCCCCTGGACTCGCCACGCCAGCGGCTCTCTCTCGCCGGCGACCCCATCCCTCTCCGCC
GATCTCCACGAGTGGCCTCCCTCGAGTGCCATCCCGGTGGACCTCGAAGGCCTCTACGCA
ACCCTCGCCAACCTCGGGCTTGCCTACGGCCCCGAGTTCCAGGGCCTCCGCTCCGTCTAC
AAGCGCGGCGACGAGCTCTTTGCCGAAGCCAAGCTCCCGGAAGCGGCCGAAAAGGATGCC
GCCCGGTTTGCCCTCCACCCTGCGCTGCTCGACAGCGCCCTGCATGCACTGGCCTTTGAG
GACGAGCAGAGAGGGACGGTCGCTCTGCCCTTCTCGTGGAGCGGAGTCTCGCTGCGCTCC
GTCGGTGCCACCACCTTGCGCGTGCGCTTCCACCGTCCCAAGGGTGAATCCTCCGTCTCG
ATCGTCCTGGCCGACGCCGCAGGTGACCCTCTTGCCTCGGTGCAAGCGCTCGCCATGCGG
ACGACGTCCGCCGCGCAGCTCCGCACCCCGGCAGCTTCCCACCATGATGCGCTCTTCCGC
GTCGACTGGAGCGAGCTCCAAAGCCCCACTTCACCGCCTGCCGCCCCGAGCGGCGTCCTT
CTCGGCACAGGCGGCCACGATCTCGCGCTCGACGCCCCGCTCGCCCGCTACGCCGACCTC
GCTGCCCTCCGAAGCGCCCTCGACCAGGGCGCTTCGCCTCCCGGCCTCGTCGTCGCCCCC
TTCATCGATCGACCGGCAGGCGACCTCGTCCCGAGCGCCCACGAGGCCACCGCGCTCGCA
CTCGCCCTCTTGCAAGCCTGGCTCGCCGACGAACGCCTCGCCTCGTCGCGCCTCGTCCTC
GTCACCCGACGCGCCGTCGCCACCCACACCGAAGACGACGTCAAGGACCTCGCTCACGCG
CCGCTCTGGGGGCTCGCGCGCTCCGCGCAAAGTGAGCACCCAGACCTCCCGCTCTTCCTC
GTCGACATCGACCTCAGCGAGGCCTCCCAGCAGGCCCTGCTAGGCGCGCTCGACACAGGA
GAACGCCAGCTCGCCCTCCGCAACGGGAAACCCCTCATCCCGAGGTTGGCGCAACCACGC
TCGACGGACGCGCTCATCCCGCCGCAAGCACCCACGTGGCGCCTCCATATTCCGACCAAA
GGCACCTTCGACGCGCTCGCCCTCGTCGACGCCCCCGAGGCCCAGGCGCCCCTCGCACAC
GGCCAAGTCCGCATCGCCGTGCACGCGGCAGGGCTCAACTTCCGCGATGTCGTCGACACC
CTTGGCATGTATCCGGGCGACGCGCCGCCGCTCGGAGGCGAAGGCGCGGGCATCGTTACT
GAAGTCGGTCCAGGTGTCTCCCGATACACCGTAGGCGACCGGGTGATGGGGGTCTTCGGC
GCAGCCTTTGGTCCCACGGCCATCGCCGACGCCCGCATGATCTGCCCCATCCCCCACGCC
TGGTCCTTCGCCCAAGCCGCCAGCGTCCCCATCATCTATCTCACCGCCTACTATGGACTC
GTCGATCTCGGGCATCTGAAACCCAATCAACGTGTCCTCATCCATGCGGCCGCCGGCGGC
GTCGGGACGGCCGCCGTTCAGCTCGCACGCCACCTCGGCGCCGAGGTCTTTGCCACCGCC
AGTCCAGGGAAGTGGAGCGCTCTCCGCGCGCTCGGCTTCGACGATGCGCACCTCGCGTCC
TCACGTGACCTGGGCTTCGAGCAGCACTTCCTGCGCTCCACGCATGGGCGCGGCATGGAT
GTCGTCCTCGACTGTCTGGCACGCGAGTTCGTCGACGCCTCGCTGCGCCTCATGCCGAGC
GGTGGACGCTTCATCGAGATGGGAAAGACGGACATCCGTGAGCCCGACGCGATCGGCCTC
GCCTACCCTGGCGTCGTTTACCGCGCCTTCGACGTCACAGAGGCCGGACCGGATCGAATT
GGGCAGATGCTCGCAGAGCTGCTCAGCCTCTTCGAGCGCGGTGTGCTTCGTCTGCCACCC
ATCACATCCTGGGACATCCGTCATGCCCCCCAGGCCTTCCGCGCGCTCGCCCAGGCGCGG
CATGTTGGGAAGTTCGTCCTCACCATTCCCCGTCCGATCGATCCCGAGGGGACCGTCCTC
ATCACGGGAGGCACCGGGACGCTAGGAGTCCTGGTCGCACGCCACCTCGTCGCGAAACAC
AGCGCCAAACACCTGCTCCTCACCTCGAGGAAGGGCGCGCGTGCTCCGGGCGCGGAGGCT
CTGCGAAGCGAGCTCGAAGCGCTGGGGGCCTCGGTCACCCTCGTCGCGTGCGACGTGGCC
GACCCACGCGCCCTCCGGACCCTCCTGGACAGCATCCCGAGGGATCATCCGATCACGGCC
GTCGTGCACGCCGCCGGCGCCCTCGACGACGGGCCGCTCGGTAGCATGAGCGCCGAGCGC
ATCGCTCGCGTCTTTGACCCCAAGCTCGATGCCGCTTGGTACTTGCATGAGCTCACCCAG
GACGAGCCGGTCGCGGCCTTCGTCCTCTTCTCGGCCGCCTCCGGCGTCCTTGGTGGTCCA
GGTCAGTCGAACTACGCCGCTGCCAATGCCTTCCTCGATGCGCTCGCACATCACCGGCGC
GCCCAAGGACTCCCAGCCGCTTCGCTCGCCTGGGGCTACTGGGCCGAGCGCAGTGGGATG
ACCCGGCACCTCAGCGCCGCCGACGCCGCTCGCATGAGGCGCGCCGGCGTCCGGCCCCTC
GACACTGACGAGGCGCTCTCCCTCTTCGATGTGGCTCTCTTGCGACCCGAGCCCGCTCTG
GTCCCCGCCCCCTTCGACTACAACGTGCTCAGCACGAGTGCCGACGGCGTGCCCCCGCTG
TTCCAGCGTCTCGTCCGCGCTCGCATCGCGCGCAAGGCCGCCAGCAATACTGCCCTCGCC
TCGTCGCTTGCAGAGCACCTCTCCTCCCTCCCGCCCGCCGAACGCGAGCGCGTCCTCCTC
GATCTCGTCCGCACCGAAGCCGCCTCCGTCCTCGGCCTCGCCTCGTTCGAATCGCTCGAT
CCCCATCGCCCTCTACAAGAGCTCGGCCTCGATTCCCTCATGGCCCTCGAGCTCCGAAAT
CGACTCGCCGCCGCCGCCGGGCTGCGGCTCCAGGCTACTCTCCTCTTCGACTATCCAACC
CCGACTGCGCTCTCACGCTTTTTCACGACGCATCTCTTCGGGGGAACCACCCACCGCCCC
GGCGTACCGCTCACCCCGGGGGGGAGCGAAGACCCTATCGCCATCGTGGCGATGAGCTGC
CGCTTCCCGGGCGACGTGCGCACGCCCGAGGATCTCTGGAAGCTCTTGCTCGACGGACAA
GATGCCATCTCCGGCTTTCCCCAAAATCGCGGCTGGAGTCTCGATGCGCTCGACGCCCCC
GGTCGCTTCCCAGTCCGGGAGGGGGGCTTCGTCTACGACGCAGACGCCTTCGATCCGGCC
TTCTTCGGGATCAGTCCACGTGAAGCGCTCGCCGTTGATCCCCAACAGCGCATTTTGCTC
GAGATCACATGGGAAGCCTTCGAGCGTGCAGGCATCGACCCGGCCTCCCTCCAAGGAAGC
CAAAGCGGGGTCTTCGTTGGCGTATGGCAGAGCGACTACCATGCATCGCTGGTGAACGCG
ACTGGCGAATACAAAGGACTCGTTGCCACCGGTAGCGCAGCGAGCGTCGCCTCCGGCCGA
ATCGCATACACGTTCGGACTTCAAGGGCCCGCCATCAGCGTGGACACGGCGTGCAGCTCT
TCGCTCGTCGCGGTTCACCTCGCCTGCCAGGCCCTCCGCCACGGCGAATGCTCCCTGGCG
CTCGCTGGCGGCGTGACCATCATGGCCACGCCAGGCATATTCATCGCGTTCGACTCCGAG
AGCGCGGGTGCCCCCGACGGTCGCTGCAAGGCCTTCTCGGCGGAAGCCGACGGTTCGGGC
TGGGCCGAAGGCGCCGGGATGCTCCTGCTCGAGCGCCTCTCCGATGCCGTCCAAAACGGT
CATCCCGTCCTCGCCGTCCTTCGAGGCTCCGCCGTCAACCAGGACGGCCGGAGCCAAGGC
CTCACCGCGCCCAATGGCCCTGCCCAGGAGCGCGTCATCCGGCAAGCGCTCGACAGCGCG
CGGCTCACTCCAAAGGACGTCGACGTCGTCGAGGCTCACGGCACGGGAACCACCCTCGGA
GACCCCATCGAGGCACAGGCCGTTTTTGCCACCTATGGCGAGGCCCATTCCCAAGACAGA
CCCCTCTGGCTTGGAAGCCTCAAGTCCAACCTGGGACATACTCAGGCCGCGGCCGGCGTC
GGCGGCATCATCAAGATGGTGCTCGCGTTGCAGCACGGTCTCTTGCCCAAGACCCTCCAT
GCCCAGAATCCCTCCCCCCACATCGACTGGTCTCCAGGCATCGTAAAGCTCCTGAACGAG
GCCGTCGCCTGGACGACCAGCGGACATCCTCGCCGCGCCGGTGTTTCCTCGTTCGGCGTC
TCCGGCACCAACGCCCATGTCATCCTCGAAGAGGCTCCCGCCGCCACGCGGGCCGAGTCA
GGCGCTTCACAGCCTGCATCGCAGCCGCTCCCCGCGGCGTGGCCCGTCGTCCTGTCGGCC
AGGAGCGAGGCCGCCGTCCGCGCCCAGGCTCAAAGGCTCCGCGAGCACCTGCTCGCCCAA
GGCGACCTCACCCTCGCCGATGTGGCCTATTCGCTGGCCACCACCCGCGCCCACTTCGAG
CACCGCGCCGCTCTCGTAGCCCACGACCGCGACGAGCTCCTCTCCGCGCTCGACTCGCTC
GCCCAGGACAAGCCCGCACCGAGCACCGTCCTCGGACGGAGCGGAAGCCACGGCAAGGTC
GTCTTCGTCTTTCCTGGGCAAGGCTCGCAGTGGGAAGGGATGGCCCTCTCCCTGCTCGAC
TCCTCGCCCGTCTTCCGCACACAGCTCGAAGCATGCGAGCGCGCGCTCCGTCCTCACGTC
GAGTGGAGCCTGCTCGCCGTCCTGCGCCGCGACGAGGGCGCCCCCTCCCTCGACCGCGTC
GACGTCGTGCAGCCCGCCCTCTTTGCCGTCATGGTCTCCCTGGCCGCCCTCTGGCGCTCG
CTCGGCGTCGAGCCCGCCGCCGTCGTCGGCCACAGCCAGGGCGAGATAGCCGCCGCCTTC
GTCGCAGGCGCTCTCTCCCTCGAGGACGCGGCCCGCATCGCCGCCCTGCGCAGCAAAGCG
TCACCACCGTCGCCGGCAACGGGCATGGCCGCCGTCGAGCTCGGCGCCTCCGACCTCCAG
ACCTACCTCGCTCCCTGGGGCGACAGGCTCTCCATCGCCGCCGTCAACAGCCCCAGGGCC
ACGCTCGTATCCGGCGAGCCCGCCGCCGTCGACGCGCTGATCGACTCGCTCACCGCAGCG
CAGGTCTTCGCCCGAAGAGTCCGCGTCGACTACGCCTCCCACTCAGCCCAGATGGACGCC
GTCCAAGACGAGCTCGCCGCAGGTCTAGCCAACATCGCTCCTCGGACGTGCGAGCTCCCT
CTTTATTCGACCGTCACCGGCACCAGGCTCGACGGCTCCGAGCTCGACGGCGCGTACTGG
TATCGAAACCTCCGGCAAACCGTCCTGTTCTCGAGCGCGACCGAGCGGCTCCTCGACGAT
GGGCATCGCTTCTTCGTCGAGGTCAGCCCTCATCCCGTGCTCACGCTCGCCCTCCGCGAG
ACCTGCGAGCGCTCACCGCTCGATCCCGTCGTCGTCGGCTCCATTCGACGCGACGAAGGC
CACCTCCCCCGTCTCCTTGCTCTCTTGGGCCGAGCTCTATGGCCGGGCCTCACGCCCGAG
TGGAAGGCCTTCTTCGCGCCCTTCGCTCCCCGCAAGGTCTCACTCCCCACCTACGCCTTC
CAGCGCGAGCGTTTCTGGCTCGACGCCCCCAACGCACACCCCGAAGGCGTCGCTCCCGCT
GCGCCGATCGATGGGCGGTTTTGGCAAGCCATCGAACGCGGGGACCTCGACGCGCTCAGC
GGCCAGCTCCACGCGGACGGCGACGAGCAGCGCGCCGCCCTCGCCCTGCTCCTTCCCACC
CTCTCGAGCTTTCACCACCAGCGCCAAGAGCAGAGCACGGTCGACACCTGGCGCTACCGC
ATCACGTGGAGGCCTCTGACCACCGCCGCCACGCCCGCCGACCTCGCCGGCACCTGGCTC
CTCGTCGTGCCGTCCGCGCTCGGCGACGACGCGCTCCCTGCCACGCTCACCGATGCGCTT
ACCCGGCGCGGCGCGCGTGTCCTCGCGCTGCGCCTGAGCCAGGTTCACATAGGCCGCGCG
GCTCTCACCGAGCACCTGCGCGAGGCTGTTGCCGAGACTGCCCCGATTCGCGGCGTGCTC
TCCCTCCTCGCCCTCGACGAGCGCCCCCTCGCGGACCATGCCGCCCTGCCCGCGGGCCTT
GCCCTCTCGCTCGCCCTCGTCCAAGCCCTCGGCGACCTCGCCCTCGAGGCTCCCTTGTGG
CTCTTCACGCGCGGCGCCGTCTCGATTGGACACTCCGACCCACTCGCCCATCCCACCCAG
GCCATGATCTGGGGCTTGGGCCGCGTCGTCGGCCTCGAGCACCCCGAGCGGTGGGGCGGG
CTCGTCGACCTCGGCGCAGCGCTCGACGCGAGCGCCGCAGGCCGCTTGCTCCCGGCCCTC
GCCCAGCGCCACGACGAAGACCAGCTCGCGCTGCGCCCGGCCGGCCTCTACGCACGCCGC
TTCGTCCGCGCCCCGCTCGGCGATGCGCCTGCCGCTCGCGGCTTCATGCCCCGAGGCACC
ATCCTCATCACCGGTGGTACCGGCGCCATTGGCGCTCACGTCGCCCGATGGCTCGCTCGA
AAAGGCGCTGAGCACCTCGTCCTCATCAGCCGACGAGGGGCCCAGGCCGAAGGCGCCGTG
GAGCTCCACGCCGAGCTCACCGCCCTCGGCGCGCGCGTCACCTTCGCCGCGTGCGATGTC
GCCGACAGGAGCGCTGTCGCCACGCTTCTCGAGCAGCTCGACGCCGGAGGGCCACAGGTG
AGCGCCGTGTTCCACGCGGGCGGCATCGAGCCCCACGCTCCGCTCGCCGCCACCTCCATG
GAGGATCTCGCCGAGGTTGTCTCCGGCAAGGTACAAGGTGCAAGACACCTCCACGACCTG
CTCGGCTCTCGACCCCTCGACGCCTTTGTTCTCTTCTCGTCCGGCGCGGTCGTCTGGGGC
GGCGGACAACAAGGCGGCTATGCCGCTGCGAACGCCTTCCTCGATGCCCTGGCCGAGCAG
CGGCGCAGCCTTGGGCTGACGGCGACATCGGTGGCCTGGGGCGTGTGGGGCGGCGGCGGC
ATGGCTACCGGGCTCCTGGCAGCCCAGCTAGAGCAACGCGGTCTGTCGCCGATGGCCCCC
TCGCTGGCCGTGGCGACGCTCGCGCTGGCGCTGGAGCACGACGAGACCACCCTCACCGTC
GCCGACATCGACTGGGCGCGCTTTGCGCCTTCGTTCAGCGCCGCTCGCTCCCGCCCGCTC
CTGCGCGATTTGCCCGAGGCGCAGCGCGCTCTCGAAGCCAGCGCCGATGCGTCCTCCGAG
CAAGACGGGGCCACAGGCCTCCTCGACAAGCTCCGAAACCGCTCGGAGAGCGAGCAGATC
CACCTGCTCTCCTCGCTGGTGCGCCACGAAGCGGCCCTCGTCCTGGGCCATACCGACGCC
TCCCAGGTCGACCCCCACAAGGGCTTCATGGACCTCGGCCTCGATTCGCTCATGACCGTC
GAGCTTCGTCGGCGCTTGCAGCAGGCCACCGGCATCAAGCTCCCGGCCACCCTCGCCTTC
GACCATCCCTCTCCTCATCGCGTCGCGCTCTTCTTGCGCGACTCGCTCGCCCACGCCCTC
GGCGCGAGGCTCTCCGTCGAGCGCGACGCCGCCGCGCTCCCGGCGCTTCGCTCGGCGAGC
GACGAGCCCATCGCCATCGTCGGCATGGCCCTCCGCTTGCCGGGCGGCATCGGCGATGTC
GACGCTCTTTGGGAGTTCCTCGCCCAAGGACGCGACGCCGTCGAGCCCATTCCCCATGCC
CGATGGGATGCCGGTGCCCTCTACGACCCCGACCCCGACGCCAAGGCCAAGAGCTACGTC
CGGCATGCCGCCATGCTCGACCAGGTCGACCTCTTCGATCCTGCCTTCTTTGGCATCAGC
CCTCGCGAGGCCAAATACCTCGACCCCCAGCACCGCCTGCTCCTCGAATCTGCCTGGCTG
GCCCTCGAGGACGCCGGCATCGTCCCCTCCACCCTCAAGGATTCTCCCACCGGCGTCTTC
GTCGGCATCGGCGCCAGCGAATACGCACTGCGAAACACGAGCTCCGAAGAGGTCGAAGCG
TATGCCCTCCAAGGCACCGCCGGGTCCTTTGCCGCGGGGCGCTTGGCCTACACGCTCGGC
CTGCAAGGGCCCGCGCTCTCGGTCGACACCGCCTGCTCCTCCTCGCTCGTCGCCCTCCAC
CTCGCCTGCCAAGCCCTCCGACAGGGCGAGTGCAACCTCGCCCTCGCCGCGGGCGTCTCC
GTCATGGCCTCCCCCGGGCTCTTCGTCGTCCTTTCCCGCATGCGTGCTTTGGCGCCCGAT
GGCCGCTCCAAGACCTTCTCGACCAACGCCGACGGCTACGGACGCGGAGAGGGCGTCGTC
GTCCTTGCCCTCGAGCGGCTCGGCGACGCCCTCGCCCGAGGACACCGCGTCCTCGCCCTC
GTCCGCGGCACCGCCATGAACCATGACGGCGCGTCGAGCGGCATCACCGCCCCCAATGGC
ACCTCCCACCAGAAGGTCCTCCGCGCCGCGCTCCACGACGCCCATATCGGCCCTGCCGAC
GTCGACGTCGTCGAATGCCATGGCACCGGCACCTCCTTGGGAGACCCCATCGAGGTGCAA
GCCCTGGCCGCCGTCTACGCCGATGGCAGACCCGCTGAAAAGCCTCTCCTTCTCGGCGCA
CTCAAGACCAACATTGGCCATCTCGAGGCCGCCTCCGGCCTCGCGGGCGTCGCCAAGATC
GTCGCCTCCCTCCGCCATGACGCCCTGCCCCCCACCCTCCACACGACCCCGCGCAATCCC
CTGATCGAGTGGGATGCGCTCGCCATCGACGTCGTCGATGCCACGAGGGCGTGGGCCCGC
CACGAAGATGGCAGTCCCCGCCGCGCCGGCGTCTCCGCCTTCGGACTCTCCGGCACCAAC
GCCCACGTTATCCTCGAAGAGGCTCCCGCGATCCCGCAGGCCGAGCCCACCGCGGCACAG
CTCGCGTCGCAGCCGCTTCCCGCAGCCTGGCCCGTGCTCCTGTCGGCCAGGAGCGAGCCG
GCCGTGCGCGCCCAGGCCCAGAGGCTCCGCGACCACCTCCTCGCCCACGACGACCTCGCC
CTGGCCGATGTAGCCTACTCGCTCGCCACCACCCGGGCTACCTTCGAGCACCGTGCCGCT
CTCGTGGTCCACGACCGCGAAGAGCTCCTCTCCGCGCTCGATTCGCTCGCCCAGGGAAGG
CCCGCCCCGAGCACCGTCGTCGAACGAAGCGGAAGCCACGGCAAGGTCGTCTTCGTCTTT
CCTGGGCAAGGCTCGCAGTGGGAAGGGATGGCCCTCTCCCTGCTCGATACCTCGCCGGTC
TTCCGGGCACAGCTCGAAGCGTGCGAGCGCGCCCTCGCGCCCCACGTGGACTGGTCGCTG
CTCGCGGTGCTCCGCGGCGAGGAGGGCGCGCCCCCGCTCGACCGGGTCGACGTGGTCCAG
CCCGCGCTGTTCTCGATGATGGTCTCGCTGGCCGCCCTGTGGCGCTCCATGGGCGTCGAG
CCCGACGCGGTGGTCGGCCATAGCCAGGGCGAGATCGCCGCGGCCTGTGTGGCGGGCGCG
CTGTCGCTCGAGGACGCTGCCAAGCTGGTGGCGCTGCGCAGCCGTGCGCTCGTGGAGCTC
GCCGGCCAGGGGGCCATGGCCGCGGTGGAGCTGCCGGAGGCCGAGGTCGCACGGCGCCTC
CAGCGCTATGGCGATCGGCTCTCCATCGGGGCGATCAACAGCCCTCGTTTCACGACGATC
TCCGGCGAGCCCCCTGCCGTCGCCGCCCTGCTCCGCGATCTGGAGTCCGAGGGCGTCTTC
GCCCTCAAGCTGAGTTACGACTTCGCCTCCCACTCCGCGCAGGTCGAGTCGATTCGCGAC
GAGCTCCTCGATCTCCTGTCGTGGCTCGAGCCGCGCTCGACGGCGGTCCCGTTCTACTCC
ACGGTGAGCGGCGCCGCGATCGACGGGAGCGAGCTCGACGCCGCCTACTGGTACCGGAAC
CTCCGGCAGCCGGTCCGCTTCGCAGACGCTGTGCAAGGCCTCCTTGCCGGAGAACATCGC
TTCTTCGTGGAGGTGAGCCCCAGTCCTGTGCTGACCTTGGCCTTGCACGAGCTCCTCGAA
GCGTCGGAGCGCTCGGCGGCGGTGGTCGGCTCTCTGTGGAGCGACGAAGGGGATCTACGG
CGCTTCCTCGTCTCGCTCTCCGAGCTCTACGTCAACGGCTTCGCCCTGGATTGGACGACG
ATCCTGCCCCCCGGGAAGCGGGTGCCGCTGCCCACCTACCCCTTCCAGCGCGAGCGCTTC
TGGCTCGACGCCTCCACGGCACCCGCCGCCGGCGTCAACCACCTTGCTCCGCTCGAGGGG
CGGTTCTGGCAGGCCATCGAGAGCGGGAATATCGACGCGCTCAGCGGCCAGCTCCACGTG
GACGGCGACGAGCAGCGCGCCGCCCTTGCCCTGCTCCTTCCCACCCTCGCGAGCTTTCGC
CACGAGCGGCAAGAGCAGGGCACGGTCGACGCCTGGCGCTACCGCATCACGTGGAAGCCT
CTGACCACCGCCACCACGCCCGCCGACCTGGCCGGCACCTGGCTCCTCGTCGTGCCGGCC
GCTCTGGACGACGACGCGCTCCCCTCCGCGCTCACCGAGGCGCTCGCCCGGCGCGGCGCG
CGCGTCCTCGCCGTGCGCCTGAGCCAGGCCCACCTGGACCGCGAGGCTCTCGCCGAGCAC
CTGCGCCAGGCTTGCGCCGAGACCGCGCCGCCTCGCGGCGTGCTCTCGCTCCTCGCCCTC
GACGAAAGTCCCCTCGCCGACCATGCCGCCGTGCCCGCGGGACTCGCCTTCTCGCTCACC
CTCGTCCAAGCCCTCGGCGACATCGCCCTCGACGCGCCCTTGTGGCTCTTCACCCGCGGC
GCCGTCTCCGTCGGACACTCCGACCCCATCGCCCATCCGACGCAGGCGATGACCTGGGGC
CTGGGCCGCGTCGTCGGCCTCGAGCACCCCGAGCGCTGGGGAGGGCTCGTCGACGTCGGC
GCAGCGATCGACGCGAGCGCCGTGGGCCGCTTGCTCCCGGTCCTCGCCCTGCGCAACGAT
GAGGACCAGCTCGCTCTCCGCCCGGCCGGGTTCTACGCTCGCCGCCTCGTCCGCGCTCCG
CTCGGCGACGCGCCGCCCGCACGTACCTTCAAGCCCCGAGGCACCCTCCTCATCACCGGA
GGCACCGGCGCCGCTGGCGCTCACGTCGCCCGATGGCTCGCTCGAGAAGGCGCAGAGCAC
CTCGTCCTCATCAGCCGCCGAGGGGCCCAGGCCGAGGGCGCCTCGGAGCTCCACGCCGAG
CTCACGGCCCTGGGCGCGCGCGTCACCTTCGCCGCGTGTGATGTCGCCGACAGGAGCGCT
GTCGCCACGCTTCTCGAGCAGCTCGACGCCGAAGGGTCGCAGGTCCGCGCCGTGTTCCAC
GCGGGCGGCATCGGGCGCCACGCTCCGCTCGCCGCCACCTCTCTCATGGAGCTCGCCGAC
GTTGTCTCTGCCAAGGTCCTAGGCGCAGGGAACCTCCACGACCTGCTCGGTCCTCGACCC
CTCGACGCCTTCGTCCTTTTCTCGTCCATCGCAGGCGTCTGGGGCGGCGGACAACAAGCC
GGATACGCCGCCGGAAACGCCTTCCTCGACGCCCTGGCCGACCAGCGGCGCAGTCTTGGA
CAGCCGGACACGTCCGTGGTGTGGGGCGCGTGGGGCGGCGGCGGTGGTATATTCACGGGG
CCCCTGGCAGCCCAGCTGGAGCAACGTCGTCTGTCGCCGATGGCCCCTTCGCTGGCCGTG
GCGGCGCTCGCGCAAGCCCTGGAGCACGACGAGACCACCGTCACCGTCGCCGACATCGAC
TGGGCGCGCTTTGCGCCTTCGATCAGCGTCGCTCGCTCCCGCCGCTCCTGCGCGACTTGC
CCGAGCAGCGCGCCCTCGAAGACAGAGAAGGCGCGTCCTCCTCCGAGCACGGCCCGGCCC
CCCGACCTCCTCGACAAGCTCCGGAGCCGCTCGGAGAGCGAGCAGCTCCGTCTGCTCGCC
GCGCTGGTGTGCGACGAGACGGCCCTCGTCCTCGGCCACGAAGGCCGCTTCCCAGCTCGA
CCCCGACAAGGCTTCTTCGACCTCGGTCTCGATTCGATCATGACCGTCGAGCTTCGTCGG
CGCTTGCAACAGGCCACCGGCATCAAGCTCCCGGCCACCCTCGCCTTCGACCATCCCTCT
CCTCATCGCGTCGCGCTCTTCATGCGCGACTCGCTCGCCCACGCCCTCGGCACGAGGCTC
TCCGCCGAGGCGACGCCGCCGCGCTCCGGCCGCGCCTCGAGCGACGAGCCCATCGCCATC
GTCGGCATGGCCCTGCGCCTGCCGGGCGGCGTCGGCGATGTCGACGCTCTTTGGGAGTTC
CTCCACCAAGGGCGCGACGCGGTCGAGCCCATTCCACAGAGCCGCTGGGACGCCGGTGCC
CTCTACGACCCCGACCCCGACGCCGACGCCAAGAGCTACGTCCGGCATGCCGCGATGCTC
GACCAGATCGACCTCTTCGACCCTGCCTTCTTCGGCATCAGCCCCCGGGAGGCCAAACAC
CTCGACCCCCAGCACCGCCTGCTCCTCGAATCTGCCTGGCTGGCCCTCGAGGACGCCGGC
ATCGTCCCCACCTCCCTCAAGGACTCCCTCACCGGCGTCTTCGTCGGCATCTGCGCCGGC
GAATACGCGATGCAAGAGGCGAGCTCGGAAGGTTCCGAGGTTTACTTCATCCAAGGCACT
TCCGCGTCCTTTGGCGCGGGGGGCTTGGCCTATACGCTCGGGCTCCAGGGGCCGCGATCT
TCGGTCGACACCGCCTGCTCCTCCTCGCTCGTCTCCCTCCACCTCGCCTGCCAAGCCCTC
CGACAGGGCGAGTGCAACCTCGCCCTCGCCGCGGGCGTGTCGCTCATGGTCTCCCCCCAG
ACCTTCGTCATCCTTTCCCGTCTGCGCGCCTTGGCGCCCGACGGCCGCTCCAAGACCTTC
TCGGACAACGCCGACGGCTACGGACGCGGAGAAGGCGTCGTCGTCCTTGCCCTCGAGCGG
ATCGGCGACGCCCTCGCCCGGAGACACCGCGTCCTCGTCCTCGTCCGCGGCACCGCCATC
AACCACGACGGCGCGTCGAGCGGTATCACCGCCCCCAACGGCACCTCCCAGCAGAAGGTC
CTCCGGGCCGCGCTCCACGACGCCCGCATCACCCCCGCCGACGTCGACGTCGTCGAGTGC
CATGGCACCGGCACCTCGCTGGGAGACCCCATCGAGGTGCAAGCCCTGGCCGCCGTCTAC
GCCGACGGCAGACCCGCTGAAAAGCCTCTCCTTCTCGGCGCGCTCAAGACCAACATCGGC
CATCTCGAGGCCGCCTCCGGCCTCGCGGGCGTCGCCAAGATGGTCGCCTCGCTCCGCCAC
GACGCCCTGCCCCCCACCCTCCACGCGACCCCACGCAATCCCCTCATCGAGTGGGAGGCG
CTCGCCATCGACGTCGTCGATACCCCGAGGCCTTGGCCCCGCCACGAAGATGGCAGTCCC
CGCCGCGCCGGCATCTCCGCCTTCGGATTCTCGGGCACCAACGCCCACGTCATCCTCGAA
GAGGCTCCCGCCGCCCTGCCGGCCGAGCCCGCCACCTCACAGCCGGCGTCGCAAGCCGCT
CCCGCGGCGTGGCCCGTGCTCCTGTCGGCCAGGAGCGAGGCCGCCGTCCGCGCCCAGGCG
AAGCGGCTCCGCGACCACCTCGTCGCCCACGACGACCTCACCCTCGCGGATGTGGCCTAT
TCGCTGGCCACCACCCGCGCCCACTTCGAGCACCGCGCCGCTCTCGTAGCCCACAACCGC
GACGAGCTCCTCTCCGCGCTCGACTCGCTCGCCCAGGACAAGCCCGCCCCGAGCACCGTC
CTCGGACGGAGCGGAAGCCACGGCAAGCTCGTCTTCGTCTTTCCTGGGCAAGGCTCGCAG
TGGGAAGGGATGGCCCTCTCGCTGCTCGACTCCTCGCCCGTCTTCCGCGCTCAGCTCGAA
GCATGCGAGCGCGCGCTCGCTCCTCACGTCGAGTGGAGCCTGCTCGCCGTCCTGCGCCGC
GACGAGGGCGCCCCCTCCCTCGACCGCGTCGACGTCGTACAGCCCGCCCTCTTTGCCGTC
ATGGTCTCCCTGGCGGCCCTCTGGCGCTCGCTCGGCGTAGAGCCCGCCGCCGTCGTCGGC
CACAGTCAGGGCGAGATCGCCGCCGCCTTCGTCGCAGGCGCTCTCTCCCTCGAGGACGCG
GCCCGCATCGCCGCCCTGCGCAGCAAAGCGCTCACCACCGTCGCCGGCAACGGGGCCATG
GCCGCCGTCGAGCTCGGCGCCTCCGACCTCCAGACCTACCTCGCTCCCTGGGGCGACAGG
CTCTCCATCGCCGCCGTCAACAGCCCCAGGGCCACGCTCGTGTCCGGCGAGCCCGCCGCC
ATCGACGCGCTGATCGACTCGCTCACCGCAGCGCAGGTCTTCGCCCGAAAAGTCCGCGTC
GACTACGCCTCCCACTCCGCCCAGATGGACGCCGTCCAAGACGAGCTCGCCGCAGGTCTA
GCCAACATCGCTCCTCGGACGTGCGAGCTCCCTCTTTATTCGACCGTCACCGGCACCAGG
CTCGACGGCTCCGAGCTCGACGGCGCGTACTGGTATCGAAACCTCCGGCAAACCGTCCTG
TTCTCGAGCGCGACCGAGCGGCTCCTCGACGATGGGCATCGCTTCTTCGTCGAGGTCAGC
CCCCATCCCGTGCTCACGCTCGCCCTCCGCGAGACCTGCGAGCGCTCACCGCTCGATCCC
GTCGTCGTCGGCTCCATTCGACGCGACGAAGGCCACCTCGCCCGCCTGCTCCTCTCCTGG
GCGGAGCTCTCTACCCGAGGCCTCGCGCTCGACTGGAACGCCTTCTTCGCGCCCTTCGCT
CCCCGCAAGGTCTCCCTCCCCACCTACCCCTTCCAACGCGAGCGCTTCTGGCTCGACGCC
TCCACGGCGCACGCTGCCGACGTCGCCTCCGCAGGCCTGACCTCGGCCGACCACCCGCTG
CTCGGCGCCGCCGTCGCCCTCGCCGACCGCGATGGCTTTGTCTTCACAGGACGGCTCTCC
CTCGCAGAGCACCCGTGGCTCGAAGACCACGTCGTCTTCGGCATACCCTGTCCTGCCAGG
CGCCGCCTCCTCGAGCTCGCCCTGCATGTCGCCCATCTCGTCGGCCTCGACACCGTCGAA
GACGTCACGCTCGACCCCCCCCTCGCTCTCCCATCGCAGGGCGCCGTCCTCCTCCAGATC
TCCGTCGGGCCCGCGGACGGTGCTGGACGAAGGGCGCTCTCCGTTCATAGCCGGCGCCAC
GACGCGCTTCAGGATGGCCCCTGGACTCGCCACGCCAGCGGCTCTCTCGCGCAAGCTAGC
CCGTCCCATTGCCTTCGATGCTCCGCGAATGGCCCCCCCTCGGGCGCCACCCAGGTGGAC
ACCCAAGGTTTCTACGCAGCCCTCGAGAGCGCTGGGCTTGCTTATGGCCCCGAGTTCCAG
GGCCTCCGCCGCCGTCTACAAGCGCGGCGACGAGCTCTTCGCCGAAGCCAAGCTCCCGGA
CGCCGCCGAAGAGGACGCCGCTCGTTTTGCCCTCCACCCCGCCCTGCTCGACAGCGCCTT
GCAGGCGCTCGCCTTTGTAGACGACCAGGCAAAGGCCTTCAGGATGCCCTTCTCGTGGAG
CGGAGTATCGCTGCGCTCCGGTCGGAGCCACCACCCTGCGCGTGCGTTTCCACCGTCCTG
AGGGCGAATCCTCGCGCTCGCTCCTCCTCGCCGACGCCAGAGGCGAACCCATCGCCTCGG
TGCAAGCGCTCGCCATGCGCGCCGCGTCCGCCGAGCAGCTCCGCAGACCCGGGAGCGTCC
CACCTCGATGCCCTCTTCCGCATCGACTGGAGCGAGCTGCAAAGCCCCACCTCACCGCCC
ATCGCCCCGAGCGGTGCCCTCCTCGGCACAGAAGGTCTCGACCTCGGGACCAGGGTGCCT
CTCGACCGCTATACCGACCTTGCTGCTCTACGCAGCGCCCTCGACCAGGGCGCTTCGCCT
CCAAGCCTCGTCATCGCCCCCTTCATCGCTCTGCCCGAAGGCGACCTCATCGCGAGCGCC
CGCGAGACCACCGCGCACGCGCTCGCCCTCTTGCAAGCCTGGCTCGCCGACGAGCGCCTC
GCCTCCTCGCGCCTCGCCCTCGTCACCCGACGCGCCGTCGCCACCCACGCTGAAGAAGAC
GTCAAGGGCCTCGCTCACGCGCCTCTCTGGGGTCTCGCTCGCTCCGCGCAGAGCGAGCAC
CCAGAGCGCCCTCTCGTCCTCGTCGACCTCGACGACAGCGAGGCCTCCCAGCACGCCCTG
CTCGGCGCGCTCGACGCAAGAGAGCCAGAGATCGCCCTCCGCAACGGCAAACCCCTCGTT
CCAAGGCTCTCACGCCTGCCCCAGGCGCCCACGGACACAGCGTCCCCCGCAGGCCTCGGA
GGCACCGTCCTCATCACGGGAGGCACCGGCACGCTCGGCGCCCTGGTCGCGCGCCGCCTC
GTCGTAAACCACGACGCCAAGCACCTGCTCCTCACCTCGCGCCAGGGCGCGAGCGCTCCG
GGTGCTGATGTCTTGCGAAGCGAGCTCGAAGCTCTGGGGGCTTCGGTCACCCTCGCCGCG
TGCGACGTGGCCGATCCACGCGCTCTAAAGGACCTTCTGGATAACATTCCGAGCGCTCAC
CCGGTCGCCGCCGTCGTGCATGCCGCCAGCGTCCTCGACGGCGATCTGCTCGGCGCCATG
AGCCTCGAGCGGATCGACCGCGTCTTCGCCCCCAAGATCGATGCCGCCTGGCACTTGCAT
CAGCTCACCCAAGATAAGCCCCTTGCCGCCTTCATCCTCTTCTCGTCCGTCGCCGGCGTC
CTCGGCAGCTCAGGTCACTCCAACTACGCCGCTGCGAGCGCCTTCCTCGATGCGCTTGCG
CACCACCGGCGCGCGCAAGGGCTCCCTGCCTCATCGCTCGCGTGGAGCCACTGGGCCGAG
CGCAGCGCAATGACAGAGCACGTCAGCGCCGCCGGCGCCCCTCGCATGGAGCGCGCCGGC
CTTCCCTCGACCTCTGAGGAGAGGCTCGCCCTCTTCGATGCGGCGCTCTTCCGAACCGAG
ACCGCCCTGGTCCCCGCGCGCTTCGACTTGAGCGCGCTCAGGGCGAACGCCGGCAGCGTC
CCCCCGTTGTTCCAACGTCTCGTCCGCGCTCGCACCGTACGCAAGGCCGCCAGCAACACC
GCCCAGGCCTCGTCGCTTACAGAGCGCCTCTCAGCCCTCCCGCCCGCCGAACGCGAGCGT
GCCCTGCTCGATCTCATCCGCACCGAAGCCGCCGCCGTCCTCGGCCTCGCCTCCTTCGAA
TCGCTCGATCCCGATCGCCTCCTCCAAGAGCTTGGCCTCGACTCCATCATCGCGCTCGAT
CTCCGAAATCGGCTCGCCGCCGCCACCGGCGTGCGACTCCCAGCCACCCTCCTCTTCGAG
CATCCAACCCCAGCTGCGCTCGCAGCCTTGCTCTTGGCTCGACTCGAACCTGGAATGCGA
AGAGGACCGGCGAAGGACGGCGCCTCTCCCACGGACACAGAGAGCGACGGCGCGCTCCTT
GGAATGGTTCAACCAGCGAACGAGATCGGAGCGATCGAAGAGGCCCGAAATCTCATCGCC
GCGGCCTTGAAAGTCCGCCTGGCGGTCGAAGACGCGTCGAAGCGGTCAGCGGTCGCGATC
GCGGAGGAGCCGCCCACTCGACTCGCAAGGGGTCAAGCGACACCCCAGTTGATTTGCTTT
CCGGCGTTCGTGGTTCCATCGGCGCCTATTCAGTACGCGCGCTTCGCTTCACACCTCAGG
GACCGGCGCGACATCTGGTTCATACCTCATCCAGGCTACCGCCATAAGACGCCGCTCACA
CGGAGCCTCGACGAGCTCGTTTCCTCGCACGCAAGAACGACATTGGCGTGCGCGCGCAAT
TCCCCCTTCGTGCTGTTCGGCCACTCTTCGGGTGGAAACATCGCCCACATGGTGGCCGAG
CACCTGGAGAGCATCGGACACGGCCCCGCCGGAGTCGTGCTCCTGGACAGCTATGATTAC
GCCAGTCCAGCGGTAGAGGCTGGGCTGAAGATCTTCCATGTAGAGCAGCTGCAAACTTGG
GGCGCCTCGGACGCCGGCCTGACCGCCGAGGCGTGGTACTATGAACACATCGGACTCGAG
ACCTGGAAGCCTAGACAGCTGGCCGCTCCGACATTGCATGTCCGCGCGACCGAACCCATG
AAGCAGTTCGTGGGGAGCGAAGGCGCTCCTGCGGAATGGCGCGCGAGCTGGAAATTGCCG
CATGTCGCGATAGATGCTCCAGGAGACCACGCTACGGTGGTAGATCACCCTTTCTTGGCG
CAAGCGGTCGACGACTGGCTGTCCTCGCTCTCCAACGAGCCGTCCAACCAATAG
[4] KS34..404
[4] AT564..886
[4] malonyl-CoA756..760
[4] KR1207..1387
[4] ACP1491..1561
[5] KS1589..1963
[5] AT2125..2442
[5] methylmalonyl-CoA2316..2320
[5] DH2493..2656
[5] ER2981..3287
[5] KR3297..3477
[5] ACP3579..3649
[6] KS3671..4041
[6] AT4201..4512
[6] methylmalonyl-CoA4391..4395
[6] KR4839..5019
[6] ACP5123..5193
[7] KS5221..5595
[7] AT5757..6074
[7] methoxymalonyl-ACP5948..5952
[7] KR6394..6574
[7] ACP6675..6749
[8] KS6775..7149
[8] AT7311..7628
[8] methylmalonyl-CoA7502..7506
[8] DH7678..7842
[8] KR8161..8341
[8] ACP8442..8512
[8] TE8596..8807
[4] KS100..1212
[4] AT1690..2658
[4] malonyl-CoA2266..2280
[4] KR3619..4161
[4] ACP4471..4683
[5] KS4765..5889
[5] AT6373..7326
[5] methylmalonyl-CoA6946..6960
[5] DH7477..7968
[5] ER8941..9861
[5] KR9889..10431
[5] ACP10735..10947
[6] KS11011..12123
[6] AT12601..13536
[6] methylmalonyl-CoA13171..13185
[6] KR14515..15057
[6] ACP15367..15579
[7] KS15661..16785
[7] AT17269..18222
[7] methoxymalonyl-ACP17842..17856
[7] KR19180..19722
[7] ACP20023..20247
[8] KS20323..21447
[8] AT21931..22884
[8] methylmalonyl-CoA22504..22518
[8] DH23032..23526
[8] KR24481..25023
[8] ACP25324..25536
[8] TE25786..26421

close this sectionFeature

BLASTP
Database:UniProtKB:2011_09
show BLAST table
InterPro
Database:interpro:38.0
IPR001031 Thioesterase (Domain)
 [8596-8807]  6.00000000000001e-25 PF00975
PF00975   Thioesterase
IPR001227 Acyl transferase domain (Domain)
 [557-688]  G3DSA:3.40.366.10 [756-875]  G3DSA:3.40.366.10 [2118-2248]  G3DSA:3.40.366.10 [2316-2431]  G3DSA:3.40.366.10 [4194-4324]  G3DSA:3.40.366.10 [4391-4506]  G3DSA:3.40.366.10 [5747-5880]  G3DSA:3.40.366.10 [5948-6063]  G3DSA:3.40.366.10 [7304-7434]  G3DSA:3.40.366.10 [7502-7617]  G3DSA:3.40.366.10
G3DSA:3.40.366.10   Ac_transferase_reg
IPR002198 Short-chain dehydrogenase/reductase SDR (Family)
 [3297-3464]  1.9e-62 PF00106 [6394-6560]  1.5e-54 PF00106 [8161-8328]  3.09999999999998e-58 PF00106
PF00106   adh_short
IPR002364 Quinone oxidoreductase/zeta-crystallin, conserved site (Conserved_site)
 [3109-3130]  PS01162
PS01162   QOR_ZETA_CRYSTAL
IPR006162 Phosphopantetheine attachment site (PTM)
 [1519-1534]  PS00012 [3607-3622]  PS00012 [5151-5166]  PS00012 [6707-6722]  PS00012 [8470-8485]  PS00012
PS00012   PHOSPHOPANTETHEINE
IPR009081 Acyl carrier protein-like (Domain)
 [1500-1557]  7.1e-12 PF00550 [3582-3647]  1.5e-10 PF00550 [5127-5189]  1.6e-11 PF00550 [6700-6745]  3.4e-09 PF00550 [8445-8511]  3.90000000000001e-12 PF00550
PF00550   PP-binding
 [1484-1606]  1.20000117458134e-25 SSF47336 [3572-3687]  9.19998414420358e-27 SSF47336 [5116-5238]  4.80000830876748e-25 SSF47336 [6672-6792]  2.19999900980708e-21 SSF47336 [8434-8529]  3.89999861795218e-21 SSF47336
SSF47336   ACP_like
 [1491-1561]  PS50075 [3579-3649]  PS50075 [5123-5193]  PS50075 [6675-6749]  PS50075 [8442-8512]  PS50075
PS50075   ACP_DOMAIN
 [1492-1564]  1.99999999999999e-97 G3DSA:1.10.1200.10 [3579-3651]  1.99999999999999e-97 G3DSA:1.10.1200.10 [5125-5196]  1.99999999999999e-97 G3DSA:1.10.1200.10 [6679-6752]  1.99999999999999e-97 G3DSA:1.10.1200.10 [8440-8516]  1.99999999999999e-97 G3DSA:1.10.1200.10
G3DSA:1.10.1200.10   ACP_like
IPR011032 GroES-like (Domain)
 [2972-3116]  3.49999466863949e-30 SSF50129
SSF50129   GroES_like
IPR013149 Alcohol dehydrogenase, C-terminal (Domain)
 [3120-3211]  2.4e-16 PF00107
PF00107   ADH_zinc_N
IPR013154 Alcohol dehydrogenase GroES-like (Domain)
 [3001-3060]  3.6e-07 PF08240
PF08240   ADH_N
IPR013968 Polyketide synthase, KR (Domain)
 [1207-1386]  1.99999999999999e-62 PF08659 [4839-5018]  7.10000000000008e-62 PF08659
PF08659   KR
IPR014030 Beta-ketoacyl synthase, N-terminal (Domain)
 [34-279]  1e-90 PF00109 [1589-1838]  1.5e-89 PF00109 [3671-3916]  2.20000000000002e-89 PF00109 [5221-5470]  3.49999999999995e-89 PF00109 [6775-7020]  5.59999999999999e-86 PF00109
PF00109   ketoacyl-synt
IPR014031 Beta-ketoacyl synthase, C-terminal (Domain)
 [287-404]  1.6e-45 PF02801 [1846-1963]  2.19999999999997e-42 PF02801 [3924-4041]  9.50000000000005e-46 PF02801 [5478-5595]  1.09999999999999e-40 PF02801 [7034-7149]  1.9e-42 PF02801
PF02801   Ketoacyl-synt_C
IPR014043 Acyl transferase (Domain)
 [564-886]  1.39999999999997e-68 PF00698 [2125-2442]  1.69999999999998e-104 PF00698 [4201-4512]  4.89999999999996e-102 PF00698 [5757-6074]  5.69999999999998e-104 PF00698 [7311-7628]  2.79999999999994e-107 PF00698
PF00698   Acyl_transf_1
IPR015083 Polyketide synthase, docking (Domain)
 [1-27]  5.10000000000001e-08 PF08990
PF08990   Docking
IPR016035 Acyl transferase/acyl hydrolase/lysophospholipase (Domain)
 [560-855]  1.59998835313644e-71 SSF52151 [2123-2427]  7.29994178862692e-67 SSF52151 [4199-4502]  3.79997359309398e-67 SSF52151 [5755-6067]  1.79999754022378e-68 SSF52151 [7309-7613]  1.59998835313644e-68 SSF52151
SSF52151   Acyl_Trfase/lysoPlipase
IPR016036 Malonyl-CoA ACP transacylase, ACP-binding (Domain)
 [690-755]  9.00000407957274e-17 SSF55048 [2250-2315]  7.89999609889417e-17 SSF55048 [4325-4390]  4.70000326671386e-16 SSF55048 [5882-5947]  1e-15 SSF55048 [7436-7501]  2.49999811956465e-17 SSF55048
SSF55048   Malonyl_transacylase_ACP-bd
IPR016038 Thiolase-like, subgroup (Domain)
 [34-289]  G3DSA:3.40.47.10 [291-457]  G3DSA:3.40.47.10 [1592-1850]  G3DSA:3.40.47.10 [1851-2018]  G3DSA:3.40.47.10 [3670-3926]  G3DSA:3.40.47.10 [3928-4094]  G3DSA:3.40.47.10 [5224-5482]  G3DSA:3.40.47.10 [5483-5650]  G3DSA:3.40.47.10 [6775-7032]  G3DSA:3.40.47.10 [7036-7204]  G3DSA:3.40.47.10
G3DSA:3.40.47.10   Thiolase-like_subgr
IPR016039 Thiolase-like (Domain)
 [26-403]  1.69998802370953e-97 SSF53901 [1582-2018]  2.1000026783403e-102 SSF53901 [3670-4040]  2.80000504261099e-98 SSF53901 [5214-5650]  7.0999832511256e-100 SSF53901 [6768-7204]  3.39999972437719e-97 SSF53901
SSF53901   Thiolase-like
IPR016040 NAD(P)-binding domain (Domain)
 [1206-1389]  G3DSA:3.40.50.720 [1453-1465]  G3DSA:3.40.50.720 [3072-3261]  5.10000000000004e-62 G3DSA:3.40.50.720 [3297-3497]  G3DSA:3.40.50.720 [4839-5021]  G3DSA:3.40.50.720 [6394-6610]  G3DSA:3.40.50.720 [8160-8344]  G3DSA:3.40.50.720
G3DSA:3.40.50.720   NAD(P)-bd
IPR018201 Beta-ketoacyl synthase, active site (Active_site)
 [192-208]  PS00606 [1751-1767]  PS00606 [3829-3845]  PS00606 [5383-5399]  PS00606
PS00606   B_KETOACYL_SYNTHASE
IPR020801 Polyketide synthase, acyl transferase domain (Domain)
 [564-867]  SM00827 [2127-2424]  5.50001754266469e-114 SM00827 [4203-4499]  1.69998802370953e-120 SM00827 [5759-6056]  2.50000909916183e-118 SM00827 [7313-7610]  3.89999861795218e-123 SM00827
SM00827   PKS_AT
IPR020802 Polyketide synthase, thioesterase domain (Domain)
 [8598-8807]  8.70005082121738e-103 SM00824
SM00824   PKS_TE
IPR020806 Polyketide synthase, phosphopantetheine-binding domain (Domain)
 [1492-1564]  3.80000697093588e-33 SM00823 [3580-3652]  1.60000240695997e-27 SM00823 [5124-5196]  1.89999859865865e-31 SM00823 [6680-6752]  6.59999852526376e-26 SM00823 [8443-8515]  3.39999972437717e-29 SM00823
SM00823   PKS_PP
IPR020807 Polyketide synthase, dehydratase domain (Domain)
 [2493-2656]  3.99998544139406e-74 SM00826 [7678-7842]  3.39999972437719e-61 SM00826
SM00826   PKS_DH
IPR020841 Polyketide synthase, beta-ketoacyl synthase domain (Domain)
 [36-457]  SM00825 [1592-2018]  SM00825 [3673-4094]  SM00825 [5224-5650]  SM00825 [6778-7204]  SM00825
SM00825   PKS_KS
IPR020842 Polyketide synthase/Fatty acid synthase, KR (Domain)
 [1207-1387]  4.69998262513195e-54 SM00822 [3297-3477]  6.20001486442668e-62 SM00822 [4839-5019]  8.20001307135067e-55 SM00822 [6394-6574]  6.70002630158883e-48 SM00822 [8161-8341]  3.70001063537244e-57 SM00822
SM00822   PKS_KR
IPR020843 Polyketide synthase, enoylreductase (Domain)
 [2981-3287]  SM00829
SM00829   PKS_ER
SignalP No significant hit
TMHMM No significant hit
Page top