Oxzol_00140 : CDS information

close this sectionLocation

Organism
StrainJA3453
Entry nameOxazolomycin
Contig
Start / Stop / Direction20,505 / 43,718 / + [in whole cluster]
20,505 / 43,718 / + [in contig]
Location20505..43718 [in whole cluster]
20505..43718 [in contig]
TypeCDS
Length23,214 bp (7,737 aa)
Click on the icon to see Genetic map.

close this sectionAnnotation

Category1.3 PKS/NRPS hybrid
Productpolyketide synthase/non-ribosomal peptide synthetase
Product (GenBank)NRPS/PKS
Gene
Gene (GenBank)ozmH
EC number
Keyword
  • glycine
Note
Note (GenBank)
Reference
ACC
PmId
[16707707] Utilization of the methoxymalonyl-acyl carrier protein biosynthesis locus for cloning the oxazolomycin biosynthetic gene cluster from Streptomyces albus JA3453. (J Bacteriol. , 2006)
[18401530] Alternative method for site-directed mutagenesis of complex polyketide synthase in Streptomyces albus JA3453. (Acta Biochim Biophys Sin (Shanghai). , 2008)
[20406823] Oxazolomycin biosynthesis in Streptomyces albus JA3453 featuring an "acyltransferase-less" type I polyketide synthase that incorporates two distinct extender units. (J Biol Chem. , 2010)
comment
※論文中ではLoading moduleをmodule 1としているが、本DBではLoading moduleはmodule 0とするため論文のmodule No.とずれが生じる。


[PMID:16707707]
methoxymalonyl-ACP生合成遺伝子の同定とクローニング

OzmB, OzmC, OzmD, OzmE, OzmF, OzmGが、methoxymalonyl-ACP生合成遺伝子であることを同定し、これらがoxazolomycin productionに必須であることを示した。
OzmHも一緒にクローニングされているが、機能解析はされていない。


[PMID:18401530]
module 9(※論文中module10と記載) KS domainについての報告。

module9にKS domainが2つ保存されているが、1st KS domainは触媒三残基がCys-Asn-Hisで変異しているのに対し、2nd KS doaminはCys-His-Hisで変異していなかった。
His->Asnに変化したactive siteのsingle AA基質はKSの不活性化を与えると予測し検証した。1st KS domainのCys->Gly変異はoxazolomycin生産量に影響しないが2nd KS doaminのCys->Gly変異はoxazolomycin生産が出来なかった。
1st KS domainの触媒三残基のHis->Asn変異したdomainは特に機能がないと結論づけている。


[PMID:20406823]
oxazolomycin生合成遺伝子クラスターの報告。

ozmH: Hybrid NRPS/PKS

module5(KS, KR, MT, ACP)  ※論文中module6と記載
module6(C, A, PCP)  ※論文中module7と記載
module7(KS, DH, KR, ACP)  ※論文中module8と記載
module8(KS, DH, ACP, MT, KR, ACP) ※論文中module9と記載
module9(KS, KS, KR, ACP)  ※論文中module10と記載

module8のDH domainはInterProではヒットしてこない。

A doaminの基質特異性に関して、FIGURE 4.にてATP-PPi exchange reaction(in vitro amino acid-dependent radiolabel exchange assay betweenpyrophosphate ([32P]PPi) and ATP)を測定している。この結果もGlycineに対して特異性を示した。

close this sectionPKS/NRPS Module

5
6 glycine
7
8
9
KS35..409
KR975..1155
MT1417..1520
ACP1621..1689
C1707..1993
A2171..2561
PCP2661..2730
KS2765..3149
DH3390..3546
KR3856..4022
ACP4104..4172
KS4224..4596
ACP5075..5144
MT5364..5466
KR5714..5912
ACP6008..6077
ks6122..6474
KS6551..6917
KR7350..7531
ACP7598..7671

close this sectionSequence

selected fasta
>polyketide synthase/non-ribosomal peptide synthetase [NRPS/PKS]
MRRRSMDDKRALLLKLLASVRQQAAPATAPRRTAEDIAVIGLAGRYPRARTPDELWRNVV
EGRNCVSEVPADRWDVDAHYHPDAKDGRAYSKWGGWLDDVDKFDPLLFQISPSDAEEMDP
QERLFLETAWASIEDAGYRPRGLGEHDAVGVFAGVMNNDYEWLAGHSSAFGADTHARSAH
WSIANRVSYVLDLRGPSLTVDTACSASLTAVHLACESLRRGECATAIAGGVNLILHPMHL
RMLADRQMISRGDRCRSFGARADGFVDGEGVGAVLLKPLDAAEADGDRVYAVLKGSAVNA
GGRTSGYTVPNPTAQAEVITAALRRAGVAPHTVGYVEAHGTGTPLGDPIEIAGLREAFRD
GTGESALPGGCAVGSLKSNIGHLESAAGIAGLTKVLMQLKHGVLAPSLHSAELNPGIDLT
GTPFRIQQEAEPWHRPVLRDRDGRETEGPRRAGVSSFGGGGANAHLIVEEYLGEPRDRRA
PAHGGAAEELIVLSAMSEERLRAYARDLAGFLDRQPVAGEALDACVRVAADVLRVLPHDL
DADVELAEYGPGAAELAGLSERLGLATTISGRTTLRELARDRGGPALSLADVAHTLRVGR
EQLDVRLAFPARELSDVRQVLRDVADGVESGVALHDTARDRRPAPDGAAERLTRALSDGD
LTEAARLWAQGVEARWPDSGARRVGLPTYPFARKRYWIPSPPERKALGGERARMAATPRD
ASAPVEPPRPDEVRTDSLHAPARDLIPGPVRGSLNGSAAGSAAGDDTPELGFYQPVRVPE
ELAEAGTGDRAVATTAGHEVAVLVAGEPGALAEALLRHHPGARLVRLDRDDPAELLTSRP
VRHLYHLGGLHRPDGLEAALRDGVLALFRTHGAGPRITVVTAGAHQDNPHAAGILGYAQV
LAAECPHLDITCVDIDDTDSDSDIVDVVATLPALLAEPPHPAGRAVLLRAGRRHVRQLVR
TPVPAPDRPPYRTGGTYLMVGGSGGIGRALSQELAQRYRANVVWISRGKLDAAQRACADR
VREAGGRLLHLRADASDSAALRVAVAEARRQFGALHGVIHAAMTFNASTIAELTEPELRA
ALAAKVDGSLALVGALGDEPLDFLAFFSSVGSFVSAAGNAAYVAASSFLDSYGRHLATRL
PYPVRVVNWGYWGRVGSGAQPGLQEVFRRTGVAEFTVREGLDCLERVLANGPVQVMPIRA
DRRALEALGHRPSPLGERYTAPAPAGPGADAVIEGYDRLSALCDAALLGVYRRMGALTRQ
GERDSVSGLADRLGIVPKYHRLHAALLTILADAGHLTVRGDTVEVLAADADTEVNADAGT
GTEHVERELDRIAAGHPDIRATVELTRLFLRSYPQVLRGETGATEIMFPHASMELVQDFY
RGNPLTDSLNELVADMVAEHAAHRVAGLAAGERLHVVEFGAGTGATTERVLPVLSEWADR
AEYVFTDISPQFLESAEERFGARHPFTRFRTLNLERGLAEQGYTPGGVDIIVATNVVHAT
GDLRATLRKARELLRPGGRLVLNELTAIRSSITVTGGVLDGWWAFTDQELRIKDAPLATA
QTWQRLLLEEGFADALVLDRGTHLGQHVIVGVNADAGHRAPAAAPAGTAPSVRGSLPGRG
PLDRLGDIVTATLKLDEPVDPDRHLSDYGFDSLSGMKIAAVLEEDLGVRLRLSDLLEHAT
LRELSDHIGGLTDEVPGTAPAATRPARFPLSAGQRALSVIERTAPGTYAYNLPLAFWLRP
ETDPAALREALQSMVDRHPQLRARIDGDHQVIEERQQLALDIREMGTSSEDAVREAAREC
VRHPFDLRQGPLFRVTLFILGDGRQVLLLTFHHIVFDGLSIAVFLRELAAFHRGQAPAAA
PPATFADFVDWQTELLRSERGERLRAYWLDRLSGDLPSLRLPLDRPRPAVPSYRGASVEG
ELGAARIRAARQLAGEERTSLFVVLLAVYATLLHRYSQQDRVLIGTPVAGRPSPRYADVL
GYFMNMVVLKHDFDEGQDFRGLLRQVRDTTLEALEHSDYPLFSLAQELRASRLFDTAFYF
QNWVEDDTDARPVAGVFHGVHQEGEFDLTLEVVEEPEGARYCLKYNPDLFDEDTVRRLGE
HFRLLLDSALATPDQGLGALSLRSAEDTASARQRLTRRDHPAGRVLPALLTDQIRRTPDA
VAVTDRGTTLTYRELGARVEALAARLRGRGVAPGRNVGVLVDRSADMLVALLGVLAAGGA
YVPLDPDYPAERLRYMAEDAGLHLLITGPGARPDLGAPVLVVDAEDGTADGPGTAGSPAL
PVPGPDDTAYVIYTSGSTGRPKGVQVPHRALANLLLSMAEEPGLTADDHLLALTTVCFDI
AALELFLPLVTGGRVEIVPAEVARDGVLLRRLLDSSPATVVQATPATWKMLLAAGWTGGR
GLKVLCGGEALDQDTAELLLARADQVWNMFGPTETTIWSAVCRLAPGERVTIGRPVANTG
LYVLDARGRAVPPGVPGELYIGGAGLATGYLGRPELTAERFVTLDGERRYRTGDLVRELA
DGRIEYLGRLDAQVKVRGFRIEPGEVEAVLRAQEGVREAAVVARRVGGDTVLHAFLVLDE
NAAAPRREALAQRLPAHMIPDVLVELAALPQTLNGKVDRTRLSGAPLTELRGGDTGSTSR
GPGAESVPAASARRAAGDAGRIGELCDLIAGILGTDAAEVPVDVPLGQLGMNSISFTVLS
TRVSERYGTEVLPTLFYRRPTVAAVAAHLGEVFGDADVTGTTEPEALAQAGTTAVAPRPT
ARGTDIAVVGVAGRFPGSADLAEFWDHLEQGRDLVTEIPGDRWDWRARTGTSRSRWGGFV
PGVDRFDAAFFGISPREAELMDPQQRLLLEVVWTAVEDAGYRASDLAGRRVGVFIGTTNS
DYAEVQRAGGRPAEAHTLTGAALSVIPNRISYLLDLRGPSVAVDTACSSSLTAVHQAVGA
LRDGTCDLAIAGGVSLILDPRLYDALSQNEMLSEDGRCKAFDASANGYVRGEGVGVVVLK
DHAAARADGDRVAAVIRAAAVNHGGRTTSLTSPNPDAQAELLVEAYRTAGVDPRTVGYIE
AHGTGTALGDPIEITGLTEAFQRLGGDGGPGDGGGSGGGGAAPGAGRASCGIGSVKTNIG
HLEAAAGIAGLLKVVLALRHRTIPASLHFRERNPYLDLDGSPFEIVGATRPWPAPLAADG
TALPRRAGVSSFGFGGANAHVVIEEAPADAYAAPAADSTEAELFVLSARTGAALRSQAGR
LAAHVRAELPAPADIAHTLRVGREAMEERLAFVARGHAELLDRLDACADGRTPAGALRGR
VSAGRRRAATTRGTAWREFVRALSAEGDLESLAQLWADGADVDWSWLPQRGRRTGLPTYP
FEPTRHWIDAGTDARAAARPAPAGPPGAPASLLDENVSTFGEAAFVKHLTGSEFYLTDHR
VGDELVLPGVVYLEMARLAGERAHGSAPVRRVDEIVWAAPVTLPPGRSRDVRVAVAPSGA
FEVTGERPHARGRLVFGQDGTGAGAAPAPVDPAAVRARCGERRTGQECYAYFAGLGFRYG
PAFQVIEELHLGEGEALARLRRPEVGDHRFHPSLLDGALQAAGMLVRGSTAHLPYAIGSV
RLFGELPADCLAHVVAVEGRADSQVFDISLAAPDGTVVARVERFTLRAVPDARRAPQETA
GPGVLAFEPFWREAPASAADAAPVELLSVIDAGDGRAEALRDELAVLAPELAVVVGDRSR
ASHLVHLAGEGRGTGLDEALRDGFHTALDVCGTRIAERGGPLRYLFVHEDRTGAAGAAHA
ALDGFARSIGQEHPGIRLSVLTRTGSPSVRDLAEFVLAELPGHTPEVRNDGRRRLVRGWR
ESTLPAAGDSPLAATGAHLVTGGTGRLGLLVAERIARHPGAGVVLVGRSAPAGPLPEGWL
HVRADVAVREDVERAVAEARHRFGRVAGVVHAAGTLRDGLALHKSGEDADAVLAPKVRGL
VHLDEATRADTPDYFVAFGSTAAVFGNVGQTDYAYANSFLAHYLERRPGGGLTVDWSLWR
DGGMTLTAEAREAMRREFGMEPLPSEAALDALEAALRGGASRVLLTAGDRVRIGEALRRT
AEPPREAATPPAASTDGGDLRAPMVTYLRELLADELKMALEDVAEDEAFDHYGVDSLLVL
SLTRALEERFGPLSKTLFFEYLTIGELADFLVAQHPAEARDLVAPAAVAPTAAPAAAAVE
PSAVAPAPAVRVTLPAPHPAPDDDEIVIVGVAGRYPKADDLAQFWRNLREGRDCVEEVPE
DRWDHGRFYDPDPAAPGKAYAKWGGWLSDVASFDPMFFRMSQVEAEHIDPQERIFLQTVW
HLLEDAGTSRAALSKVRTGVFVGLMYGHYQLYGVEEALRGTGAATSSSYASVANRVSYFF
DFDGPSIALDTMCSSSLTALHLACRAIRDGDCEVAVAGGVNVSSHPLKYLQLAKGGFLST
DGRCRSFGEGGDGYVPAEGSGAVLLKRRSAAEADGDRVLAVVRSTAVNHGGAGKGFSVPN
PRAQGVLIGEALERAGLAPADLGYLEAHGTGTSLGDPVEITGLVRAFQGHDLTGVRIPIG
SVKSGIGHAESAAGMAALTKVLLQFRHQELVPSLHAERLNPHLDLDATPFRLQRDLAPWT
PRVDATGRALPRTAAISAFGAGGSNAHVILEESVPPTQTPAQEPPYVCALSARDAERLHE
HTARTAEFLRGEGRAAHPAAVAATLLTREPMAHRLAVVFDTVDDLADALEDHLAGAGSPR
VLTGTASRAAAPATGRTAPELAEAWVRGAPVAAPAGAPRVSLPGYPFARERCWLPAADAV
RRPAATEPHGEVLLSTATPVIAGHRVQGRSLLPALAYVDLIAKVFRDHGHAVEHLTVRDL
TALRPLDVTDGPVAAEIRCTRAGDDRWQVTVTDGEPYATADVLLTAAPGFRDRLDGHPLG
TPVPLAETYARDGGNGQHYGGAARADGLVRADQDRLTVELDAPAGDFLIHPALLLGGAVA
AGSLLDTGGQAFLPLHIGSFRAAGPLTGACTARVRRASVSRRGEVARFSVDFFDPQGRQV
AELSELSSKAVAGAPTAAPPPARDADAGPAAAAHAENFLRRLLAERLGTDPETVPPTAGY
YELGLASVQVLGLVEAVRDVVGQELEPTLLFEYTTVRDLAAHLAARFPHAFGEASEDAGG
VAATAGPGAGTGLGRLRESAVIPPPPAPAALPAPELDRLIARQVLLRLRACGLFDEARTE
TVDGMARRLGVVGKYRRWLEEAARLFTAAGLTRRHGDTLELADERPPAHDTEWTAVRERF
AADPYWDAQLSLVQECVERLPEILSGTVPATDVLFPGGSLAKLTAVYQGNAVADRLNDVV
AEVTAAAVRERVAGDPSATVRVAEVGAGTGGTTAVVLPRLDPCADRLEYWYTDLSPAFLD
QAERRFGPGRDYLRYGRWDVTQEGAGERLTGGGCDVVVATNVLHATPDIRLVLRNLAAAL
RPGGVLVVNEVTRKSAVLTLTFGLLDGWWLYDDEDVRLPGAPLLSAPRWQEVLHDSGFGE
VWRPVAEPDAFGEVLVARPGTREVPPPEGTRLLVREWEPSPALPGRGPAAVAVIGAGREA
ARLAEALPGGRLIATAADLDDRFDALVDLGGAEPADWLPVLQRMAGRQALLLGVGRGDAR
AGLYRMLQSEYGRIRSRYLEADPADPGLTGLVARELADGGHDTEVTYRRGVRHRAVLEAL
PAATGAPVRFPAGEPLLITGGTRGIGLALARHAVAEWGARTLVLTGREQLPPRAEWDRHG
TDTALGRKLRGLRALEDDGVRLKVLALPLGEDAGAVHAALDGIRGEFGPIGGVLHAAGLV
DRDNLAFVRKPVEAVRAVLAPKTAGLDALVDALAGDPLRFCVLFSSVASMVPAAAVGQSD
YAMANAHLDAVARRAPHGLPVVSVAWPSWRGVGMGSERPGPGYLATGLGELSEAQGLRLL
DHILSTGAGPVVLPAIVAPEWTPGALRAPGAGTPASVPAPVAAHAENPAAPAAVGTEPGA
GADAAAKAESWLLGLLAEELGFDRARLAADVPISDYGTDSIMMVQILRTVGAELDADLDP
SVLVDHPTVRSFVGWLTAHHGQALAAAFGATPPAPVLPAAAPPVTAAVPAARAEAPADTP
YDIAVVGMSGRFPGAPDLDAYWRLLSEGRSAIAPVPARRWADGTKYTAGLLDLEGFDPGH
FHLSDADAAAMDPQALLLLEETLFAFCDAGYAPDELKGRGIGVYVGGRSRHVPDEATLGR
SRNPVVAVGQNYLAANLSHHFDLRGPSTVVDTACSSALVALHHAAQALRSGDVEAAVVAG
VTLLPDAGGHRLFDRRGLLNTGTEFHVFDRRARGFTPAEGVGVLLLKPLAAAEAAGDRVH
AVLKGIAVNNDGRTAGPATPNPAAQRGVMARALAKAGVAADDVTYIETNAAGSQIPDLIE
LKAIAAVYRDGSDTPCSLGSVKPNIGHPQCAEGIAGVIKTVLMLRNRAIVPFLSGRQPLE
HFDFAATPLRFERALTPWPDAPLLAAVSSFADGGTNAHAVLAGRTNGTTGRRAPLDRPRL
ARRGLPAAGAERFAVIGMAGHYPGAEDLDAFWANLKDGRDSVTEVPAQRWTPGDGDGSRW
GGFLDDVGRFDADFFRISRPEAEITDPQERWFLRTCWEAIEDSGYTPEGLTGAKGPDRRR
AVGVFAGVMHKDYTLVAAEASAPVPLSLNQGQIANRVSFVCDFHGPSMTVDTLCSSSLTA
LHLAVESLRRGECEVAVAGGVNLSLHPGKYRTYGAVGMHSSDGRCRSFGEGGDGYVSAEG
VGAVVLKPLAAAEADGDHIYAVVAGSAVNHVGSASGFSVPSPVGQAAVITAALERAGVDA
RTIGYLEAHGTGTSLGDPVEIRGLSTAFGRHTDDRGFCAIGSVKSNIGHAESAAGVAGLT
KAVLQLHHRTLVPSLHADTVNPLLGLDGTPFRLQRATEAWPAPPEGPRRAGLSSFGATGA
GAHIVLEEYVPAAEAAAASRTPGEPVVVPLSARTRDALRQSAARLRDALTQGGRTLRDVA
YTLQVGRVEWPERVAFVARDVRELLEQLAEFAVHGVRPALGAGEPHDVARRWADGSSVDW
DARHGKDRPRRVSLPTYPFAGEWHWVPGGVPAAPAGAPGADRAVSVTAAEAPVTPPSGLL
SVPRWEEAPVAPATGPAPRRVLIVTDEPGAGLAGALAEHYRRHGGSEVTERPLDDLRIPP
GAAPDRVLLVTGSGPADGGAAAGPELALLRLVKAVQRLDGGRTDLCVVTRDTQSVTGERG
AAHGAGLTGLAYFVARDSGRFAVRNIDVAAADLRTPGDRAAVAALVADEPASPAADLVAL
RGGRRYRQKVGPVTAAEAATPGLPGIRPGGTYVVVGGSGFVGRVVSRHLIDRYDAKVVCV
GRRPQSDPAVREAVHGDRVGYVQGDVTDPREARRVIAGAKGLLGEIHGVLFAGATRITGA
PGALAGLGEDEFRAHYEIKASGARNVYEAVADEPLDFLCYFSSAQAFSFGGAGTHAAYAA
GITAADAFARAVAPTAAFPVGIVNWGAWRASFGEAARDYPTLGFLDDDEGAACFDTAVRL
LRAGRHRQVIGMRAPARSAARAEAADRAHPAAPAGRDRRPELRRLLVERLARTLRVPAED
LSPSTAFADLGVDSITGSTFVAAIAEELGVELNAAALYEFSSAERLAEHLDALLGPAAQE
PPAPPAPPASSASSASSGTSAPDDLIVKLEARFAAGELSAAEVLDLLDAELATREQR
selected fasta
>polyketide synthase/non-ribosomal peptide synthetase [NRPS/PKS]
ATGAGACGAAGGTCTATGGACGACAAGCGTGCTCTTCTCCTGAAGCTGCTGGCATCCGTG
CGGCAGCAGGCCGCTCCCGCCACCGCACCGCGGCGCACCGCCGAGGACATCGCGGTGATC
GGGCTCGCCGGGCGCTACCCGCGGGCGCGGACACCGGACGAGCTGTGGCGCAACGTCGTC
GAGGGCCGCAACTGCGTGAGCGAGGTGCCCGCCGACCGCTGGGACGTGGACGCCCACTAC
CACCCGGACGCCAAGGACGGGCGCGCGTACAGCAAATGGGGCGGCTGGCTCGACGACGTC
GACAAGTTCGACCCGCTGCTCTTCCAGATCTCCCCCTCCGACGCCGAGGAGATGGACCCC
CAGGAGCGCCTCTTCCTGGAGACCGCCTGGGCGAGCATCGAGGACGCCGGATACCGCCCC
CGCGGGCTCGGCGAGCACGACGCGGTGGGAGTGTTCGCCGGGGTGATGAACAACGACTAC
GAATGGCTCGCTGGGCACAGCAGCGCCTTCGGCGCGGACACGCACGCGCGCTCCGCCCAC
TGGTCGATCGCCAACCGGGTCTCGTACGTCCTGGACCTGCGCGGGCCGAGCCTGACCGTC
GACACCGCGTGCTCGGCGTCGCTGACCGCCGTCCACCTGGCCTGCGAGAGCCTGCGCCGC
GGCGAGTGCGCCACGGCGATCGCCGGCGGAGTCAACCTCATCCTGCACCCGATGCACCTG
CGGATGCTGGCGGACCGACAGATGATCTCCCGGGGCGACAGGTGCCGCAGCTTCGGTGCC
CGCGCGGACGGCTTCGTCGACGGCGAGGGCGTCGGCGCCGTCCTGCTCAAGCCGCTGGAC
GCCGCGGAGGCCGACGGCGACCGCGTCTACGCCGTCCTCAAGGGTTCGGCGGTCAACGCG
GGCGGCAGGACCAGTGGTTACACCGTGCCCAACCCCACCGCCCAGGCCGAGGTGATCACC
GCGGCCCTGCGCCGCGCGGGCGTCGCACCGCACACCGTCGGCTATGTGGAGGCGCACGGC
ACCGGCACCCCCCTCGGCGACCCCATCGAGATCGCCGGGCTGCGCGAGGCCTTCCGCGAC
GGTACGGGTGAGAGCGCCTTGCCCGGCGGCTGCGCCGTCGGCTCCCTCAAGTCCAACATC
GGGCACCTGGAGTCGGCCGCCGGCATCGCGGGTCTGACCAAGGTGCTCATGCAGCTGAAG
CACGGTGTCCTGGCGCCGTCCCTGCACTCGGCGGAGCTGAACCCCGGCATCGACCTCACC
GGCACCCCCTTCCGGATCCAGCAGGAGGCCGAGCCCTGGCACCGCCCCGTGCTGCGGGAC
CGGGACGGGCGGGAGACGGAGGGGCCCCGCCGGGCGGGCGTCAGCTCCTTCGGCGGCGGC
GGGGCCAACGCCCATCTGATCGTCGAGGAGTACCTGGGGGAGCCCCGTGACCGCCGCGCA
CCCGCGCACGGCGGTGCGGCCGAGGAACTGATCGTCCTGTCCGCGATGAGCGAGGAGCGG
CTGCGGGCCTACGCGCGCGACCTCGCGGGCTTCCTCGACAGGCAGCCGGTCGCGGGGGAG
GCGCTCGACGCGTGCGTGCGGGTGGCCGCCGATGTCCTACGGGTGCTGCCCCACGATCTC
GACGCCGACGTCGAACTGGCCGAGTACGGGCCGGGCGCGGCCGAACTCGCCGGGCTCAGC
GAGCGCCTCGGCCTCGCGACGACGATCTCCGGCAGGACCACACTGCGCGAGCTGGCCCGC
GACCGCGGCGGGCCCGCGCTGTCCCTCGCCGACGTCGCGCACACGCTGCGCGTCGGCAGG
GAGCAGCTCGACGTGCGACTCGCCTTCCCGGCGCGTGAACTCAGCGACGTACGGCAGGTG
TTGCGGGACGTCGCGGACGGCGTGGAGAGCGGCGTCGCGCTGCACGACACCGCCCGCGAC
CGGCGCCCGGCACCGGACGGCGCCGCCGAGCGGCTCACGCGCGCCCTGTCCGACGGGGAC
CTCACCGAAGCCGCCCGGCTGTGGGCCCAGGGTGTCGAGGCCCGGTGGCCGGACAGCGGC
GCGCGCCGCGTCGGGCTGCCCACGTATCCGTTCGCACGCAAGCGGTACTGGATTCCGTCG
CCGCCGGAACGGAAGGCCCTGGGAGGGGAGCGGGCGCGGATGGCGGCGACCCCTCGGGAC
GCCTCCGCGCCCGTGGAACCGCCGCGGCCGGACGAGGTGCGCACCGATTCGCTCCACGCC
CCCGCCAGGGACCTCATACCCGGCCCCGTGAGGGGGTCCCTGAACGGCTCCGCCGCCGGG
AGCGCGGCGGGTGACGACACTCCGGAGCTCGGCTTCTACCAGCCCGTCCGGGTTCCGGAG
GAGCTCGCCGAGGCCGGGACCGGTGACCGCGCCGTCGCCACGACCGCGGGGCACGAGGTG
GCCGTCCTCGTCGCCGGTGAACCCGGCGCCCTCGCCGAGGCGTTACTCCGGCACCACCCC
GGCGCCCGCCTCGTCCGGCTCGACCGGGACGACCCGGCGGAACTGCTCACCTCCCGGCCC
GTGCGGCATCTCTACCACCTCGGCGGACTGCACAGGCCCGACGGCCTCGAAGCGGCCCTG
CGCGACGGCGTCCTCGCCCTCTTCCGTACCCACGGCGCAGGGCCCCGGATCACCGTGGTC
ACCGCGGGCGCGCACCAGGACAACCCCCACGCGGCCGGCATCCTCGGATACGCCCAGGTC
CTCGCCGCCGAATGCCCCCACCTCGACATCACCTGCGTCGACATCGACGACACCGACAGC
GACAGCGACATCGTCGACGTGGTGGCCACGCTGCCCGCCCTGCTGGCCGAGCCCCCGCAC
CCCGCGGGGCGTGCCGTCCTGCTGCGGGCCGGCCGCCGCCACGTACGACAGCTCGTGCGC
ACCCCCGTGCCCGCGCCGGACCGCCCGCCCTACCGGACCGGCGGCACCTACCTCATGGTC
GGCGGCAGCGGCGGTATCGGCCGCGCCCTCTCCCAGGAGCTCGCCCAGCGGTACCGGGCC
AATGTCGTCTGGATCAGCCGCGGCAAACTCGACGCCGCGCAGCGGGCCTGTGCCGACCGT
GTGCGCGAGGCCGGGGGCCGGCTGCTGCACCTGCGGGCCGACGCGTCCGACTCCGCGGCC
CTCCGCGTGGCCGTCGCCGAGGCCAGGCGGCAGTTCGGCGCCCTGCACGGTGTGATCCAC
GCGGCCATGACGTTCAACGCGAGCACCATCGCCGAGCTCACCGAACCGGAGCTGCGGGCG
GCGCTCGCCGCCAAGGTGGACGGTTCACTCGCGCTCGTCGGCGCGCTCGGCGACGAGCCC
CTGGACTTCCTCGCGTTCTTCAGCTCGGTCGGCTCCTTCGTCAGCGCCGCGGGCAACGCC
GCTTACGTGGCGGCCAGTTCGTTCCTCGACTCCTACGGCCGCCACCTCGCGACCCGCCTC
CCGTACCCCGTGCGCGTGGTGAACTGGGGCTACTGGGGGCGCGTCGGCTCCGGGGCGCAG
CCCGGGCTCCAGGAGGTGTTCCGCAGAACGGGCGTCGCCGAGTTCACCGTGCGCGAGGGC
CTCGACTGCCTGGAGCGGGTGCTGGCCAACGGCCCCGTACAGGTCATGCCGATCCGAGCC
GACCGGCGGGCCCTGGAAGCCCTCGGCCACCGTCCTTCGCCCCTCGGCGAGCGGTACACC
GCGCCCGCACCGGCCGGGCCCGGCGCCGACGCCGTCATCGAGGGCTACGACCGCCTCTCC
GCCCTCTGCGACGCGGCACTGCTCGGCGTCTACCGGCGGATGGGCGCGCTCACCCGGCAG
GGCGAGCGCGACAGCGTCTCGGGGCTCGCCGACCGGCTCGGCATCGTACCCAAGTACCAC
CGGCTGCACGCGGCTCTGCTCACCATCCTCGCCGACGCCGGACACCTGACCGTGCGCGGC
GACACCGTGGAGGTCCTCGCCGCGGACGCGGACACGGAGGTGAACGCGGACGCGGGCACC
GGAACCGAGCACGTGGAGCGCGAGCTCGACCGGATCGCCGCCGGCCATCCGGACATCAGG
GCCACGGTGGAGCTGACCCGGCTGTTCCTGCGGAGCTACCCGCAGGTGCTGCGCGGCGAG
ACCGGCGCCACCGAGATCATGTTCCCCCACGCGTCGATGGAGCTCGTCCAGGACTTCTAC
CGGGGCAACCCGCTCACCGACTCCCTCAACGAACTCGTCGCCGACATGGTCGCCGAGCAC
GCCGCGCACCGCGTCGCCGGTCTCGCGGCGGGCGAGCGGCTCCATGTCGTCGAGTTCGGC
GCGGGCACCGGCGCCACCACCGAGCGCGTCCTGCCCGTGCTCTCGGAGTGGGCCGACCGG
GCCGAATACGTCTTCACCGACATCTCGCCGCAGTTCCTGGAGAGCGCCGAGGAGCGGTTC
GGCGCGCGCCACCCGTTCACCCGGTTCCGTACCCTCAACCTCGAACGAGGTCTTGCGGAG
CAGGGGTACACCCCCGGCGGCGTCGACATCATCGTGGCCACCAACGTCGTGCATGCCACC
GGTGACCTGCGCGCCACCCTGCGCAAGGCACGGGAACTGCTGCGGCCCGGCGGCCGTCTG
GTGCTCAACGAACTGACCGCGATCCGCAGCAGCATCACCGTCACCGGCGGTGTCCTCGAC
GGCTGGTGGGCGTTCACCGACCAGGAACTGCGCATCAAGGACGCACCCCTGGCCACCGCC
CAGACCTGGCAGCGGCTCCTCCTGGAGGAGGGCTTCGCCGACGCTCTGGTCCTGGACCGG
GGCACCCACCTCGGCCAGCACGTCATCGTCGGCGTGAACGCGGACGCCGGGCACCGCGCA
CCGGCCGCCGCTCCGGCGGGAACCGCGCCGTCCGTCCGTGGGTCCCTGCCCGGTCGCGGA
CCGCTCGACCGGCTCGGCGACATCGTGACGGCCACCCTCAAGCTGGACGAACCCGTCGAC
CCCGACCGGCACCTCTCCGACTACGGCTTCGACTCGTTGAGCGGCATGAAGATCGCCGCC
GTGCTGGAGGAGGATCTGGGTGTCCGGCTGCGCCTGAGCGACCTGCTCGAACACGCCACG
CTGCGCGAGCTGAGCGACCACATCGGCGGACTGACCGACGAGGTGCCCGGCACCGCCCCG
GCGGCGACCCGGCCCGCACGGTTCCCCCTGTCGGCCGGGCAGCGCGCCCTGAGCGTCATC
GAGCGGACGGCGCCCGGCACCTACGCCTACAACCTGCCGCTCGCCTTCTGGCTGCGACCG
GAGACCGACCCGGCCGCCCTCCGCGAGGCCCTTCAGTCCATGGTGGACCGCCACCCCCAG
CTCCGCGCCAGGATCGACGGCGACCACCAGGTCATCGAAGAGCGGCAGCAACTGGCCCTC
GACATACGCGAGATGGGCACCTCGTCCGAGGACGCCGTGCGCGAGGCGGCCAGGGAGTGC
GTCCGCCACCCCTTCGACCTGCGGCAGGGGCCCCTGTTCCGGGTCACCCTGTTCATCCTC
GGCGACGGCCGGCAGGTGCTGCTGCTCACCTTCCACCACATCGTCTTCGACGGCCTCTCC
ATCGCCGTCTTCCTGCGCGAACTCGCCGCGTTCCACCGCGGGCAGGCCCCCGCCGCCGCG
CCGCCGGCCACCTTCGCCGACTTCGTCGACTGGCAGACGGAACTGCTCCGGTCGGAGCGC
GGCGAGCGGCTGCGCGCGTACTGGCTGGACCGTCTCTCCGGTGACCTGCCCAGCCTGCGG
CTCCCCCTCGACCGGCCGCGCCCCGCGGTGCCCAGCTACCGCGGTGCCTCCGTCGAGGGC
GAACTCGGCGCGGCACGGATCCGGGCCGCGCGACAGCTCGCGGGCGAGGAACGCACCTCG
CTCTTCGTGGTCCTCCTCGCCGTCTACGCGACGCTGCTGCACCGCTACTCGCAGCAGGAC
CGCGTGCTCATCGGCACCCCGGTGGCCGGACGGCCCTCACCCCGCTACGCCGACGTGCTC
GGCTACTTCATGAACATGGTCGTCCTCAAGCACGACTTCGACGAAGGACAGGACTTCCGG
GGCCTGCTGCGCCAGGTGCGGGACACCACCCTGGAGGCCCTGGAGCACAGCGACTACCCG
CTGTTCAGCCTCGCCCAGGAACTACGGGCCTCGCGCCTGTTCGACACCGCCTTCTACTTC
CAGAACTGGGTGGAGGACGACACCGACGCCCGCCCCGTCGCCGGAGTGTTCCACGGCGTC
CACCAGGAGGGAGAGTTCGACCTCACCCTGGAGGTCGTCGAGGAGCCCGAGGGCGCCCGC
TACTGCCTGAAGTACAACCCGGACCTCTTCGACGAGGACACCGTGCGCCGGCTCGGCGAG
CACTTCCGCCTGCTGCTCGACTCGGCCCTCGCCACCCCGGACCAAGGCCTCGGCGCGCTC
TCCCTGCGGAGCGCCGAGGACACCGCCAGCGCCCGGCAGCGGCTCACACGGCGCGACCAC
CCCGCGGGCCGCGTCCTGCCCGCGCTCCTCACCGACCAGATCCGCCGCACCCCCGACGCC
GTGGCGGTGACCGACCGCGGCACCACACTGACGTACCGCGAACTGGGCGCCCGCGTCGAA
GCCCTCGCGGCCCGGCTGCGGGGGCGCGGGGTCGCGCCGGGGCGCAACGTCGGCGTGCTC
GTCGACCGTTCCGCCGACATGCTCGTCGCCCTGCTCGGCGTTCTCGCCGCGGGCGGCGCC
TATGTGCCCCTCGACCCCGACTACCCGGCCGAGCGGCTGCGGTACATGGCCGAGGACGCC
GGACTCCACCTCCTGATCACCGGCCCCGGGGCCCGCCCGGACCTCGGCGCTCCGGTCCTG
GTGGTCGACGCCGAGGACGGGACCGCGGACGGGCCGGGGACGGCCGGGTCTCCCGCGCTT
CCCGTGCCGGGCCCGGACGACACGGCGTACGTGATCTACACCTCCGGGTCGACCGGCCGC
CCCAAGGGCGTCCAGGTGCCGCACCGGGCGCTGGCCAACCTGCTCCTCTCCATGGCCGAG
GAGCCGGGACTCACCGCCGACGACCACCTGCTCGCGCTCACCACCGTCTGCTTCGACATC
GCCGCGCTCGAACTGTTCCTGCCGCTGGTCACCGGCGGCCGGGTGGAGATCGTGCCCGCG
GAGGTCGCGCGGGACGGGGTGCTCCTGCGCAGGCTGCTCGACTCCAGCCCGGCCACGGTC
GTCCAGGCCACCCCCGCCACGTGGAAGATGCTGCTCGCCGCGGGCTGGACGGGCGGACGG
GGGCTCAAGGTGCTGTGCGGGGGAGAGGCCCTCGACCAGGACACCGCGGAGCTGCTGCTC
GCCCGCGCGGACCAGGTGTGGAACATGTTCGGGCCCACCGAGACCACCATCTGGTCGGCC
GTGTGCAGGCTCGCCCCCGGTGAGCGCGTCACCATCGGGCGTCCCGTCGCCAACACCGGG
CTGTACGTCCTCGACGCGCGGGGCAGGGCGGTGCCGCCCGGGGTGCCCGGCGAGCTGTAC
ATCGGCGGTGCGGGGCTCGCCACCGGCTATCTGGGCCGCCCCGAACTGACCGCCGAACGG
TTCGTCACCCTCGACGGCGAGCGCCGCTACCGCACCGGTGACCTGGTGCGCGAGCTGGCC
GACGGCCGCATCGAGTATCTGGGCAGGCTGGACGCCCAGGTGAAGGTGCGCGGCTTCCGC
ATCGAGCCCGGCGAGGTCGAGGCCGTGCTGCGCGCACAGGAGGGGGTCCGCGAGGCCGCG
GTGGTCGCTCGCCGGGTCGGCGGTGACACCGTCCTGCATGCCTTCCTCGTCCTCGACGAG
AACGCCGCGGCTCCCCGCCGCGAGGCGCTCGCGCAGCGCCTGCCCGCCCATATGATCCCG
GACGTCCTGGTCGAACTGGCCGCGCTCCCGCAGACGCTCAACGGCAAGGTGGACCGCACC
CGCCTGAGCGGTGCGCCCCTGACGGAGCTGCGCGGCGGGGACACGGGGAGCACGTCTCGC
GGCCCGGGGGCCGAGTCCGTGCCCGCGGCCTCCGCCCGGCGTGCGGCAGGTGATGCCGGC
CGCATCGGCGAGCTGTGCGATCTCATCGCCGGGATTCTCGGTACCGATGCCGCCGAGGTG
CCCGTCGACGTGCCGCTCGGGCAGCTCGGCATGAACTCGATCAGTTTCACCGTGCTCAGC
ACGCGCGTCAGTGAGCGGTACGGGACCGAGGTCCTGCCCACGCTGTTCTACCGCCGGCCG
ACCGTCGCCGCGGTCGCCGCCCACCTGGGCGAGGTCTTCGGGGACGCCGACGTCACCGGG
ACCACGGAACCGGAGGCCCTCGCTCAGGCGGGCACCACCGCCGTGGCACCCCGCCCAACC
GCCCGCGGCACGGACATCGCCGTCGTCGGCGTGGCCGGACGGTTTCCCGGCTCCGCGGAC
CTCGCGGAGTTCTGGGACCATCTGGAGCAGGGCAGGGACCTCGTCACCGAGATCCCGGGC
GACCGCTGGGACTGGCGGGCCCGCACCGGCACGTCCCGCTCGCGCTGGGGCGGCTTCGTC
CCCGGCGTGGACCGCTTCGACGCCGCCTTCTTCGGCATCTCGCCCCGCGAGGCCGAGCTG
ATGGACCCCCAGCAGCGGCTGCTCCTGGAGGTGGTGTGGACGGCGGTCGAGGACGCCGGG
TACCGCGCGAGCGACCTCGCGGGCAGACGCGTCGGGGTCTTCATCGGCACCACCAACTCC
GACTACGCGGAGGTGCAGCGCGCCGGTGGCCGCCCGGCCGAGGCGCACACGCTCACCGGG
GCCGCGCTCTCGGTCATCCCCAACCGCATCTCGTACCTCCTGGACCTGCGCGGACCCAGC
GTCGCGGTCGACACCGCCTGTTCCAGCTCGCTCACCGCAGTCCACCAGGCGGTCGGCGCG
CTGCGCGACGGCACCTGCGACCTCGCGATCGCGGGCGGCGTCAGTCTCATCCTCGACCCC
CGGCTCTACGACGCCCTGAGCCAGAACGAGATGCTCAGCGAGGACGGCCGGTGCAAGGCC
TTCGACGCCTCCGCCAACGGCTATGTGCGGGGCGAGGGCGTGGGCGTCGTGGTGCTCAAG
GACCACGCGGCGGCCCGCGCCGACGGCGACCGCGTGGCGGCGGTGATCAGGGCCGCCGCC
GTCAACCACGGGGGCCGCACCACCTCCCTCACCTCGCCCAACCCCGACGCACAGGCCGAA
CTGCTCGTCGAGGCCTACCGCACGGCGGGCGTGGACCCCCGCACGGTCGGCTACATCGAG
GCACACGGCACCGGCACGGCCCTGGGCGACCCCATCGAGATCACCGGCCTGACCGAGGCG
TTCCAGCGGCTCGGCGGGGACGGCGGGCCCGGGGACGGCGGAGGGAGCGGCGGGGGCGGT
GCGGCGCCCGGAGCCGGGCGCGCGTCCTGCGGCATCGGCTCGGTGAAGACCAACATCGGT
CACCTGGAGGCCGCCGCGGGCATCGCGGGGCTGCTCAAGGTGGTCCTCGCGCTGCGCCAC
CGGACCATCCCCGCCAGCCTGCACTTCCGTGAGCGCAACCCGTACCTCGACCTGGACGGG
AGTCCCTTCGAGATCGTCGGCGCCACCAGGCCCTGGCCCGCGCCGCTCGCCGCGGACGGC
ACGGCGCTGCCCCGCCGCGCGGGCGTCAGCTCCTTCGGCTTCGGCGGCGCCAACGCCCAC
GTGGTGATCGAGGAGGCCCCGGCGGACGCGTACGCCGCACCCGCGGCGGACAGCACGGAG
GCCGAGCTGTTCGTCCTGTCCGCGCGGACCGGGGCGGCCCTGCGGTCGCAGGCCGGGCGG
CTCGCCGCCCACGTGCGCGCCGAGCTGCCCGCGCCCGCCGACATCGCCCACACCCTGCGC
GTCGGCCGTGAGGCCATGGAGGAACGGCTCGCCTTCGTGGCCCGCGGCCACGCCGAACTC
CTCGACCGGCTCGACGCCTGCGCCGACGGCCGTACCCCCGCGGGTGCCCTGCGCGGCCGG
GTGAGTGCGGGCAGGCGCCGCGCCGCCACCACCCGCGGCACCGCCTGGCGCGAGTTCGTC
CGAGCCCTGTCCGCCGAAGGCGACCTGGAGTCCCTGGCCCAGCTCTGGGCCGACGGGGCG
GACGTGGACTGGTCGTGGCTGCCGCAGCGCGGGCGCAGGACCGGGCTCCCGACCTATCCG
TTCGAGCCGACCCGGCACTGGATCGACGCGGGCACCGATGCCCGGGCCGCCGCGCGTCCC
GCTCCGGCGGGACCGCCCGGCGCCCCCGCGAGCCTGCTCGACGAGAACGTCTCGACGTTC
GGCGAAGCCGCCTTCGTGAAGCACCTGACGGGCTCGGAGTTCTATCTGACCGACCACCGG
GTCGGCGACGAACTCGTGCTCCCCGGCGTCGTCTACCTGGAGATGGCGCGGCTGGCGGGC
GAGCGCGCCCACGGGAGCGCCCCGGTGCGGCGCGTCGACGAGATCGTCTGGGCCGCACCC
GTCACGCTGCCGCCCGGCCGGTCGCGTGACGTCCGGGTGGCGGTCGCGCCCTCCGGCGCC
TTCGAGGTCACGGGCGAACGCCCGCACGCCCGGGGGCGCCTGGTGTTCGGGCAGGACGGC
ACCGGGGCCGGCGCGGCGCCCGCGCCGGTCGACCCGGCCGCCGTACGCGCCCGGTGCGGC
GAGCGCAGGACCGGGCAGGAGTGCTACGCCTACTTCGCGGGCCTCGGCTTCCGGTACGGC
CCCGCCTTCCAGGTCATCGAGGAGCTCCACCTCGGCGAGGGCGAGGCCCTGGCACGGCTG
CGCCGTCCCGAGGTCGGCGACCACCGCTTCCACCCCTCGCTGCTCGACGGGGCCCTCCAG
GCCGCGGGCATGCTGGTGCGGGGATCGACGGCGCACCTGCCGTACGCGATCGGCTCCGTA
CGCCTCTTCGGCGAGCTGCCCGCCGACTGCCTCGCCCATGTCGTGGCGGTGGAGGGCCGG
GCCGACTCCCAGGTCTTCGACATCAGCCTGGCCGCCCCCGACGGCACGGTCGTCGCCCGT
GTCGAGCGGTTCACCCTGCGGGCCGTGCCGGACGCCCGGCGCGCCCCGCAGGAGACCGCG
GGGCCAGGTGTGCTCGCCTTCGAGCCGTTCTGGCGCGAGGCACCGGCCAGCGCCGCCGAT
GCCGCGCCGGTGGAGCTGCTGTCCGTCATCGACGCCGGGGACGGCAGGGCCGAGGCCCTG
CGCGACGAACTCGCCGTGCTCGCACCGGAGCTGGCCGTGGTGGTCGGCGACCGGTCGCGC
GCCTCCCACCTCGTCCACCTCGCCGGGGAAGGCCGGGGAACCGGCCTCGACGAGGCGCTG
CGCGACGGATTCCACACCGCGCTCGACGTGTGCGGGACCCGGATCGCGGAGCGCGGCGGG
CCGCTGCGCTACCTGTTCGTCCACGAGGACCGCACCGGGGCCGCGGGGGCCGCGCACGCC
GCGCTCGACGGGTTCGCCCGCAGCATCGGCCAGGAGCACCCCGGCATCCGGCTGAGCGTG
CTGACGCGCACCGGATCGCCCTCCGTCCGTGACCTCGCCGAATTCGTGCTCGCCGAACTC
CCGGGCCACACGCCGGAGGTGCGGAACGACGGGCGGCGCAGGCTCGTCCGCGGCTGGCGG
GAGAGCACGCTGCCCGCGGCAGGGGACTCGCCCCTGGCCGCGACCGGGGCGCATCTCGTC
ACGGGCGGTACGGGCAGGCTGGGCCTGCTGGTCGCCGAGCGGATCGCCCGGCATCCGGGC
GCCGGCGTGGTCCTGGTCGGCAGGTCCGCCCCGGCAGGTCCGCTGCCGGAGGGCTGGCTG
CACGTCCGGGCCGACGTCGCGGTGCGGGAGGACGTCGAGCGTGCGGTGGCCGAGGCCAGG
CACCGGTTCGGACGCGTCGCCGGAGTGGTGCACGCGGCGGGAACGCTGCGTGACGGGCTC
GCCCTGCACAAGAGCGGCGAGGACGCCGACGCGGTGCTCGCGCCCAAGGTGCGCGGCCTG
GTCCACCTCGACGAGGCGACCCGTGCCGACACCCCCGACTACTTCGTCGCGTTCGGCTCG
ACCGCCGCGGTCTTCGGCAACGTGGGCCAGACCGACTACGCGTACGCCAACAGCTTCCTC
GCCCACTACCTGGAGCGCCGCCCCGGCGGCGGCCTCACCGTCGACTGGTCGCTGTGGCGC
GACGGCGGCATGACCCTCACCGCCGAGGCCCGCGAGGCCATGCGGCGCGAGTTCGGCATG
GAGCCGCTCCCGTCGGAGGCGGCACTCGACGCCCTGGAGGCCGCGCTGCGCGGCGGCGCC
TCCCGCGTCCTGCTGACCGCCGGCGACCGCGTGCGGATCGGCGAGGCGCTGCGGAGGACC
GCCGAGCCGCCCAGGGAAGCGGCCACGCCCCCGGCCGCGAGCACGGACGGCGGCGACCTG
CGGGCGCCCATGGTCACGTATCTGCGGGAACTCCTCGCCGACGAGCTGAAGATGGCCCTG
GAGGACGTCGCGGAGGACGAGGCCTTCGACCACTACGGAGTCGACTCGCTCCTGGTCCTG
AGCCTCACGCGCGCGCTGGAGGAACGTTTCGGGCCGCTCTCCAAGACGCTGTTCTTCGAG
TACCTGACCATCGGCGAGCTTGCCGACTTCCTGGTCGCACAGCACCCCGCGGAAGCGCGT
GACCTCGTGGCCCCCGCCGCCGTCGCCCCCACCGCGGCCCCCGCCGCGGCCGCCGTCGAA
CCGTCGGCGGTCGCCCCCGCACCCGCCGTGCGCGTCACCCTGCCCGCGCCGCACCCCGCA
CCGGACGACGACGAGATCGTGATCGTCGGTGTGGCGGGCCGCTACCCCAAGGCCGACGAC
CTCGCACAGTTCTGGCGCAACCTGCGCGAGGGCAGGGACTGCGTGGAGGAGGTCCCCGAG
GACCGCTGGGACCACGGCCGGTTCTACGACCCCGACCCCGCGGCGCCCGGCAAGGCGTAC
GCCAAGTGGGGCGGCTGGCTCTCCGACGTCGCCTCCTTCGACCCGATGTTCTTCCGCATG
TCCCAGGTCGAGGCGGAACACATCGACCCGCAGGAGCGGATCTTCCTCCAGACGGTCTGG
CACCTCCTGGAGGACGCGGGCACCTCGCGCGCCGCCCTGTCCAAGGTCCGCACCGGCGTC
TTCGTCGGCCTGATGTACGGCCACTACCAGCTCTACGGCGTCGAGGAGGCGCTCCGCGGC
ACCGGCGCGGCCACCTCGTCGTCGTACGCCTCGGTGGCCAACCGCGTCTCGTACTTCTTC
GACTTCGACGGTCCGAGCATCGCCCTCGACACCATGTGCTCGTCCTCGCTCACCGCCCTG
CACCTCGCCTGCCGGGCGATCAGGGACGGCGACTGCGAGGTCGCCGTGGCGGGCGGCGTC
AACGTCTCCAGCCACCCGCTGAAGTACCTCCAGCTCGCCAAGGGCGGGTTCCTCTCCACC
GACGGCCGGTGCCGCAGCTTCGGCGAGGGCGGCGACGGCTACGTGCCCGCCGAGGGGTCG
GGGGCCGTCCTGCTCAAGCGCCGCTCGGCGGCCGAGGCCGACGGCGACCGCGTCCTCGCG
GTCGTCAGGTCGACGGCCGTCAACCACGGCGGGGCGGGCAAGGGCTTCAGCGTGCCCAAC
CCCAGGGCGCAGGGCGTGCTGATCGGCGAGGCCCTCGAACGGGCCGGTCTCGCCCCGGCG
GACCTCGGCTACCTGGAGGCGCACGGCACCGGCACCTCGCTCGGCGACCCGGTGGAGATC
ACCGGCCTCGTCCGCGCCTTCCAGGGGCACGACCTGACCGGCGTACGCATCCCCATCGGC
TCGGTGAAGTCCGGCATAGGGCACGCCGAGTCGGCCGCGGGCATGGCCGCGCTCACCAAG
GTCCTGCTCCAGTTCCGCCACCAGGAGCTGGTGCCCTCGCTGCATGCCGAGCGCCTCAAC
CCGCACCTCGACCTGGACGCCACCCCCTTCCGCCTGCAACGCGACCTCGCCCCCTGGACG
CCTCGCGTCGACGCCACGGGCCGTGCGCTGCCCCGCACCGCCGCCATCAGCGCGTTCGGC
GCGGGCGGCAGCAACGCCCACGTGATCCTGGAGGAGTCCGTGCCCCCGACGCAGACGCCC
GCGCAGGAGCCGCCGTACGTGTGCGCGCTCTCGGCGCGCGACGCCGAGCGGCTCCACGAA
CACACCGCGCGCACGGCGGAGTTCCTGCGCGGCGAGGGGCGTGCCGCCCACCCCGCCGCC
GTCGCCGCGACGCTGCTGACCCGCGAACCCATGGCGCACCGCCTGGCCGTGGTCTTCGAC
ACCGTGGACGACCTCGCCGACGCCCTTGAGGACCACCTCGCGGGAGCGGGCTCGCCGCGC
GTCCTGACCGGCACCGCGAGCCGCGCCGCCGCACCGGCCACCGGCCGCACCGCACCCGAA
CTCGCCGAGGCCTGGGTGCGCGGCGCCCCCGTCGCCGCCCCCGCCGGCGCACCCCGCGTC
TCCCTGCCCGGCTACCCCTTCGCCCGGGAGCGCTGCTGGCTGCCCGCGGCGGACGCCGTG
CGGCGGCCCGCGGCCACCGAACCCCACGGCGAGGTGCTGCTCTCCACCGCCACGCCCGTC
ATCGCGGGACACCGCGTCCAGGGCCGCTCACTGCTGCCCGCCCTCGCCTACGTCGACCTG
ATCGCCAAGGTCTTCCGCGACCACGGCCACGCCGTCGAACACCTCACGGTGCGCGACCTC
ACCGCGCTGCGTCCGCTCGACGTGACCGACGGCCCCGTGGCCGCCGAGATCCGCTGCACC
AGGGCCGGGGACGACCGCTGGCAGGTCACGGTCACCGACGGCGAGCCCTACGCCACGGCC
GACGTGCTCCTCACCGCGGCGCCCGGCTTCCGCGACCGGCTCGACGGCCACCCGCTCGGC
ACACCGGTCCCGCTCGCCGAGACCTACGCCAGGGACGGCGGGAACGGCCAGCACTACGGC
GGGGCGGCGCGTGCCGACGGCCTCGTGCGCGCGGACCAGGACCGGTTGACGGTCGAACTC
GACGCTCCCGCGGGCGACTTCCTGATCCATCCCGCCCTGCTGCTCGGCGGCGCGGTCGCG
GCCGGGTCGCTGCTCGACACCGGCGGTCAGGCGTTCCTGCCGCTGCACATCGGTTCGTTC
CGCGCGGCGGGGCCGCTGACCGGGGCCTGCACGGCGCGGGTACGGCGTGCCTCGGTGAGC
CGCCGGGGCGAGGTCGCCCGCTTCTCCGTGGACTTCTTCGACCCTCAGGGACGTCAGGTG
GCCGAACTGTCCGAGCTGTCGAGCAAGGCCGTGGCAGGTGCGCCCACCGCGGCTCCGCCC
CCCGCACGGGACGCGGACGCCGGGCCCGCGGCTGCCGCGCACGCCGAGAACTTCCTGCGG
CGGCTGCTCGCCGAACGCCTGGGCACCGACCCGGAGACGGTCCCCCCGACCGCCGGCTAC
TACGAACTGGGCCTGGCATCGGTGCAGGTGCTCGGCCTGGTCGAGGCCGTGCGGGACGTG
GTGGGGCAGGAGCTCGAACCCACGCTGCTCTTCGAGTACACCACCGTCCGCGACCTTGCG
GCCCATCTCGCCGCCCGCTTCCCGCACGCCTTCGGGGAGGCCTCCGAGGACGCCGGGGGC
GTGGCGGCCACCGCGGGCCCGGGCGCGGGCACCGGGCTCGGCCGCCTCCGCGAGAGCGCC
GTCATCCCGCCCCCGCCCGCGCCCGCCGCCCTCCCGGCGCCGGAGCTGGACCGTCTGATC
GCCCGCCAGGTACTGCTCCGCCTGCGCGCGTGCGGCCTCTTCGACGAGGCCCGCACCGAG
ACCGTGGACGGCATGGCCCGGCGCCTCGGTGTCGTCGGCAAGTACCGGCGCTGGCTCGAA
GAGGCCGCCCGGCTGTTCACCGCCGCCGGGCTGACGCGGCGGCACGGCGACACCCTTGAG
CTCGCCGACGAGCGGCCCCCCGCGCACGACACCGAGTGGACGGCCGTGCGCGAGCGCTTC
GCGGCCGACCCGTACTGGGACGCGCAGCTCTCCCTGGTCCAGGAGTGCGTGGAGCGGCTC
CCCGAGATCCTGTCCGGCACCGTGCCCGCGACCGACGTGCTCTTCCCCGGCGGCTCCCTC
GCCAAGCTCACCGCGGTCTACCAGGGCAACGCCGTCGCCGACCGGCTCAACGACGTGGTG
GCCGAGGTGACCGCCGCGGCGGTCCGCGAACGCGTCGCCGGCGACCCGTCGGCCACGGTC
CGCGTGGCCGAGGTCGGCGCGGGCACCGGGGGCACCACCGCCGTCGTCCTGCCCCGTCTC
GACCCCTGCGCCGACCGCCTGGAGTACTGGTACACCGACCTGTCCCCGGCCTTCCTCGAC
CAGGCCGAACGACGCTTCGGACCCGGCAGGGACTACCTGCGCTACGGCCGCTGGGACGTG
ACGCAGGAGGGCGCCGGTGAACGGCTCACCGGCGGCGGCTGCGACGTCGTCGTGGCGACC
AACGTCCTGCACGCCACCCCCGACATCCGCCTCGTCCTGCGCAACCTGGCCGCCGCGCTG
CGCCCCGGCGGGGTCCTCGTCGTCAACGAGGTGACCCGCAAGTCCGCCGTCCTCACCCTC
ACCTTCGGCCTGCTCGACGGGTGGTGGCTCTACGACGACGAGGACGTACGGTTGCCCGGC
GCACCGCTCCTCTCCGCGCCCCGCTGGCAGGAGGTGCTGCACGACAGCGGATTCGGCGAG
GTGTGGCGGCCGGTCGCCGAACCGGACGCGTTCGGCGAGGTCCTGGTCGCCCGGCCGGGC
ACGCGCGAGGTGCCACCGCCCGAGGGCACGCGGCTGCTCGTACGGGAGTGGGAGCCCTCT
CCCGCCCTCCCGGGCCGGGGCCCGGCCGCCGTGGCCGTCATCGGGGCCGGCCGGGAAGCC
GCCCGGCTCGCCGAAGCCCTGCCCGGTGGCCGGCTGATCGCCACCGCCGCCGACCTCGAC
GACCGCTTCGACGCGCTGGTCGACCTCGGCGGCGCCGAACCCGCCGACTGGCTGCCGGTG
CTCCAGCGCATGGCGGGCCGCCAGGCACTGCTGCTCGGCGTCGGCCGGGGCGACGCGCGC
GCCGGGCTCTACCGGATGCTCCAGAGCGAGTACGGGCGTATCCGCTCCCGCTACCTGGAG
GCCGACCCCGCGGACCCCGGCCTCACCGGCCTCGTGGCACGTGAACTGGCCGACGGCGGC
CACGACACCGAGGTCACCTACCGCCGGGGCGTACGGCACCGGGCCGTCCTCGAAGCGCTG
CCCGCCGCGACAGGGGCGCCCGTCCGCTTCCCCGCGGGCGAGCCGCTGCTCATCACCGGC
GGCACCCGCGGCATCGGCCTCGCGCTCGCCCGGCACGCGGTCGCCGAGTGGGGCGCCAGG
ACCCTCGTCCTCACCGGCAGGGAGCAGTTGCCGCCGCGCGCGGAGTGGGACCGGCACGGC
ACGGACACGGCGCTCGGCCGCAAGCTCCGGGGGCTGCGCGCCCTGGAGGACGACGGGGTC
AGGCTCAAGGTGCTCGCGCTGCCCCTGGGCGAGGACGCGGGCGCCGTCCACGCGGCCCTG
GACGGGATCCGCGGGGAGTTCGGGCCCATCGGCGGCGTGCTGCACGCCGCCGGGCTCGTG
GACCGGGACAACCTCGCCTTCGTCCGGAAGCCGGTCGAGGCCGTGCGCGCCGTGCTCGCT
CCCAAGACGGCCGGGCTCGACGCCCTGGTCGACGCCCTGGCCGGGGATCCACTGCGTTTC
TGCGTGCTGTTCTCCTCGGTCGCCTCCATGGTCCCCGCGGCGGCCGTGGGGCAGAGCGAC
TACGCGATGGCCAACGCCCACCTGGACGCCGTCGCCCGCCGCGCGCCGCACGGACTGCCC
GTGGTGAGCGTGGCCTGGCCGAGCTGGCGGGGCGTCGGCATGGGCAGCGAGCGGCCGGGC
CCCGGCTATCTGGCCACCGGCCTCGGTGAACTCTCCGAGGCGCAGGGGCTGCGGCTGCTC
GACCACATCCTGTCCACCGGGGCGGGTCCTGTCGTCCTGCCCGCGATCGTCGCTCCGGAG
TGGACGCCGGGGGCCCTGCGGGCGCCTGGAGCCGGCACGCCGGCCTCCGTCCCGGCACCC
GTGGCCGCGCACGCCGAAAACCCTGCGGCACCCGCGGCGGTCGGCACGGAGCCGGGGGCC
GGCGCGGACGCCGCCGCGAAGGCCGAGTCATGGCTGCTCGGCCTGCTCGCCGAGGAACTG
GGATTCGACCGCGCCCGCCTCGCCGCCGACGTGCCGATCTCCGACTACGGCACCGACTCG
ATCATGATGGTGCAGATCCTGCGCACGGTCGGCGCCGAACTGGACGCCGACCTCGATCCC
TCGGTCCTCGTGGACCATCCCACCGTGCGGTCCTTCGTCGGCTGGCTCACCGCCCACCAC
GGGCAGGCCCTGGCCGCGGCCTTCGGCGCCACGCCCCCGGCCCCCGTGCTCCCGGCCGCC
GCGCCCCCGGTCACCGCTGCCGTGCCCGCCGCGCGCGCCGAGGCCCCCGCCGACACCCCG
TACGACATCGCGGTCGTCGGCATGTCGGGCCGCTTCCCCGGCGCCCCGGACCTCGACGCC
TACTGGCGGCTGCTCAGCGAGGGGCGTTCCGCGATCGCCCCCGTGCCCGCCCGGCGCTGG
GCGGACGGGACCAAGTACACCGCGGGCCTGCTCGACCTCGAAGGGTTCGACCCCGGCCAC
TTCCACCTTTCCGACGCCGACGCCGCCGCGATGGACCCGCAGGCGCTGCTCCTGCTCGAA
GAGACCCTGTTCGCGTTCTGCGACGCGGGTTACGCCCCCGACGAGCTGAAGGGACGCGGC
ATCGGCGTGTACGTGGGCGGACGCTCCCGGCACGTCCCGGACGAGGCGACCCTCGGCCGC
AGCCGCAACCCCGTGGTCGCCGTCGGCCAGAACTACCTGGCGGCCAACCTCTCGCACCAC
TTCGACCTGCGCGGCCCCAGCACGGTCGTGGACACCGCCTGCTCCTCCGCGCTCGTGGCC
CTGCACCACGCCGCCCAAGCCCTGCGCTCCGGCGACGTCGAGGCGGCCGTGGTCGCCGGC
GTCACCCTGCTGCCCGACGCGGGAGGGCACCGCCTCTTCGACCGGCGGGGACTGCTCAAC
ACCGGCACGGAGTTCCACGTCTTCGACCGGCGGGCCCGCGGCTTCACGCCCGCCGAGGGC
GTCGGCGTCCTCCTGCTCAAGCCGCTCGCCGCGGCCGAGGCGGCGGGCGACCGTGTGCAC
GCGGTGCTCAAAGGGATCGCCGTCAACAACGACGGCAGGACCGCCGGCCCCGCCACCCCC
AACCCCGCGGCCCAGCGCGGCGTCATGGCCCGGGCCCTCGCGAAGGCCGGGGTCGCCGCC
GACGACGTCACGTACATCGAGACCAACGCGGCGGGCTCGCAGATCCCCGACCTGATCGAG
CTGAAGGCCATCGCGGCCGTCTACCGCGACGGCTCCGACACACCCTGCTCGCTCGGCTCG
GTCAAGCCCAACATCGGCCACCCGCAGTGCGCCGAGGGCATCGCCGGGGTCATCAAGACC
GTCCTGATGCTCCGCAACCGCGCCATCGTGCCGTTCCTGAGCGGGCGGCAGCCGCTGGAG
CACTTCGACTTCGCCGCGACGCCCCTGCGCTTCGAGCGCGCGCTCACGCCGTGGCCCGAC
GCGCCGCTGCTCGCCGCCGTGTCCAGCTTCGCGGACGGCGGCACCAACGCCCACGCCGTC
CTGGCCGGGCGGACGAACGGCACCACCGGCCGCCGTGCGCCGCTGGACCGCCCCCGCCTC
GCACGGCGCGGCCTGCCCGCGGCGGGCGCCGAACGCTTCGCGGTCATCGGCATGGCGGGG
CACTACCCCGGCGCGGAAGACCTCGACGCGTTCTGGGCGAACCTGAAGGACGGCAGGGAC
AGCGTCACCGAGGTGCCCGCCCAGCGCTGGACCCCGGGGGACGGTGACGGCTCGCGCTGG
GGCGGATTCCTCGACGACGTCGGCCGGTTCGACGCCGACTTCTTCCGGATCTCCCGCCCC
GAGGCGGAGATCACCGACCCCCAGGAGCGCTGGTTCCTGCGCACCTGCTGGGAGGCGATC
GAGGACTCCGGATACACCCCGGAGGGGCTCACCGGGGCCAAGGGCCCCGACCGGCGGCGC
GCGGTCGGCGTGTTCGCCGGGGTCATGCACAAGGACTACACCCTGGTCGCCGCCGAGGCG
TCCGCACCGGTCCCGCTCAGCCTCAACCAGGGCCAGATCGCCAACCGCGTCTCCTTCGTC
TGCGACTTCCACGGCCCGAGCATGACGGTGGACACCCTGTGCTCCTCCTCGCTCACCGCG
CTGCACCTCGCGGTGGAGTCGCTGCGCCGGGGCGAGTGCGAGGTCGCCGTCGCGGGCGGC
GTCAACCTCTCGCTGCACCCGGGCAAGTACCGCACCTACGGAGCCGTGGGCATGCACTCC
TCCGACGGCCGCTGCCGCTCCTTCGGAGAGGGCGGGGACGGCTATGTGTCGGCGGAGGGC
GTCGGCGCGGTCGTCCTCAAACCGCTCGCCGCGGCCGAGGCCGACGGTGACCACATCTAC
GCCGTCGTCGCCGGCTCCGCGGTCAACCACGTCGGTTCCGCGAGCGGGTTCAGCGTGCCG
AGCCCGGTCGGCCAGGCCGCCGTCATCACGGCGGCGCTCGAAAGGGCGGGCGTCGACGCC
CGCACCATCGGCTACCTGGAGGCGCACGGCACCGGCACCTCGCTCGGCGACCCGGTCGAG
ATCAGAGGCCTGTCCACCGCCTTCGGCAGGCACACCGACGACCGGGGGTTCTGCGCCATC
GGATCGGTGAAGTCCAACATCGGGCACGCGGAGTCCGCGGCCGGTGTCGCGGGCCTGACC
AAGGCCGTGCTGCAACTGCACCACCGCACGCTTGTCCCCTCGCTGCACGCGGACACCGTG
AATCCGCTGCTCGGCCTGGACGGCACCCCCTTCCGGCTCCAGCGCGCCACCGAGGCGTGG
CCCGCGCCGCCCGAGGGGCCGCGCCGGGCGGGGCTCAGCTCGTTCGGGGCGACCGGAGCG
GGCGCGCACATCGTCCTTGAGGAGTACGTGCCCGCCGCCGAGGCGGCGGCCGCGTCCCGT
ACCCCCGGGGAGCCCGTCGTCGTACCGCTCTCCGCGCGCACCCGCGACGCGTTGCGGCAG
AGTGCCGCACGGCTGCGGGACGCGCTCACCCAGGGCGGCCGGACGCTGCGTGATGTTGCG
TACACCTTGCAGGTGGGACGGGTCGAGTGGCCCGAGCGCGTGGCTTTCGTGGCACGTGAC
GTGAGGGAACTTCTGGAGCAGTTGGCCGAGTTCGCGGTCCACGGGGTCCGGCCGGCGCTC
GGCGCCGGTGAGCCCCACGACGTCGCGCGGCGGTGGGCGGACGGCTCCTCGGTCGACTGG
GACGCCCGCCACGGGAAGGACCGGCCGCGGCGGGTCTCCCTGCCGACGTACCCGTTCGCG
GGGGAGTGGCACTGGGTTCCCGGTGGTGTGCCCGCCGCCCCCGCCGGGGCGCCCGGCGCG
GACCGTGCCGTGAGCGTCACCGCCGCGGAGGCGCCCGTCACCCCGCCGAGCGGACTGCTG
AGCGTGCCGCGCTGGGAGGAGGCGCCCGTCGCGCCCGCCACGGGGCCCGCGCCCCGCAGG
GTGCTGATCGTCACCGACGAGCCGGGGGCGGGTCTCGCAGGGGCGCTGGCCGAGCACTAC
CGGCGCCACGGCGGCAGTGAGGTGACGGAGCGCCCGCTCGACGACCTCCGAATCCCTCCC
GGGGCGGCGCCCGACCGGGTGCTCCTGGTGACCGGTTCGGGACCGGCGGACGGCGGGGCC
GCGGCCGGGCCCGAACTCGCGCTGCTGCGACTGGTCAAGGCCGTCCAGCGGCTGGACGGC
GGACGCACCGACCTCTGCGTCGTCACCCGGGACACCCAGTCCGTCACGGGTGAGCGCGGC
GCCGCGCACGGCGCGGGCCTCACCGGTCTCGCCTACTTCGTGGCGCGGGACTCGGGCCGG
TTCGCCGTACGCAACATCGACGTGGCGGCGGCCGACCTGCGCACGCCCGGCGACCGCGCC
GCCGTGGCCGCCCTGGTGGCCGACGAGCCGGCCTCCCCCGCCGCCGACCTCGTCGCACTG
CGCGGCGGGCGCAGGTACCGTCAGAAGGTCGGCCCGGTGACCGCGGCCGAGGCCGCCACG
CCCGGCCTGCCCGGCATCAGGCCCGGCGGCACCTACGTGGTGGTCGGCGGCAGCGGCTTC
GTCGGACGCGTCGTCAGCCGCCACCTCATCGACCGCTACGACGCCAAGGTGGTCTGCGTC
GGACGCCGCCCGCAGAGCGACCCCGCGGTGCGCGAGGCGGTCCACGGCGACCGCGTCGGC
TACGTACAGGGCGATGTCACCGACCCGCGCGAGGCACGGCGGGTCATCGCCGGGGCCAAG
GGGCTGCTCGGCGAGATCCACGGAGTGCTGTTCGCGGGCGCCACCCGCATCACCGGGGCG
CCCGGCGCGCTCGCCGGGCTCGGCGAGGACGAGTTCCGCGCGCACTACGAGATCAAGGCC
TCGGGCGCGCGCAATGTGTACGAGGCCGTGGCGGACGAACCGCTCGACTTCCTCTGCTAC
TTCTCCTCCGCGCAGGCCTTCTCCTTCGGCGGGGCGGGCACCCACGCCGCCTACGCGGCC
GGCATCACCGCGGCCGACGCCTTCGCCAGGGCCGTCGCACCGACCGCCGCCTTCCCCGTC
GGCATCGTCAACTGGGGTGCCTGGCGCGCCTCGTTCGGCGAGGCCGCGCGGGACTACCCG
ACGCTCGGCTTCCTCGACGACGACGAGGGGGCCGCCTGCTTCGACACGGCGGTGCGGCTG
CTGCGGGCGGGCCGCCACCGGCAGGTCATCGGCATGCGGGCCCCGGCACGGTCCGCTGCC
CGCGCCGAGGCCGCCGACCGGGCGCACCCGGCCGCTCCCGCGGGACGTGACCGGCGACCG
GAGCTCAGGCGGCTGCTGGTGGAGCGGCTCGCCAGGACCCTGCGGGTGCCGGCGGAGGAC
CTGTCGCCCTCGACGGCCTTCGCCGACCTCGGCGTCGACTCGATCACCGGCTCCACGTTC
GTCGCGGCCATCGCCGAGGAACTGGGCGTCGAGCTGAACGCCGCCGCCCTCTACGAGTTC
TCCTCCGCGGAGCGGCTGGCCGAGCACCTCGACGCACTGCTCGGCCCTGCGGCGCAAGAA
CCACCGGCACCACCGGCACCACCGGCGTCGTCGGCGTCGTCGGCGTCGTCGGGGACGTCC
GCGCCGGACGACCTGATCGTGAAGTTGGAGGCACGGTTCGCGGCGGGTGAGCTGTCCGCC
GCGGAGGTGCTCGACCTGCTCGACGCAGAGCTGGCCACGAGGGAGCAACGATGA
[5] KS35..409
[5] KR975..1155
[5] MT1417..1520
[5] ACP1621..1689
[6] C1707..1993
[6] A2171..2561
[6] glycine2339..2438
[6] PCP2661..2730
[7] KS2765..3149
[7] DH3390..3546
[7] KR3856..4022
[7] ACP4104..4172
[8] KS4224..4596
[8] ACP5075..5144
[8] MT5364..5466
[8] KR5714..5912
[8] ACP6008..6077
[9] ks6122..6474
[9] KS6551..6917
[9] KR7350..7531
[9] ACP7598..7671
[5] KS103..1227
[5] KR2923..3465
[5] MT4249..4560
[5] ACP4861..5067
[6] C5119..5979
[6] A6511..7683
[6] glycine7015..7314
[6] PCP7981..8190
[7] KS8293..9447
[7] DH10168..10638
[7] KR11566..12066
[7] ACP12310..12516
[8] KS12670..13788
[8] ACP15223..15432
[8] MT16090..16398
[8] KR17140..17736
[8] ACP18022..18231
[9] ks18364..19422
[9] KS19651..20751
[9] KR22048..22593
[9] ACP22792..23013

close this sectionFeature

BLASTP
Database:UniProtKB:2011_09
show BLAST table
InterPro
Database:interpro:38.0
IPR000873 AMP-dependent synthetase/ligase (Domain)
 [2171-2561]  2.39999999999997e-106 PF00501
PF00501   AMP-binding
IPR001242 Condensation domain (Domain)
 [1707-1993]  1.40000000000001e-63 PF00668
PF00668   Condensation
IPR009081 Acyl carrier protein-like (Domain)
 [1623-1689]  3.99999999999998e-84 G3DSA:1.10.1200.10 [2664-2731]  3.99999999999998e-84 G3DSA:1.10.1200.10 [4105-4177]  2.79999999999994e-84 G3DSA:1.10.1200.10 [5076-5146]  3.99999999999998e-84 G3DSA:1.10.1200.10 [6009-6082]  3.99999999999998e-84 G3DSA:1.10.1200.10 [7600-7675]  3.99999999999998e-84 G3DSA:1.10.1200.10
G3DSA:1.10.1200.10   ACP_like
 [1625-1688]  1.2e-13 PF00550 [2664-2729]  3.1e-09 PF00550 [4107-4171]  4.40000000000001e-11 PF00550 [5079-5143]  3.5e-08 PF00550 [6011-6076]  2.3e-10 PF00550 [7606-7670]  1.2e-11 PF00550
PF00550   PP-binding
 [1621-1689]  PS50075 [2661-2730]  PS50075 [4104-4172]  PS50075 [5075-5144]  PS50075 [6008-6077]  PS50075 [7598-7671]  PS50075
PS50075   ACP_DOMAIN
 [1614-1722]  1.39999892049878e-15 SSF47336 [2654-2780]  7.19999557654549e-16 SSF47336 [4101-4240]  1.70000295590054e-18 SSF47336 [5068-5146]  2.19999900980708e-15 SSF47336 [6008-6137]  2.90000354677981e-18 SSF47336 [7595-7708]  1.20000117458134e-19 SSF47336
SSF47336   ACP_like
IPR010071 Amino acid adenylation domain (Domain)
 [2171-2561]  2.59999999999995e-128 TIGR01733
TIGR01733   AA-adenyl-dom
IPR013217 Methyltransferase type 12 (Domain)
 [1417-1520]  1.6e-18 PF08242 [5364-5466]  1.3e-15 PF08242
PF08242   Methyltransf_12
IPR013968 Polyketide synthase, KR (Domain)
 [975-1153]  2.99999999999998e-42 PF08659 [3856-4021]  3.2e-40 PF08659 [5717-5911]  1.5e-37 PF08659 [7350-7530]  8.20000000000006e-28 PF08659
PF08659   KR
IPR014030 Beta-ketoacyl synthase, N-terminal (Domain)
 [35-282]  7.90000000000016e-84 PF00109 [2765-3004]  3.99999999999998e-88 PF00109 [4224-4471]  1.10000000000001e-84 PF00109 [6122-6352]  2.29999999999998e-58 PF00109 [6551-6792]  2.59999999999995e-73 PF00109
PF00109   ketoacyl-synt
IPR014031 Beta-ketoacyl synthase, C-terminal (Domain)
 [290-409]  1.6e-40 PF02801 [3014-3149]  2.80000000000001e-38 PF02801 [4479-4596]  5.69999999999998e-34 PF02801 [6360-6474]  9e-33 PF02801 [6800-6917]  1.9e-39 PF02801
PF02801   Ketoacyl-synt_C
IPR016038 Thiolase-like, subgroup (Domain)
 [35-292]  G3DSA:3.40.47.10 [294-471]  G3DSA:3.40.47.10 [2765-3016]  G3DSA:3.40.47.10 [3020-3208]  G3DSA:3.40.47.10 [4226-4483]  G3DSA:3.40.47.10 [4485-4653]  G3DSA:3.40.47.10 [6122-6362]  G3DSA:3.40.47.10 [6364-6522]  G3DSA:3.40.47.10 [6552-6803]  G3DSA:3.40.47.10 [6804-6970]  G3DSA:3.40.47.10
G3DSA:3.40.47.10   Thiolase-like_subgr
IPR016039 Thiolase-like (Domain)
 [27-470]  3.49999466863949e-90 SSF53901 [2756-3208]  1e-89 SSF53901 [4216-4652]  6.29998179434741e-88 SSF53901 [6113-6472]  3.70001063537244e-64 SSF53901 [6543-6972]  2.1000026783403e-81 SSF53901
SSF53901   Thiolase-like
IPR016040 NAD(P)-binding domain (Domain)
 [973-1152]  1.10000000000001e-101 G3DSA:3.40.50.720 [3859-4005]  1.10000000000001e-101 G3DSA:3.40.50.720 [5716-5914]  1.10000000000001e-101 G3DSA:3.40.50.720 [5990-5993]  1.10000000000001e-101 G3DSA:3.40.50.720 [7349-7544]  2.50000000000001e-57 G3DSA:3.40.50.720
G3DSA:3.40.50.720   NAD(P)-bd
IPR018201 Beta-ketoacyl synthase, active site (Active_site)
 [195-211]  PS00606 [2918-2934]  PS00606 [6265-6281]  PS00606
PS00606   B_KETOACYL_SYNTHASE
IPR020806 Polyketide synthase, phosphopantetheine-binding domain (Domain)
 [1622-1692]  1.20000117458134e-13 SM00823 [2662-2733]  5.90000104995095e-12 SM00823 [4105-4175]  4.39999967439747e-16 SM00823 [5076-5147]  9.59999914716171e-09 SM00823 [6009-6080]  3.59999643435987e-11 SM00823 [7603-7674]  2.70000183580794e-15 SM00823
SM00823   PKS_PP
IPR020807 Polyketide synthase, dehydratase domain (Domain)
 [3390-3546]  5.8999880940569e-29 SM00826
SM00826   PKS_DH
IPR020841 Polyketide synthase, beta-ketoacyl synthase domain (Domain)
 [37-473]  SM00825 [2766-3208]  SM00825 [4226-4655]  SM00825 [6123-6524]  1.19999063421924e-112 SM00825 [6553-6971]  SM00825
SM00825   PKS_KS
IPR020842 Polyketide synthase/Fatty acid synthase, KR (Domain)
 [975-1155]  4.10001399699315e-33 SM00822 [3856-4022]  1.79999754022375e-31 SM00822 [5714-5912]  1.20000117458134e-34 SM00822 [7350-7531]  1.29999924468179e-17 SM00822
SM00822   PKS_KR
IPR020845 AMP-binding, conserved site (Conserved_site)
 [2291-2302]  PS00455
PS00455   AMP_BINDING
IPR023213 Chloramphenicol acetyltransferase-like domain (Domain)
 [1718-1862]  3.4e-07 G3DSA:3.30.559.10
G3DSA:3.30.559.10   CAT-like_dom
SignalP
 [1-24]  0.778 Signal
Eukaryota   
 [1-24]  0.756 Signal
Bacteria, Gram-negative   
 [1-29]  0.411 Signal
Bacteria, Gram-positive   
TMHMM No significant hit
Page top