Rubra_00070 : CDS information

close this sectionLocation

Organism
StrainNRRL 3061 (=NBRC 14000)
Entry nameRubradirin
Contig
Start / Stop / Direction23,472 / 7,423 / - [in whole cluster]
23,472 / 7,423 / - [in contig]
Locationcomplement(7423..23472) [in whole cluster]
complement(7423..23472) [in contig]
TypeCDS
Length16,050 bp (5,349 aa)
Click on the icon to see Genetic map.

close this sectionAnnotation

Category1.1 PKS
Productpolyketide synthase
Product (GenBank)putative polyketide synthase
Gene
Gene (GenBank)rubA
EC number
Keyword
Note
Note (GenBank)
Reference
ACC
PmId
[18080113] Biosynthesis of rubradirin as an ansamycin antibiotic from Streptomyces achromogenes var. rubradiris NRRL3061. (Arch Microbiol. , 2008)
comment
Rubradirin生合成clusterの報告。
RubA(5349aa): Polyketide synthase

Loading domain: CoA ligase ACP
Module1: KS ATmm DH KR ACP
Module2: KS Atm ACP
Module3: KS Atmm KR* ACP

starterは3-amino-5-hydroxybenzoic acid (AHBA)

Modules 1, 4, 5, and 6はDH domainを持っているようだが、DH domainのコンセンサス配列はmodule 4以外、どのdomainでも保存されていない。
KR domainsはmodules 1, 3, 4, 5, and 6で検出された。KR domainのコンセンサス配列はmodules 1, 4, and 5で同定され、modules 3 and 6では修飾が見られた。

rubA破壊でrubradirin非産生。

close this sectionPKS/NRPS Module

A0
1 methylmalonyl-CoA
2 malonyl-CoA
3 methylmalonyl-CoA
A38..445
PCP908..982
KS1002..1378
AT1537..1855
dh1904..2085
KR2304..2484
ACP2595..2665
KS2694..3069
AT3232..3555
ACP3631..3703
KS3725..4102
AT4269..4594
kr4914..5091
ACP5196..5266

close this sectionSequence

selected fasta
>polyketide synthase [putative polyketide synthase]
MRDSVRAELIRALPLVLREHADNFGGKVAFEDAERAVTYADLEARTRRLAGHLAGLGVRR
GDRVMICLRNSVEMLESYLAILRADAIGVPVNPASTDFELDYLLADSEAAVVITDPVHVA
GFLRSPSLPRGARLLVTGDTPAHASVHAYQELVRTEPAEPARDDLGLDDVAWTFYTSGTT
REPKGVLSSQRNCLYSVAASYVPIPGLSADDRVLWPLPLFHSLSHIACVLAVTAVGATAR
IMDSPSGDEFLEAARETRATFVAGVPTTYHYLLEARRQRRITLPDLRIGLVGGAVAGPGL
CRSFREEFGVPLVDAYGSTETCGAITMNPPGGVRVDGSCGLPVPGVDVRIVDPETGRDVP
AGAEGEVWVRGPNVTPGYHNKPEATAAAFQDGWYRTGDLARRDAAGYFTISGRINDLIVR
GGENVHPEEIEAVIRAVPGIADVGVAGRPHEVLGEVPVAYVVAGPSGVEADAVIERCRRE
LSTFKLPEEVYEVAGVPRTASGKIQRRLLADQPAVLRCTASGVHDGVLRLEWVPAPARGT
AGPAPTTWAFVGSAAAGLAAAAGSSSCYADLATAGAAEMTVLLAPDMPGTWAERRAGIEQ
LVEELTAWAARAGSGQLVLVTRRAVAVSARDDPADPAQAMLAAAAGQVLGSRAILVDLDG
ASPVDAVPYATLRDEPRWALRSGELLVARLTRQAVPDRSIRDSWSGSDGVVVLTGAHTVR
GAAVARHLASVLPNERLLLIAGEDAAEWEPAIAAGPAAVILADGDPELAAKLSTAADFAR
TSFITIADSYEVTGAADPGRAVSAAMMGAVVHDRRRRGLAGSVLTWCEPAGDTPGSPWLR
DGLFALDTLLATSDPTPLFALRARRPEPGAGVPALLRALFPKPLVQDGLEDTSSALRAEL
ARLDRHAQHTLLQALVRDESAASLNQPDTVAVPVDRAFRDLGYNSVALIELRTRLMARTG
LKLPTAAVFDHPTPRALAAYLHAELVGDTAADTAAGKPSDPDEPVVIVGMACRLPGGITG
PDELWRLVSEGRDASGPFPADRGWDLERLFDADPGRAGTSYVDRGGFLDGAAEFDADFFG
ISPREALAMDPQQRLLLETAWEALEHAGIDASSLRGTDAGVFAGLMEQGYGTGGPVPEEL
EAFQTTGTAGSVASGRISYLLGLRGPAITVDTACSSSLVALHLAAHALRRGECSLALAGG
ATVMATPQSFVEFSRQRALAADGRCKSYASAADGTAWAEGAGMLVLERLGDARRNGHRIL
AVLRGSSVNSDGASNGLTAPSGEAQQRVIRQALTSAGLGPADIDAVEGHGTGTVLGDPIE
AEALLATYGRDRDAGQPLWLGSLKSNVGHTQAAAGVAGVIKMIQALRHGVLPATLNVDEP
TSRVDWSSGAVALLTENRTWPDLGRPRRAGISGFGLSGTNAHVIVEEAPRQPAVRRPAPP
APSAPRIVPLVLSARGTRALNRLADRLVPLIQTADDSSWADIAVALATRRATMDERAVVL
AGSRDEALAGLRALARDEASPLVVRDSATSDASSGVVLIFPGQGAQWAGMGRELLDTSPA
FAGVVQECARLLTEWADWSLIDVLRGEADQELLERVDVIQVASFAMMAGLAALWASAGVR
PDAVVGHSQGEIAAAYVAGALSLRDAMRVVVVRSQAIAGTLSGRGGMASVRLDASAAAAW
LAERDGRVQVAAVNGPSSVVLSGDPEALADALSELAESGVDVRRVAVDYASHSDQVEAVR
DHLAVALADLSPAEPTVPFHSTVTEGRATGALDGGYWYRNLRQQVRFGPTIDRLLEQGFG
AFVEVSPHPVLVQPVAEAVHHAPAEAVVVGSLRRDDGGLGRLVRSMAELFVHGVPVDWRG
LLADDVARAAWVDLPTYPFERQHFWRRPVPSAGASGMGLDDAGHPLLGALVRLPESGGLV
ATGRWSASTLAWAAGTEATDGSDGAPVRVPEAVLVELALWAGGEADAPVVEELTIDPAVV
LPRHGGRDIRIVVGRPDASRRRPITVYSGPPGAAVTQEWTRHATGTVTPSAGPPGEFSLG
PAAEVALDDAEQPDAHRYGIHPALLDAAVRAVVSPDERPKTWRGVQLLASGATGIDVRSA
SAPTRPAWHIELTDPAGQRVLEVAELVTEQREQAQPAASCAGLFGIDWVDHPLPPNSSAA
GSAARYITDATDLVGAPATVLIHEVDISAGDPRGAVVAELELLQAFLARPAGDDDRLVIV
TPDGEDVVTSAVRGLVRSAQSEHPGRFVLVAVDDAAGATNAVIAAALSGEPQLRLRDGRA
QVPRLDRLHPAASSSQARPLDPDGTVLITGGTGTLAAVTARHLVTEHGVRHLILAGRRGP
EADGAAALHDELVGLGATVRLVAADVADREQVVALLAMVEENHPLTAVVHTAGVLDDAVI
TELDPHRVDAVFGAKVDAARHLDDLTRDLGLAAFVLYSSAAGVLGNPGQGNYTAANAALD
AVARERRHAGLAAVSIAWGYWDRASGLTRHLGHADHRRNRDIGMVALSDGDGMALLDQCL
RAGDGPDAALVAAGFDLAALRAADQSRVPPVLRSLVRGKRPIGRATTNARPAPAAPASPS
GRISALPPAEQFDALVDLIRRQCSLVLGHASTDAIRLDRTFKDSGFNSLTALELRNRLAT
ATGLSLPPSMIFDFPRPRSLAAHLQAKLLDNAQAQPATAQPRATAAEPRATAGEPIAIVA
MACRFPGGVHSPEDLWRILSTETDTVTEFPTDRGWDTARLFDPDPDHPGTTYVRHGAFLD
DPGAFDAGFFGISPQEALAKDPQQRLVMETSWELLERAGIDPHSLHAQDVGVFVGVNSHD
WTVRTHHASGVEGFRLTGSSGSVLSGRVAYHLGLEGPAITVDTACSSSLVALHLAVQAIR
NGECSMALAGGVMVMGTVETFIEFSRQPGLAPDGRCKGLADTAGGTGWSEGVGLLLVEPL
SRAREQGHRVLALVRGTAVNSDGASNGLTAPNGPSQQRVIRKALAAAGLRGDEVDSVEGH
GTGTTLGDPIEAQALLETYGRDRPADRPLWLGSVKSNIGHTQAAAGVAGVIKMVMAMRHG
VLPRTLHVDRPSTHVDWSAGAVRLLTERRDWPRAGQPRRAGVSSFGIGGTNAHVILEEAG
EESAQHDADEMMTGDPALDRPSDEGCVPVPVSGRTPAALRAQAGRLADFVASRPELEPAD
IALALATGRAQLDRRAVLVVRHRDELVGDLRRLAKGAAPAIGGAPVDGKLAFLFTGQGSQ
WPGMGRELAARFPVFAEAFTDACRAVETHWAGHATAPLSDVVFAAPGAQDGRLIDQTIYT
QAGLFALETALYRLYESWGLRPDLVVGHSIGEITAAHVSGVLDLADAGKLVATRARLMQA
LPAGGAMVAVQASEAEVAPELVEAPDVVHVAAVNSARSLVLSGAEAAVLWVAGELARFGH
KTRRLPVSHAFHSPLMRPMLAEFREVAESLTYRPGRVPVVSTVTGRPDTGQRLATADYWV
EQVQATVRFGDAVRSLRDQGVTTFLELGPGGALTTMASESLELSAASCIATLRADGAEVA
DVMAALGELHVRGVALDWPAVVGRPAGSASALATELPTYAFQSRRYWLDQAALGDAVPAA
EAGPADGREDVQESVAARLANRSRNERRRAALEVVRESVAVVLGYQPADLDAMDDDQSFR
SLGFDSLGGVRLRNRLRDLTGVDLPVTAVFDHPTPKVLAAHVADEVAGEVAEVTDRAPVS
PADPDEPIAIIGMGVRLPGGVESPDDLWRLVIERRDAISGFPTDRGWDTDGLYHPDPAHP
GTTYTRSGGFLYDAAQFDPGLFGISPREALAMDPQQRLLLEASWEAMERAGIDPLSARGE
EIGVFTGIVHHDYATRLNRIPEEVRGYVMTGTSPSVASGRIAYVFGFEGPAVTLDTACSS
SLVAIHQAAQALRHGECTMALAGGATVMASPEAFVEFSSQRGLSADGRCKTFSSTADGTG
WSEGAGVVLLERLSVARQRGHRVLATIRGSAVNSDGVSNGLTAPNGAAQQRVIRRALASA
GIGPADVDAVEAHGTGTVLGDPIEAGALLATYGEARDAGQPLWLGSLKSNIGHTQAAAGV
AGVIKMVEALRHRVLPPTIHVEEPTGQVDWSAGTLALLTEPREWPRTGRPRRAGVSAFGA
SGTNAHLILEQAPAEEAATAPSEDPEPLKALATNIVPLVVSAKSARSLGAQATRLRMFLD
SPAQSGEPTLPAVADALVRGRATLPERAVVLAGSRDEALTGLAALARGEVTPNLTTGSVT
GSGAPGRLVLVFPGQGAQWAGMGRTLLEGSAVFRARIDECARALRPWVDWSLTDVLRGEA
DEQTLGRVDVVQPASFAMMVGLAALWESLGVRPDAVVGHSQGEIAAACVAGVLSLADAAR
IVAVRSQAIATTLSGRGGMASVMLGADEANDRLAAWAGRLEVAVVNSPSSVVVAGDEESL
AEALESLTRDGVRTRPVPVDYASHTHHVERIRDVLATALSGIDTRPPRIPFYSTVTGGWV
DDGHCLDADYWYQNLRRPVGFGPAVASLLTEGFGVFLEVSGHPVLVQPMTDVIDDPDGPL
RGAGRPIVSGTLRRGADGPASFLAAAARLFAGGVPLDWSATLPAQPSRHTVELPTYAFDR
QHFWLAEAAADNDQPAGRATDAGFWNAIDDADPAALATMLKLSAHQSQALDAVLPALADW
RKARENWSVSERLRYAIGWASPPREALGVPAGRWLVITPETADVLRDGLLDQLRANGLDV
VPCAVETGLSRDELARRLGGFLTDDGIAGILSLLALPDRMESHRPDAAALTTSTLTLIQA
LAAAGATAPLWCLTQGAVSVGVRSAVAGPAHVAQAAVWGLGRAAALERLNQWGGLIDLPA
EPDGRAVRHLLGVLTGVSGEDQVAIRRTGVHVRRLRRAPHPAGTGGERQWRPHGTVLVTG
GAEGLGRYASLWLARAGAERLVVTTSGREPGERVESLRDEVAKLDKGTVVESCADADRDA
LAGLVHAPGRPLTAVVHAADLSLTSLIDETGDAEVTEVFRAKVNTAVWLSELTADLPLDA
FIVFSSIAGIWGGGGQAAYGAANAVLDALVRRLRADGVPAQAIAWGALTAGGAGMDEETL
AQLRRRGVIPMTPDTATAALDQAVQAATESVVIADMDWSAFIVPFTSARTSPLFDDLPEA
AAAIEAAQPSDDFAESASSLMTSLRAVGEAEQDRILLRLVRSQASMVLGHGGADGIGAAQ
AFQEAGFDSLAAVNFRNSLITATGLRPPATLIFDCPTPQAVVAYLRSELLEAEDDADVRG
EDVRRILASVPYQRLKEAGVLETLLGLADAEAGGARASDEGPGPEAAADELIDVMDVDSL
IKRALGSGS
selected fasta
>polyketide synthase [putative polyketide synthase]
ATGCGTGATTCCGTGCGCGCCGAGCTGATCAGAGCGCTCCCCCTGGTGTTGCGGGAGCAC
GCGGACAACTTCGGTGGGAAGGTCGCCTTCGAGGACGCGGAGCGGGCGGTCACGTACGCC
GATCTGGAGGCGCGGACCCGGCGGCTGGCCGGGCATCTGGCGGGACTCGGTGTGCGGCGC
GGCGATCGGGTGATGATCTGCCTGCGCAACAGCGTGGAGATGCTGGAGAGTTACCTCGCG
ATCCTTCGCGCGGACGCGATCGGGGTGCCGGTCAACCCCGCGTCCACCGACTTCGAACTG
GACTATCTCCTGGCCGACAGCGAGGCGGCCGTCGTCATCACCGACCCCGTGCACGTGGCC
GGCTTCCTGCGCTCGCCGTCCCTGCCCCGCGGCGCCAGGCTGCTGGTGACCGGCGACACA
CCCGCGCACGCGTCGGTCCACGCCTACCAGGAACTCGTCAGGACCGAGCCCGCGGAACCC
GCACGGGACGATCTCGGCCTGGACGACGTGGCATGGACGTTCTACACCTCGGGGACCACC
CGGGAACCGAAGGGAGTGCTGTCCAGCCAGCGCAACTGCTTGTACTCGGTGGCGGCCAGC
TATGTGCCGATCCCCGGGCTGTCGGCCGACGATCGCGTGCTGTGGCCGCTGCCGTTGTTC
CACAGCCTCTCCCACATCGCGTGCGTGCTGGCGGTGACCGCGGTCGGCGCGACCGCCCGG
ATCATGGACAGCCCTTCGGGCGACGAGTTCCTGGAGGCCGCCCGGGAAACCCGGGCCACC
TTCGTGGCGGGTGTGCCGACCACCTACCACTACCTTCTGGAGGCGCGGCGCCAGCGCCGG
ATCACCCTGCCGGACCTGCGGATCGGGCTGGTCGGGGGAGCGGTCGCCGGCCCAGGGCTG
TGCCGGTCGTTCCGCGAGGAGTTCGGGGTGCCGCTGGTCGATGCGTACGGCAGCACCGAG
ACGTGCGGGGCGATCACCATGAACCCGCCGGGAGGCGTCCGCGTCGACGGGTCGTGCGGG
CTTCCGGTGCCCGGCGTGGATGTCCGCATCGTCGACCCGGAGACCGGCCGCGACGTGCCG
GCCGGCGCCGAGGGCGAGGTGTGGGTGCGCGGCCCGAACGTGACGCCCGGCTACCACAAC
AAACCGGAGGCGACCGCTGCCGCGTTCCAGGACGGCTGGTATCGCACGGGTGACCTCGCC
CGCCGGGACGCCGCCGGGTACTTCACCATCAGCGGCCGGATCAACGATCTGATCGTCCGG
GGCGGGGAGAACGTCCATCCGGAGGAGATCGAGGCGGTTATCCGCGCCGTTCCGGGGATC
GCCGACGTCGGCGTGGCCGGCCGGCCGCACGAGGTGCTGGGCGAGGTGCCCGTCGCCTAC
GTGGTCGCCGGCCCGTCAGGCGTCGAGGCCGACGCCGTGATCGAACGGTGCCGCCGGGAG
CTGTCCACGTTCAAGCTTCCCGAAGAGGTCTATGAGGTGGCCGGCGTTCCGCGGACTGCG
TCGGGGAAGATCCAGCGGCGCCTGCTGGCCGATCAGCCCGCCGTACTGCGATGCACGGCA
AGCGGGGTGCACGACGGAGTGCTGCGCCTGGAGTGGGTGCCCGCGCCGGCCCGCGGCACC
GCCGGCCCCGCGCCGACGACCTGGGCGTTCGTCGGCTCGGCCGCGGCGGGCCTCGCCGCG
GCCGCGGGCTCGTCCTCCTGCTACGCCGATCTGGCCACGGCCGGCGCCGCCGAGATGACG
GTGCTGCTGGCGCCCGACATGCCGGGCACCTGGGCGGAGCGCCGCGCCGGGATCGAACAG
CTGGTCGAGGAGCTGACGGCGTGGGCCGCGCGAGCCGGATCGGGGCAGCTGGTCCTCGTG
ACCCGGCGGGCAGTGGCGGTCTCGGCGCGGGACGATCCGGCCGATCCGGCTCAGGCCATG
CTGGCGGCCGCGGCGGGCCAGGTGCTGGGGAGCCGGGCCATCCTGGTCGACCTGGACGGC
GCATCGCCGGTCGACGCCGTGCCGTACGCGACCCTGCGTGACGAGCCGCGCTGGGCTCTG
CGGTCCGGCGAGCTCCTGGTGGCCCGGCTGACCCGTCAGGCGGTCCCGGACCGGTCGATA
CGCGATTCATGGTCCGGCTCCGACGGCGTCGTGGTGCTGACGGGTGCGCACACCGTGCGC
GGCGCCGCGGTGGCACGCCACCTGGCCTCCGTCCTCCCGAACGAGCGGCTGCTGCTGATC
GCCGGCGAGGACGCCGCCGAGTGGGAGCCGGCGATCGCGGCGGGCCCCGCCGCGGTGATC
CTCGCCGACGGCGACCCCGAGCTGGCCGCGAAGCTGAGCACCGCAGCGGACTTTGCCCGG
ACCTCGTTCATCACGATCGCGGACTCGTACGAGGTCACCGGGGCGGCAGACCCCGGCCGG
GCGGTCTCGGCGGCCATGATGGGCGCCGTGGTCCACGACCGCCGACGGCGCGGTCTGGCC
GGCTCGGTCCTGACCTGGTGCGAACCGGCCGGGGACACGCCGGGTTCGCCCTGGTTGCGT
GACGGGCTGTTCGCGCTCGACACGCTCCTTGCCACATCGGACCCGACGCCGTTGTTCGCC
CTGCGTGCGCGCCGGCCGGAGCCGGGGGCCGGCGTGCCCGCCCTGCTGCGTGCCCTCTTC
CCAAAGCCGCTCGTCCAGGACGGACTCGAGGACACCTCCTCCGCTCTGCGGGCAGAACTG
GCCCGCCTGGACCGGCACGCGCAGCACACGCTGCTCCAGGCCCTGGTCCGGGACGAGTCC
GCGGCGTCGCTCAACCAGCCCGACACCGTCGCCGTCCCGGTGGACCGGGCGTTCCGGGAC
CTCGGGTACAACTCGGTCGCACTCATCGAACTGCGTACCCGGCTGATGGCCAGGACCGGT
CTGAAGCTGCCGACCGCAGCGGTGTTCGACCATCCGACGCCGCGTGCGCTGGCCGCGTAC
CTGCACGCCGAACTGGTCGGTGACACCGCTGCCGACACCGCCGCCGGCAAACCGTCCGAC
CCCGACGAGCCGGTGGTGATCGTCGGGATGGCCTGCCGCCTGCCGGGCGGGATCACCGGG
CCGGACGAGCTGTGGCGTCTGGTCAGCGAGGGCCGGGATGCGTCAGGGCCGTTTCCGGCC
GACCGCGGCTGGGACCTGGAGCGGCTGTTCGACGCCGACCCGGGCCGGGCCGGGACCTCG
TATGTCGACCGGGGCGGGTTCCTCGACGGCGCGGCCGAGTTCGACGCCGACTTCTTCGGG
ATCTCCCCGCGCGAGGCACTCGCGATGGACCCGCAGCAGCGGCTGCTGCTGGAGACCGCG
TGGGAGGCGCTGGAGCACGCGGGAATCGACGCGTCGTCGTTGCGGGGCACCGACGCCGGG
GTCTTCGCCGGCCTGATGGAGCAGGGCTACGGCACCGGCGGTCCGGTGCCCGAGGAGCTC
GAGGCCTTCCAGACGACCGGAACCGCGGGCAGCGTCGCGTCGGGCCGGATCTCCTATCTG
CTCGGCCTGCGGGGACCGGCGATCACCGTGGACACGGCGTGCTCGTCCTCACTGGTCGCG
CTGCATCTGGCCGCCCACGCGCTGCGCCGCGGCGAGTGCTCCCTGGCCTTGGCCGGTGGT
GCCACGGTGATGGCCACCCCCCAGTCATTCGTCGAGTTCTCCCGGCAGCGCGCACTGGCC
GCTGACGGCCGGTGCAAATCCTATGCCTCGGCCGCCGACGGCACCGCGTGGGCAGAGGGG
GCCGGGATGCTGGTCCTGGAACGCCTCGGCGATGCCCGCCGCAACGGGCACCGGATCCTG
GCGGTGCTGCGCGGCTCCTCGGTGAACTCCGACGGCGCGTCGAACGGCCTCACCGCACCG
AGTGGTGAGGCGCAGCAACGGGTCATCCGGCAGGCACTGACGTCGGCCGGCCTGGGCCCG
GCGGACATCGACGCGGTGGAGGGGCACGGGACCGGGACCGTTCTGGGCGACCCGATCGAG
GCCGAGGCGTTGCTGGCCACCTACGGCCGGGACCGCGACGCCGGCCAGCCCCTGTGGCTG
GGATCGCTCAAATCGAACGTCGGCCACACCCAGGCCGCGGCCGGTGTCGCCGGCGTGATC
AAGATGATCCAGGCGCTTCGGCACGGTGTGCTGCCGGCGACACTCAACGTGGACGAGCCC
ACCTCACGCGTCGACTGGTCGTCGGGCGCGGTCGCGCTGCTGACGGAGAACAGGACCTGG
CCGGACCTCGGCCGTCCGCGTCGCGCGGGAATCTCCGGCTTCGGCCTGAGCGGAACGAAC
GCCCACGTGATCGTGGAGGAAGCACCCCGGCAGCCGGCCGTCCGGCGGCCGGCGCCACCG
GCGCCATCGGCGCCCCGGATCGTGCCTCTGGTCCTGTCGGCCCGCGGAACGCGGGCGCTC
AACCGGCTGGCGGACCGGCTCGTCCCGCTCATCCAGACCGCCGACGACTCGTCCTGGGCC
GACATCGCCGTGGCGCTGGCCACTCGACGAGCAACCATGGACGAGCGCGCGGTCGTACTG
GCCGGCTCCCGGGACGAGGCGCTGGCCGGCCTGCGCGCACTGGCCCGCGACGAGGCCAGT
CCCCTCGTGGTGCGCGACAGCGCCACCTCTGACGCGTCGTCCGGCGTCGTGCTGATCTTT
CCCGGGCAGGGCGCGCAGTGGGCCGGGATGGGCCGTGAGCTGCTCGATACCTCCCCCGCG
TTCGCCGGGGTCGTCCAGGAGTGCGCTCGGCTGCTGACCGAATGGGCGGACTGGTCGCTG
ATCGACGTGCTGCGCGGCGAGGCCGACCAGGAACTGCTGGAACGCGTCGACGTCATCCAG
GTGGCGAGCTTCGCGATGATGGCCGGGCTGGCCGCGCTGTGGGCGTCGGCGGGAGTCCGG
CCCGACGCGGTCGTGGGCCACTCGCAGGGTGAGATCGCCGCGGCATACGTCGCCGGGGCG
CTCTCCCTGCGCGACGCCATGCGGGTCGTCGTGGTCCGCAGCCAGGCGATCGCCGGGACC
CTGTCCGGCCGAGGAGGCATGGCGTCGGTGCGCCTCGACGCCTCCGCCGCGGCGGCCTGG
CTGGCGGAAAGGGACGGCCGGGTCCAGGTCGCCGCGGTGAACGGTCCGTCCTCGGTGGTG
CTCTCCGGTGACCCGGAGGCGCTGGCGGACGCGTTGAGCGAACTGGCCGAGTCCGGTGTC
GACGTACGGCGGGTGGCAGTCGACTACGCATCGCACAGCGACCAGGTCGAGGCCGTCCGG
GACCACCTGGCGGTGGCTCTGGCCGATCTGTCGCCCGCCGAGCCCACGGTGCCCTTCCAC
TCGACCGTGACCGAGGGCCGGGCCACCGGAGCGCTGGATGGCGGCTACTGGTACCGCAAC
CTGCGGCAACAGGTCCGTTTCGGTCCCACGATCGACCGCCTGCTGGAGCAGGGATTCGGC
GCGTTCGTCGAGGTCAGCCCCCACCCGGTGCTGGTCCAGCCGGTCGCTGAGGCCGTACAC
CACGCGCCCGCGGAAGCGGTCGTCGTCGGTTCGTTGCGCCGGGACGACGGCGGGCTCGGC
CGTCTGGTGCGATCGATGGCCGAGCTGTTCGTCCACGGCGTCCCGGTGGACTGGCGCGGC
CTGCTCGCCGACGACGTCGCCCGGGCCGCGTGGGTCGACCTGCCGACGTACCCGTTCGAG
CGCCAGCACTTCTGGCGCCGTCCGGTGCCGAGCGCCGGCGCGTCGGGCATGGGCCTGGAC
GACGCCGGCCACCCGCTGCTCGGTGCGCTCGTCCGGCTCCCCGAATCCGGTGGCCTGGTC
GCGACCGGCCGATGGTCCGCGTCGACGCTCGCGTGGGCGGCCGGCACCGAGGCCACCGAC
GGGTCCGACGGCGCCCCGGTCCGGGTACCGGAGGCGGTACTGGTGGAGCTGGCACTGTGG
GCAGGCGGAGAGGCCGACGCCCCGGTCGTCGAGGAGCTGACCATCGATCCGGCCGTCGTG
CTGCCCCGCCACGGCGGCCGCGACATACGGATCGTCGTGGGGCGGCCGGACGCTTCTCGC
CGGCGGCCGATCACGGTCTACTCGGGGCCGCCCGGAGCGGCCGTGACGCAGGAATGGACA
CGTCACGCCACCGGCACGGTGACACCGTCGGCCGGCCCACCCGGCGAGTTCTCCCTCGGC
CCCGCGGCCGAGGTCGCGCTCGATGACGCCGAGCAGCCCGACGCGCACCGCTACGGCATT
CATCCGGCTCTGCTCGACGCGGCGGTCCGCGCCGTTGTCTCCCCGGATGAGCGGCCGAAG
ACCTGGCGAGGCGTCCAGCTGCTGGCGTCCGGAGCCACCGGGATTGACGTCCGCTCGGCG
TCCGCGCCGACCCGTCCGGCCTGGCACATCGAGTTGACGGATCCGGCCGGGCAGCGGGTG
CTGGAGGTGGCTGAACTGGTCACCGAGCAGCGGGAGCAGGCGCAGCCCGCTGCTTCCTGC
GCGGGCCTCTTCGGCATCGACTGGGTCGACCACCCGCTACCGCCAAACTCCTCGGCGGCC
GGCTCTGCCGCCCGCTACATCACCGACGCCACCGACCTCGTCGGCGCGCCGGCGACTGTG
CTGATCCACGAGGTCGACATCTCCGCCGGGGACCCTCGGGGCGCGGTCGTCGCGGAACTC
GAACTGCTACAGGCGTTCCTCGCCCGGCCGGCCGGGGACGACGACCGACTGGTGATCGTC
ACGCCGGACGGGGAAGACGTCGTCACCAGCGCCGTGCGGGGCCTGGTGCGGTCGGCCCAG
TCGGAGCACCCGGGCCGGTTCGTCCTGGTGGCGGTGGACGACGCGGCCGGTGCGACGAAT
GCTGTGATCGCGGCGGCCCTGAGCGGCGAACCGCAGCTCCGGCTCCGGGACGGCCGGGCA
CAGGTGCCACGGCTCGACCGGCTGCACCCGGCCGCTTCGTCATCGCAGGCCCGACCGCTC
GATCCCGACGGCACCGTGCTGATCACCGGTGGTACGGGGACGCTCGCGGCGGTGACCGCG
CGGCACCTCGTCACCGAGCACGGTGTCCGCCATCTGATCCTGGCCGGCCGCCGCGGCCCC
GAGGCCGACGGCGCCGCGGCGCTGCACGACGAACTGGTCGGGCTCGGCGCGACCGTTCGG
CTCGTGGCCGCCGATGTGGCGGACCGCGAGCAGGTGGTGGCGCTGCTGGCCATGGTCGAA
GAGAACCACCCGCTGACCGCGGTCGTGCACACCGCCGGGGTACTGGACGACGCGGTCATC
ACCGAGCTCGACCCGCACCGCGTCGACGCCGTGTTCGGGGCGAAGGTGGATGCGGCCCGC
CACCTGGACGACCTCACCCGTGACCTCGGTCTCGCCGCCTTCGTTCTCTACTCCTCGGCC
GCGGGCGTGCTCGGCAACCCGGGGCAGGGAAACTACACCGCGGCCAACGCGGCACTGGAC
GCCGTGGCGCGGGAACGGCGGCACGCCGGCCTGGCGGCGGTCTCGATCGCCTGGGGTTAC
TGGGACCGGGCCAGTGGCCTGACCCGCCACCTCGGCCACGCCGACCACCGGCGGAACCGG
GACATCGGCATGGTCGCCCTGTCGGACGGCGACGGGATGGCGTTGCTCGACCAGTGCCTG
CGCGCCGGTGACGGCCCGGACGCGGCGCTCGTCGCCGCCGGGTTCGACCTGGCCGCGCTC
CGGGCGGCCGATCAGTCGCGGGTGCCGCCCGTCCTGCGGAGCCTGGTCCGGGGCAAACGG
CCGATCGGCCGGGCGACCACGAACGCACGGCCGGCTCCGGCGGCGCCGGCCTCCCCCTCC
GGCCGGATCAGCGCGCTGCCCCCGGCCGAGCAGTTCGACGCCCTGGTCGACCTGATCCGC
CGGCAGTGCTCCCTCGTCCTTGGCCACGCGAGCACCGACGCGATCCGGCTCGACCGGACG
TTCAAGGACTCGGGCTTCAACTCGCTCACCGCGCTCGAACTGCGCAACCGGCTCGCCACC
GCGACCGGGCTGTCCCTGCCCCCCTCGATGATCTTCGACTTTCCGCGACCGAGGTCGCTG
GCGGCCCACCTGCAGGCCAAGCTCCTCGACAACGCGCAGGCCCAGCCGGCCACCGCGCAG
CCGAGGGCAACGGCCGCGGAGCCGAGGGCAACGGCCGGGGAACCGATCGCCATCGTCGCG
ATGGCCTGCCGGTTCCCCGGTGGCGTACACAGCCCCGAAGACCTGTGGCGGATCCTGTCC
ACGGAGACAGACACGGTCACCGAGTTCCCCACCGACCGCGGATGGGACACCGCCCGGCTG
TTCGATCCCGACCCGGACCACCCCGGCACCACCTACGTCCGCCACGGCGCCTTCCTAGAC
GACCCGGGCGCCTTCGACGCCGGCTTCTTCGGTATCTCCCCGCAAGAGGCGTTGGCGAAG
GACCCGCAGCAGCGACTGGTCATGGAGACCTCGTGGGAGCTGCTCGAACGCGCAGGCATC
GACCCGCACAGCCTGCACGCCCAGGATGTGGGCGTCTTCGTCGGGGTGAACAGCCATGAC
TGGACCGTACGCACGCACCACGCGTCCGGCGTCGAGGGTTTCCGCCTGACCGGCAGCTCG
GGCAGCGTCCTGTCCGGAAGGGTGGCCTACCACCTGGGCCTGGAGGGCCCGGCGATCACC
GTGGACACCGCCTGCTCGTCCTCACTGGTAGCCCTGCACCTGGCCGTGCAGGCCATCCGC
AACGGCGAGTGCTCGATGGCGCTGGCGGGCGGCGTGATGGTGATGGGCACGGTGGAGACC
TTCATCGAGTTCTCCCGGCAGCCGGGGCTCGCCCCCGACGGCCGCTGCAAGGGGTTGGCC
GACACGGCCGGAGGCACCGGCTGGTCGGAGGGGGTCGGACTCCTTCTGGTCGAGCCGCTG
TCCCGGGCCCGTGAGCAGGGCCACCGGGTGCTGGCCCTGGTCCGCGGAACGGCGGTGAAC
TCCGACGGCGCCTCCAACGGCCTCACCGCGCCGAACGGCCCGTCGCAGCAGCGCGTGATC
CGCAAGGCGCTCGCCGCGGCCGGACTGCGGGGCGATGAGGTCGACTCCGTCGAGGGCCAC
GGCACCGGGACCACGCTCGGCGACCCGATCGAGGCGCAGGCGCTGCTGGAGACCTACGGC
CGGGACCGGCCGGCCGACCGACCGCTCTGGCTCGGGTCGGTGAAGTCGAACATCGGCCAC
ACCCAGGCCGCGGCCGGCGTCGCGGGCGTGATCAAGATGGTCATGGCGATGCGGCACGGC
GTCCTGCCGAGGACGCTGCACGTCGACCGGCCGTCGACCCACGTGGACTGGTCGGCGGGC
GCGGTCCGGCTGCTCACCGAACGCCGCGACTGGCCTCGTGCCGGACAGCCGCGCCGGGCT
GGTGTGTCGTCGTTCGGTATCGGCGGGACCAATGCTCACGTCATCCTGGAAGAGGCCGGT
GAGGAGAGCGCCCAGCATGACGCCGACGAGATGATGACCGGGGATCCCGCCCTGGACCGG
CCCTCGGACGAGGGCTGCGTGCCGGTTCCGGTCTCCGGTCGCACCCCCGCCGCGCTGCGG
GCCCAGGCCGGCCGGCTGGCGGACTTCGTGGCATCGCGGCCGGAGCTGGAGCCTGCCGAC
ATCGCCCTCGCGCTGGCCACCGGCCGGGCGCAGCTCGACCGGCGGGCGGTCCTGGTGGTG
CGGCATCGTGACGAGTTGGTCGGCGATCTGCGCCGTCTGGCCAAGGGTGCCGCTCCGGCC
ATCGGCGGCGCCCCGGTCGACGGCAAGCTGGCCTTCCTGTTCACCGGACAGGGCAGCCAG
TGGCCCGGCATGGGCCGGGAACTCGCGGCCAGGTTCCCGGTCTTCGCCGAGGCGTTCACC
GACGCCTGCCGCGCCGTCGAGACCCACTGGGCCGGTCATGCCACGGCGCCGCTGTCCGAC
GTCGTCTTCGCCGCCCCGGGCGCGCAGGACGGCCGGCTGATCGATCAGACGATCTACACC
CAGGCCGGCCTGTTCGCTCTGGAAACCGCCCTGTATCGGCTCTACGAGTCATGGGGACTG
CGCCCGGACCTGGTCGTCGGCCACTCCATCGGAGAGATCACCGCCGCGCACGTCAGCGGG
GTGCTCGACCTGGCCGACGCGGGAAAGCTGGTCGCCACGCGGGCCCGGCTGATGCAGGCG
CTACCGGCCGGCGGAGCCATGGTCGCTGTCCAGGCGAGCGAGGCCGAAGTGGCACCGGAA
CTGGTGGAGGCACCCGACGTCGTCCACGTCGCGGCGGTCAACAGCGCCCGGTCGCTGGTG
CTGTCCGGTGCCGAGGCAGCCGTGCTGTGGGTGGCGGGTGAGCTCGCCCGCTTCGGCCAC
AAGACCCGCCGGCTGCCGGTGAGTCATGCCTTCCACTCACCGTTGATGCGACCGATGCTC
GCCGAGTTCCGGGAGGTCGCCGAGTCGCTGACCTACCGGCCGGGCCGGGTGCCCGTCGTG
TCCACGGTCACCGGCCGGCCGGACACCGGACAGCGATTGGCCACCGCCGACTACTGGGTC
GAGCAGGTGCAGGCCACGGTCCGGTTCGGGGATGCGGTCAGATCGCTGCGCGATCAGGGC
GTGACGACCTTCCTGGAGCTCGGGCCGGGGGGTGCGCTCACCACGATGGCGTCGGAGAGC
CTGGAGTTGTCGGCGGCGTCCTGCATCGCGACGCTGCGCGCGGACGGCGCAGAAGTCGCC
GACGTGATGGCGGCCCTGGGCGAACTGCACGTCCGCGGCGTGGCCCTGGACTGGCCCGCG
GTCGTCGGCCGGCCGGCCGGATCCGCCTCGGCCCTGGCCACCGAACTGCCCACCTACGCG
TTCCAGAGCCGGCGGTACTGGCTCGACCAGGCGGCGCTCGGGGACGCGGTCCCCGCGGCC
GAGGCCGGGCCGGCCGACGGCCGGGAGGACGTTCAGGAGAGCGTCGCCGCCCGGTTGGCG
AACCGGTCCCGGAACGAACGCCGGCGGGCCGCGCTCGAGGTGGTCCGGGAGTCGGTGGCG
GTCGTGCTGGGCTATCAGCCGGCCGATCTCGACGCGATGGACGACGACCAGTCGTTCAGG
AGCCTGGGCTTCGATTCGCTGGGCGGAGTACGGCTGCGTAACCGGCTGCGTGACCTCACG
GGCGTCGACCTGCCCGTGACGGCGGTCTTCGACCACCCGACGCCGAAGGTCCTCGCGGCG
CACGTGGCCGACGAGGTGGCGGGCGAGGTGGCCGAGGTGACCGATCGGGCTCCGGTGTCC
CCGGCCGACCCCGACGAGCCCATCGCGATCATCGGCATGGGCGTGCGGCTGCCCGGCGGC
GTGGAGAGCCCGGACGACCTGTGGCGCCTGGTCATCGAGCGACGCGACGCGATCTCCGGC
TTCCCCACCGACCGGGGCTGGGACACCGACGGGCTCTACCACCCGGATCCCGCGCACCCC
GGGACCACCTACACCCGATCGGGCGGCTTCCTCTACGACGCCGCCCAGTTCGACCCTGGG
CTGTTCGGGATCTCCCCGCGCGAGGCACTGGCCATGGACCCGCAGCAACGGCTGCTCCTG
GAGGCGTCCTGGGAGGCCATGGAGCGGGCCGGCATCGACCCGCTGTCCGCCCGGGGCGAA
GAGATCGGTGTGTTCACCGGGATCGTGCACCACGACTACGCGACCAGGCTGAACCGGATT
CCCGAGGAGGTCCGTGGATACGTCATGACCGGCACGTCGCCGAGCGTCGCCTCGGGCCGG
ATCGCGTACGTCTTCGGCTTCGAGGGGCCGGCGGTGACCCTGGACACCGCGTGCTCGTCC
TCGCTGGTCGCCATCCACCAGGCCGCGCAGGCCCTCCGGCACGGCGAGTGCACGATGGCG
CTGGCGGGCGGGGCCACGGTGATGGCCAGCCCGGAGGCGTTCGTCGAGTTCTCCAGCCAG
CGCGGGCTCTCCGCCGACGGCCGGTGCAAGACGTTCTCGTCCACCGCGGACGGCACCGGC
TGGTCCGAGGGCGCCGGCGTCGTGCTTCTGGAGCGGCTCTCGGTGGCCCGGCAGCGCGGT
CACCGTGTGCTGGCCACGATTCGCGGGTCGGCGGTCAACTCCGACGGTGTGTCCAACGGG
CTGACCGCGCCGAACGGGGCGGCCCAGCAAAGGGTGATCCGGCGGGCACTCGCCTCGGCC
GGCATAGGTCCCGCCGACGTCGACGCGGTGGAGGCCCACGGGACCGGCACGGTGCTCGGC
GACCCGATCGAGGCGGGGGCACTGCTTGCCACGTACGGGGAGGCCCGGGATGCAGGGCAG
CCGCTGTGGCTCGGGTCCTTGAAATCCAACATCGGGCACACCCAGGCGGCCGCCGGTGTG
GCCGGCGTGATCAAGATGGTCGAGGCGCTGCGGCACCGTGTGCTGCCGCCGACCATCCAT
GTCGAGGAACCGACCGGACAGGTGGACTGGTCGGCAGGCACCCTCGCCCTGCTGACCGAG
CCGCGGGAGTGGCCGCGGACCGGCCGGCCCCGGCGGGCCGGCGTCTCCGCGTTCGGTGCC
AGCGGAACCAACGCGCACCTGATCCTCGAGCAGGCCCCGGCCGAGGAGGCCGCGACCGCC
CCCTCCGAGGACCCGGAGCCGTTGAAGGCGCTCGCGACGAACATCGTGCCGCTGGTGGTG
TCCGCCAAGAGCGCCCGGTCACTCGGCGCGCAGGCCACCCGGCTGCGAATGTTCCTGGAC
AGCCCCGCTCAGAGCGGCGAACCGACGCTGCCGGCGGTGGCGGACGCGCTGGTGCGTGGC
CGGGCGACGCTGCCGGAGCGAGCGGTCGTGCTGGCCGGCTCGCGGGACGAGGCGCTGACC
GGCCTCGCCGCGCTCGCCCGCGGCGAGGTGACGCCCAACCTGACCACGGGCAGCGTGACC
GGCTCCGGAGCACCCGGCCGACTGGTTCTCGTCTTCCCGGGCCAGGGAGCGCAATGGGCG
GGCATGGGGCGCACGCTGCTGGAGGGTTCCGCGGTCTTCCGGGCACGGATCGACGAGTGC
GCGCGGGCACTGCGACCCTGGGTGGACTGGTCGCTGACCGATGTCCTGCGCGGCGAGGCG
GACGAACAGACCCTCGGCCGCGTGGACGTCGTCCAGCCGGCGAGTTTCGCGATGATGGTC
GGCCTGGCCGCGCTCTGGGAGTCGCTGGGCGTGCGGCCCGACGCGGTGGTGGGCCATTCC
CAGGGGGAGATCGCGGCGGCGTGCGTCGCCGGTGTCCTGTCGCTGGCCGACGCCGCCCGC
ATCGTCGCGGTGCGCAGCCAGGCGATCGCGACCACACTGTCCGGCCGGGGCGGCATGGCG
TCGGTGATGCTCGGCGCGGACGAGGCCAACGACCGGCTGGCGGCCTGGGCCGGCCGGCTC
GAGGTGGCCGTGGTGAACAGCCCGTCCTCGGTGGTCGTCGCCGGTGACGAGGAGTCGTTG
GCGGAGGCACTGGAGTCCTTGACCCGGGACGGCGTACGGACCCGCCCGGTGCCGGTGGAC
TACGCCTCGCACACCCACCACGTCGAACGCATCAGGGACGTCCTGGCGACGGCGCTGAGC
GGGATCGACACCCGGCCTCCGCGCATACCGTTCTACTCGACGGTGACCGGTGGCTGGGTC
GATGACGGCCATTGCCTCGACGCGGACTACTGGTACCAGAACCTGCGGCGGCCAGTCGGC
TTCGGCCCGGCCGTGGCGAGTCTGCTGACCGAGGGCTTCGGAGTGTTCCTGGAAGTCAGC
GGGCATCCGGTCCTGGTGCAGCCGATGACCGACGTCATCGACGACCCGGACGGCCCGCTG
CGCGGCGCCGGGCGACCGATCGTCTCCGGCACGCTGCGCCGGGGCGCCGACGGTCCGGCA
TCCTTCCTCGCCGCGGCCGCGCGGCTCTTCGCCGGCGGTGTGCCGCTCGACTGGAGCGCG
ACACTGCCGGCGCAACCGTCCCGGCACACGGTCGAGTTGCCCACCTACGCCTTCGACCGC
CAGCACTTCTGGCTGGCCGAAGCCGCCGCGGACAACGACCAACCGGCCGGCCGGGCGACG
GATGCCGGCTTCTGGAACGCCATCGACGACGCCGATCCGGCGGCTCTGGCCACCATGCTC
AAACTGTCCGCCCACCAGAGCCAGGCCCTGGACGCGGTACTGCCGGCCCTCGCCGACTGG
CGCAAGGCACGCGAGAACTGGTCGGTCTCGGAGCGACTCCGGTACGCGATCGGCTGGGCC
TCGCCCCCGCGCGAGGCCCTGGGCGTGCCCGCCGGCCGCTGGCTCGTCATCACGCCGGAG
ACCGCGGACGTCTTGCGCGACGGTCTGCTCGACCAGCTGCGGGCCAACGGCCTCGACGTC
GTGCCCTGCGCGGTGGAAACCGGCCTGTCCAGGGACGAACTCGCCCGCAGGCTGGGCGGT
TTCCTCACCGACGACGGGATCGCCGGCATCCTGTCCCTGCTCGCGCTCCCGGACCGGATG
GAGAGCCACCGGCCGGATGCCGCGGCGCTCACCACCTCCACCCTGACCCTGATCCAGGCG
CTCGCCGCGGCCGGTGCCACCGCGCCCCTGTGGTGCCTGACCCAGGGCGCGGTCAGCGTC
GGTGTCCGCAGCGCCGTCGCCGGGCCGGCCCACGTGGCCCAGGCGGCGGTGTGGGGACTC
GGCCGGGCGGCCGCGCTCGAACGCCTGAACCAGTGGGGCGGTTTGATCGATCTGCCCGCC
GAACCGGACGGCCGTGCCGTCCGGCACCTGCTCGGCGTGCTGACCGGCGTGTCCGGTGAG
GACCAGGTGGCGATCCGACGGACCGGCGTCCATGTCCGGCGCCTGCGGCGTGCCCCCCAT
CCGGCCGGGACCGGGGGCGAACGGCAGTGGCGGCCGCACGGGACGGTCCTGGTCACCGGC
GGTGCGGAGGGCCTCGGACGCTACGCCTCGCTGTGGCTGGCGCGGGCAGGCGCCGAGCGA
TTGGTGGTCACGACCAGCGGGCGCGAGCCCGGCGAGCGGGTCGAGTCACTGCGCGACGAG
GTGGCGAAGCTGGACAAGGGCACCGTCGTCGAGTCGTGCGCGGACGCCGACCGGGACGCG
CTCGCCGGCCTGGTCCACGCTCCGGGCCGCCCGTTGACCGCGGTCGTCCACGCCGCCGAC
CTGTCCCTGACCAGCTTGATCGACGAGACCGGTGACGCGGAGGTCACGGAGGTCTTCCGG
GCCAAGGTGAACACCGCGGTCTGGCTCAGCGAGCTGACGGCGGACCTTCCGCTCGACGCG
TTCATCGTCTTCTCCTCGATCGCCGGCATCTGGGGCGGTGGCGGTCAAGCCGCCTACGGC
GCGGCGAACGCGGTCCTCGACGCGCTGGTGCGGCGTCTGCGGGCGGACGGCGTCCCGGCG
CAGGCGATCGCCTGGGGTGCGCTGACCGCGGGCGGCGCGGGAATGGACGAGGAGACGCTG
GCCCAGCTCCGGCGGCGCGGCGTCATCCCGATGACACCCGACACGGCGACAGCCGCGCTG
GACCAGGCGGTCCAGGCCGCCACGGAATCGGTGGTCATCGCCGACATGGACTGGAGCGCC
TTCATCGTGCCGTTCACGTCGGCCCGCACCAGCCCGCTCTTCGACGACCTGCCGGAGGCC
GCGGCGGCGATCGAGGCGGCACAGCCCTCCGACGACTTTGCCGAGAGCGCGTCGTCGCTG
ATGACGTCGCTGCGCGCGGTCGGGGAGGCCGAACAGGACCGGATCCTGCTCCGGCTGGTG
CGCAGCCAGGCGTCCATGGTCCTCGGTCACGGTGGTGCCGACGGGATCGGCGCGGCCCAG
GCGTTCCAGGAGGCCGGTTTCGACTCGCTGGCGGCCGTCAACTTCCGCAACAGCCTGATC
ACCGCCACCGGGCTGAGGCCGCCGGCCACACTGATCTTCGACTGTCCGACCCCGCAGGCG
GTGGTCGCATACCTGCGCTCGGAACTGCTCGAGGCCGAGGACGACGCGGATGTCCGCGGG
GAGGACGTACGGCGGATCCTGGCCTCGGTGCCCTACCAGCGCCTCAAGGAGGCCGGTGTC
CTGGAGACGCTGCTCGGCCTGGCCGACGCCGAGGCGGGCGGGGCCAGGGCATCGGACGAG
GGCCCCGGGCCGGAGGCGGCCGCCGACGAGCTCATCGACGTCATGGACGTCGACAGTCTG
ATCAAGCGGGCGCTCGGCTCCGGCAGCTGA
[0] A38..445
[0] PCP908..982
[1] KS1002..1378
[1] AT1537..1855
[1] methylmalonyl-CoA1729..1733
[1] dh1904..2085
[1] KR2304..2484
[1] ACP2595..2665
[2] KS2694..3069
[2] AT3232..3555
[2] malonyl-CoA3429..3433
[2] ACP3631..3703
[3] KS3725..4102
[3] AT4269..4594
[3] methylmalonyl-CoA4461..4465
[3] kr4914..5091
[3] ACP5196..5266
[0] A112..1335
[0] PCP2722..2946
[1] KS3004..4134
[1] AT4609..5565
[1] methylmalonyl-CoA5185..5199
[1] dh5710..6255
[1] KR6910..7452
[1] ACP7783..7995
[2] KS8080..9207
[2] AT9694..10665
[2] malonyl-CoA10285..10299
[2] ACP10891..11109
[3] KS11173..12306
[3] AT12805..13782
[3] methylmalonyl-CoA13381..13395
[3] kr14740..15273
[3] ACP15586..15798

close this sectionFeature

BLASTP
Database:UniProtKB:2011_09
show BLAST table
InterPro
Database:interpro:38.0
IPR000873 AMP-dependent synthetase/ligase (Domain)
 [38-445]  1.5e-112 PF00501
PF00501   AMP-binding
IPR001227 Acyl transferase domain (Domain)
 [1532-1660]  G3DSA:3.40.366.10 [1729-1843]  G3DSA:3.40.366.10 [3228-3361]  G3DSA:3.40.366.10 [3429-3543]  G3DSA:3.40.366.10 [4263-4392]  G3DSA:3.40.366.10 [4461-4585]  G3DSA:3.40.366.10
G3DSA:3.40.366.10   Ac_transferase_reg
IPR002198 Short-chain dehydrogenase/reductase SDR (Family)
 [2304-2471]  2.29999999999998e-56 PF00106
PF00106   adh_short
IPR006162 Phosphopantetheine attachment site (PTM)
 [940-955]  PS00012 [2623-2638]  PS00012 [3661-3676]  PS00012
PS00012   PHOSPHOPANTETHEINE
IPR009081 Acyl carrier protein-like (Domain)
 [908-982]  PS50075 [2595-2665]  PS50075 [3631-3703]  PS50075 [5196-5266]  PS50075
PS50075   ACP_DOMAIN
 [905-1019]  1.79999754022375e-24 SSF47336 [2588-2710]  1.70000295590054e-24 SSF47336 [3624-3742]  1.50000306971104e-26 SSF47336 [5188-5283]  4.80000830876748e-17 SSF47336
SSF47336   ACP_like
 [921-981]  1.1e-08 PF00550 [2599-2664]  1.9e-10 PF00550 [3634-3702]  8.40000000000001e-13 PF00550 [5205-5265]  1.7e-06 PF00550
PF00550   PP-binding
 [912-985]  3.79999999999991e-74 G3DSA:1.10.1200.10 [2593-2668]  3.79999999999991e-74 G3DSA:1.10.1200.10 [3633-3733]  7.5e-58 G3DSA:1.10.1200.10 [5196-5270]  3.79999999999991e-74 G3DSA:1.10.1200.10
G3DSA:1.10.1200.10   ACP_like
IPR013968 Polyketide synthase, KR (Domain)
 [4914-5089]  3.7e-43 PF08659
PF08659   KR
IPR014030 Beta-ketoacyl synthase, N-terminal (Domain)
 [1002-1252]  8.1e-94 PF00109 [2694-2943]  1.5e-87 PF00109 [3725-3975]  4.2e-96 PF00109
PF00109   ketoacyl-synt
IPR014031 Beta-ketoacyl synthase, C-terminal (Domain)
 [1260-1378]  2.1e-45 PF02801 [2951-3069]  9.8e-47 PF02801 [3984-4102]  7.4e-47 PF02801
PF02801   Ketoacyl-synt_C
IPR014043 Acyl transferase (Domain)
 [1537-1855]  1e-100 PF00698 [3232-3555]  3.40000000000004e-58 PF00698 [4269-4594]  6.7999999999999e-99 PF00698
PF00698   Acyl_transf_1
IPR016035 Acyl transferase/acyl hydrolase/lysophospholipase (Domain)
 [1535-1827]  1e-60 SSF52151 [3229-3539]  8.99992502689852e-69 SSF52151 [4267-4571]  1.3000049540733e-66 SSF52151
SSF52151   Acyl_Trfase/lysoPlipase
IPR016036 Malonyl-CoA ACP transacylase, ACP-binding (Domain)
 [1663-1728]  1.60000240695997e-17 SSF55048 [3363-3428]  1.20000117458134e-15 SSF55048 [4395-4460]  3.69999438558106e-18 SSF55048
SSF55048   Malonyl_transacylase_ACP-bd
IPR016038 Thiolase-like, subgroup (Domain)
 [1002-1262]  G3DSA:3.40.47.10 [1264-1431]  G3DSA:3.40.47.10 [2696-2955]  G3DSA:3.40.47.10 [2956-3121]  G3DSA:3.40.47.10 [3734-3988]  G3DSA:3.40.47.10 [3989-4155]  G3DSA:3.40.47.10
G3DSA:3.40.47.10   Thiolase-like_subgr
IPR016039 Thiolase-like (Domain)
 [995-1374]  7.00003808422647e-103 SSF53901 [2686-3121]  5.60004908290974e-99 SSF53901 [3718-4100]  8.50003024809049e-103 SSF53901
SSF53901   Thiolase-like
IPR016040 NAD(P)-binding domain (Domain)
 [2304-2490]  2.4e-63 G3DSA:3.40.50.720 [4913-5110]  2.80000000000001e-59 G3DSA:3.40.50.720
G3DSA:3.40.50.720   NAD(P)-bd
IPR018201 Beta-ketoacyl synthase, active site (Active_site)
 [1165-1181]  PS00606 [2856-2872]  PS00606 [3889-3905]  PS00606
PS00606   B_KETOACYL_SYNTHASE
IPR020801 Polyketide synthase, acyl transferase domain (Domain)
 [1539-1836]  9.30000048475962e-111 SM00827 [3233-3536]  3.49999466863949e-108 SM00827 [4271-4576]  1.59998835313644e-124 SM00827
SM00827   PKS_AT
IPR020806 Polyketide synthase, phosphopantetheine-binding domain (Domain)
 [913-985]  4.19999771339581e-25 SM00823 [2596-2668]  8.49999291746323e-27 SM00823 [3632-3706]  4.19999771339581e-29 SM00823 [5197-5269]  4.40000933643891e-25 SM00823
SM00823   PKS_PP
IPR020807 Polyketide synthase, dehydratase domain (Domain)
 [1904-2085]  5.29999657550189e-28 SM00826
SM00826   PKS_DH
IPR020841 Polyketide synthase, beta-ketoacyl synthase domain (Domain)
 [1005-1430]  SM00825 [2696-3121]  SM00825 [3728-4154]  SM00825
SM00825   PKS_KS
IPR020842 Polyketide synthase/Fatty acid synthase, KR (Domain)
 [2304-2484]  5.00000909915354e-54 SM00822 [4914-5091]  4.80000830876748e-26 SM00822
SM00822   PKS_KR
IPR025110 Domain of unknown function DUF4009 (Domain)
 [484-511]  3.4e-06 PF13193
PF13193   DUF4009
SignalP No significant hit
TMHMM No significant hit
Page top