LOCUS Utopia 15346 bp DNA linear 31-MAY-2025 DEFINITION Arthrobacter phage Utopia. ACCESSION VERSION KEYWORDS . SOURCE Arthrobacter phage Utopia ORGANISM Arthrobacter phage Utopia Viruses; dsDNA viruses, no RNA stage; Caudovirales; Siphoviridae. REFERENCE 1 (bases 1 to 15346) AUTHORS Choudry,H., Labib,S., Hafeez,A., Ali,R., Hussain,K., Khan,M., Perez,M.T., Pollenz,R.S., Garlena,R.A., Russell,D.A., Jacobs-Sera,D. and Hatfull,G.F. TITLE Direct Submission JOURNAL Submitted (31-MAY-2025) Molecular Biosciences, University of South Florida, 4202 E Fowler Ave ISA2015, Tampa, FL 33620, USA COMMENT Phage isolation, DNA preparation and annotation were performed by Benedictine University, Lisle, IL. Genome sequencing performed by Pittsburgh Bacteriophage Institute by Illumina sequencing, coverage is 25899x on December 27, 2024. Assembly performed using Consed 29.0 with Unicycler version 0.5.0 as of April, 2024. Supported by Howard Hughes Medical Institute's Science Education Alliance - Phage Hunters Advancing Genomics and Evolutionary Science (SEA-PHAGES) program, Chevy Chase, MD. FEATURES Location/Qualifiers source 1..15346 /organism="Arthrobacter phage Utopia" /mol_type="genomic DNA" /isolation_source="Soil, Oak Brook, IL" /lab_host="Arthrobacter globiformis B-2979" /country="USA" /lat_lon="41.83 N 87.98 W" /collection_date="01-Sep-2024" /collected_by="Ibrahim Muhammad" /identified_by="Ibrahim Muhammad" /note="complete genome" gene 76..1755 /gene="1" /locus_tag="SEA_UTOPIA_1" CDS 76..1755 /gene="1" /locus_tag="SEA_UTOPIA_1" /codon_start=1 /transl_table=11 /product="terminase" /translation="METLERVFWDHTRDRLVGPPGGPLTISKSRPRFMSEVPTDTDIT NAEIGIRMLGLTLFPQGRDVAAVMEAKSTEIDMLTGRPAPLYAHCTVQEPRRSSKTTA IQAVYLGRCETIPGHRVVQTAQDGTRASGFFMNMVRMLERVTPEPADRNWKVYKSTGR EYLEWTNGSRWWVVPPDPAAFRGEAADDIWFDEGGEFKPETAKELRNGVLALMDTRSD AQITISGTPGIVRAGMFWEDLEAARKDPNALGIVDYSAADNDLLVLPDGTANEDLWWL RHPGLACGLTTIKKMRERWEKLGPVGFGQEYLCIWSSDNTISALDPVQWVDTTTEPVP FAGQDFDLTFDCHIQGAFSAIVATWLDAAGEVHAQVMDYRPGIDWVAGELARANRAHP KVKINYDAIGNNSAVAMSLQRIPGFKINALRPVSMRDAAAGAALVSSGLANGTWHHAL SPALDEAAKNVTWRYSGESRLFGRKSANVCVAAVVAASIGTAMTTGRRSRRKGSIPAP ILGNGIQDVPVPTPDDPFPDGFGTPDDPVRNETVPGTPANKLRQLVAAGRI" gene 2063..3193 /gene="2" /locus_tag="SEA_UTOPIA_2" CDS 2063..3193 /gene="2" /locus_tag="SEA_UTOPIA_2" /codon_start=1 /transl_table=11 /product="portal protein" /translation="MGLTDRISNILGYPSRPGLFDGVAVRSPESALLESIVFFGGGSI QMPVTSRTALKVPGIARAIQLYTAVCAQLPLKASTESADSQWLNWSTGTISPALRNAL TVQDIIFHGGSCYAVARNDAGHVTNGIRIPIGFWTVDAAGRIQVNTALTDPDATTGPH RFVDVDQSQILYIPSLMPCGLLDMADDGIQTYLQIGRTIKDRASNPTPLIALVVKDDA IADPDELKQAQSNWHDARTSPNGAVAIVPAGVGIETPGADRDDSAMLIGARNASRLDV ANWLNLPAAMLDGNSGTSDQYSNTLQNQNEFLALSVSLPIRAIEARLSQDDVTPAGVT VSFDTAVFDSMPEPATGNTGAAVEPSPAGAAVEPEEKREINA" gene 3190..4737 /gene="3" /locus_tag="SEA_UTOPIA_3" CDS 3190..4737 /gene="3" /locus_tag="SEA_UTOPIA_3" /codon_start=1 /transl_table=11 /product="major capsid and protease fusion protein" /translation="MTIQVVGDLLTANDDTLTLSYTLLTFGEPGRTNKGTITVDPGVL TIPAGALPVNDEHDPSVSVGYMLAAEAPERITTDVNFYRTPAGEAAYKAAKAGDKRGI SMEVLAPVIRAGKLLAGRLAGSGTVKKPAFPSSLLTAAEDTGDVSPELQTALDDLAAA AAAVQAAATNPPADPADPPADANTNNPPTDKVTASMTNPPAVPPLLAGNTGSGNTAPD MSFAALATALAGGKKSPELLAAFATVTEANVFDVATPPTYVGEITAQSTYKRRYADLV TNNPLTGSKVQGWRFVPGKTPVVRPYAGDLAEIPSNEVVLEEVSFDAGRFAGGNKVDR KHYDFPTPGFWAGYLRESTGDYDKQIDLLVRDTLLGTTNEILAGASIAGVAEGWSKLV DGALAVWDFGTPDWALVGVDLYREMALTSEKDRLAFLNASLGLDEAAAAGFRIRPVSG PGTDGTVTVGTKGGVELQELPGLIRVSAVDVSHAGVDEGVYGYGGVFTRDARAIRKVV DVLTPEG" gene 4741..5064 /gene="4" /locus_tag="SEA_UTOPIA_4" CDS 4741..5064 /gene="4" /locus_tag="SEA_UTOPIA_4" /codon_start=1 /transl_table=11 /product="head-to-tail adaptor" /translation="MLVGWVDYDPAKELWADSILAGEDKVRELLENAYESCVIYAPKL PDGAPVPQRYKDAQVLQARAIWQMQRQGPGDQFGADGISIAIYPLDARIRQMLRPKQV LGGML" gene 5061..5441 /gene="5" /locus_tag="SEA_UTOPIA_5" CDS 5061..5441 /gene="5" /locus_tag="SEA_UTOPIA_5" /codon_start=1 /transl_table=11 /product="tail terminator" /translation="MSLRSELSAVLKTELGDGFAVLASKRAIDNTSIPVVMVHRKAVT PGPERKRLATDVEVLVLVAEAYGDGAEDAADEALDDVLRVLERIEDPIVWSRAERDNF EGGFVGYVVTLEATTDNYLLPGRE" gene 5448..5840 /gene="6" /locus_tag="SEA_UTOPIA_6" CDS 5448..5840 /gene="6" /locus_tag="SEA_UTOPIA_6" /codon_start=1 /transl_table=11 /product="major tail protein" /translation="MPALFLKNATITIEGVDASEDVDNVQFTPTTTPATFTPISGKTQ SDAGATSWVCTMNIAQNYTAGSLFMLMFAAGAPLDVVLKPRGTAVGGPTISAEIVPVP ATIGGGSGALTASVTCQVNGKPTIAAGA" gene 5844..6371 /gene="7" /locus_tag="SEA_UTOPIA_7" CDS 5844..6371 /gene="7" /locus_tag="SEA_UTOPIA_7" /codon_start=1 /transl_table=11 /product="Hypothetical Protein" /translation="MVVRVQPSVEALDAYKAVVLAMKVIDKPIRQAINVDARTTLSPV WKKLVTEHAGTLLDQRVLNTGTRIAAGNPPAAIAGASKRRLSGGLVPAEYNRMVEFGV DPKDRNVPSEYSRKTKSGKTSTVRRRTRIGLPPARSKGRVVYPAFADFAPRAIAYWVQ SVVRITHETLEKGTR" gene 6373..8088 /gene="8" /locus_tag="SEA_UTOPIA_8" CDS 6373..8088 /gene="8" /locus_tag="SEA_UTOPIA_8" /codon_start=1 /transl_table=11 /product="tape measure protein" /translation="MAAGINLPFFADVRNFLKGTDSIADALDDVAGSLDDLDTSSKAS ADTAGDAIADGIQDGAKDASKAVDKLERDVSDNMTGIAKDAKTAGNKAGDALKDGLKD GGTAADKLEQKVSDTFRGIAADAKAAGDKVGKSMKDGSDRAGEGLDDMKSEAASTAKE TAASFDGSADSIVGSFQEVAANAFAAFGPAGLAAGLAAAAGIGVAMTVLQGVADEAND TGDAVTDMATKIRDAGGDMSNVDLTEGMIDYGFAIQDTKEWFEFFQDSAETGFEQIKK KSEEAGIGWVEAFRGTKGTAEESRDALATVVEKLEAARDGATMWVDAASGMQGIDLAD QRKIDALEDLKKKFETNIETLDRAESANRDLAAAGIKTTEQIEAEKEAVDAANESLLA HRDALDAAAGAAIDADKAELDYVKTLAEGNADIKKNGETIDINTEKGRANRQTLLDMA GASNSLIAAQIEQGDSTASVTARTQQARDAFIRSAEAAGYTKDEAKKLADQYGLIPKN VATKVEARNVEKTKREIDGVAAPRDVPLNLVRGNESVTSWIAGLSGRTIPVNIAVRGG RGVTD" gene 8104..9495 /gene="9" /locus_tag="SEA_UTOPIA_9" CDS 8104..9495 /gene="9" /locus_tag="SEA_UTOPIA_9" /codon_start=1 /transl_table=11 /product="minor tail protein" /translation="MSSITAEVLPDSAAVRLSINAAAGIRSITRRDANGINPVRVTDG VLDVVPELAFAGTNLILNPTFEVGTSNWTAQRCTLSRYTWTASWFNYTPGPAGAYGMR LTADGVAGGTYAYAASFAVTPGDYAAGMAVGSSSSSQVYTSVWFYDAGGAVIGNHDSP LSAAGQSVYNLQTLVVAPVLVPANAVTARLLLRFSGNPPASSFSYWDRAMVVTASTSA GAATAIASYFDGSYSPSMEYKVAWTGTAHASTSTRMVPASPAVVYDYEAADGPIRYDV TDLDGRLESLDVTGFVLDAPWLFTPVIPGYSRKAVSVTGIDTEFEDRSTVHSGLLGRP DPVVVLRPLGLRSGTMELYAGTYADALEILSPLQRATVMMLRQPEHAGLDMYFAAAGG SPKIVSLVTAGGSTVWGVHVPYVEVKRPEGPIAGALGWTYADLAAAVPRYSDLRLTFA TYADMRLNQRITP" gene 9492..10610 /gene="10" /locus_tag="SEA_UTOPIA_10" CDS 9492..10610 /gene="10" /locus_tag="SEA_UTOPIA_10" /codon_start=1 /transl_table=11 /product="minor tail protein" /translation="MTITGLYQDRALDIVKQTHRQDGSAALHFAAGDSLDIVLRDPQI AFSEDWSPYMQVTADAVTPADPSVLARIDPRAGVDLEVRAGYVYDDGTSDVQPLAIGH LRSRRALLPEGTMPVTAASAEQLAQDAKWLNATTTKVFGGVLEALEYLTTYATGTNTA GTFQSTINPVHRPDLVSGVVLEQGADLWGPISAIALSAGLRVWADENNVWNLAPKTTL AGVTSAFLKQGPATTVSKVEDVLTRENWFNAAALTYTWKDSGGVEQTIVGTYAPTPAP GTEKGAGCRTFTDDRPGPISQYQANENARLTVNNLSTRGGSYAVESVAMYWLRPGMTV QIDLANGITVRHIIRKITFNVGAGSMSVVTREPSNLGE" gene 10614..11012 /gene="11" /locus_tag="SEA_UTOPIA_11" CDS 10614..11012 /gene="11" /locus_tag="SEA_UTOPIA_11" /codon_start=1 /transl_table=11 /product="hypothetical protein" /translation="MATTTKRSYRYPGQNGEPDVAGDIQRLAEDLDNDVAKLFDGLPA KIEFGSDSISIGAGAATITKVISLPAGFTAAPSVSLQNTANVAGRASLLGLYVTAKTA TQFTVKMQTSDNANAGTSYAISFDWIAVGN" gene 11012..11740 /gene="12" /locus_tag="SEA_UTOPIA_12" CDS 11012..11740 /gene="12" /locus_tag="SEA_UTOPIA_12" /codon_start=1 /transl_table=11 /product="endolysin" /translation="MVSFCRPARGRETQPFATIQLQDGLPHAGTDFGFYDAAGNACPE VFAAEAGTVLFAGDSRSLGWPNPWYFNPDFDRNDGRDSSAGNVLVIGHRDGVTTYSHL AGFNVAKGATVSRGQHVATIGNTGNANGKHLHFEWIPYPFDFGTATFGRVRPTFKKGL FMYLTEKQELEILAAAKLLNARAKYLDAPVSAVPKKAAAAVLDAPIPRKGGRTGTTTP RNVFAYSDANLDAADDMPEADPAA" gene 11737..11985 /gene="13" /locus_tag="SEA_UTOPIA_13" CDS 11737..11985 /gene="13" /locus_tag="SEA_UTOPIA_13" /codon_start=1 /transl_table=11 /product="membrane protein" /translation="MSQETTPRPAPVNPARFIPSPRLRAYLYGILVPAGALLVFRGII TAGELGLWLSLAGAVLAVSNGLALANTPKGSTDDTAGR" gene 11966..12277 /gene="14" /locus_tag="SEA_UTOPIA_14" CDS 11966..12277 /gene="14" /locus_tag="SEA_UTOPIA_14" /codon_start=1 /transl_table=11 /product="membrane protein" /translation="MTPPAADEITTGELGRRLDSFGTTLQDGFRELSKKIDDRPDWQD VRRIEAGLVERVTSESEARKIAQGIADRAILALEDGQKWATRLILGAVGLGVINLIWT R" gene 12409..12717 /gene="15" /locus_tag="SEA_UTOPIA_15" CDS 12409..12717 /gene="15" /locus_tag="SEA_UTOPIA_15" /codon_start=1 /transl_table=11 /product="helix-turn-helix DNA binding domain protein" /translation="MTRDGPEVCNDRRMTKKDKPPGAADEARIVVPSDLISTGEAAKI LGVDRATIVRRARTGKIPIVAQLDAKTGHGAYVFDRNEIKDSGEDPKNGMAHNGTSGD " gene 12698..12883 /gene="16" /locus_tag="SEA_UTOPIA_16" CDS 12698..12883 /gene="16" /locus_tag="SEA_UTOPIA_16" /codon_start=1 /transl_table=11 /product="MerR-like helix-turn-helix DNA binding domain protein" /translation="MAQAATRVGVSVRTVERYVDAGKLDAHKLPSGRRRVRIGDVDAL LRPVRRPAGDAAKVGSK" gene 12880..13512 /gene="17" /locus_tag="SEA_UTOPIA_17" CDS 12880..13512 /gene="17" /locus_tag="SEA_UTOPIA_17" /codon_start=1 /transl_table=11 /product="helix-turn-helix DNA binding domain protein" /translation="MSWEVLAWAMKKGRDYQLQPTTRHVMLTLANYADPEGNDIYPSL SRLELDTGLSERTIRRQIQHLMGCRLLDYGDQRVVEQNPRIRPDQRPKVYRFILEAAP AVVDNSPERPDTMSTRGYGFAPKVVDNSGHARTLSPERPDKRPVTVSNEPINQKLKPG AGFADEAADAVLDVLPVVSGSDFVEEIRRRRAEREAEKQKDALDHARSRG" gene 13493..14047 /gene="18" /locus_tag="SEA_UTOPIA_18" CDS 13493..14047 /gene="18" /locus_tag="SEA_UTOPIA_18" /codon_start=1 /transl_table=11 /product="Hypothetical Protein" /translation="MHDHAANDAVLPSQRITMNTYETRYEARSKRRGDLYPAAADQYV PAAVITDHVEAGGFAWPVSGDLTTIEHGVPVPKLRRRLKVVAAGRLRSIDPNGIASLA VLWNLDARDLNERVRSGEENPVAVDGDGHLRARASLIASTEQMNMAWAQARAECFATD AGADRRRDEARTRYREALDAARSV" gene 14050..14391 /gene="19" /locus_tag="SEA_UTOPIA_19" CDS 14050..14391 /gene="19" /locus_tag="SEA_UTOPIA_19" /codon_start=1 /transl_table=11 /product="HNH endonuclease" /translation="MEDMLFDIPGLQKEPEKPKGRRWGGNDSAKARAIVAPTLPRPCT RCGETVTADMKWHADHILEDVFGGTSTPDNLGPAHKHCNESAGGKIGAAMTNGFKQAQ DQRREVTVKWW" gene 14385..14588 /gene="20" /locus_tag="SEA_UTOPIA_20" CDS 14385..14588 /gene="20" /locus_tag="SEA_UTOPIA_20" /codon_start=1 /transl_table=11 /product="Hypothetical Protein" /translation="MVKRKTTERAVPGEVNEWVVLGFLNSCMEHSGRITAKDWNDAIA AGRDELADELARTNTNTTTKGTK" gene 14585..14977 /gene="21" /locus_tag="SEA_UTOPIA_21" CDS 14585..14977 /gene="21" /locus_tag="SEA_UTOPIA_21" /codon_start=1 /transl_table=11 /product="hypothetical protein" /translation="MRITGINRNDLEADVVGAALAGDRIVVLADTRELVRPIFDTIAD AFQDDDGKVMDGVTIRKQNGRERIELRNGSRITFGSVRRPDSLRGLVADRIYTPSDTK VEVLRILEAVTSTAAKPATLLYGIRVDA" gene 14974..15249 /gene="22" /locus_tag="SEA_UTOPIA_22" CDS 14974..15249 /gene="22" /locus_tag="SEA_UTOPIA_22" /codon_start=1 /transl_table=11 /product="membrane protein" /translation="MTVPVATGVAVLIIILAAVGAGAAMWWLVRLIQWLWERRPHGLA GRHDWIEWLTINGEVRRRLCLKCGRDERVVQGEEITAQSTGKVEPVD" ORIGIN 1 gcgccgtttc cgcgcgtaga cgtgatataa gatgcgaatt agtaggcttc aacaattcgc 61 gaagggatcg cacaaatgga aactctcgaa cgggtctttt gggatcacac ccgggaccgg 121 ctcgttggtc ctcccggcgg tccgctgaca atctcgaaaa gccggccgcg gtttatgtcc 181 gaagtcccga cagacacgga tattaccaac gccgaaatcg gaatccggat gctcggactc 241 acattgttcc cccagggccg ggacgtcgcc gccgtgatgg aagcgaagtc gacggaaatc 301 gacatgctca ccggccggcc ggcgccgctc tatgcacact gcacagtcca ggaaccgcgc 361 cggtcctcga agacgaccgc gatccaggcg gtctatctcg gccggtgcga gacgatcccc 421 ggacaccggg tcgtccagac ggcacaggac gggacccgcg cgtccgggtt ttttatgaac 481 atggtccgga tgctcgaacg cgtcacgccg gagccggccg atcgcaactg gaaagtctac 541 aaatcgaccg gccgcgaata cctcgaatgg actaacgggt cccgctggtg ggtcgtcccg 601 ccggacccgg ccgcgttccg cggcgaagcg gccgacgaca tttggtttga cgaaggcggg 661 gaattcaagc cggagaccgc gaaggaactc cgaaacggcg tcctcgcgct catggacacc 721 cggtcggatg cacagatcac gatctccggg acgccgggga tcgtccgggc cggcatgttt 781 tgggaagacc tcgaagccgc caggaaggac ccgaacgcgc tcgggatcgt cgattactcc 841 gcggccgata acgacttgct cgtcctcccg gacgggaccg ccaacgaaga cctctggtgg 901 ctccgtcacc ccgggttggc ttgtggtctg acgacgataa agaagatgcg cgaacgctgg 961 gaaaagctcg ggccggtcgg cttcgggcaa gagtacctgt gcatatggtc cagtgataac 1021 acgatctccg cgctcgaccc ggtccagtgg gtcgacacca caacggaacc ggtccccttc 1081 gccggccagg acttcgacct caccttcgat tgccatatcc agggagcttt ttccgcgatc 1141 gtcgcgacgt ggctcgacgc cgccggcgaa gtccacgcgc aggtgatgga ctaccgtccc 1201 gggattgatt gggtcgccgg cgaactcgcc agggcaaacc gggcacaccc gaaggtcaag 1261 atcaactatg acgcgatcgg gaacaactcc gcggtcgcca tgtcgttgca gcggatcccc 1321 ggtttcaaga tcaacgcgct ccggccggtc tccatgaggg acgccgcggc cggcgccgcg 1381 ctcgtgtcct ccgggctcgc taatggcaca tggcaccatg ccctctcgcc ggcgctcgac 1441 gaagcggcca aaaacgtgac atggcggtac tccggcgaat cccgtctctt cggccggaag 1501 tctgccaacg tgtgtgtcgc tgcggtcgtc gcggcgtcga tcggtacggc catgacgacc 1561 ggccggcggt cccggcggaa ggggtcgatc cccgcgccga tccttgggaa cgggatccag 1621 gacgtcccgg tcccgacccc ggacgacccg ttcccggacg gcttcgggac cccggacgac 1681 ccggtccgta acgagaccgt ccccgggacg ccggcgaaca agctccggca actggtcgcg 1741 gccggccgga tttgagtcct tcgggaaaac ctcgagaatt ttgtgggatc ccgctaagcc 1801 ggcgcgccag gttgcagctg taaaaaccgc ggaacggcgc ggatttgcca aacgccggct 1861 tgtgtggatc cgacaaagac ccggctaggc cggcccgggg accgcgacaa agtccgacaa 1921 acccggccat aacctcgaca attcgataaa gcgtttggag taacttcgcc ggcggtcccg 1981 aaaacggggt cgccggcgct ttttgtgtgg tgtcatttgg ttgttcgatt cccccaacga 2041 cccgaatgga gaataagccc ggatgggact taccgacaga atctctaaca ttctcggata 2101 tccgtcccgt ccgggcctat tcgacggggt cgcggtccgc tccccggaat cggctttgct 2161 cgaatccatt gttttcttcg gcggcggaag tatccaaatg ccggtcacgt cgcggaccgc 2221 gctaaaggtc cccgggatcg cccgggcaat tcagctttac accgccgtct gtgcacaact 2281 cccgttgaaa gcgtcgaccg aatccgcgga ctctcagtgg ctcaactggt cgaccgggac 2341 gatctcgccg gcgctccgga acgcgctcac ggtccaggac attattttcc acggcggctc 2401 gtgctacgcc gtcgccagga acgacgccgg ccatgtcaca aacgggatcc ggatcccgat 2461 cggtttttgg acggtcgacg ccgccggccg gatccaggtc aacaccgcgc tcaccgaccc 2521 ggacgcgacg accggtcctc accgcttcgt cgacgtcgac caatcacaga tcctctacat 2581 cccgtctctt atgccgtgtg ggttgctcga catggcagac gacgggatcc agacgtattt 2641 gcagatcggc cggacgatca aagaccgggc ttcgaacccg accccgttga tcgcgctcgt 2701 ggtcaaagac gacgcgatcg cggacccgga cgaactgaaa caagcgcaaa gcaattggca 2761 tgacgccagg acgtccccta acggcgcggt cgcgatcgtc ccggccggcg tcgggatcga 2821 aaccccggga gcggaccgcg acgattcggc gatgctgatc ggagcccgga acgcgtcccg 2881 gctcgacgtc gccaactggt tgaacctccc cgcggcgatg ctcgacggga actccggaac 2941 gtccgaccaa tactccaaca cgttgcagaa tcaaaacgaa tttctcgcgc tctcggtctc 3001 tttgccgatc cgggcgatcg aagctcgttt gagccaggac gacgtgacgc cggccggcgt 3061 gaccgtgtcc ttcgataccg ccgtctttga ctccatgccc gaacccgcga ccggcaacac 3121 cggcgcagcg gtcgaaccct ccccggccgg cgccgcggtc gaacccgaag agaaacgaga 3181 aatcaacgca tgacaatcca ggtagtcggc gatctgctca ccgccaacga cgacacgctc 3241 acgttgtcct atacgctgct gactttcggc gaacccggcc ggaccaacaa agggacgatc 3301 acggtcgacc ccggggtcct cacgatcccc gccggcgctc tcccggtcaa cgacgaacac 3361 gatccgtccg tctcggtcgg ctacatgctc gccgccgaag ctccggagcg gatcacaacg 3421 gacgtcaact tttaccggac cccggccggc gaagccgcat acaaagcggc gaaagccggc 3481 gacaaacgcg ggatttcgat ggaagtcctc gcgccggtga tccgggccgg gaagctgctc 3541 gccggccggc tcgccggatc cgggaccgtc aagaaacccg cgttcccgtc ctcgctgctc 3601 accgcggccg aagacaccgg cgacgtatcg cccgaactgc aaaccgcgct cgacgacctc 3661 gcggccgcgg ccgctgctgt ccaggccgcg gcaacgaatc cgccggcgga tcccgccgat 3721 ccgccggcgg acgccaacac caacaaccca cctacagaca aggtaactgc cagcatgacg 3781 aaccctcccg ccgtcccgcc gctgctcgcc ggcaacaccg gatccggcaa caccgcgccg 3841 gatatgtcct tcgccgctct cgcgaccgct ctcgccggcg gtaagaagtc tccggaactg 3901 ctcgccgctt tcgcgaccgt gaccgaagcg aacgtcttcg acgtcgcgac cccgccgacc 3961 tacgtcggag agatcacggc gcagtcgacg tacaagcgcc gttacgccga cctcgtgacc 4021 aacaacccgc tcaccggctc gaaggtccag ggatggcgct tcgtgcccgg gaagacgccg 4081 gtcgtccgtc cctacgccgg ggacctcgcc gaaatcccgt ccaacgaagt cgtcctcgaa 4141 gaagtgtcct tcgacgccgg ccgcttcgcc ggcggtaaca aggtcgaccg caagcactac 4201 gatttcccga ccccgggttt ttgggccggc tacctccggg agtccaccgg cgactatgac 4261 aagcagatcg atttgctcgt ccgcgacacg ctgctcggga cgaccaacga aatcctcgcc 4321 ggcgcttcga tcgccggcgt cgccgaaggt tggtcgaagc tcgtcgacgg cgctctcgcc 4381 gtgtgggatt tcgggacccc ggattgggct ctcgtcggcg tcgacctgta ccgcgaaatg 4441 gcgctcacgt cggagaaaga ccggcttgcg ttcctcaacg cgtcgctcgg tctcgacgaa 4501 gccgcggccg cgggattccg gatccgtccg gtctccggcc cgggaacgga cgggacggtc 4561 acggtcggca cgaagggcgg ggtcgaattg caggaactcc ccggattgat ccgggtctcc 4621 gcggtcgacg tctctcacgc cggcgtcgac gaaggcgtct acggttacgg cggcgtcttc 4681 acccgcgacg cccgggcgat ccgcaaggtc gtcgacgtcc tcaccccgga aggctagtca 4741 atgctggtgg gatgggtcga ctatgacccc gctaaagagt tgtgggcgga ctccattctc 4801 gccggcgaag acaaagtccg ggagcttcta gaaaacgcgt atgagtcttg tgtgatctac 4861 gcgccgaagc tcccggacgg cgctcccgtc ccgcaacggt acaaagacgc gcaggtcctc 4921 caagcccggg cgatctggca aatgcagcgc caggggcccg gggatcaatt cggcgccgac 4981 gggatctcta tcgcgatcta ccctctcgac gcccggatcc ggcaaatgct ccggcccaaa 5041 caagtcctag gcggaatgct atgagtctcc ggtccgaact ctccgcggtc ctcaaaaccg 5101 aactcgggga cggcttcgcg gtcctcgcgt ccaaacgcgc gatcgacaac acgtcgattc 5161 cggtcgtcat ggttcaccgg aaagcggtca cgccagggcc ggagcggaag cggctcgcga 5221 ccgacgtcga agtgttggtc ctcgtcgccg aagcttacgg ggacggcgcc gaagacgccg 5281 cggacgaagc cctcgacgac gtcctccggg tcctcgaacg gatcgaagat ccgatcgttt 5341 ggtcccgggc cgaacgcgac aacttcgaag gtggcttcgt cggctacgtc gtcaccctcg 5401 aagccaccac ggataactac ctactccccg gaagggaata accgaaaatg ccagctctct 5461 ttctcaaaaa cgcgacgatc acgatcgaag gcgtcgacgc gtccgaagac gtcgataacg 5521 tccagttcac cccgacgacg accccggcga cgttcacccc gatttccggg aagacgcagt 5581 ccgacgccgg cgcgacctca tgggtctgca ccatgaacat tgcacagaac tacaccgccg 5641 ggtcgttgtt catgctcatg tttgccgccg gcgctcccct cgacgtcgtc ctgaaaccgc 5701 gcgggaccgc ggtcggcggt ccgacgatct ccgcggaaat cgtcccggtc ccggcgacga 5761 tcggcggcgg ctccggcgcg ctcaccgcgt ccgtgacgtg ccaggtcaac ggcaaaccga 5821 caatcgccgc cggagcctag gacatggtcg tccgggtgca accttcggtc gaagccctcg 5881 acgcttacaa agccgtggtc ctcgccatga aggtgatcga caagccgatc cggcaagcga 5941 tcaacgtcga cgcccggacg accctctccc cggtctggaa aaagctcgtc accgaacacg 6001 ccgggacgct gctcgaccaa cgtgtcctca acaccgggac ccggatcgcg gccggcaacc 6061 cgccggccgc gatcgccgga gcgtccaagc gccggctctc cggcggtctc gtcccggccg 6121 aatataaccg tatggtcgaa ttcggcgtcg acccgaagga ccgtaacgtc ccgtccgaat 6181 actcccggaa gacgaagtcc gggaagacct cgaccgtccg gcgccggacc cggatcgggc 6241 tcccgccggc ccggtcgaag ggacgcgtcg tctatcccgc gtttgccgac ttcgctcccc 6301 gggcaatcgc ctattgggtg caatccgtag tccgtatcac tcacgaaaca ctagaaaagg 6361 ggacccgctg acatggcggc cggaatcaat cttccgtttt tcgcggacgt ccgtaacttt 6421 ctgaagggga cagattcgat cgcggacgcg ctcgacgacg tcgccgggtc cctcgacgac 6481 ctcgacacgt cctcgaaggc gtccgctgac accgccggcg acgcgatcgc ggacgggatc 6541 caggacggcg cgaaggacgc gtccaaagcc gtcgacaagc tcgaacgcga cgtctcggac 6601 aatatgaccg ggatcgccaa agacgcgaag acggccggca acaaagccgg cgacgcgctc 6661 aaagacggct tgaaagacgg cgggaccgcg gcggacaagc tcgaacagaa ggtctcggac 6721 accttccgcg ggatcgccgc ggacgccaaa gccgccggcg acaaagtcgg caagtccatg 6781 aaggacggat ccgaccgggc cggcgaaggt ctcgacgaca tgaagtccga agcggcgtcg 6841 accgcgaagg agaccgcggc gtcgttcgac ggatccgcgg actccattgt tgggtccttc 6901 caggaggtcg cggcgaacgc gttcgcagcg ttcgggccgg ccggactcgc cgccggactc 6961 gcggccgcgg ccgggatcgg cgtcgctatg accgtcctgc agggggtcgc ggacgaagcc 7021 aacgacaccg gcgacgccgt gaccgacatg gcgacgaaga tccgggacgc cggcggggac 7081 atgtccaacg tcgacctcac ggaagggatg atcgactacg gcttcgcgat ccaggacacg 7141 aaggaatggt ttgagttttt ccaggactcc gcggaaaccg gcttcgaaca gattaagaag 7201 aaatccgaag aagccgggat cgggtgggtc gaagcgttcc gcgggacgaa gggcacagcc 7261 gaagagtccc gggacgctct cgcgaccgtg gtcgaaaagc tcgaagccgc cagggacggt 7321 gcgaccatgt gggtcgacgc cgcgtccggg atgcagggga tcgacctcgc cgaccaacga 7381 aaaatcgacg cgctcgaaga cctcaaaaag aaattcgaaa ccaatattga aaccctcgac 7441 cgggccgaat ccgccaaccg ggatctcgcc gcggccggga tcaaaacgac cgaacaaatc 7501 gaagcggaaa aagaagcggt cgacgccgcc aacgaatccc tcctcgctca ccgggacgcg 7561 ctcgacgccg cggccggcgc cgcgatcgac gccgacaaag ccgaactcga ctacgtcaag 7621 accttggccg aagggaacgc cgatatcaaa aagaacggcg aaacgatcga catcaacact 7681 gagaagggcc gggcaaaccg gcaaacgctg ctcgacatgg ccggcgcgtc taactcgctg 7741 atcgccgcgc agatcgaaca gggcgactcg accgcgtccg tcaccgccag gacccagcaa 7801 gcccgggacg cgtttatccg atccgccgaa gccgccggct acacgaagga cgaagcgaag 7861 aaactcgccg accaatacgg tctgatcccg aagaacgtcg cgacgaaggt cgaagcccgg 7921 aacgtggaaa aaaccaaacg ggaaatcgac ggcgtcgccg ctccccggga cgtccctctc 7981 aacctcgtcc gcggcaacga gtccgtaacc tcatggattg ccggcctctc gggccggacg 8041 atccccgtca acatagctgt ccgcggcggt cgcggcgtga ccgactgaaa agggttttag 8101 cacatgtctt ccattacagc cgaagtcctc ccggactccg ccgcggtccg gctctccatc 8161 aacgcagcgg ccgggatccg gtcaatcacc cggcgcgacg cgaacgggat caacccggtc 8221 cgcgtgacag acggggtcct cgacgtcgtc cccgaactcg cgttcgccgg gaccaaccta 8281 atcctcaatc cgaccttcga agtcggtacg agtaattgga cggcgcagcg ctgtacgctc 8341 tcccgctaca cgtggacggc tagttggttc aactacacac ccgggccggc cggagcgtac 8401 gggatgcggc tcaccgcgga cggcgtcgcc ggcggcactt acgcctatgc cgcatccttc 8461 gccgtcacgc caggggacta cgccgcgggt atggcggtag ggtcctcgtc cagttctcag 8521 gtttacacgt cggtatggtt ctatgacgcc ggcggcgccg tgattggcaa ccatgactcg 8581 ccgctctccg cggccggcca atctgtgtat aacctccaaa ctctcgtcgt cgctccggtc 8641 ctcgtcccgg cgaacgccgt caccgcgcgg ctattgttgc gcttctccgg gaaccctccc 8701 gctagctcgt tttcgtactg ggatcgggca atggtcgtca ccgccagcac gtcagccggc 8761 gcggcgacgg cgatcgccag ctacttcgac gggtcctatt ccccgtcgat ggagtacaaa 8821 gtcgcatgga ccgggaccgc tcacgcgtcg acgtcgaccc gtatggtccc ggcgtccccg 8881 gcggtcgtct atgactatga agccgcggac ggtccgatcc gttatgacgt gaccgacctc 8941 gacggccggc tcgaatccct cgacgtgacc ggcttcgtcc tcgacgcgcc gtggttgttc 9001 accccggtta ttcccgggta ctcccggaaa gcggtgtccg tgaccgggat cgacaccgaa 9061 ttcgaagacc ggtcaacggt ccactccggg ttgctcggac gcccggaccc ggtcgtcgtc 9121 ctccggccgc tcggactccg ctccgggacg atggaactct acgccgggac ctacgcggac 9181 gcgctcgaga tcctctcccc gctgcaacgt gcgaccgtga tgatgctccg gcaacccgaa 9241 cacgccggac tggatatgta cttcgccgcg gccggcggat ccccgaagat cgtctcgctc 9301 gtcaccgccg gcggctcgac cgtgtgggga gtccacgtcc cctacgtcga agtaaagcgg 9361 ccggaaggac cgatcgccgg cgctctcgga tggacgtatg ccgacctcgc ggccgcggtc 9421 ccgcgctact cggatcttcg actcaccttc gccacgtatg ccgacatgcg gcttaaccaa 9481 aggatcacgc catgacgata accgggcttt accaggaccg ggccctcgac atagtcaagc 9541 agacccaccg ccaggacgga tccgccgcgc tgcacttcgc cgccggcgac tcgctcgaca 9601 ttgtcctccg ggatccgcag atcgcttttt ccgaagactg gtccccgtat atgcaagtca 9661 ccgcggacgc cgtgacgccg gcggacccgt ccgtcctcgc caggattgat cctagggccg 9721 gcgtcgacct cgaagtccgg gccggctacg tctatgacga cggcacgtcc gacgtgcaac 9781 cgctcgcgat cggacacctc cggtcccgcc gcgctctact cccggaaggg acgatgccgg 9841 ttacggcggc gtccgccgaa caactcgcgc aagatgcgaa gtggctcaac gcgacgacga 9901 cgaaggtctt cggcggggtc ctcgaagccc tcgaatacct gacgacctac gcgaccggga 9961 ccaacacggc cgggaccttc caatcgacga tcaacccggt ccaccggccc gacctcgtct 10021 ccggcgtcgt cctcgaacaa ggagcggacc tttggggacc gatctccgcg atcgcgcttt 10081 cggccggtct ccgggtgtgg gcagacgaaa acaacgtctg gaacctcgcg cctaaaacga 10141 ccctcgccgg cgtgacgtcc gctttcctca aacaagggcc ggcgacgacc gtttcgaagg 10201 tcgaagacgt cctcacccgg gaaaactggt tcaacgcggc cgcgctcacc tacacgtgga 10261 aggactccgg cggcgtcgaa caaacgatcg tcgggaccta cgcgccgacc ccggcgccag 10321 ggaccgaaaa gggcgccgga tgccggacct tcacggacga ccggcccgga cccatcagtc 10381 agtatcaggc caacgaaaac gcgcggctca cggtcaataa cctctcgacc cgcggcgggt 10441 cctacgcggt cgaatcggtc gctatgtatt ggctccggcc cggaatgacc gtacaaatcg 10501 acctcgcgaa cgggatcacg gtccggcata ttatccgaaa aatcacgttc aacgtcggcg 10561 ccggctccat gtcggtcgtc acccgcgaac cttccaacct aggagaatga cacatggcaa 10621 cgacgaccaa acggtcttac cgctatcccg ggcagaacgg cgaacccgac gtcgccggcg 10681 atatccagcg gctcgccgaa gacctcgata acgacgtcgc gaagctgttc gacggtctcc 10741 ccgcgaagat cgaattcggg tccgattcga tttcgatcgg cgccggcgcc gcgacgatca 10801 cgaaggtgat ctccctcccg gccggcttca ccgcggcgcc gtccgtgtcg ctgcaaaaca 10861 cggcgaacgt ggccggccgc gcgtcgctgc tcgggctcta cgtcaccgcg aagactgcga 10921 cacagttcac ggtcaagatg cagacgtccg ataacgcgaa cgcggggacg tcctacgcga 10981 tttccttcga ctggatcgcg gtgggcaact aatggtctcc ttttgccggc cggcccgggg 11041 acgggagacc cagcccttcg cgacgatcca gttacaggac ggtttgcctc acgccgggac 11101 ggacttcggc ttttacgatg cggccgggaa cgcatgtccg gaagtcttcg ccgccgaagc 11161 cgggaccgtc ctcttcgccg gcgactcccg gtcgctcggg tggcctaatc cgtggtactt 11221 caacccggac tttgaccgga acgacggccg ggactcgtcc gccggcaacg tcctagtgat 11281 cgggcaccgg gacggcgtca ccacgtattc ccatttggcc ggcttcaacg tcgccaaagg 11341 cgcgacggtc tcccgcggcc aacacgtcgc gacgatcggc aacaccggga acgcgaacgg 11401 caaacatttg cacttcgaat ggatccccta tcccttcgac ttcgggacgg ctaccttcgg 11461 ccgggtccga cccaccttca agaaaggctt gttcatgtac ctcacggaaa agcaggaact 11521 cgaaatcctc gccgcggcga agctgctcaa cgccagggcg aaatacctcg acgcgccggt 11581 ctccgccgtc ccgaagaaag ccgcggccgc ggtcctcgac gcgccgatcc cgcggaaggg 11641 cggacggacc gggacgacga ccccgcggaa cgtcttcgcg tattcggacg cgaacctcga 11701 cgccgcggac gacatgcccg aagcggaccc ggccgcatga gccaggaaac gaccccgcgg 11761 ccggctccgg tcaacccggc gcgtttcatc ccgtccccgc ggctccgggc ctacctttac 11821 gggatcctcg tccccgccgg cgctctgctg gtcttccgcg ggatcatcac ggccggggag 11881 ctaggtctat ggctctcgct cgccggcgcc gtcctcgccg tgtcgaacgg gctcgctctc 11941 gccaacacgc cgaaggggtc gacggatgac accgccggcc gctgacgaaa tcaccacagg 12001 ggagctaggc cggcggctcg actcgttcgg gacgaccctt caagacgggt ttcgggagct 12061 ctcgaagaag atcgacgacc gtccggattg gcaggacgtc cggcggatcg aagccggcct 12121 agtggagcgc gtgacgtccg aatccgaagc ccggaagatc gcgcagggga tcgcggaccg 12181 ggcgatcctc gcgctcgaag acggccagaa atgggcaacc cggcttattc tcggcgcggt 12241 cggtctcggt gttatcaatt tgatatggac taggtaagtt gtgggatccc gctaagccgg 12301 cgcgccaggt tgcagctgta aaaaccgcgg aacggcgcgg acttgcaaaa accgggtttg 12361 gagggatccc acaacaccgg ccggccggca acacgcggac cgcgtcaagt gacgcgcgac 12421 ggtccggaag tgtgcaacga tcgacgcatg acaaagaaag acaagccgcc gggagcggca 12481 gacgaagccc ggatcgtagt cccgtccgac ctgatctcca ccggcgaagc cgcgaagatc 12541 ctaggggtcg accgggcaac gatcgtaaga cgcgccagga ccgggaaaat cccgatcgtc 12601 gcgcaactcg acgccaaaac cggtcacggc gcctatgtat tcgaccggaa cgaaataaag 12661 gactctgggg aggatcctaa gaatggaatg gctcacaatg gcacaagcgg cgactagggt 12721 cggcgtctcg gtccggacgg tcgaacgcta cgtcgacgcc ggcaagctcg acgcgcataa 12781 gctcccgtcc ggccggcgcc gggtccggat cggcgacgtc gacgcgctgc tccggccggt 12841 ccgccggccc gccggcgacg ccgcgaaggt gggatcgaaa tgagctggga agtcctcgca 12901 tgggccatga aaaagggccg ggactaccaa ctacagccga cgacccgtca cgtcatgctc 12961 acgctcgcca actacgccga cccggaaggg aacgacattt atccgtccct ctcccggctc 13021 gaactcgaca ccgggctttc cgaacggacg atccgccggc agatccaaca ccttatggga 13081 tgccgtctgc tcgactacgg ggatcaacgg gtcgtcgaac aaaacccgcg gatccggccc 13141 gaccaacgac cgaaggttta ccggtttatc ctcgaagcgg ctcccgccgt tgtggataac 13201 tcaccggagc ggccggacac catgtccacc cgtggatacg gtttcgcgcc gaaggttgtg 13261 gataactccg gtcacgcccg gactttgagt cccgaacgac cggacaaacg accggtcaca 13321 gtgtccaacg aacctataaa ccaaaaactt aaaccagggg ccggcttcgc tgacgaagcg 13381 gcggacgccg tcctcgacgt cctcccggtc gtctccggct cggacttcgt cgaagagatc 13441 cgacgccgtc gcgccgaacg cgaagccgaa aagcaaaaag atgctcttga ccatgcacga 13501 tcacgcggct aacgacgccg tcctaccttc gcaaaggatc accatgaaca cctatgaaac 13561 acgttatgaa gcccggtcga agcgccgcgg cgatctctat cccgcggccg ctgaccaata 13621 cgtccccgcc gcggtcatta ccgatcacgt cgaagccggc ggcttcgctt ggccggtctc 13681 cggggacctc accacgatcg aacatggcgt cccggtcccg aagctccggc gccggctaaa 13741 ggtcgtcgcc gccggccggc tccggtcgat cgacccgaac gggatcgcgt ccctagccgt 13801 cctctggaat ctcgacgcca gggacctcaa cgaacgggtc cggtccgggg aggaaaaccc 13861 ggtcgccgtc gacggggatg gacaccttcg cgccagggct tcgctgatcg cctcaacgga 13921 gcaaatgaat atggcatggg cacaagcccg ggccgaatgc ttcgcgaccg acgccggcgc 13981 ggaccgccgg cgcgacgaag cccggacccg ctaccgcgaa gccctcgacg ccgctaggag 14041 cgtctagaaa tggaagacat gctgttcgat atccccgggc tccaaaaaga gccggagaag 14101 ccaaagggcc ggcgctgggg cgggaatgat tcggcgaagg cccgggcgat tgtcgcgccg 14161 accctcccgc ggccatgtac ccggtgtggg gagaccgtca cggccgacat gaagtggcat 14221 gcggaccata tcctcgaaga cgtcttcggc gggacctcga ccccggacaa tctcgggccg 14281 gcacacaagc actgtaacga gtccgccggc gggaagatcg gcgccgccat gacgaacggc 14341 ttcaaacaag cacaggacca acgaagggaa gtgaccgtca aatggtggta aagagaaaga 14401 cgaccgaacg cgccgtcccc ggcgaagtta acgagtgggt cgtcctaggg tttctaaatt 14461 cgtgcatgga acacagcggc cggatcacgg ccaaagattg gaacgacgct atcgccgccg 14521 gccgggacga actcgccgac gaactcgcca ggaccaacac caacaccaca acgaagggca 14581 cgaaatgaga atcacaggta tcaaccggaa cgacctcgaa gcggacgtcg tcggcgctgc 14641 tctcgccgga gaccggatcg tcgtcctcgc tgacaccagg gaacttgttc ggccgatctt 14701 cgacacgatc gcggacgcgt tccaggacga cgacgggaag gttatggacg gcgtgacgat 14761 ccgtaagcag aacggccggg agcggatcga actccgcaac ggatcccgga tcaccttcgg 14821 gagcgtccgc cggcccgact cgctccgggg attggtcgcg gaccggatct acacgccgtc 14881 agacacgaag gtcgaagtcc tccggatcct cgaagccgtg acgagcacgg cggcgaagcc 14941 ggcgacgctg ctctacggga tccgggtcga cgcgtgaccg tcccggtcgc gaccggcgtc 15001 gccgtcctca tcatcatcct cgccgcggtc ggagccggcg ccgctatgtg gtggctagtg 15061 cgactgatcc agtggctttg ggagcgccgt ccccacggtc tcgccggccg gcatgactgg 15121 atcgaatggc tcacgatcaa cggcgaagtc cggcgccggc tctgtctgaa atgcggccgg 15181 gacgaacggg ttgtccaggg cgaagagatc acggcacagt caacaggtaa ggttgaaccg 15241 gtcgactaat gcgaaggtag accgatcgga cccggtccgg atctctcttg ggaggatccg 15301 gaccgggtct ttttgctgaa accgtcaaag cccacggtcc ccgtcc //