LOCUS NC_016822:16 18774 bp DNA linear BCT 07-SEP-2013 DEFINITION Shigella sonnei 53G, complete genome. ACCESSION NC_016822 SOURCE Shigella sonnei 53G, complete genome. COMMENT locus start: 3420826; locus end: 3439599 COMMENT n1_4mer:GRV/n1_4mer:RV = 1.518469; n0_4mer:D = 29.626702; n0_4mer:PS = 34.836765 FEATURES Location/Qualifiers source 1..18774 /organism="Shigella sonnei 53G, complete genome." gene 1..1020 /gene="" /locus_tag="" /db_xref="3420826..3421845" CDS 1..1020 /gene="" /product="putative transposase, IS110 family protein" /translation="MKYTPVGVDIAKHVIQIHFINEHTGEVVDKQLRRQDFLTFFGNR EPCLIGMEACGGSQHWARELTKLGHKVRLLQARFVKAFVMGNKNDVMDARAIWMAVQQ PGKEIAVKTEEQQSVLVLHRTRMQLVKFRTAQINALHGTLLEFGETIHKGRAAMEREF PEALERMKERLPPYLIMVLENQYNRLNELDSLIEDIEKQLTSVARQNETCKRLLDIPG VGPLIATAAVATMGEASAFKSGREFAAYVGLVPKQTGSGGKVRLLGISKRGDTYLRTL FIHGARAVALVAKEPGPWITELKKRRPASVAIVAMANKLARTVWAITAHDRKYDRNHV SIRPY" gene 1369..1596 /gene="" /locus_tag="" /db_xref="3422194..3422421" CDS 1369..1596 /gene="" /product="Prophage integrase" /translation="MACSALIESGLWSRDAVERQMSHQERNGVRAAYIHKAEHLEERR LMLQWWADFLDANRDKGISPFEYAKINNPLK" gene 2048..5563 /gene="" /locus_tag="" /db_xref="3422873..3426388" CDS 2048..5563 /gene="" /product="superfamily I DNA helicase" /translation="MDENALGFTSYWRNSLADAESGKGSFERKDAKNFTHWHGIAAGR LDEAIVSKFFKGEKDDVETVDVILRPKVYFRLLQHGKDRSAGAPDIVTPIVTPALLSR EGFLYPTPATSIPRDLLEPLPKGAFSIGEIGQYDKYKTTHTTFSINFDDSVDKTAETD EEREARYAALQQEWRQYLYDSERLLKSVAGDWIEKPEQYELAEHGYIVKTAQSGGASS HILSLYDHLIVCNKDVPLFNRFASREVHAAESLLAPGAKFSDRLGHSGDKFPLAKAQR DALSHFLDARHGDILAVNGPPGTGKTTLVLSIIATQWARAALEKSEPPVIIATSTNNQ AVTNIIEAFGKDFSQGSGAMAGRWLPELKSFGAYFPSSSRKAEAAKKYQTEDFFNQVE SKEYVEDALLFYLEKAKAAFPGKECSSPEKVIELLHGQLAAKSEQLIRLNATWQTLSQ IRAARELIANDIEQYLDNLNKLLSGQEQKVTLLKSAKTEWKKYRAGESLIYSLFSWLP AVRNKRQYQIQLFLEDKLGALIAGNQWSDPETIERNIDGLLNSAEREQTTYRQQIDSA HEIVLKEQQAVQEWQRLAFDLGYEGDEELSFSQADELADTQIRFPAFLLTTHYWEGRW LMDMASIDDLQDEKKKKGAKGVTARWQRRMKLTPCVVMTCYMLPGNMQISEHKGQRKF EKSYLYDFADLLIVDEAGQVLPEVAAASFALAKKALVIGDTEQIPPIWSIAPAIDVGN MLAEKILSGSTQEEITEKYTAIADLGKSAASGSVMKIAQFASRYQYDPELARGMYLYE HRRCYDNIIGYCNTLCYHGKLLPKRGREESNLMPAMGYLHIDGKGELASSGSRYNLLE AETIAVWLAENQQNIEAHYGKSLHEVVGIVTPFSAQVSTIKQVLGKQGISTGTNEKSL TVGTVHSLQGAERAIVIFSPVYSKHEDGGFIDSDNSMLNVAVSRAKDSFLVFGDMDLF EVQPASSPRGLLAKYLFESEKNALSFDYKERKDLKTAGTKIYTLHGVEQHDNFLNQTF ENTSKHITIISPWLTWQRLEQTGFLDSMIAACSRGINVTIVTDRSYNTEHNDFEKRKE KQQNFKAALEKLNALGIATKLVNRVHSKIVIGDDGLLCVGSFNWFSATREARYERYDT SMVYCGDNLKGEIEAIYNSLERRQV" gene 5628..5930 /gene="" /locus_tag="" /db_xref="3426453..3426755" CDS 5628..5930 /gene="" /product="IS600 orf1" /translation="MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQW VTAARKGLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR" gene 6071..6784 /gene="" /locus_tag="" /db_xref="3426896..3427609" CDS 6071..6784 /gene="" /product="hypothetical protein" /translation="VAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQ KRKFRATTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYT CEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQFGL KTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNRQRRHS RLGNISPAAFREKYHQMAA" gene 7191..7535 /gene="" /locus_tag="" /db_xref="3428016..3428360" CDS 7191..7535 /gene="" /product="transposase, IS110 family protein" /translation="VPGEYSSGNSIRPRGITKVGNSELRRLLYEAAWSYRTPAKVGAW LIYYRPDSVTQYSKDIAWKAQQRLCSRYRTLTAKGKKSQVAITAVARELTGFMWDIAL AAQSSFSQQKQN" gene complement(7532..7786) /gene="" /locus_tag="" /db_xref="3428357..3428611" CDS complement(7532..7786) /gene="" /product="truncated transposase" /translation="MQSRPLLTSLYKLMQEKEHTLSKKCRLGDAFRYIRKHWAALCNF CDDGLAEVDNNAAERALRAVCLGKKTFSRITHSVSDCFRV" gene complement(7864..8088) /gene="" /locus_tag="" /db_xref="3428689..3428913" CDS complement(7864..8088) /gene="" /product="hypothetical protein" /translation="VFFQACGSLRNTVVTFDGAVEGLPLITDPVGVVGDIIRIGELRA DGRKQDAGLMRDAKSTMYISVLKARRRKKL" gene 8179..8940 /gene="" /locus_tag="" /db_xref="3429004..3429765" CDS 8179..8940 /gene="" /product="putative plasmid transfer protein" /translation="MVKSHGTVSVDGKVSDADLTYLEEVANSTGQEVDKSRLTSQAFA RAALITDVGIALATELETAGQKWSLGFPPKFQRVDLFNYNVLVRNYDSSAFKGDRYHN TKNGINADIGASTDLDDNWTLGLVAQNLIPRSIETKEVNGITETFRIRPQVTAGVSWH NAMFTTAFDVDLTPASGFTSDSNRQFAAIGAEFNAWKWAQLRAGYRQNLAGNDGSAFT AGVGISPFDVVHLDVAGLIGTDNTYGAVAQFQFTF" gene 9302..13159 /gene="" /locus_tag="" /db_xref="3430127..3433984" CDS 9302..13159 /gene="" /product="serine protease" /translation="MNKIYSLKYSHITGGLVAVSELTRKVSVGTSRKKVILGIILSSI YGSYGETAFAAMLDINNIWTRDYLDLAQNRGEFRPGATNVQLMMKDGKIFHFPELPVP DFSAVSNKGATTSIGGAYSVTATHNGTQHHAITTQSWDQTAYKASNRVSSGDFSVHRL NKFVVETTGVTESADFSLSPEDAMKRYGVNYNGKEQIIGFRAGAGTTSTILNGKQYLF GQNYNPDLLSASLFNLDWKNKSYIYTNRTPFKNSPIFGDSGSGSYLYDKEQQKWVFHG VTSTVGFLSSTNIAWTNYSLFNNILVNNLKKNFTNTMQLDGKKQELSSIIKDKDLSVS GGGELTLKQDTDLGIGGLIFDKNQTYKVYGKDKSYKGAGIDIDNNTTVEWNVKGVAGD NLHKIGSGTLDVKIAQGNNLKIGNGTVILSAEKAFNKIYMAGGKGTVKINAKDALSES GNGEIYFTRNGGTLDLNGYDQSFQKIAATDAGTTVTNSNVKQSTLSLTNTDAYMYHGN VSGNISINHIINTTQQHNNNANLIFDGSVDIKNDISVRNAQLTLQGHATEHAIFKEGN NNCPIPFLCQKDYSAAIKDQESTVNKRYNTEYKSNNQIASFSQPDWESRKFNFRKLNL ENATLSIGRDANVKGHIEAKNSQIVLGNKTAYIDMFSGRNITGEGFGFRQQLRSGDSA GESSFNGSLSAQNSKITVGDKSTVTMTGALSLINTDLIINKGATVTAQGKMYVDKAIE LAGTLTLTGTPTENNKYSPAIYMSDGYNMTEDGATLKAQNYAWVNGNIKSDKKASILF GVDQYKEDNLDKTTHTPLATGLLGGFDTSYTGGIDAPAASASMYNTLWRVNGQSALQS LKTRDSLLLFSNIENSGFHTVTVNTLDATNTAVIMRADLSQSVNQSDKLIVKNQLTGS NNSLSVDIQKVGNNNSGLNVDLITAPKGSNKEIFKASTQAIGFSNISPVISTKEDQEH TTWTLTGYKVAENTASSGAAKSYMSGNYKAFLTEVNNLNKRMGDLRDTNGEAGAWARI MSGAGSASSGYSDNYTHVQIGVDKKHELDGLDLFTGLTMTYTDSHASSNAFSGKTKSV GAGLYASAIFDSGAYIDLISKYVHHDNEYSATFAGLGTKDYSSHSLYVGAEAGYRYHV TEDSWIEPQAELVYGAVSGKRFDWQDRGMSVTMKDKDFNPLIGRTGVDVGKSFSGKDW KVTARAGLGYQFDLFANGETVLRDASGEKRIKGEKDGRILMNVGLNAEIRDNLRFGLE FEKSAFGKYNVDNAINANFRYSF" gene 13206..13787 /gene="" /locus_tag="" /db_xref="3434031..3434612" CDS 13206..13787 /gene="" /product="hypothetical protein" /translation="MYYPVTDYIALALIISFLFLTLFICLLCLKHERIKKETIRQKNA HILEHGWNATEFSWFRYGQYNETGIYISIEKTIIITITVSGECFKKEYSIVSHMLVTD TITEATLYENGLYTRHIRLSRPVSDSKYPLPPGSQLIKNMTLRLRLQDQQEETSVTLF QGKMSTDGNNYYIIKGKVSSVLLLLKMLQINHA" gene 14079..14354 /gene="" /locus_tag="" /db_xref="3434904..3435179" CDS 14079..14354 /gene="" /product="IS1 repressor protein InsA" /translation="VASVSISCPSCSATDGVVRNGKSTAGHQRYLCSHCRKTWQLQFT YTASQPGTHQKIIDMAMNGVGCRATARIMGVGLNTILRHLKNSGRSR" gene 14273..14776 /gene="" /locus_tag="" /db_xref="3435098..3435601" CDS 14273..14776 /gene="" /product="IS1 ORF2" /translation="MPGNCTHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMD EQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDG WPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHY LNIKHYQ" gene complement(14787..15347) /gene="" /locus_tag="" /db_xref="3435612..3436172" CDS complement(14787..15347) /gene="" /product="integrase" /translation="VAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQ KRKFRATTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYT CEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQSGL KTSMSRKGNCYDNAPMESFWGTLKNG" gene complement(15488..15790) /gene="" /locus_tag="" /db_xref="3436313..3436615" CDS complement(15488..15790) /gene="" /product="IS600 orf1" /translation="MSRKTQRYSKEFKAEAVRTVLENQLSISEGASRLSLPEGTLGQW VTAARKGLGTPGSRTVAELESEILQLRKALNEARLERDILKKATAYFAQESLKNTR" gene 15957..16640 /gene="" /locus_tag="" /db_xref="3436782..3437465" CDS 15957..16640 /gene="" /product="hypothetical protein" /translation="MTQIESTVTSLHREAEAQFRPELEKIVRGIETGFRGTALYATEN IAGRINARLEDEGFTVKITFPAVSQLQTRIAVKTNLSALMEERTETVTRRRRQSGFWG KICGAFGTSDWGWENYKENVSRSVININSVRKEVMSLTRAYFGELQASIEQNINQPVR QEIDDFFCTFREKVEQLRNTLIQSSEDHKRDQQAQEQLTERLQALNERVPELITDSKA LREELETLL" gene 16637..17542 /gene="" /locus_tag="" /db_xref="3437462..3438367" CDS 16637..17542 /gene="" /product="hypothetical protein" /translation="VTSPFIQQIADNRVCQVLSCLPEKFVVDFANGIDVAQEHNRTAG GRTFFRRLKEGLTGKGAVRQNAINASLAQGVEASLRWLTELTTSLATTNYAITRVNDR VSSLVSDTARLAHYSADTREQLLTLAEQVHQKLNHLEEKLHRVDQVQRAQLHLEQIFS WWSAGRYASFSPAGRCYVALEELRWGAFGDVIRQGETGQVNQLLDILRHKALTQMAQE NGGSATVRLNTLDWLGGQSREQADNEWHEAVNWLGDWCSEERHPVIWSTTQAAEHLPV RMPRLCSAERLSESMVDEIFQKGEA" gene 17539..18609 /gene="" /locus_tag="" /db_xref="3438364..3439434" CDS 17539..18609 /gene="" /product="hypothetical protein" /translation="MSTEMKTGLVLSGGGAVGAYQAGVVKALAECGAQISMVSGASIG ALNGAIITASPDLSEAALRLEALWDHLGNNQVLSVNRSVYFSLLKKLVQAMNLCQIPG RAGALLTTLFRHISTINGFDNPMIQPLLSDEPLTALMDHYLDTDALADGLPLYVSLYP TEGGMQDIIDCIRAELGAGTTKNAVFQHIQSLPRGQQKEALLASAALPLLFRPREVQG TMYGDGGMGGWRNRQGNTPVTPLVDAGCNMVIVTHLSDGSLWDRRAYPDTTILEIRPR KRLKQIGDEGKSGGLLSFTSAHTDAWRQQGYEDTMLTMEHIRKPLAARQALTRSETVL QKSLEITEGADSALRNAMARIK" ORIGIN 1 atgaaatata caccggttgg cgttgatatc gcaaaacatg tcattcagat tcacttcatc 61 aatgagcaca caggtgaagt ggttgataaa cagttgcgta gacaggattt tctgacgttc 121 ttcggcaacc gtgagccatg cctgattggt atggaggcct gtggaggttc tcagcactgg 181 gcacgggaac tgacaaaact tggtcataaa gtccggttgt tgcaggcccg cttcgttaag 241 gcattcgtca tgggcaataa gaatgatgtg atggatgccc gggctatctg gatggcggtt 301 cagcagccgg gtaaagaaat cgccgtaaaa acagaagaac agcagtcggt actggttctg 361 caccgtaccc gcatgcaact ggtgaagttc cggaccgcac aaattaatgc cctgcacggg 421 acgttactgg agtttggtga aaccatccac aaaggccggg cagcgatgga gcgggagttc 481 cccgaagcac tggaacggat gaaagagaga ctgccaccgt atctcattat ggttctggaa 541 aaccagtaca accgactgaa tgagctggac tcactgatag aggatattga aaaacagctt 601 accagcgtgg cgaggcagaa tgaaacctgt aagcggttgc tggatattcc tggcgttgga 661 ccacttattg cgacggcagc ggtggccacc atgggggaag catcagcgtt taaatcgggg 721 cgagagttcg ccgcatatgt tggtctggtt ccaaaacaaa ctggctccgg agggaaagta 781 cgtctgctgg ggataagcaa acgtggtgac acttatctca ggacattatt tatccacggt 841 gcaagagcgg tggcattagt agctaaagag cctggcccgt ggataaccga actgaaaaaa 901 cgtcgtccag ccagtgtggc aatcgtcgcc atggcaaaca agctggcacg aacagtatgg 961 gcgataaccg cccatgaccg taagtatgac aggaaccacg tcagtatcag accatattaa 1021 tcgctgatac cattaaacaa tgaactctta acaaaagggt gaatgctgaa aggttgctat 1081 ggcggccaga gtgatgacaa agacaggtaa gaccgtgact cactaaacct gaacagtatt 1141 ttgggcttga agtccgccgt gaaaataagg ggtgagtcgg cgaattacat aggggctcgc 1201 agcgttacgg ctgcaataaa gccggatata aagctgcaac ctacccgtca tgtcaaaaca 1261 atggatgcct tgcaaacggg atgcgttcat ataaatactg taaataaagc cctgagggtg 1321 atggggtatg acacaaccca ggatgtctgt ggccatggat tccgggcgat ggcgtgcagt 1381 gcattgattg aatcaggttt gtggtcccgc gatgctgtgg aacgtcagat gagccatcag 1441 gagcgtaatg gtgtacgtgc tgcgtatatc cataaagcag aacatctgga agaacgtcga 1501 ctgatgctac agtggtgggc agattttctg gatgcgaaca gagataaggg tatcagcccg 1561 tttgaatatg caaagattaa caatccatta aaatagtaag cagtccgggc tgattgcccg 1621 gactgcttta actgattatt tttctttgta aatgatgacg aagattgatg ttctggcaag 1681 ttactccggc ttccattaaa gtcccttcgt catgttcttc aggctattta tgactgtgat 1741 tagcaccagg aatatctggt ttccatgaga gcaggtaata acccccggtg ttttcggctg 1801 ttctggtatc cttgtccaac acaggcagtc attataaacg gtaaaaaatg tattcactgt 1861 tatgaaatca atagtcctga cgtactgttg aatgtgcttg tgttctgaac cttgtacaac 1921 cagtacattt gataccgaat tataaacaaa acattgcatc caactttttc aatatggaca 1981 atagttgtcc atatggtggt tttatattcg ataaatcaat ggatttcatt gagtcacagg 2041 actggtaatg gatgaaaatg ctttagggtt tacctcatac tggcgcaact cgcttgcgga 2101 tgctgagtca ggaaagggca gttttgaacg gaaagacgcc aaaaatttca ctcactggca 2161 tgggatagcg gcgggacgtc ttgacgaagc gattgtcagt aaatttttta agggagaaaa 2221 agacgatgtc gaaacggtcg atgtcatctt gcgcccaaaa gtttatttcc ggttactgca 2281 gcatggtaag gaccgttctg caggtgcgcc tgatattgtt accccgatag tgacgccagc 2341 cttgctaagc cgtgagggtt ttttatatcc gacgccagcg acctccattc ccagagacct 2401 gcttgaacct ttgccaaaag gagcattttc gattggtgag attgggcagt atgacaaata 2461 caagacgacc cataccacgt tctctatcaa ctttgatgac agcgttgata agactgccga 2521 aacggatgaa gaacgggaag cacgatatgc cgccttgcag caggagtggc gtcaatatct 2581 gtatgactca gagaggctac tgaagagcgt tgccggcgac tggattgaaa aacctgagca 2641 atatgaactc gctgagcacg gttatattgt taaaacggct caatctggcg gtgccagttc 2701 ccatatcctt tctctttatg atcacctgat tgtttgcaat aaggatgtgc cgctcttcaa 2761 tcgcttcgcc tcgcgagagg ttcatgctgc agagtctttg ctggccccag gagcaaaatt 2821 cagcgacagg cttggacact ccggagataa gtttccgctg gcaaaggctc agcgcgatgc 2881 cttaagccat tttctggatg caagacatgg cgatatcctt gctgttaatg gccctccggg 2941 aaccggaaaa accacgctgg tgctttctat catcgccacg cagtgggcca gagcggctct 3001 cgaaaaatct gagcctccgg ttattatcgc gacttcaacg aataaccagg ctgtaacgaa 3061 cattattgag gcattcggga aagacttttc gcaaggttca ggtgcgatgg ccgggcgatg 3121 gttgccagag ctgaaaagct tcggtgctta ttttccctca agcagtcgta aagctgaggc 3181 agccaaaaaa tatcaaactg aagatttctt caaccaggtt gagtcaaaag agtatgtaga 3241 ggatgcactg ctgttttatc tggaaaaggc taaggcagcc tttcctggaa aagagtgttc 3301 atcccctgaa aaggtcattg aactcctgca tggtcagttg gcagcaaaat ctgagcaact 3361 gataagactg aacgcaacat ggcaaacgtt aagccagatt cgggctgcgc gtgagcttat 3421 tgctaatgat attgagcaat atctcgataa tttaaataaa ttactttccg gacaagaaca 3481 aaaagtcact ctactgaaga gtgctaaaac ggaatggaaa aaatatcgcg ccggtgaatc 3541 actgatctat tcattatttt cctggctccc ggcggttcgc aataagcgac agtaccaaat 3601 acagctgttt ctcgaagata aattaggcgc gctgattgca ggaaatcagt ggtctgatcc 3661 tgaaactatc gaacgtaata ttgatgggct gctcaattcc gctgagcgcg agcaaacaac 3721 ataccggcag cagattgact ccgcccatga aatcgttctt aaagaacagc aggcggttca 3781 ggagtggcag aggctggcat ttgatttagg gtatgagggc gacgaggaac tgagcttctc 3841 acaggccgat gaactggctg atacgcagat tcgcttccct gcatttttac tgacgactca 3901 ctactgggaa ggtcgttggc tgatggatat ggccagcatt gatgatctgc aggacgagaa 3961 gaagaaaaaa ggtgctaaag gggtaaccgc ccgttggcaa cgtcgaatga aactcacgcc 4021 atgtgtggta atgacatgct atatgctgcc cggtaatatg cagataagtg agcacaaagg 4081 acaacgtaaa ttcgagaaaa gttatttgta tgattttgcc gatttactca ttgtcgatga 4141 agccgggcag gtgcttcctg aagtggctgc tgcctcgttt gcattagcta agaaggcatt 4201 agtgattggc gatacggagc agatcccgcc aatatggagt attgctcctg cgattgatgt 4261 cggtaacatg ctggcggaaa aaattctgtc tggcagtacg caagaagaga ttaccgagaa 4321 atatacggca atcgcagacc ttggtaaaag tgccgcatct ggcagcgtta tgaaaatagc 4381 gcagtttgct tcgcgctatc aatatgatcc cgaactggct cgtggtatgt acctatatga 4441 acaccgccgg tgctacgaca atattattgg atactgtaat acgctctgct atcacggtaa 4501 gttgttgcct aaaagagggc gtgaagagag caatttaatg cccgcaatgg ggtatctcca 4561 tattgatggt aaaggagagc tggcaagtag tggaagtcga tataatttgc ttgaggctga 4621 aacgatagcg gtctggttgg cagagaacca gcaaaatatt gaagcgcatt acggtaaatc 4681 gcttcatgaa gttgtcggta ttgtgacgcc ttttagcgct caggtatcca ctatcaaaca 4741 ggtgctgggc aaacaaggta tcagtacagg cacgaatgaa aagtcgctca cagtgggcac 4801 cgtgcactct cttcagggag cggaaagagc gattgtgata ttctcgccag tctattcaaa 4861 acatgaagac ggcgggttta ttgatagcga taacagcatg ctgaatgttg cagtctcccg 4921 tgcgaaggac agttttctgg tcttcggcga tatggacctg tttgaggtcc agccagcctc 4981 atcgccacgg ggattactgg caaaatacct ctttgagtca gagaagaatg cgctctcttt 5041 tgattataaa gagcgtaagg atttaaaaac cgccgggacc aaaatctaca cacttcatgg 5101 tgtggagcaa catgataatt tcctgaatca gacatttgaa aataccagta aacacatcac 5161 gataatttct ccatggctga cctggcaaag gctggagcaa accggttttc ttgattccat 5221 gattgcggcg tgttcacgtg gaattaacgt cacgatagtc actgacagaa gctacaacac 5281 tgaacataat gattttgaga agcgaaaaga gaagcagcag aactttaaag cggcgctgga 5341 gaaactgaat gcgctgggta ttgctacaaa gctggtaaac cgtgttcata gcaaaattgt 5401 tattggtgat gatggtttgc tgtgtgtggg atcgttcaac tggtttagtg cgacacggga 5461 agcgcgatat gaacgatacg atacatcaat ggtttattgc ggtgataacc tgaagggtga 5521 gattgaggct atttataata gtcttgagag gcgtcaggtt tagtgaggta gcctgagttt 5581 aacggacact ccttcctgaa atagaatggc atcagaagga gctaataatg agcagaaaaa 5641 cccaacgtta ctctaaagag ttcaaagccg aagctgtcag aacggttctt gaaaatcaac 5701 tttcgatcag tgaaggcgct tcccgattat ctcttcctga aggcacttta ggacaatggg 5761 ttaccgccgc cagaaaaggg ctcggtactc ctggttcccg cacggtggct gaactggaat 5821 ctgaaattct gcaactgcgt aaggcgttaa atgaagctcg ccttgagcga gatatattaa 5881 aaaaagcaac agcgtatttt gcacaggagt cgctgaaaaa tacgcgttaa tcgaacaatg 5941 gcgacaacaa tttcccattg aagcgatgtg tcaggtattt ggtgtatcca ggagcggtta 6001 ttacaactgg gtacagcatg aaccctcaga cagaaaacaa agtgatgagc ggctaaaact 6061 ggagattaag gtggcacata tccgcactcg cgaaacatat ggaacccggc ggctccagac 6121 ggagctggca gagaatggca tcatcgttgg tcgtgaccga ctggcacgtc ttcgtaagga 6181 gctaaggcta cgctgtaagc agaaacgcaa gttcagagcg actacgaact cgaaccacaa 6241 tctgccagtt gcgccaaatc tgctgaacca gacgttcgct cctacagcac caaatcaggt 6301 ctgggtggcg gacctgacgt atgttgccac acaggaggga tggttgtacc tcgctggcat 6361 caaagatgtt tatacgtgcg aaattgtcgg ctacgccatg ggagagcgca tgacaaaaga 6421 gctgacaggt aaagccctgt ttatggcgct caggagccag cgcccacctg ccgggctaat 6481 ccaccactct gatcgaggtt cacagtactg cgcatacgat taccgggtca tacaggagca 6541 gtttggtctg aaaacatcaa tgtcgcgtaa aggtaactgt tacgacaacg ctccgatgga 6601 aagcttctgg ggaacgctga aaaatgagag cctgagccac tatcgtttta ataaccggga 6661 tgaagccatc tcagtaatac gggaatacat tgagattttc tacaatcgtc agcgtcgtca 6721 ctctcgtctg gggaatatct ccccggcagc cttcagggaa aaatatcatc agatggctgc 6781 ttaaaaaaag aacaaatggt agtgtccgct attgccagta cacctcatag agcaggaatg 6841 gtgagtagcc atcttaccga tcgttttcga gcgtaagatg gctgaatgga atggctatta 6901 ttgcacagtc cttaattata acattcatac cgacatgatt atcttctgtc cggaagaatc 6961 agaggctcag aacgctgaaa aatgagagcc tgagccacta tcgttttaat aaccgggatg 7021 aagccatctc agtaatacgg gaatacattg agattttcta caatcgtcag cgtcgtcact 7081 ctcgtctggg gaatatctcc ccggcagcct tcagggaaaa atatcatcag atggctgctt 7141 aaaaaaagaa caaatggtag tgtccgctat tgccagtaca cctcagactg gtgccaggtg 7201 aatattccag tggaaacagt attcgtccca gaggaataac aaaagttgga aacagcgaac 7261 tgagacgtct gctttacgaa gccgcttggt cttatcgtac acctgcaaaa gttggagcat 7321 ggcttatata ttaccgaccg gactctgtaa cacaatattc caaagatatt gcatggaaag 7381 ctcaacaacg attgtgttct cgttaccgaa ctctgacagc aaaagggaaa aaatcacaag 7441 tagccattac ggcggtggct cgtgagttaa ctggatttat gtgggatatt gcacttgctg 7501 cccaatcatc attcagtcag cagaagcaaa attaaaccct gaagcagtca gacacggaat 7561 gagtgatcct cgaaaaagtt ttcttcccga gacagactgc acgaagcgct ctttctgctg 7621 cattattatc tacctcagcc aggccatcat cacagaagtt gcacaacgca gcccagtgct 7681 tcctgatata ccggaacgca tctcccagac gacatttctt cgataacgtg tgttctttct 7741 cctgcatcag cttatacagg gaagtcagga gcggtctact ctgcatttgc ctgaccgcca 7801 ggcgttcaga caccggtaac ccgcgtatat cgtgctctat ggcgtacagt tcaccgatta 7861 gtttcagagc ttcttccgcc gtcgcgcttt tagtactgat atatacatcg tggattttgc 7921 gtcgcgcatg agcccagcat cctgcttccg tccgtcggcg cgcagctctc caatccggat 7981 aatatcaccg acaacgccga ccggatcagt gatgagtgga aggccttcga ccgcgccatc 8041 gaaagtaacc acggtgttcc tgaggctgcc gcacgcctga aagaacacct ccgggacttc 8101 cgtaacacac atgccgccgc acagctgggc gtttctgctg tggcggccct tcccggtgat 8161 cgcctctctg ccgcactgat ggttaaatca cacggcaccg tcagtgtcga cggtaaagtc 8221 tctgatgctg acctcactta cctggaagag gtggcaaaca gcaccgggca ggaggtggac 8281 aaaagccgcc tgacatcaca ggcctttgcc cgcgcggcac tgatcactga tgtgggtatt 8341 gctctggcta cggaactgga aacggccggg cagaaatggt ctctgggctt cccccccaaa 8401 ttccagcgcg tcgacctgtt caactacaac gtgctggtca gaaattatga cagcagcgcc 8461 ttcaaaggcg accgctacca caacacgaaa aatggcatca acgccgacat cggtgccagt 8521 acggacctgg atgacaactg gacgctggga ctggtcgcac agaacctgat cccccgtagt 8581 attgagacaa aagaagtgaa cggtatcacg gaaaccttca ggatccgccc gcaggtgacg 8641 gccggtgtct cctggcacaa cgcgatgttc accaccgcat ttgatgtgga tctgaccccg 8701 gccagcggtt tcacctccga cagcaaccgt cagtttgccg ccattggcgc agaatttaat 8761 gcctggaaat gggcacagct gcgcgccggt taccgtcaga atctggccgg taacgacggc 8821 agtgcattca cggccggggt ggggatctca ccgtttgatg tggttcacct tgatgttgca 8881 ggcctgatcg gcacggacaa cacttacggt gcggtcgcac agttccagtt caccttctga 8941 gttccacctg caaaacagca tgcacattat cgctgcatgt tgtttccatc atcaaaagtc 9001 tgtcggacat cgacagactt aactcccccg gataatacag aaaacctgta aaggattgac 9061 cggaaacaac aaagatcaaa ttttgatcgc aaagttcaaa atacaaacac caagtgatca 9121 atatctgacc aatcaatcac ttgtgcttat cttttttttt cattttgtta catctgtcat 9181 acaaatataa ctgacagtga ttatcattat acccttatca gttacgtacc atgactgata 9241 gttccccgtt gtaattaaat gctatcccat aaccacaact cagaaatatc ggagttcacg 9301 tatgaataaa atttattcac tgaaatatag tcatattaca ggtggattag ttgctgtttc 9361 tgaactgacc cggaaagtta gtgtcggtac atcaagaaag aaagttatcc tcggtattat 9421 tttatcctca atatatggaa gttatggcga aacagcattt gcagcaatgc tggatataaa 9481 taatatatgg acccgcgatt atcttgacct tgctcaaaac agaggagagt tcagaccggg 9541 tgcaacaaat gttcaattaa tgatgaaaga tggaaagata tttcattttc cagaactacc 9601 tgtacctgat ttttctgctg tttccaacaa aggtgcaaca acatcaattg gaggtgcgta 9661 cagtgttact gcgactcata acggtacaca gcatcatgca ataacaacac agtcatggga 9721 tcagacagca tataaagcaa gtaacagagt atcatctggc gacttttcgg ttcatcgtct 9781 gaataaattc gtcgtggaaa caacaggggt tacggagagt gccgacttct cactttctcc 9841 cgaagatgcg atgaaaagat atggcgtaaa ctacaacggt aaggaacaaa taattggctt 9901 cagagcaggt gccggaacaa cctcaacgat attaaacggc aaacaatatc tgtttggaca 9961 aaactataat cccgacttgt taagcgcaag tctttttaat ctggactgga aaaacaagag 10021 ttacatttat accaacagaa ccccttttaa aaactcacca atttttggcg atagtggttc 10081 tggttcttat ctatatgata aagaacaaca aaaatgggtt ttccatggtg ttaccagtac 10141 agttggtttt ctcagtagta ccaatatagc ctggacaaac tactcgttat ttaataatat 10201 tctggtaaac aatttaaaaa agaatttcac aaacactatg cagctggatg gtaaaaaaca 10261 agagttatca tcgattataa aagataagga cctgtctgtc tcaggaggag gggaattaac 10321 gctcaagcag gataccgatc ttggcattgg cgggcttata ttcgataaga accagacata 10381 taaagtgtac ggaaaagata agtcttataa aggtgccggg atagatattg ataataatac 10441 caccgttgaa tggaatgtta agggcgttgc cggagataat ctgcataaaa taggtagtgg 10501 tactctggat gtaaaaatag cacagggaaa taaccttaaa ataggtaatg ggactgtcat 10561 ccttagtgct gaaaaagcct tcaataaaat ttacatggcc ggaggtaaag gtacggtaaa 10621 aataaatgcc aaagacgctt taagcgaaag cggtaatggc gaaatctatt ttaccagaaa 10681 tggcggaaca ctggatctaa acggctatga ccagtcattt cagaaaatcg cagcaacaga 10741 tgcgggaaca accgtaacga actcaaacgt gaagcaatca acattatcac ttactaatac 10801 tgatgcatat atgtaccatg ggaatgtatc aggtaatata agcataaatc atattatcaa 10861 tactacccag caacataaca ataatgccaa tctgatcttt gatggctcag tcgatatcaa 10921 aaacgatatc tctgtccgga atgcacagtt aacattacaa ggacatgcga cagaacatgc 10981 catatttaaa gaaggcaata acaactgtcc aattcctttt ttatgtcaaa aagactattc 11041 tgctgccata aaggaccagg aaagcactgt aaataaacgt tacaatacgg aatataagtc 11101 caacaatcag atagcctctt tttcccagcc cgactgggaa agtcgtaaat ttaatttccg 11161 gaaattaaat ttagaaaacg caaccctgag tataggccgg gatgctaatg taaaaggaca 11221 catagaggct aaaaactctc aaattgttct gggaaataaa actgcataca ttgacatgtt 11281 ctcaggaaga aacattactg gcgaaggttt tggattcaga caacagcttc gctccgggga 11341 ttcagcaggc gaaagtagtt tcaacggcag tctgagtgct caaaacagca aaataactgt 11401 tggtgataaa tcaactgtta ctatgactgg tgcattatcc ttaattaata cagacctgat 11461 tatcaacaaa ggagctactg ttaccgccca gggaaaaatg tatgtagata aagctattga 11521 actggccgga accctgacat taacaggcac ccctacagaa aataataaat acagcccggc 11581 aatctatatg tcagatggat ataatatgac agaagatggt gccacgttaa aggctcaaaa 11641 ttatgcctgg gtcaatggta atataaaatc agacaaaaaa gcatctattc tgtttggtgt 11701 tgaccagtat aaagaagata acctggacaa aaccacacac acaccgctgg ctacaggttt 11761 gctgggtggc tttgatactt cttataccgg aggtattgat gctcctgctg cctcagccag 11821 catgtataac accttatgga gagtaaacgg acagtcagcc ctgcaatcat taaaaacccg 11881 cgacagtctt ttgttgttta gtaacataga gaattcgggt ttccatactg tgacagtaaa 11941 cacactggat gccactaata ctgctgtgat tatgcgggct gatctgagcc agtctgtaaa 12001 tcaatcggat aaactcattg ttaaaaatca gttaaccgga agcaataaca gtctgtcggt 12061 cgatatacag aaagtgggaa ataataactc aggattaaac gttgacctga taacagcccc 12121 aaaaggaagc aataaagaga tatttaaagc cagtactcag gccataggtt tcagcaacat 12181 atctcctgtg atcagcacga aagaggatca ggaacatacc acgtggaccc tgaccggata 12241 taaggtggct gaaaatacag catcttccgg tgcagcaaaa tcgtatatgt ccggtaatta 12301 caaagccttc ctgacagaag tcaacaacct gaataaacga atgggggatc tgcgtgacac 12361 caatggcgag gccggtgcat gggcccgcat catgagcgga gcaggttcag cttctagtgg 12421 atacagtgac aactacaccc atgtgcagat tggtgtggat aaaaaacatg agctggatgg 12481 acttgacctt ttcactggtc tgactatgac gtataccgac agtcatgcca gcagtaatgc 12541 attcagtggc aagacgaagt ccgtcggggc aggtctgtat gcttccgcta tatttgactc 12601 tggtgcctat atcgacctga ttagtaagta tgttcaccat gataatgagt actcggcgac 12661 ctttgctgga ctcggaacaa aagactacag ttctcattcc ttgtatgtgg gtgctgaagc 12721 aggctaccgc tatcatgtaa cagaagactc ctggattgag ccgcaggcag aactggttta 12781 tggggccgta tcaggtaaac ggttcgactg gcaggatcgc ggaatgagcg tgaccatgaa 12841 ggataaggac tttaatccgc tgattgggcg taccggtgtt gatgtgggta aatccttctc 12901 cggtaaggac tggaaagtca cagcccgcgc cggccttggc taccagtttg acctgtttgc 12961 caacggtgaa accgtactgc gtgatgcgtc cggtgagaaa cgtatcaaag gtgaaaaaga 13021 cggtcgtatt ctcatgaatg ttggtctcaa cgccgaaatt cgcgataatc ttcgcttcgg 13081 tcttgagttt gagaaatcgg catttggtaa atacaacgtg gataacgcga tcaacgccaa 13141 cttccgttac tctttctgat aacagcccgg gccgcgtttg cggcccttct tctaccgtag 13201 agaatatgta ttaccctgtg acagactata tcgctcttgc tctcattatt agctttcttt 13261 ttctgacatt atttatctgc ctgttatgcc ttaaacatga gcgaataaaa aaagaaacta 13321 tcaggcaaaa aaatgcacat attctggagc atggctggaa tgcaactgag ttctcatggt 13381 tccgatacgg acagtataac gaaacgggta tttacatctc gatcgagaag accattatca 13441 ttacaatcac cgtttctggt gagtgcttta aaaaagaata cagcattgtc tcccacatgc 13501 tggtcactga caccataaca gaggccaccc tgtacgaaaa cgggctgtat acgcgccaca 13561 tcagactcag tcgcccagta tctgacagca aatacccctt gccacccgga tctcagctca 13621 ttaaaaatat gaccttacga ctgcgcctgc aggatcagca ggaagaaaca tccgtgacat 13681 tatttcaggg aaaaatgagt accgacggaa ataactatta catcatcaaa ggtaaagtaa 13741 gctcagtact tctgttatta aaaatgctgc agattaatca cgcctgatct ctgtcccggt 13801 attttgaggt gtactgaggt gtactggcaa tagcggacac taccatttgt tcttttttta 13861 agcagccatc tgatgatatt tttccctgaa ggctgccggg gagatattcc ccagacgaga 13921 gtgacgacgc tgacgattgt agaaaatctc aatgtattcc cgtattactg agatggcttc 13981 atcccggtta ttaaaacgat agtggctcag gctctcattt ttcggtgatg ctgccaactt 14041 actgatttag tgtatgatgg tgtttttgag gtgctccagt ggcttctgtt tctatcagct 14101 gtccctcctg ttcagctact gacggggtgg tgcgtaacgg caaaagcact gccggacatc 14161 agcgctatct ctgctctcac tgccgtaaaa catggcaact gcagttcact tacaccgctt 14221 ctcaacccgg tacgcaccag aaaatcattg atatggccat gaatggcgtt ggatgccggg 14281 caactgcacg cattatgggc gttggcctca acacgatttt acgtcactta aaaaactcag 14341 gccgcagtcg gtaacctcgc gcatacagcc gggcagtgac gtcatcgtct gcgcggaaat 14401 ggacgaacag tggggctatg tcggggctaa atcgcgccag cgctggctgt tttacgcgta 14461 tgacagtctc cggaagacgg ttgttgcgca cgtattcggt gaacgcacta tggcgacgct 14521 ggggcgtctt atgagcctgc tgtcaccctt tgacgtggtg atatggatga cggatggctg 14581 gccgctgtat gaatcccgcc tgaagggaaa gctgcacgta atcagcaagc gatatacgca 14641 gcgaattgag cggcataacc tgaatctgag gcagcacctg gcacggctgg gacggaagtc 14701 gctgtcgttc tcaaaatcgg tggagctgca tgacaaagtc atcgggcatt atctgaacat 14761 aaaacactat caataagttg gagtcattac ccatttttca gcgttcccca gaagctttcc 14821 atcggagcgt tgtcgtaaca gttaccttta cgcgacattg atgttttcag accagactgc 14881 tcctgtatga cccggtaatc gtatgcgcag tactgtgaac ctcgatcaga gtggtggatt 14941 agcccggcag gtgggcgctg gctcctgagc gccataaaca gggctttacc tgtcagctct 15001 tttgtcatgc gctctcccat ggcgtagccg acaatttcgc acgtataaac atctttgatg 15061 ccagcgaggt acaaccatcc ctcctgtgtg gcaacatacg tcaggtccgc cacccagacc 15121 tgatttggtg ctgtaggagc gaacgtctgg ttcagcagat ttggcgcaac tggcagattg 15181 tggttcgagt tcgtagtcgc tctgaacttg cgtttctgct tacagcgtag ccttagctcc 15241 ttacgaagac gtgccagtcg gtcacgacca acgatgatgc cattctctgc cagctccgtc 15301 tggagccgcc gggttccata tgtttcgcga gtgcggatat gtgccacctt aatctccagt 15361 tttagccgct catcactttg ttttctgtct gagggttcat gctgtaccca gttgtaataa 15421 ccgctcctgg atacaccaaa tacctgacac atcgcttcaa tgggaaattg ttgtcgccat 15481 tgttcgatta acgcgtattt ttcagcgact cctgtgcaaa atacgctgtt gcttttttta 15541 atatatctcg ctcaaggcga gcttcattta acgccttacg cagttgcaga atttcagatt 15601 ccagttcagc caccgtgcgg gaaccaggag taccgagccc ttttctggcg gcggtaaccc 15661 attgtcctaa agtgccttca ggaagagata atcgggaagc gccttcactg atcgaaagtt 15721 gattttcaag aaccgttctg acagcttcag ctttgaactc tttagagtaa cgttgggttt 15781 ttctgctcat tattagctcc ttctgatgcc attctatttc aggaaggagt gtccgttaaa 15841 ctcaggctac ctcacagttt agtgtcagtg ccccgggcca tcattcatgg aaacgagact 15901 ttgatccgga cagccccgag ataaaattca gtgatcgcag ggaagccctt aaactgatga 15961 cgcaaatcga atcgaccgtg accagcctgc accgtgaggc tgaagcacag ttccggcctg 16021 aactggaaaa aatcgtcagg ggtatcgaaa caggttttcg tggtacggcc ctgtatgcca 16081 cagaaaatat tgccggacgt atcaatgccc gcctggagga cgagggcttc actgtaaaaa 16141 tcactttccc ggcagtcagc cagttacaga ccaggatcgc ggtaaaaaca aatctgagtg 16201 cgcttatgga ggaaagaact gagacagtca cccgtcgccg tcggcagagt ggtttctggg 16261 gaaagatttg tggagcgttt ggcaccagtg actggggctg ggaaaactat aaggagaatg 16321 tgagccgcag tgtgatcaat atcaactcgg tcaggaagga agtgatgtca ctgacccggg 16381 catatttcgg ggagttgcag gcatccattg agcagaatat taaccagccc gtccgccagg 16441 agatcgatga ttttttctgt acattcaggg aaaaagtcga acagctacgt aacacactca 16501 ttcagagctc tgaagatcat aaacgcgatc agcaggcaca ggagcagctt acggagcgcc 16561 ttcaggcatt aaatgaaagg gttcccgagc tgattactga cagtaaggcg ttaagggaag 16621 aactggagac actgctgtga cctcaccatt tattcaacaa attgctgata accgggtatg 16681 tcaggtactc agttgccttc ctgaaaaatt tgtggttgat tttgcaaacg gtattgatgt 16741 tgcacaggag cataaccgca cggccggggg gcgcacgttt ttccggcgtt taaaagaagg 16801 tctgactggc aaaggcgctg tacgccagaa tgccatcaat gcttcgcttg cacagggcgt 16861 tgaggcgtcc ctgcgctggc tgacggagct gacaacgtcg cttgccacca caaattacgc 16921 gattacccgg gtaaacgaca gggtcagttc actggtcagt gatactgcca ggctggcgca 16981 ttattcggca gacacgcggg agcagttact caccctggca gaacaggttc atcagaaact 17041 gaatcatctt gaagaaaaac tccaccgtgt tgaccaggtt cagcgggcac agttacatct 17101 tgagcagata ttctcatggt ggagcgccgg gcgatacgcg tcattttccc ctgccggacg 17161 ttgttatgtg gcgcttgaag agcttcgctg gggcgcgttt ggtgatgtga tacgtcaggg 17221 tgaaacaggc caggttaacc agttactgga tatcctcaga cataaagcat taacgcaaat 17281 ggcacaggag aatggcggta gcgcaacagt acgtctgaac acactggact ggcttggtgg 17341 tcagagccgg gaacaggctg ataacgaatg gcatgaagcg gttaactggc tgggggactg 17401 gtgcagtgaa gagcggcatc ctgtgatctg gtcaaccaca caggctgcag aacacttacc 17461 ggttcgtatg ccccgtctct gctctgcaga acgcctctct gaaagtatgg ttgatgaaat 17521 atttcagaag ggggaggcat gagtacagaa atgaaaacgg ggctggtgct gtccggcggc 17581 ggggcggtgg gcgcttatca ggcgggagtg gttaaggcac tggcagagtg cggcgcacag 17641 atcagtatgg tctcaggagc cagtatcggg gctctcaatg gtgccattat cacggcctct 17701 cccgatctgt cagaagccgc cttacgcctg gaggcgctct gggatcatct ggggaataat 17761 caggttctgt cggtaaacag atcggtttac ttttcattgc tgaaaaaact ggttcaggcc 17821 atgaacctct gccagatccc cggacgtgcg ggcgcactgc ttacgacgct ttttcgccat 17881 atatcgacaa tcaacgggtt cgacaatccg atgattcagc cgctgttgtc agatgagccc 17941 ctgacagcgc tgatggatca ttatcttgat acagatgctc tggccgacgg gctaccgctg 18001 tacgtgtcgc tgtaccccac agaaggaggc atgcaggata ttattgactg cattcgtgct 18061 gaactgggtg ccggaaccac gaaaaacgct gtttttcagc atatccagag tctgccccgc 18121 gggcagcaga aagaagcctt actggcgtca gccgcgctgc ctctgctgtt ccgtccccgc 18181 gaggttcagg gaacaatgta cggtgatggt gggatgggcg gatggcgaaa caggcagggg 18241 aatactcccg tgacgccact ggtggatgcc ggatgcaata tggtgattgt gacgcatctg 18301 agtgacggtt ctttatggga tcgtcgggct tatccggaca ccacaatcct tgaaatccgt 18361 ccccggaaaa ggttgaaaca aatcggggac gaaggcaaaa gtggcggtct gctcagtttt 18421 acatcggcac ataccgacgc ctggcgtcag cagggctacg aggacacgat gctgacgatg 18481 gagcatatcc ggaaaccgct ggcagcacgt caggcactga cccggtcaga gacggtattg 18541 cagaaaagcc tggagataac tgaaggtgca gattcggcac tgagaaacgc gatggcccgg 18601 attaaataaa gacgctccgg agaaacgcca ccgccactgt ggtggtgttt ttcctgtctg 18661 gtattattga aaatattttc agatatatat tacctggatg atggttaaga ttaatttatt 18721 taattttggg gagaaaactc atgaaactgt taaacattct tttagctgtt gttt //