Sequence of DPV Grapevine leafroll-associated virus 2

Grapevine leafroll-associated virus 2 methyltransferase/helicase polyprotein gene, partial cds; and RNA-dependent RNA polymerase, putative transmembrane small hydrophobic protein, 65 kDa chaperone protein, 63 kDa protein, 25 kDa diverged coat protein, 22 kDa coat protein, 19 kDa protein, and 24 kDa protein genes, complete cds.

ACC No: AF039204

Dated: 2005-04-15 | Length: 15000 | CRC: 1303634693

                !!NA_SEQUENCE 1.0
ID   AF039204   standard; genomic RNA; VRL; 15000 BP.
XX
AC   AF039204;
XX
SV   AF039204.1
XX
DT   11-MAY-1998 (Rel. 55, Created)
DT   15-APR-2005 (Rel. 83, Last updated, Version 4)
XX
DE   Grapevine leafroll-associated virus 2 methyltransferase/helicase
DE   polyprotein gene, partial cds; and RNA-dependent RNA polymerase, putative
DE   transmembrane small hydrophobic protein, 65 kDa chaperone protein, 63 kDa
DE   protein, 25 kDa diverged coat protein, 22 kDa coat protein, 19 kDa protein,
DE   and 24 kDa protein genes, complete cds.
XX
KW   .
XX
OS   Grapevine leafroll-associated virus 2
OC   Viruses; ssRNA positive-strand viruses, no DNA stage; Closteroviridae;
OC   Closterovirus.
XX
RN   [1]
RP   1-15000
RX   MEDLINE; 98264507.
RX   PUBMED; 9603345.
RA   Zhu H.Y., Ling K.S., Goszczynski D.E., McFerson J.R., Gonsalves D.;
RT   "Nucleotide sequence and genome organization of grapevine
RT   leafroll-associated virus-2 are similar to beet yellows virus, the
RT   closterovirus type member";
RL   J. Gen. Virol. 79(Pt 5):1289-1298(1998).
XX
RN   [2]
RP   1-15000
RA   Zhu H.Y., Ling K.S., Gonsalves D.;
RT   ;
RL   Submitted (18-DEC-1997) to the EMBL/GenBank/DDBJ databases.
RL   Plant Pathology, Cornell University, New York State Agricultural Experiment
RL   Station, Geneva, NY 14456, USA
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .15000
FT                   /db_xref="taxon:64003"
FT                   /mol_type="genomic RNA"
FT                   /organism="Grapevine leafroll-associated virus 2"
FT                   /isolate="Pinot Noir"
FT   CDS             <1. .7423
FT                   /codon_start=2
FT                   /db_xref="GOA:O71209"
FT                   /db_xref="InterPro:IPR000606"
FT                   /db_xref="InterPro:IPR002588"
FT                   /db_xref="InterPro:IPR008749"
FT                   /db_xref="UniProt/TrEMBL:O71209"
FT                   /note="larger than 277 kDa; ORF1a; contains domains for two
FT                   papain-like leader proteases, a methyltransferase and a
FT                   helicase; identified by sequence comparison"
FT                   /function="replication and unknown functions"
FT                   /product="methyltransferase/helicase polyprotein"
FT                   /protein_id="AAC40855.1"
FT                   /translation="ADYVAMLRYVCGGKFPLVLMSRVIYPDGRCYLAHMRYLCAFYCRP
FT                   FRESDYALGMWPTVARLRACVEKNFGVEACGIALRGYYTSRNVYHCDYDSAYVKYFRNL
FT                   SGRIGGGSFDPTSLTSVITVKISGLPGGLPKNIAFGAFLCDIRYVEPVDSGGIQSSVKT
FT                   KREDAHRTVEERAAGGSVEQPRQKRIDEKGCGRVPSGGFSHLLVGNLNEVRRKVAAGLL
FT                   RFRVGGDMDFHRSFSTQAGHRLLVWRRSSRSVCLELYSPSKNFLRYDVLPCSGDYAAMF
FT                   SFAAGGRFPLVLMTRIRYPNGFCYLAHCRYACAFLLRGFDPKRFDIGAFPTAAKLRNRM
FT                   VSELGERSLGLNLYGAYTSRGVFHCDYDAKFIKDLRLMSAVIAGKDGVEEVVPSDITPA
FT                   MKQKTIEAVYDRLYGGTDSLLKLSIEKDLIDFKNDVQSLKKDRPIVKVPFYMSEATQNS
FT                   LTRFYPQFELKFSHSSHSDHPAAAASRLLENETLVRLCGNSVSDIGGCPLFHLHSKTQR
FT                   RVHVCRPVLDGKDAQRRVVRDLQYSNVRLGDDDKILEGPRNIDICHYPLGACDHESSAM
FT                   MMVQVYDASLYEICGAMIKKKSRITYLTMVTPGEFLDGRECVYMESLDCEIEVDVHADV
FT                   VMYKFGSSCYSHKLSIIKDIMTTPYLTLGGFLFSVEMYEVRMGVNYFKITKSEVSPSIS
FT                   CTKLLRYRRANSDVVKVKLPRFDKKRRMCLPGYDTIYLDSKFVSRVFDYVVCNCSAVNS
FT                   KTFEWVWSFIKSSKSRVIISGKIIHKDVNLDLKYVESFAAVMLASGVRSRLASEYLAKN
FT                   LSHFSGDCSFIEATSFVLREKIRNMTLNFNERLLQLVKRVAFATLDVSFLDLDSTLESI
FT                   TDFAECKVAIELDELGCLRAEAENEKIRNLAGDSIAAKLASEIVVDIDSKPSPKQVGNS
FT                   SSENADKREVQRPGLRGGSRNGVVGEFLHFVVDSALRLFKYATDQQRIKSYVRFLDSAV
FT                   SFLDYNYDNLSFILRVLSEGYSCMFAFLANRGDLSSRVRSAVCAVKEVATSCANASVSK
FT                   AKVMITFAAAVCAMMFNSCGFSGDGREYKSYIHRYTQVLFDTIFFEDSSYLPIEVLSSA
FT                   ICGAIVTLFSSGSSISLNAFLLQITKGFSLEVVVRNVVRVTHGLSTTATDGVIRGVFSQ
FT                   IVSHLLVGNTGNVAYQSAFIAGVVPLLVKKCVSLIFILREDTYSGFIKHGISEFSFLSS
FT                   ILKFLKGKLVDELKSIIQGVFDSNKHVFKEATQEAIRTTVMQVPVAVVDALKSAAGKIY
FT                   NNFTSRRTFGKDEGSSSDGACEEYFSCDEGEGPGLKGGSSYGFSILAFFSRIMWGARRL
FT                   IVKVKHECFGKLFEFLSLKLHEFRTRVFGKNRTDVGVYDFLPTGIVETLSSIEECDQIE
FT                   ELLGDDLKGDKDASLTDMNYFEFSEDFLASIEEPPFAGLRGGSKNIAILAILEYAHNLF
FT                   RIVASKCSKRPLFLAFAELSSALIEKFKEVFPRKSQLVAIVREYTQRFLRSRMRALGLN
FT                   NEFVVKSFADLLPALMKRKVSGSFLASVYRPLRGFSYMCVSAERREKFFALVCLIGLSL
FT                   PFFVRIVGAKACEELVSSARRFYERIKIFLRQKYVSLSNFFCHLFSSDVDDSSASAGLK
FT                   GGASRMTLFHLLVRLASALLSLGWEGLKLLLSHHNLLFLCFALVDDVNVLIKVLGGLSF
FT                   FVQPIFSLFAAMLLQPDRFVEYSEKLVTAFEFFLKCSPRAPALLKGFFECVANSTVSKT
FT                   VRRLLRCFVKMLKLRKGRGLRADGRGLHRQKAVPVIPSNRVVTDGVERLSVKMQGVEAL
FT                   RTELRILEDLDSAVIEKLNRRRNRDTNDDEFTRPAHEQMQEVTTFCSKANSAGLALERA
FT                   VLVEDAIKSEKLSKTVNEMVRKGSTTSEEVAVALSDDEAVEEISVADERDDSPKTVRIS
FT                   EYLNRLNSSFEFPKPIVVDDNKDTGGLTNAVREFYYMQELALFEIHSKLCTYYDQLRIV
FT                   NFDRSVAPCSEDAQLYVRKNGSTIVQGKEVRLHIKDFHDHDFLFDGKISINKRRRGGNV
FT                   LYHDNLAFLASNLFLAGYPFSRSFVFTNSSVDILLYEAPPGGGKTTTLIDSFLKVFKKG
FT                   EVSTMILTANKSSQVEILKKVEKEVSNIECQKRKDKRSPKKSIYTIDAYLMHHRGCDAD
FT                   VLFIDECFMVHAGSVLACIEFTRCHKVMIFGDSRQIHYIERNELDKCLYGDLDRFVDLQ
FT                   CRVYGNISYRCPWDVCAWLSTVYGNLIATVKGESEGKSSMRINEINSVDDLVPDVGSTF
FT                   LCMLQSEKLEISKHFIRKGLTKLNVLTVHEAQGETYARVNLVRLKFQEDEPFKSIRHIT
FT                   VALSRHTDSLTYNVLAARRGDATCDAIQKAAELVNKFRVFPTSFGGSVINLNVKKDVED
FT                   NSRCKASSAPLSVINDFLNEVNPGTAVIDFGDLSADFSTGPFECGASGIVVRDNISSSN
FT                   ITDHDKQRV"
FT   CDS             <7422. .8801
FT                   /codon_start=1
FT                   /db_xref="GOA:O71210"
FT                   /db_xref="InterPro:IPR001788"
FT                   /db_xref="InterPro:IPR007094"
FT                   /db_xref="InterPro:IPR007095"
FT                   /db_xref="UniProt/TrEMBL:O71210"
FT                   /note="RdRp; 52 kDa; similar to RNA polymerases of other
FT                   closteroviruses; presumably expressed via +1 ribosomal
FT                   frameshift"
FT                   /function="replication"
FT                   /product="RNA-dependent RNA polymerase"
FT                   /protein_id="AAC40856.1"
FT                   /translation="SVVRSQAIPRRKPSLQENLYSFEARNYNFSTCDRNTSASMFGEAM
FT                   AMNCLRRCFDLDAFSSLRDDVISITRSGIEQWLEKRTPSQIKALMKDVESPLEIDDEIC
FT                   RFKLMVKRDAKVKLDSSCLTKHSAAQNIMFHRKSINAIFSPIFNEVKNRIMCCLKPNIK
FT                   FFTEMTNRDFASVVSNMLGDDDVYHIGEVDFSKYDKSQDAFVKAFEEVMYKELGVDEEL
FT                   LAIWMCGERLSIANTLDGQLSFTIENQRKSGASNTWIGNSLVTLGILSLYYDVRNFEAL
FT                   YISGDDSLIFSRSEISNYADDICTDMGFETKFMSPSVPYFCSKFVVMCGHKTFFVPDPY
FT                   KLFVKLGAVKEDVSMDFLFETFTSFKDLTSDFNDERLIQKLAELVALKYEVQTGNTTLA
FT                   LSVIHCLRSNFLSFSKLYPRVKGWQVFYTSVKKALLKSGCSLFDSFMTPFGQAVMVWDD
FT                   E"
FT   CDS             8865. .9035
FT                   /codon_start=1
FT                   /db_xref="GOA:O39853"
FT                   /db_xref="UniProt/TrEMBL:O39853"
FT                   /note="6 kDa; probably membrane-associated; similar to
FT                   small hydrophobic proteins of other closteroviruses"
FT                   /product="putative transmembrane small hydrophobic protein"
FT                   /protein_id="AAC40857.1"
FT                   /translation="MNQVLQFECLFLLNLAVFAVTFIFILLVFRVIKSFRQKGHEAPVP
FT                   VVRGGGFSTVV"
FT   CDS             9051. .10850
FT                   /codon_start=1
FT                   /db_xref="GOA:O71211"
FT                   /db_xref="HSSP:1DKG"
FT                   /db_xref="InterPro:IPR001023"
FT                   /db_xref="UniProt/TrEMBL:O71211"
FT                   /note="p65; HSP70; similar to heat shock 70 proteins;
FT                   identified by sequence comparison"
FT                   /product="65 kDa chaperone protein"
FT                   /protein_id="AAC40858.1"
FT                   /translation="MVVFGLDFGTTFSTVCVYKDGRVFSFKQNNSAYIPTYLYLFSDSN
FT                   HMTFGYEAESLMSNLKVKGSFYRDLKRWVGCDSSNLDAYLDRLKPHYSVRLVKIGSGLN
FT                   ETVSIGNFGGTVKSEAHLPGLIALFIKAVISCAEGAFACTCTGVICSVPANYDSVQRNF
FT                   TDQCVSLSGYQCVYMINEPSAAALSACNSIGKKSANLAVYDFGGGTFDVSIISYRNNTF
FT                   VVRASGGDLNLGGRDVDRAFLTHLFSLTSLEPDLTLDISNLKESLSKTDAEIVYTLRGV
FT                   DGRKEDVRVNKNILTSVMLPYVNRTLKILESTLKSYAKSMNESARVKCDLVLIGGSSYL
FT                   PGLADVLTKHQSVDRILRVSDPRAAVAVGCALYSSCLSGSGGLLLIDCAAHTVAIADRS
FT                   CHQIICAPAGAPIPFSGSMPLYLARVNKNSQREVAVFEGEYVKCPKNRKICGANIRFFD
FT                   IGVTGDSYAPVTFYMDFSISSVGAVSFVVRGPEGKQVSLTGTPAYNFSSVALGSRSVRE
FT                   LHISLNNKVFLGLLLHRKADRRILFTKDEAIRYADSIDIADVLKEYKSYAASALPPDED
FT                   VELLLGKSVQKVLRGSRLEEIPL"
FT   CDS             10777. .12432
FT                   /codon_start=1
FT                   /db_xref="InterPro:IPR004909"
FT                   /db_xref="UniProt/TrEMBL:O71212"
FT                   /note="p63; putative heat shock protein 90 homolog"
FT                   /product="63 kDa protein"
FT                   /protein_id="AAC40859.1"
FT                   /translation="MSNYSWESLFKKFYGEADWKKYLSRSIAAHSSEIKTLPDIRLYGG
FT                   RVVKKSEFESALPNSFEQELGLFILSEREVGWSKLCGITVEEAAYDLTNPKAYKFTAET
FT                   CSPDVKGEGQKYSMEDVMNFMRLSNLDVNDKMLTEQCWSLSNSCGELINPDDKGRFVAL
FT                   TFKDRDTADDTGAANVECRVGDYLVYAMSLFEQRTQKSQSGNISLYEKYCEYIRTYLGS
FT                   TDLFFTAPDRIPLLTGILYDFCKEYNVFYSSYKRNVDNFRFFLANYMPLISDVFVFQWV
FT                   KPAPDVRLLFELSAAELTLEVPTLSLIDSQVVVGHILRYVESYTSDPAIDALEDKLEAI
FT                   LKSSNPRLSTAQLWVGFFCYYGEFRTAQSRVVQRPGVYKTPDSVGGFEINMKDVEKFFD
FT                   KLQRELPNVSLRRQFNGARAHEAFKIFKNGNISFRPISRLNVPREFWYLNIDYFRHANR
FT                   SGLTEEEILILNNISVDVRKLCAERACNTLPSAKRFSKNHKSNIQSSRQERRIKDPLVV
FT                   LKDTLYEFQHKRAGWGSRSTRDLGSRADHAKGSG"
FT   CDS             12344. .13015
FT                   /codon_start=1
FT                   /db_xref="GOA:O39856"
FT                   /db_xref="InterPro:IPR002679"
FT                   /db_xref="UniProt/TrEMBL:O39856"
FT                   /note="p25; CPd; coat protein duplicate"
FT                   /product="25 kDa diverged coat protein"
FT                   /protein_id="AAC40860.1"
FT                   /translation="MSSNTSVPVGGLEALETSGVVLTTRKEAVDKFFNELKNENYSSVD
FT                   SSRLSDSEVKEVLEKSKESFKSELASTDEHFVYHIIFFLIRCAKISTSEKVKYVGSHTY
FT                   VVDGKTYTVLDAWVFNMMKSLTKKYKRVNGLRAFCCACEDLYLTVAPIMSERFKTKAVG
FT                   MKGLPVGKEYLGADFLSGTSKLMSDHDRAVSIVAAKNAVDRSAFTGGERKIVSLYDLGR
FT                   Y"
FT   CDS             13084. .13680
FT                   /codon_start=1
FT                   /db_xref="GOA:O71213"
FT                   /db_xref="InterPro:IPR002679"
FT                   /db_xref="UniProt/TrEMBL:O71213"
FT                   /evidence=EXPERIMENTAL
FT                   /note="p22"
FT                   /product="22 kDa coat protein"
FT                   /protein_id="AAC40861.1"
FT                   /translation="MELMSDSNLSNLVITDASSLNGVDKKLLSAEVEKMLVQKGAPNEG
FT                   IEVVFGLLLYALAARTTSPKVQRADSDVIFSNSFGERNVVVTEGDLKKVLDGCAPLTRF
FT                   TNKLRTFGRTFTEAYVDFCIAYKHKLPQLNAAAELGIPAEDSYLAADFLGTCPKLSELQ
FT                   QSRKMFASMYALKTEGGVVNTPVSNLRQLGRREVM"
FT   CDS             13680. .14165
FT                   /codon_start=1
FT                   /db_xref="UniProt/TrEMBL:O39858"
FT                   /note="p19"
FT                   /product="19 kDa protein"
FT                   /protein_id="AAC40862.1"
FT                   /translation="MEDYEEKSESLILLRTNLNTMLLVVKSDASVELPKLLICGYLRVS
FT                   GRGEVTCCNREELTRDFEGNHHTVIRSRIIQYDSESAFEEFNNSDCVVKFFLETGSVFW
FT                   FFLRSETKGRAVRHLRTFFEANNFFFGSHCGTMEYCLKQVLTETESIIDSFCEERNR"
FT   CDS             14167. .14784
FT                   /codon_start=1
FT                   /db_xref="UniProt/TrEMBL:O71214"
FT                   /note="p24"
FT                   /product="24 kDa protein"
FT                   /protein_id="AAC40863.1"
FT                   /translation="MRVIVSPYEAEDILKRSTDMLRNIDSGVLSTKECIKAFSTITRDL
FT                   HCAKASYQWGVDTGLYQRNCAEKRLIDTVESNIRLAQPLVREKVAVHFCKDEPKELVAF
FT                   ITRKYVELTGVGVREAVKREMRSLTKTVLNKMSLEMAFYMSPRAWKNAEWLELKFSPVK
FT                   IFRDLLLDVETLNELCAEDDVHVDKVNENGDENHDLELQDEC"
FT   3'UTR           14785. .15000
FT                   /note="similar to 3'UTRs of other monopartite
FT                   closteroviruses"
XX
SQ   Sequence 15000 BP; 3877 A; 3050 C; 3820 G; 4253 T; 0 other;

AF039204  Length: 15000  April 19, 2005 08:57  Type: N  Check: 7450  ..

       1  ggcggattac gtggcgatgc tgcgttatgt gtgtggcggg aaatttccac
      51  tcgtcctcat gagtagagtt atttacccgg atgggcgctg ttacttggcc
     101  catatgaggt atttgtgcgc cttttactgt cgcccgttta gagagtcgga
     151  ttatgccctc ggaatgtggc ctacggtggc gcgtctcagg gcatgcgttg
     201  agaagaactt cggtgtcgaa gcttgtggca tagctcttcg tggctattac
     251  acctctcgca atgtttatca ctgtgattat gactctgctt atgtaaaata
     301  ttttagaaac ctttccggcc gcattggcgg tggttcgttc gatccgacat
     351  ctttaacctc cgtaataacg gtgaagatta gcggtcttcc aggtggtctt
     401  cctaaaaata tagcgtttgg tgccttcctg tgcgatatac gttacgtcga
     451  accggtagac tcgggcggca ttcaatcgag cgttaagacg aaacgtgaag
     501  atgcgcaccg aaccgtagag gaacgggcgg ccggcggatc cgtcgagcaa
     551  ccgcgacaaa agaggataga tgagaaaggt tgcggcagag ttcctagtgg
     601  aggtttttcg catctcctgg tcggcaacct taacgaagtt aggaggaagg
     651  tagctgccgg acttctacgc tttcgcgttg gcggtgatat ggattttcat
     701  cgctcgttct ccacccaagc gggccaccgc ttgctggtgt ggcgccgctc
     751  gagccggagc gtgtgccttg aactttactc accatctaaa aactttttgc
     801  gttacgatgt cttgccctgt tctggagact atgcagcgat gttttctttc
     851  gcggcgggcg gccgtttccc tttagttttg atgactagaa ttagataccc
     901  gaacgggttt tgttacttgg ctcactgccg gtacgcgtgc gcgtttctct
     951  taaggggttt tgatccgaag cgtttcgaca tcggtgcttt ccccaccgcg
    1001  gccaagctca gaaaccgtat ggtttcggag cttggtgaaa gaagtttagg
    1051  tttgaacttg tacggcgcat atacgtcacg cggcgtcttt cactgcgatt
    1101  atgacgctaa gtttataaag gatttgcgtc ttatgtcagc agttatagct
    1151  ggaaaggacg gggtggaaga ggtggtacct tctgacataa ctcctgccat
    1201  gaagcagaaa acgatcgaag ccgtgtatga tagattatat ggcggcactg
    1251  actcgttgct gaaactgagc atcgagaaag acttaatcga tttcaaaaat
    1301  gacgtgcaga gtttgaagaa agatcggccg attgtcaaag tgccctttta
    1351  catgtcggaa gcaacacaga attcgctgac gcgtttctac cctcagttcg
    1401  aacttaagtt ttcgcactcc tcgcattcag atcatcccgc cgccgccgct
    1451  tctagactgc tggaaaatga aacgttagtg cgcttatgtg gtaatagcgt
    1501  ttcagatatt ggaggttgtc ctcttttcca tttgcattcc aagacgcaaa
    1551  gacgggttca cgtatgtagg cctgtgttgg atggcaagga tgcgcagcgt
    1601  cgcgtggtgc gtgatttgca gtattccaac gtgcgtttgg gagacgatga
    1651  taaaattttg gaagggccac gcaatatcga catttgccac tatcctctgg
    1701  gcgcgtgtga ccacgaaagt agtgctatga tgatggtgca ggtgtatgac
    1751  gcgtcccttt atgagatatg tggcgccatg atcaagaaga aaagccgcat
    1801  aacgtactta accatggtca cgcccggcga gtttcttgac ggacgcgaat
    1851  gcgtctacat ggagtcgtta gactgtgaga ttgaagttga tgtgcacgcg
    1901  gacgtcgtaa tgtacaaatt cggtagttct tgctattcgc acaagctttc
    1951  aatcatcaag gacatcatga ccactccgta cttgacacta ggtggttttc
    2001  tattcagcgt ggagatgtat gaggtgcgta tgggcgtgaa ttacttcaag
    2051  attacgaagt ccgaagtatc gcctagcatt agctgcacca agctcctgag
    2101  ataccgaaga gctaatagtg acgtggttaa agttaaactt ccacgtttcg
    2151  ataagaaacg tcgcatgtgt ctgcctgggt atgacaccat atacctagat
    2201  tcgaagtttg tgagtcgcgt tttcgattat gtcgtgtgta attgctctgc
    2251  cgtgaactca aaaactttcg agtgggtgtg gagtttcatt aagtctagta
    2301  agtcgagggt gattattagc ggtaaaataa ttcacaagga tgtgaatttg
    2351  gacctcaagt acgtcgagag tttcgccgcg gttatgttgg cctctggcgt
    2401  gcgcagtaga ctagcgtccg agtaccttgc taagaacctt agtcattttt
    2451  cgggagattg ctcctttatt gaagccacgt ctttcgtgtt gcgtgagaaa
    2501  atcagaaaca tgactctgaa ttttaacgaa agacttttac agttagtgaa
    2551  gcgcgttgcc tttgcgacct tggacgtgag ttttctagat ttagattcaa
    2601  ctcttgaatc aataactgat tttgccgagt gtaaggtagc gattgaactc
    2651  gacgagttgg gttgcttgag agcggaggcc gagaatgaaa aaatcaggaa
    2701  tctggcggga gattcgattg cggctaaact cgcgagcgag atagtggtcg
    2751  atattgactc taagccttca ccgaagcagg tgggtaattc gtcatccgaa
    2801  aacgccgata agcgggaagt tcagaggccc ggtttgcgtg gtggttctag
    2851  aaacggggtt gttggggagt tccttcactt cgtcgtggat tctgccttgc
    2901  gtcttttcaa atacgcgacg gatcaacaac ggatcaagtc ttacgtgcgt
    2951  ttcttggact cggcggtctc attcttggat tacaactacg ataatctatc
    3001  gtttatactg cgagtgcttt cggaaggtta ttcgtgtatg ttcgcgtttt
    3051  tggcgaatcg cggcgactta tctagtcgtg tccgtagcgc ggtgtgtgct
    3101  gtgaaagaag ttgctacctc atgcgcgaac gcgagcgttt ctaaagccaa
    3151  ggttatgatt accttcgcag cggccgtgtg tgctatgatg tttaatagct
    3201  gcggtttttc aggcgacggt cgggagtata aatcgtatat acatcgttac
    3251  acgcaagtat tgtttgacac tatctttttt gaggacagca gttacctacc
    3301  catagaagtt ctgagttcgg cgatatgcgg tgctatcgtc acacttttct
    3351  cctcgggctc gtccataagt ttaaacgcct tcttacttca aattaccaaa
    3401  ggattctccc tagaggttgt cgtccggaat gttgtgcgag tcacgcatgg
    3451  tttgagcacc acagcgaccg acggcgtcat acgtggggtt ttctcccaaa
    3501  ttgtgtctca cttacttgtt ggaaatacgg gtaatgtggc ttaccagtca
    3551  gctttcattg ccggggtggt gcctctttta gttaaaaagt gtgtgagctt
    3601  aatcttcatc ttgcgtgaag atacttattc cggttttatt aagcacggaa
    3651  tcagtgaatt ctctttcctt agtagtattc tgaagttctt gaagggtaag
    3701  cttgtggacg agttgaaatc gattattcaa ggggtttttg attccaacaa
    3751  gcacgtgttt aaagaagcta ctcaggaagc gattcgtacg acggtcatgc
    3801  aagtgcctgt cgctgtagtg gatgccctta agagcgccgc gggaaaaatt
    3851  tataacaatt ttactagtcg acgtaccttt ggtaaggatg aaggctcctc
    3901  tagcgacggc gcatgtgaag agtatttctc atgcgacgaa ggtgaaggtc
    3951  cgggtctgaa agggggttcc agctatggct tctcaatttt agcgttcttt
    4001  tcacgcatta tgtggggagc tcgtcggctt attgttaagg tgaagcatga
    4051  gtgttttggg aaactttttg aatttctatc gctcaagctt cacgaattca
    4101  ggactcgcgt ttttgggaag aatagaacgg acgtgggagt ttacgatttt
    4151  ttgcccacgg gcatcgtgga aacgctctca tcgatagaag agtgcgacca
    4201  aattgaagaa cttctcggcg acgacctgaa aggtgacaag gatgcttcgt
    4251  tgaccgatat gaattacttt gagttctcag aagacttctt agcctctatc
    4301  gaggagccgc ctttcgctgg attgcgagga ggtagcaaga acatcgcgat
    4351  tttggcgatt ttggaatacg cgcataattt gtttcgcatt gtcgcaagca
    4401  agtgttcgaa acgaccttta tttcttgctt tcgccgaact ctcaagcgcc
    4451  cttatcgaga aatttaagga ggttttccct cgtaagagcc agctcgtcgc
    4501  tatcgtgcgc gagtatactc agagattcct ccgaagtcgc atgcgtgcgt
    4551  tgggtttgaa taacgagttc gtggtaaaat ctttcgccga tttgctaccc
    4601  gcattaatga agcggaaggt ttcaggttcg ttcttagcta gtgtttatcg
    4651  cccacttaga ggtttctcat atatgtgtgt ttcagcggag cgacgtgaaa
    4701  agttttttgc tctcgtgtgt ttaatcgggt taagtctccc tttcttcgtg
    4751  cgcatcgtag gagcgaaagc gtgcgaagaa ctcgtgtcct cagcgcgtcg
    4801  cttttatgag cgtattaaaa tttttctaag gcagaagtat gtctctcttt
    4851  ctaatttctt ttgtcacttg tttagctctg acgttgatga cagttccgca
    4901  tctgcagggt tgaaaggtgg tgcgtcgcga atgacgctct tccaccttct
    4951  ggttcgcctt gctagtgccc tcctatcgtt agggtgggaa gggttaaagc
    5001  tactcttatc gcaccacaac ttgttatttt tgtgttttgc attggttgac
    5051  gatgtgaacg tccttatcaa agttcttggg ggtctttctt tctttgtgca
    5101  accaatcttt tccttgtttg cggcgatgct tctacaaccg gacaggtttg
    5151  tggagtattc cgagaaactt gttacagcgt ttgaattttt cttaaaatgt
    5201  tcgcctcgcg cgcctgcact actcaaaggg ttttttgagt gcgtggcgaa
    5251  cagcactgtg tcaaaaaccg ttcgaagact tcttcgctgt ttcgtgaaga
    5301  tgctcaaact tcgaaaaggg cgagggttgc gtgcggatgg taggggtctc
    5351  catcggcaga aagccgtacc cgtcatacct tctaatcggg tcgtgaccga
    5401  cggggttgaa agactttcgg taaagatgca aggagttgaa gcgttgcgta
    5451  ccgaattgag aatcttagaa gatttagatt ctgccgtgat cgaaaaactc
    5501  aatagacgca gaaatcgtga cactaatgac gacgaattta cgcgccctgc
    5551  tcatgagcag atgcaagaag tcaccacttt ctgttcgaaa gccaactctg
    5601  ctggtttggc cctggaaagg gcagtgcttg tggaagacgc tataaagtcg
    5651  gagaaacttt ctaagacggt taatgagatg gtgaggaaag ggagtaccac
    5701  cagcgaagaa gtggccgtcg ctttgtcgga cgatgaagcc gtggaagaaa
    5751  tctctgttgc tgacgagcga gacgattcgc ctaagacagt caggataagc
    5801  gaatacctaa ataggttaaa ctcaagcttc gaattcccga agcctattgt
    5851  tgtggacgac aacaaggata ccgggggtct aacgaacgcc gtgagggagt
    5901  tttattatat gcaagaactt gctcttttcg aaatccacag caaactgtgc
    5951  acctactacg atcaactgcg catagtcaac ttcgatcgtt ccgtagcacc
    6001  atgcagcgaa gatgctcagc tgtacgtacg gaagaacggc tcaacgatag
    6051  tgcagggtaa agaggtacgt ttgcacatta aggatttcca cgatcacgat
    6101  ttcctgtttg acggaaaaat ttctattaac aagcggcggc gaggcggaaa
    6151  tgttttatat cacgacaacc tcgcgttctt ggcgagtaat ttgttcttag
    6201  ccggctaccc cttttcaagg agcttcgtct tcacgaattc gtcggtcgat
    6251  attctcctct acgaagctcc acccggaggt ggtaagacga cgacgctgat
    6301  tgactcgttc ttgaaggtct tcaagaaagg tgaggtttcc accatgatct
    6351  taaccgccaa caaaagttcg caggttgaga tcctaaagaa agtggagaag
    6401  gaagtgtcta acattgaatg ccagaaacgt aaagacaaaa gatctccgaa
    6451  aaagagcatt tacaccatcg acgcttattt aatgcatcac cgtggttgtg
    6501  atgcagacgt tcttttcatc gatgagtgtt tcatggttca tgcgggtagc
    6551  gtactagctt gcattgagtt cacgaggtgt cataaagtaa tgatcttcgg
    6601  ggatagccgg cagattcact acattgaaag gaacgaattg gacaagtgtt
    6651  tgtatgggga tctcgacagg ttcgtggacc tgcagtgtcg ggtttatggt
    6701  aatatttcgt accgttgtcc atgggatgtg tgcgcttggt taagcacagt
    6751  gtatggcaac ctaatcgcca ccgtgaaggg tgaaagcgaa ggtaagagca
    6801  gcatgcgcat taacgaaatt aattcagtcg acgatttagt ccccgacgtg
    6851  ggttccacgt ttctgtgtat gcttcagtcg gagaagttgg aaatcagcaa
    6901  gcactttatt cgcaagggtt tgactaaact taacgttcta acggtgcatg
    6951  aggcgcaagg tgagacgtat gcgcgtgtga accttgtgcg acttaagttt
    7001  caggaggatg aaccctttaa atctatcagg cacataaccg tcgctctttc
    7051  tcgtcacacc gacagcttaa cttataacgt cttagctgct cgtcgaggtg
    7101  acgccacttg cgatgccatc cagaaggctg cggaattggt gaacaagttt
    7151  cgcgtttttc ctacatcttt tggtggtagt gttatcaatc tcaacgtgaa
    7201  gaaggacgtg gaagataaca gtaggtgcaa ggcttcgtcg gcaccattga
    7251  gcgtaatcaa cgactttttg aacgaagtta atcccggtac tgcggtgatt
    7301  gattttggtg atttgtccgc ggacttcagt actgggcctt ttgagtgcgg
    7351  tgccagcggt attgtggtgc gggacaacat ctcctccagc aacatcactg
    7401  atcacgataa gcagcgtgtt tagcgtagtt cggtcgcagg cgattccgcg
    7451  tagaaaacct tctctacaag aaaatttgta ttcgtttgaa gcgcggaatt
    7501  ataacttctc gacttgcgac cgtaacacat ctgcttcaat gttcggagag
    7551  gctatggcga tgaactgtct tcgtcgttgc ttcgacctag atgccttttc
    7601  gtccctgcgt gatgatgtga ttagtatcac acgttcaggc atcgaacaat
    7651  ggctggagaa acgtactcct agtcagatta aagcattaat gaaggatgtt
    7701  gaatcgcctt tggaaattga cgatgaaatt tgtcgtttta agttgatggt
    7751  gaagcgtgac gctaaggtga agttagactc ttcttgttta actaaacaca
    7801  gcgccgctca aaatatcatg tttcatcgca agagcattaa tgctatcttc
    7851  tctcctatct ttaatgaggt gaaaaaccga ataatgtgct gtcttaagcc
    7901  taacataaag ttttttacgg agatgactaa cagggatttt gcttctgttg
    7951  tcagcaacat gcttggtgac gacgatgtgt accatatagg tgaagttgat
    8001  ttctcaaagt acgacaagtc tcaagatgct ttcgtgaagg cttttgaaga
    8051  agtaatgtat aaggaactcg gtgttgatga agagttgctg gctatctgga
    8101  tgtgcggcga gcggttatcg atagctaaca ctctcgatgg tcagttgtcc
    8151  ttcacgatcg agaatcaaag gaagtcggga gcttcgaaca cttggattgg
    8201  taactctctc gtcactttgg gtattttaag tctttactac gacgttagaa
    8251  atttcgaggc gttgtacatc tcgggcgatg attctttaat tttttctcgc
    8301  agcgagattt cgaattatgc cgacgacata tgcactgaca tgggttttga
    8351  gacaaaattt atgtccccaa gtgtcccgta cttttgttct aaatttgttg
    8401  ttatgtgtgg tcataagacg ttttttgttc ccgacccgta caagcttttt
    8451  gtcaagttgg gagcagtcaa agaggatgtt tcaatggatt tccttttcga
    8501  gacttttacc tcctttaaag acttaacctc cgattttaac gacgagcgct
    8551  taattcaaaa gctcgctgaa cttgtggctt taaaatatga ggttcaaacc
    8601  ggcaacacca ccttggcgtt aagtgtgata cattgtttgc gttcgaattt
    8651  cctctcgttt agcaagttat atcctcgcgt gaagggatgg caggtttttt
    8701  acacgtcggt taagaaagcg cttctcaaga gtgggtgttc tctcttcgac
    8751  agtttcatga ccccttttgg tcaggctgtc atggtttggg atgatgagta
    8801  gcgctaactt gtgcgcagtt tctttgttcg tgacatacac cttgtgtgtc
    8851  accgtgcgtt tataatgaat caggttttgc agtttgaatg tttgtttctg
    8901  ctgaatctcg cggtttttgc tgtgactttc attttcattc ttctggtctt
    8951  ccgcgtgatt aagtcttttc gccagaaggg tcacgaagca cctgttcccg
    9001  ttgttcgtgg cgggggtttt tcaaccgtag tgtagtcaaa agacgcgcat
    9051  atggtagttt tcggtttgga ctttggcacc acattctcta cggtgtgtgt
    9101  gtacaaggat ggacgagttt tttcattcaa gcagaataat tcggcgtaca
    9151  tccccactta cctctatctc ttctccgatt ctaaccacat gacttttggt
    9201  tacgaggccg aatcactgat gagtaatctg aaagttaaag gttcgtttta
    9251  tagagattta aaacgttggg tgggttgcga ttcgagtaac ctcgacgcgt
    9301  accttgaccg tttaaaacct cattactcgg tccgcttggt taagatcggc
    9351  tctggcttga acgaaactgt ttcaattgga aacttcgggg gcactgttaa
    9401  gtctgaggct catctgccag ggttgatagc tctctttatt aaggctgtca
    9451  ttagttgcgc ggagggcgcg tttgcgtgca cttgcaccgg ggttatttgt
    9501  tcagtacctg ccaattatga tagcgttcaa aggaatttca ctgatcagtg
    9551  tgtttcactc agcggttatc agtgcgtata tatgatcaat gaaccttcag
    9601  cggctgcgct atctgcgtgt aattcgattg gaaagaagtc cgcaaatttg
    9651  gctgtttacg atttcggtgg tgggaccttc gacgtgtcta tcatttcata
    9701  ccgcaacaat acttttgttg tgcgagcttc tggaggcgat ctaaatctcg
    9751  gtggaaggga tgttgatcgt gcgtttctca cgcacctctt ctctttaaca
    9801  tcgctggaac ctgacctcac tttggatatc tcgaatctga aagaatcttt
    9851  atcaaaaacg gacgcagaga tagtttacac tttgagaggt gtcgatggaa
    9901  gaaaagaaga cgttagagta aacaaaaaca ttcttacgtc ggtgatgctc
    9951  ccctacgtga acagaacgct taagatatta gagtcaacct taaaatcgta
   10001  tgctaagagt atgaatgaga gtgcgcgagt taagtgcgat ttagtgctga
   10051  taggaggatc ttcatatctt cctggcctgg cagacgtact aacgaagcat
   10101  cagagcgttg atcgtatctt aagagtttcg gatcctcggg ctgccgtggc
   10151  cgtcggttgc gcattatatt cttcatgcct ctcaggatct ggggggttgc
   10201  tactgatcga ctgtgcagct cacactgtcg ctatagcgga cagaagttgt
   10251  catcaaatca tttgcgctcc agcgggggca ccgatcccct tttcaggaag
   10301  catgcctttg tacttagcca gggtcaacaa gaactcgcag cgtgaagtcg
   10351  ccgtgtttga aggggagtac gttaagtgcc ctaagaacag aaagatctgt
   10401  ggagcaaata taagattttt tgatatagga gtgacgggtg attcgtacgc
   10451  acccgttacc ttctatatgg atttctccat ttcaagcgta ggagccgttt
   10501  cattcgtggt gagaggtcct gagggtaagc aagtgtcact cactggaact
   10551  ccagcgtata acttttcgtc tgtggctctc ggatcacgca gtgtccgaga
   10601  attgcatatt agtttaaata ataaagtttt tctcggtttg cttctacata
   10651  gaaaggcgga tcgacgaata cttttcacta aggatgaagc gattcgatac
   10701  gccgattcaa ttgatatcgc ggatgtgcta aaggaatata aaagttacgc
   10751  ggccagtgcc ttaccaccag acgaggatgt cgaattactc ctgggaaagt
   10801  ctgttcaaaa agttttacgg ggaagcagac tggaagaaat acctctctag
   10851  gagcatagca gcacactcaa gtgaaattaa aactctacca gacattcgat
   10901  tgtacggcgg tagggttgta aagaagtccg aattcgaatc agcacttcct
   10951  aattcttttg aacaggaatt aggactgttc atactgagcg aacgggaagt
   11001  gggatggagc aaattatgcg gaataacggt ggaagaagca gcatacgatc
   11051  ttacgaatcc caaggcttat aaattcactg ccgagacatg tagcccggat
   11101  gtaaaaggtg aaggacaaaa atactctatg gaagacgtga tgaatttcat
   11151  gcgtttatca aatctggatg ttaacgacaa gatgctgacg gaacagtgtt
   11201  ggtcgctgtc caattcatgc ggtgaattga tcaacccaga cgacaaaggg
   11251  cgattcgtgg ctctcacctt taaggacaga gacacagctg atgacacggg
   11301  tgccgccaac gtggaatgtc gcgtgggcga ctatctagtt tacgctatgt
   11351  ccctgtttga gcagaggacc caaaaatcgc agtctggcaa catctctctg
   11401  tacgaaaagt actgtgaata catcaggacc tacttaggga gtacagacct
   11451  gttcttcaca gcgccggaca ggattccgtt acttacgggc atcctatacg
   11501  atttttgtaa ggaatacaac gttttctact cgtcatataa gagaaacgtc
   11551  gataatttca gattcttctt ggcgaattat atgcctttga tatctgacgt
   11601  ctttgtcttc cagtgggtaa aacccgcgcc ggatgttcgg ctgctttttg
   11651  agttaagtgc agcggaacta acgctggagg ttcccacact gagtttgata
   11701  gattctcaag ttgtggtagg tcatatctta agatacgtag aatcctacac
   11751  atcagatcca gccatcgacg cgttagaaga caaactggaa gcgatactga
   11801  aaagtagcaa tccccgtcta tcgacagcgc aactatgggt tggtttcttt
   11851  tgttactatg gtgagtttcg tacggctcaa agtagagtag tgcaaagacc
   11901  aggcgtatac aaaacacctg actcagtggg tggatttgaa ataaacatga
   11951  aagatgttga gaaattcttc gataaacttc agagagaatt gcctaatgta
   12001  tctttgcggc gtcagtttaa cggagctaga gcgcatgagg ctttcaaaat
   12051  atttaaaaac ggaaatataa gtttcagacc tatatcgcgt ttaaacgtgc
   12101  ctagagagtt ctggtatctg aacatagact acttcaggca cgcgaatagg
   12151  tccgggttaa ccgaagaaga aatactcatc ctaaacaaca taagcgttga
   12201  tgttaggaag ttatgcgctg agagagcgtg caatacccta cctagcgcga
   12251  agcgctttag taaaaatcat aagagtaata tacaatcatc acgccaagag
   12301  cggaggatta aagacccatt ggtagtcctg aaagacactt tatatgagtt
   12351  ccaacacaag cgtgccggtt gggggtctcg aagcactcga gacctcggga
   12401  gtcgtgctga ccacgcgaaa ggaagcggtt gataagtttt ttaatgaact
   12451  aaaaaacgaa aattactcat cagttgacag cagccgatta agcgattcgg
   12501  aagtaaaaga agtgttagag aaaagtaaag aaagtttcaa aagcgaactg
   12551  gcctccactg acgagcactt cgtctaccac attatatttt tcttaatccg
   12601  atgtgctaag atatcgacaa gtgaaaaggt gaagtacgtt ggtagtcata
   12651  cgtacgtggt cgacggaaaa acgtacaccg ttcttgacgc ttgggtattc
   12701  aacatgatga aaagtctcac gaagaagtac aaacgagtga atggtctgcg
   12751  tgcgttctgt tgcgcgtgcg aagatctata tctaaccgtc gcaccaataa
   12801  tgtcagaacg ctttaagact aaagccgtag ggatgaaagg tttgcctgtt
   12851  ggaaaggaat acttaggcgc cgactttctt tcgggaacta gcaaactgat
   12901  gagcgatcac gacagggcgg tctccatcgt tgcagcgaaa aacgctgtcg
   12951  atcgtagcgc tttcacgggt ggggagagaa agatagttag tttgtatgat
   13001  ctagggaggt actaagcacg gtgtgctata gtgcgtgcta taataataaa
   13051  cactagtgct taagtcgcgc agaagaaaac gctatggagt tgatgtccga
   13101  cagcaacctt agcaacctgg tgataaccga cgcctctagt ctaaatggtg
   13151  tcgacaagaa gcttttatct gctgaagttg aaaaaatgtt ggtgcagaaa
   13201  ggggctccta acgagggtat agaagtggtg ttcggtctac tcctttacgc
   13251  actcgcggca agaaccacgt ctcctaaggt tcagcgcgca gattcagacg
   13301  ttatattttc aaatagtttc ggagagagga atgtggtagt aacagagggt
   13351  gaccttaaga aggtactcga cgggtgtgcg cctctcacta ggttcactaa
   13401  taaacttaga acgttcggtc gtactttcac tgaggcttac gttgactttt
   13451  gtatcgcgta taagcacaaa ttaccccaac tcaacgccgc ggcggaattg
   13501  gggattccag ctgaagattc gtacttagct gcagattttc tgggtacttg
   13551  cccgaagctc tctgaattac agcaaagtag gaagatgttc gcgagtatgt
   13601  acgctctaaa aactgaaggt ggagtggtaa atacaccagt gagcaatctg
   13651  cgtcagctag gtagaaggga agttatgtaa tggaagatta cgaagaaaaa
   13701  tccgaatcgc tcatactgct acgcacgaat ctgaacacta tgcttttagt
   13751  ggtcaagtcc gatgctagtg tagagctgcc taaactacta atttgcggtt
   13801  acttacgagt gtcaggacgt ggggaggtga cgtgttgcaa ccgtgaggaa
   13851  ttaacaagag attttgaggg caatcatcat acggtgatcc gttctagaat
   13901  catacaatat gacagcgagt ctgcttttga ggaattcaac aactctgatt
   13951  gcgtagtgaa gtttttccta gagactggta gtgtcttttg gtttttcctt
   14001  cgaagtgaaa ccaaaggtag agcggtgcga catttgcgca ccttcttcga
   14051  agctaacaat ttcttctttg gatcgcattg cggtaccatg gagtattgtt
   14101  tgaagcaggt actaactgaa actgaatcta taatcgattc tttttgcgaa
   14151  gaaagaaatc gttaagatga gggttatagt gtctccttat gaagctgaag
   14201  acattctgaa aagatcgact gacatgttac gaaacataga cagtggggtc
   14251  ttgagcacta aagaatgtat caaggcattc tcgacgataa cgcgagacct
   14301  acattgtgcg aaggcttcct accagtgggg tgttgacact gggttatatc
   14351  agcgtaattg cgctgaaaaa cgtttaattg acacggtgga gtcaaacata
   14401  cggttggctc aacctctcgt gcgtgaaaaa gtggcggttc atttttgtaa
   14451  ggatgaacca aaagagctag tagcattcat cacgcgaaag tacgtggaac
   14501  tcacgggcgt gggagtgaga gaagcggtga agagggaaat gcgctctctt
   14551  accaaaacag ttttaaataa aatgtctttg gaaatggcgt tttacatgtc
   14601  accacgagcg tggaaaaacg ctgaatggtt agaactaaaa ttttcacctg
   14651  tgaaaatctt tagagatctg ctattagacg tggaaacgct caacgaattg
   14701  tgcgccgaag atgatgttca cgtcgacaaa gtaaatgaga atggggacga
   14751  aaatcacgac ctcgaactcc aagacgaatg ttaaacattg gttaagttta
   14801  acgaaaatga ttagtaaata ataaatcgaa cgtgggtgta tctacctgac
   14851  gtatcaactt aagctgttac tgagtaatta aaccaacaag tgttggtgta
   14901  atgtgtatgt tgatgtagag aaaaatccgt ttgtagaacg gtgtttttct
   14951  cttctttatt tttaaaaaaa aaataaaaaa aaaaaaaaaa aagcggccgc