Sequence of DPV Drosophila melanogaster American nodavirus

Drosophila melanogaster American nodavirus (ANV) SW-2009a segment RNA1, complete sequence.

ACC No: GQ342965

Dated: 2010-02-05 | Length: 3107 | CRC: 1655082543

                
ID   GQ342965; SV 1; linear; genomic RNA; STD; VRL; 3107 BP.
XX
AC   GQ342965;
XX
DT   09-NOV-2009 (Rel. 102, Created)
DT   05-FEB-2010 (Rel. 103, Last updated, Version 2)
XX
DE   Drosophila melanogaster American nodavirus (ANV) SW-2009a segment RNA1,
DE   complete sequence.
XX
KW   .
XX
OS   Drosophila melanogaster American nodavirus (ANV) SW-2009a
OC   Viruses; ssRNA positive-strand viruses, no DNA stage; Nodaviridae;
OC   unclassified Nodaviridae.
XX
RN   [1]
RP   1-3107
RX   PUBMED; 20080648.
RA   Wu Q., Luo Y., Lu R., Lau N., Lai E.C., Li W.X., Ding S.W.;
RT   "Virus discovery by deep sequencing and assembly of virus-derived small
RT   silencing RNAs";
RL   Proc. Natl. Acad. Sci. U.S.A. 107(4):1606-1611(2010).
XX
RN   [2]
RP   1-3107
RA   Wu Q., Ding S.-W.;
RT   ;
RL   Submitted (29-JUN-2009) to the EMBL/GenBank/DDBJ databases.
RL   Plant Pathology & Microbiology, UC Riverside, 900 University Ave,
RL   Riverside, CA 92521, USA
XX
CC   GenBank Accession Numbers GQ342965-GQ342966 represent the complete
CC   genome of Drosophila melanogaster American nodavirus (ANV)
CC   SW-2009a.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .3107
FT                   /organism="Drosophila melanogaster American nodavirus (ANV)
FT                   SW-2009a"
FT                   /segment="RNA1"
FT                   /host="Drosophila melanogaster"
FT                   /lab_host="S2 cells"
FT                   /mol_type="genomic RNA"
FT                   /country="USA"
FT                   /collection_date="Feb-2009"
FT                   /db_xref="taxon:663279"
FT   CDS             40. .3036
FT                   /codon_start=1
FT                   /product="protein A"
FT                   /protein_id="ACU32794.1"
FT                   /translation="MTLKVILGEHQITRTELLVGLATVSGCGAVAYCISKFWGYGAIAP
FT                   YPQSGGNRVTRALQRAVIDKTKTPIETRFYPLDSLRTVTPKRAFNNGHAVSGAVRDAAR
FT                   RLIDEAITSVGGSKFEVNPNPNSSTGLRNHFHFAVGDLAQDFRNDTPAEDAFIVGVDVD
FT                   YYVTEPDVLLEHMRPVVLHTFNPKKVSGFDADSPFTIKNNLVEYKVSGGAAWVHPVWDW
FT                   CEAGEFIASRVRTSWKEWFLQLPLRIVGLEKVGYHKIHHCRPWTDCPDRTLVYTIPQYT
FT                   VWRFNWIDTELHVRKLKRIEYQDETKPGWNRLEYVTDKNELMVSIGREGEHAQITIEKE
FT                   KLDMLSGLSATQSVNARLIGMGHKDPKYTSMIVQYYTGKKVVSPISPTVYKPTMPRVHW
FT                   PVTSDADVPEVSARQYTLPIVSDCMMMPMIKRWETMSESIERRVTFVANDKKPSDRIAR
FT                   IADTFVRLMNGPFNNLDPLSIEETIERLNKPSQQLQLRAVFEIIGVEPRQLIESFNKNE
FT                   PGMKSSRIISGFPDILFILKVSRYTLAYSDIVLHAEHNQHWYYPGRNPTEIADGVCEFV
FT                   SDCDAEVIETDFSNLDGRVSGWMQRNIAQKAMVQAFRPEYRDEIISFMDTIIHCPATAK
FT                   RFGFRYEPGVGVKSGSPTTTPHNTQYNACVEFTALTIEHPYAEPEDLFRLIGPKCGDDG
FT                   LSRAIIRNSIDRAAKCYGLELKVERYNPEIGLCFLSRVFVDPLATTTTIQDPLRTLRKL
FT                   HLTARDPTIPLADAACDRVEGYLCTDALTPLISDYCKMVLRLYGPTASTEEVRNQRRSR
FT                   NKEKPYWLMCDGSWPQHPQDVHLMKQVLIKRTRIDEDQVDTLIGRFAAMKDVWEKITRD
FT                   SEESAAACTFDEDGVMPGSVDESLPKLNDAKQTRANPGTSRPHSNGGGSSNGNELSGRT
FT                   EQRAQGPRQPPCLPKQGKANGKPNGNITAGETQRGGIPRRKGPRGGKTDTRRTPPKAGA
FT                   QPQPSNNRK"
FT   CDS             2738. .3058
FT                   /codon_start=1
FT                   /product="protein B2"
FT                   /protein_id="ACU32795.1"
FT                   /translation="MPSKLALIQELPDRIQTAAEAAMGMSYQDAPNNVRRDLDNLHACL
FT                   NKAKLTVSRMVTSLLEKPSVVAYLEGRAPEEAKPTLEERLRKLELSHSLPTTGSDPPPA
FT                   KP"
XX
SQ   Sequence 3107 BP; 872 A; 771 C; 767 G; 697 T; 0 other;

gq342965 Length: 3107  05-FEB-2010  Type: N  Check: 7361  ..

       1  gtttgatata caaataaaac accgaagcgc cccaaaacaa tgaccctaaa
      51  agttattctt ggagaacacc agatcacccg gactgaattg ctcgtcgggc
     101  ttgcaaccgt atctgggtgc ggtgccgtag cgtactgcat atccaagttc
     151  tggggctatg gggcaattgc gccctatccc cagagtggag ggaaccgagt
     201  tacacgcgca ttgcaacggg ctgtcattga caaaacgaag accccgatag
     251  aaacacgttt ctatccgctt gacagcctgc gtaccgtaac gcctaagcgt
     301  gcctttaaca acgggcatgc cgtttcagga gccgtacgtg atgccgcccg
     351  tcgtttaatc gacgaggcta taacatctgt aggaggatct aaatttgaag
     401  ttaacccgaa cccaaattcc agcactggac tgcggaacca tttccacttc
     451  gccgtcggtg atctggcgca agactttcgc aatgatacac cagcggaaga
     501  cgccttcatc gttggtgttg atgttgatta ttatgtcacc gagccagatg
     551  tgctgttaga gcacatgcgt ccagtggttt tacacacctt caacccgaag
     601  aaagtgagtg gctttgatgc tgactcacca ttcaccatca agaacaacct
     651  ggttgaatac aaggttagtg gaggtgctgc atgggtccat ccagtttggg
     701  attggtgcga agctggtgag ttcatcgcca gccgcgtccg aacaagttgg
     751  aaggaatggt ttttacaact gccattgaga atagttggtt tagaaaaggt
     801  cggttatcac aaaatccacc attgtagacc ctggactgat tgtccagatc
     851  gaacactcgt ctacacaata ccgcagtata ctgtttggcg gttcaattgg
     901  atcgacaccg agctacacgt acgcaaacta aaacggattg aataccagga
     951  tgaaactaaa cctggttgga acagattgga gtatgtcacc gacaagaacg
    1001  aactaatggt ttccatcggt agagaaggag agcacgctca gattaccatc
    1051  gagaaagaaa agttggatat gctctcggga ttatccgcca cacaatctgt
    1101  caacgctagg cttattggta tggggcacaa ggacccaaag tacacatcta
    1151  tgattgttca gtattacacc ggcaagaaag ttgtgtcacc aattagtcca
    1201  actgtgtata aacctacaat gccacgcgtc cattggccag taaccagtga
    1251  cgcagatgta ccagaagtga gcgcacgcca atacaccttg cctatcgtga
    1301  gtgactgtat gatgatgcca atgattaaac gctgggagac aatgtctgaa
    1351  tcaattgaac gcagggtaac ctttgtcgcc aacgacaaga aaccaagcga
    1401  ccgtatcgcg aggatcgccg acactttcgt acggttgatg aacgggcctt
    1451  tcaacaacct tgacccattg tcaatcgagg aaacgattga gcgtctgaac
    1501  aagccgtccc aacaactaca acttagggcg gttttcgaga taatcggagt
    1551  tgagccgcgt caattgattg agtcattcaa caaaaacgaa ccgggaatga
    1601  aatctagtcg gataatatcc ggctttcctg acatcctttt catcctgaag
    1651  gtctccagat acacattagc gtactcggat atcgttctac atgccgagca
    1701  taatcaacat tggtattatc ctggaaggaa cccaactgag atcgccgatg
    1751  gggtttgtga atttgtcagt gactgtgacg ctgaagtcat agaaacggat
    1801  ttttccaacc ttgatggccg ggtttctggc tggatgcaac gaaatatcgc
    1851  tcaaaaggcg atggttcaag cattccggcc agaatacaga gatgagatca
    1901  tttcatttat ggacacgata atccattgtc cagccacagc taaacgcttt
    1951  ggtttccgat atgagcctgg agttggtgtt aaaagtggta gtccaacaac
    2001  cacgccgcat aacacgcaat ataatgcatg tgtcgaattt acagctctca
    2051  ccattgagca cccttatgcc gaaccagaag atctgttccg attaatcggt
    2101  ccgaagtgcg gtgatgatgg cctgtcgcgg gccatcattc gaaattctat
    2151  cgaccgtgct gccaaatgtt atggcttgga gctcaaagtg gaacgataca
    2201  acccagagat aggtctttgc tttctctcac gtgtttttgt ggacccgctc
    2251  gcaacgacga caacaataca agacccactg cgtacactgc gaaaactgca
    2301  cctcacagca agagatccta cgataccact agctgacgcg gcttgcgacc
    2351  gcgtcgaggg ttatctttgt accgatgcgc ttactccgtt gatatcggat
    2401  tactgcaaaa tggtactacg gttatacggg ccgactgcct ccacagagga
    2451  agtcaggaac caacgtagaa gccggaataa ggagaaaccc tactggttaa
    2501  tgtgcgatgg atcatggcca cagcatccgc aagacgtcca tttgatgaag
    2551  caggttttaa tcaagcgtac tagaattgac gaagatcagg tcgatacact
    2601  cattgggcgt tttgccgcaa tgaaggatgt ctgggagaag attacacgtg
    2651  acagcgagga gagcgccgct gcatgtacgt ttgatgaaga cggcgttatg
    2701  ccgggctccg tggatgaatc gttacctaag cttaacgatg ccaagcaaac
    2751  tcgcgctaat ccaggaactt cccgaccgca ttcaaacggc ggcggaagca
    2801  gcaatgggaa tgagctatca ggacgcaccg aacaacgtgc gcagggacct
    2851  cgacaacctc catgcttgcc taaacaaggc aaagctaacg gtaagccgaa
    2901  tggtaacatc actgctggag aaacccagcg tggtggcata cctcgaagga
    2951  agggcccccg aggaggcaaa accgacactc gaagaacgcc tccgaaagct
    3001  ggagctcagc cacagccttc caacaaccgg aagtgacccc ccacccgcta
    3051  aaccgtagct gactcctagg agcacctaca cccgttctag cccgaaaggg
    3101  cggaggt