Sequence of DPV Cyclovirus-PK5006

Cyclovirus PK5006, complete genome.

ACC No: GQ404844

Dated: 2010-03-15 | Length: 1723 | CRC: 596781439

                
ID   GQ404844; SV 1; circular; genomic DNA; STD; VRL; 1723 BP.
XX
AC   GQ404844;
XX
DT   15-MAR-2010 (Rel. 104, Created)
DT   15-MAR-2010 (Rel. 104, Last updated, Version 1)
XX
DE   Cyclovirus PK5006, complete genome.
XX
KW   .
XX
OS   Cyclovirus PK5006
OC   Viruses; ssDNA viruses; Circoviridae; unclassified Circoviridae.
XX
RN   [1]
RP   1-1723
RX   DOI; 10.1128/JVI.02109-09.
RX   PUBMED; 20007276.
RA   Li L., Kapoor A., Slikas B., Bamidele O.S., Wang C., Shaukat S.,
RA   Masroor M.A., Wilson M.L., Ndjango J.B., Peeters M., Gross-Camp N.D.,
RA   Muller M.N., Hahn B.H., Wolfe N.D., Triki H., Bartkus J., Zaidi S.Z.,
RA   Delwart E.;
RT   "Multiple diverse circoviruses infect farm animals and are commonly found
RT   in human and chimpanzee feces";
RL   J. Virol. 84(4):1674-1682(2010).
XX
RN   [2]
RP   1-1723
RA   Li L., Kapoor A., Delwart E.;
RT   ;
RL   Submitted (23-JUL-2009) to the EMBL/GenBank/DDBJ databases.
RL   Department of Laboratory Medicine, University of California, San Francisco,
RL   Blood Systems Research Institute, 270 Masonic Ave, San Francisco, CA 94118,
RL   USA
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .1723
FT                   /organism="Cyclovirus PK5006"
FT                   /host="Homo sapiens"
FT                   /isolate="PK5006"
FT                   /mol_type="genomic DNA"
FT                   /country="Pakistan"
FT                   /isolation_source="stool"
FT                   /collection_date="2007"
FT                   /db_xref="taxon:742915"
FT   gene            23. .859
FT                   /gene="rep"
FT   CDS             23. .859
FT                   /codon_start=1
FT                   /gene="rep"
FT                   /product="replication-association protein"
FT                   /db_xref="GOA:D4N3N6"
FT                   /db_xref="InterPro:IPR000605"
FT                   /db_xref="InterPro:IPR003365"
FT                   /db_xref="UniProtKB/TrEMBL:D4N3N6"
FT                   /protein_id="ADD62451.1"
FT                   /translation="MANRTVRRFCFTWNDHPVEAYEKCEKFIEKFCKYGIVGEEYAPTT
FT                   GMPHLQGFCNLNKPTRFSTIKKHLDNSIHIEKANGTDEQNQKYCSKSGIFFESGVPNKQ
FT                   GQRNDLQSLVEFIHEKRPTIRDIATEHPTTYIRYFRGIERMLQLVNPIKQRDFKTEVYY
FT                   YWGPPGTGKSRRALEEAQAFNTESIYYKPRGLWWDGYEQQDSVIIDDFYGWIKYDEMLK
FT                   IMDRYPYKVQVKGAFQEFTSKKIFITSNVDTDELYKFVGYTTAAFERRITNKEYMA"
FT   gene            complement(856. .1515)
FT                   /gene="cap"
FT   CDS             complement(856. .1515)
FT                   /codon_start=1
FT                   /gene="cap"
FT                   /product="capsid protein"
FT                   /db_xref="UniProtKB/TrEMBL:D4N3N7"
FT                   /protein_id="ADD62452.1"
FT                   /translation="MALKRLRRMPQRRNRFVRSRRRVFKRRLRFRRRPISTLFTKLTRS
FT                   VQVYSDARTGGVASLHQTLEQFSEHKNLAPNFERVKIYRLNVRVFPQQNVANNTSSRVT
FT                   NYAIVPYHRPLVKPATPNFPTCLSIDKSKIRRMTQFGRMSFVPAVRLGTDTETSTLYNT
FT                   TKWRPEFDIGVDADKEILYCGFLCFEGDSTIAESHPVNFTVVMDLFVKYKNQRSFI"
XX
SQ   Sequence 1723 BP; 560 A; 384 C; 368 G; 411 T; 0 other;

gq404844 Length: 1723  15-MAR-2010  Type: N  Check: 4104  ..

       1  atacccgcca cttcgttgca cgatggcaaa ccgtacagtg cgccgattct
      51  gcttcacgtg gaacgaccat ccagtcgaag catacgaaaa gtgcgaaaag
     101  tttattgaaa aattctgcaa atatggaatc gtgggagaag aatacgctcc
     151  aactacagga atgccccatc tccaaggttt ctgtaacctt aataaaccaa
     201  cacgcttcag taccatcaag aagcacctcg ataacagcat ccatattgag
     251  aaggcaaatg ggacagacga acaaaaccaa aaatactgtt caaaatcagg
     301  catatttttt gaatcgggcg tacctaacaa acaagggcaa cgcaatgatc
     351  tgcaatctct ggtggaattc attcacgaaa agaggccaac gatacgagat
     401  attgccaccg aacatcctac gacatacata cgctacttcc gtggaatcga
     451  aagaatgctg caactggtta acccgatcaa acaacgcgat ttcaagactg
     501  aagtatatta ttattgggga ccgccgggga ctggcaaatc tagaagagca
     551  cttgaagagg cacaagcatt caacactgag tcaatatact ataaaccacg
     601  cggattgtgg tgggacggat acgaacaaca ggacagcgtc atcatagacg
     651  acttctacgg atggatcaaa tacgacgaaa tgctgaaaat aatggaccgc
     701  tacccataca aggtacaagt gaaaggagct tttcaagaat ttacaagcaa
     751  aaaaattttt attacatcaa atgttgatac agacgaatta tacaaatttg
     801  taggctatac aacagccgct tttgaacgtc gcattacaaa taaagaatac
     851  atggcttaaa taaatgagcg ttgattttta tatttaacaa acaaatccat
     901  aacaactgta aaattaacag gatgcgattc cgcaatagta ctatctccct
     951  caaaacataa aaatccacaa tataatattt ccttatccgc atcaactcct
    1001  atatcaaact caggtcgcca ttttgtagta ttatataaag tagaggtctc
    1051  cgtatcagta ccaagacgaa cagcaggcac aaaactcata cgtccaaatt
    1101  gcgtcatcct tcgtatcttg ctcttatcaa tagataaaca ggtcggaaag
    1151  ttcggagtag caggtttcac aagaggtcga tgatatggta caatagcata
    1201  attagtaact cgcgatgacg tattattcgc aacgttctgt tgtggaaaaa
    1251  cccttacatt cagtctatat atcttcacac gttcaaaatt tggtgcgagg
    1301  ttcttatgtt cagaaaattg ttccaacgtt tgatgcaggg aagccacacc
    1351  accagttctt gcatcagaat atacttgcac agaacgagta agtttagtaa
    1401  ataatgtaga aattggacgg cggcgaaatc gcagtctacg cttgaaaact
    1451  cgcctgcgag accggacaaa ccggttccta cgctggggca ttcgccgcaa
    1501  acgcttcaag gccatgcctt tcactttaac gctcggtata aaaataactc
    1551  ctcttcgggt gacggcttgt tgccaagtga acgttgcgcc cccccccgtc
    1601  ggccgggcag gctccccggc ggggggaggg cgtgcgtgtt ggaacgtgtt
    1651  acgggcggat gttagcggag cggatagtac atgcggttac gaagtcacgg
    1701  ttacgaagtg gcggggtaat act