Sequence of DPV Gull circovirus

Gull circovirus, complete genome.

ACC No: DQ845074

Dated: 2006-10-02 | Length: 2035 | CRC: -799810247

                ID   DQ845074; SV 1; circular; genomic DNA; STD; VRL; 2035 BP.
XX
AC   DQ845074;
XX
DT   02-OCT-2006 (Rel. 89, Created)
DT   02-OCT-2006 (Rel. 89, Last updated, Version 1)
XX
DE   Gull circovirus, complete genome.
XX
KW   .
XX
OS   Gull circovirus
OC   Viruses; ssDNA viruses; Circoviridae; Circovirus; unclassified Circovirus.
XX
RN   [1]
RP   1-2035
RA   Todd D., Scott A.N.J., Fringuelli E., Shivraprasad H.L., Gavier-Widen D.,
RA   Smyth J.A.;
RT   "Molecular characterization of novel circoviruses from finch and gull";
RL   Unpublished.
XX
RN   [2]
RP   1-2035
RA   Todd D., Scott A.N.J., Fringuelli E.;
RT   ;
RL   Submitted (11-JUL-2006) to the EMBL/GenBank/DDBJ databases.
RL   Department of Agriculture and Rural Development for Northern Ireland,
RL   Veterinary Sciences Division, Stoney Road, Stormont, Belfast BT4 3SD, UK
XX
FH   Key             Location/Qualifiers
FH
FT   source          1. .2035
FT                   /organism="Gull circovirus"
FT                   /specific_host="gull"
FT                   /mol_type="genomic DNA"
FT                   /db_xref="taxon:400121"
FT   gene            100. .1017
FT                   /gene="rep"
FT   CDS             100. .1017
FT                   /codon_start=1
FT                   /gene="rep"
FT                   /product="replication-associated protein"
FT                   /protein_id="ABI54254.1"
FT                   /translation="MRRRPRSGRYLRSIMAARRDSGARRWCFTLNNYTPEEEETARNLI
FT                   HDADKYAFAIIGKEVGESGTPHLQGFMHFKQKQRLTALKKLFPRAHFEKARGSDQQNAD
FT                   YCGKDGEILTMIGTPSDNNPSDLAGAVAAVKRGSQMSEIAREFSEVYVKYGRGLRDLRL
FT                   LIGCPPRDFKTEVIVLIGPPGCGKSKLANEMEGSKFYKMKGDWWDGYDNQDIVIIDDFY
FT                   GWLPYCECLRLCDRYPHRVPVKGAYVEFTSKKIVFTSNRHVDGWWKGEIEKSAFYRRIN
FT                   VYKFYETGEFKDMPGHMLPHPINY"
FT   CDS             complement(790. .1209)
FT                   /codon_start=1
FT                   /product="hypothetical protein"
FT                   /note="ORF C2"
FT                   /protein_id="ABI54256.1"
FT                   /translation="MLLDLFNKDSILNSKQVVSSLRATRVGRRQSRAPLALRSLRELRD
FT                   SGARDENDCHPEGGVFISRLIIDWVREHVSGHVLEFTSLIKLVHIDPSVEGTLLNFTFP
FT                   PPIHMAIAGKHNLLAGEFHVGTFHRHTVRVPVTQT"
FT   gene            complement(1190. .1927)
FT                   /gene="cap"
FT   CDS             complement(1190. .1927)
FT                   /codon_start=1
FT                   /transl_except=(pos:1925. .1927,aa:Met)
FT                   /gene="cap"
FT                   /product="capsid protein"
FT                   /protein_id="ABI54255.1"
FT                   /translation="MRRRHRLPRNRRGHRLNRIYLFRFRRQFTVDFPSKTDPQKWASDF
FT                   FTFNLNDFIGQEGIPAAWPFEDYRINLAKVVLRPEGVTTTIARGWGYTVPVQDARVKDL
FT                   KLQDQTQDPLANWDGARAWNLVRGFKRLVRPKPQLTISDMTASNFSASMWLNSSRSGWL
FT                   PLQMPAGTQRDGTRVVHYGLAFSWPSPGQALRYIAEITIYVTFRQFAQIMLKKSEEDLS
FT                   KFGDCLTITDNEKDIDWNVVGSL"
XX
SQ   Sequence 2035 BP; 478 A; 506 C; 532 G; 519 T; 0 other;

dq845074 Length: 2035  02-OCT-2006  Type: N  Check: 9408  ..

       1  accggcgggc cggggccatg gggccatctg gcccctggcc aagcatgcgc
      51  agtagtggcg catgcgcaat ggccgcgcag cggcctttat ttgcggagca
     101  tgcgcagaag gccgcgcagc ggccgctatt tgcgaagcat catggcagca
     151  cggcgagaca gcggagcccg tcgctggtgc tttactctga ataactacac
     201  tcccgaggaa gaggagaccg ctcgcaatct gattcatgat gcagataagt
     251  acgcgtttgc gataatcggc aaagaagttg gcgaaagcgg tactcctcat
     301  ctgcaaggct tcatgcattt taagcaaaag cagcggctta ccgctctaaa
     351  gaaacttttt cctcgcgcgc attttgagaa agctcgcggc agtgatcagc
     401  agaatgctga ttattgtggt aaagacggcg aaatacttac catgatcggt
     451  actccgagtg ataataatcc gagtgatctt gcgggagctg ttgccgctgt
     501  gaaacgcgga agtcaaatga gtgaaatcgc gcgagagttc agtgaagtct
     551  acgtcaagta tgggcggggc ctccgtgatc tccggttgct gattggttgc
     601  ccgccccgcg atttcaaaac agaagtcatc gttctgattg gcccacctgg
     651  ctgtggcaag tcaaaattgg ccaatgagat ggaagggtct aagttctaca
     701  agatgaaagg tgattggtgg gatggttatg acaatcaaga tattgtcata
     751  atcgatgact tctacggttg gctgccgtac tgtgagtgcc tacgtctgtg
     801  tgaccggtac cctcaccgtg tgcctgtgaa aggtgcctac gtggaattca
     851  ccagcaagaa gattgtgttt accagcaatc gccatgtgga tgggtggtgg
     901  aaaggtgaaa ttgagaagag tgccttctac cgacggatca atgtgtacaa
     951  gttttatgag actggtgaat tcaaggacat gcccggacac atgctcccgc
    1001  acccaatcaa ttattaggcg cgagatgaaa acacccccct cgggatgaca
    1051  gtcgttttca tctcgcgccc ccgagtcgcg gagttcgcga agcgaacgga
    1101  gcgcgagtgg cgcgcgagat tgtcgacgac cgacgcgcgt agcgcggagg
    1151  gaggagacca cttgctttga atttaatatg ctatctttat taaagagatc
    1201  caacaacatt ccagtctatg tctttttcat tgtcagtaat tgttaagcag
    1251  tccccgaact ttgacaggtc ttcttcagat ttcttcaaca ttatctgagc
    1301  gaactgcctg aatgttacat atatagttat ttctgcaata tatctcagtg
    1351  cctgtccagg tgaaggccaa gagaatgcca gcccgtagtg cactactctt
    1401  gtaccgtccc tctgggtccc tgctggcatt tgtaaaggta accagcctga
    1451  tctactgcta ttcagccaca ttgatgctga aaagttagag gctgtcatat
    1501  ctgatatggt tagttgaggc tttgggcgta ccagcctctt aaaccctctg
    1551  accaagttcc acgccctcgc cccgtcccag ttagccagtg ggtcttgcgt
    1601  ttgatcctgt aattttaaat ctttaactct ggcgtcctga actgggactg
    1651  tgtaccccca gccccttgca attgttgttg tgacaccttc aggcctcagc
    1701  accactttag ctaaatttat cctgtaatcc tcgaacggcc atgctgccgg
    1751  tattccctct tgacctataa agtcattcag gttgaaggtg aagaaatctg
    1801  aggcccactt ttgtgggtca gtcttggatg ggaagtccac tgtgaattgc
    1851  cgtctaaatc tgaacaagta tattctgttc agtctgtgcc ctcttcggtt
    1901  cctgggcaac ctgtgacgtc tgcgcactcg tctgtgaaac cgccttctcc
    1951  ggaaaccgcg agtaacacgc ctccttctgg ttgctgagcg acgtacctaa
    2001  aaatagaatg gccccacggc ccgccgtata gtatt