Sequence of DPV Gull circovirus
Gull circovirus, complete genome.
ACC No: DQ845074
Dated: 2006-10-02 | Length: 2035 | CRC: -799810247
ID DQ845074; SV 1; circular; genomic DNA; STD; VRL; 2035 BP.
XX
AC DQ845074;
XX
DT 02-OCT-2006 (Rel. 89, Created)
DT 02-OCT-2006 (Rel. 89, Last updated, Version 1)
XX
DE Gull circovirus, complete genome.
XX
KW .
XX
OS Gull circovirus
OC Viruses; ssDNA viruses; Circoviridae; Circovirus; unclassified Circovirus.
XX
RN [1]
RP 1-2035
RA Todd D., Scott A.N.J., Fringuelli E., Shivraprasad H.L., Gavier-Widen D.,
RA Smyth J.A.;
RT "Molecular characterization of novel circoviruses from finch and gull";
RL Unpublished.
XX
RN [2]
RP 1-2035
RA Todd D., Scott A.N.J., Fringuelli E.;
RT ;
RL Submitted (11-JUL-2006) to the EMBL/GenBank/DDBJ databases.
RL Department of Agriculture and Rural Development for Northern Ireland,
RL Veterinary Sciences Division, Stoney Road, Stormont, Belfast BT4 3SD, UK
XX
FH Key Location/Qualifiers
FH
FT source 1. .2035
FT /organism="Gull circovirus"
FT /specific_host="gull"
FT /mol_type="genomic DNA"
FT /db_xref="taxon:400121"
FT gene 100. .1017
FT /gene="rep"
FT CDS 100. .1017
FT /codon_start=1
FT /gene="rep"
FT /product="replication-associated protein"
FT /protein_id="ABI54254.1"
FT /translation="MRRRPRSGRYLRSIMAARRDSGARRWCFTLNNYTPEEEETARNLI
FT HDADKYAFAIIGKEVGESGTPHLQGFMHFKQKQRLTALKKLFPRAHFEKARGSDQQNAD
FT YCGKDGEILTMIGTPSDNNPSDLAGAVAAVKRGSQMSEIAREFSEVYVKYGRGLRDLRL
FT LIGCPPRDFKTEVIVLIGPPGCGKSKLANEMEGSKFYKMKGDWWDGYDNQDIVIIDDFY
FT GWLPYCECLRLCDRYPHRVPVKGAYVEFTSKKIVFTSNRHVDGWWKGEIEKSAFYRRIN
FT VYKFYETGEFKDMPGHMLPHPINY"
FT CDS complement(790. .1209)
FT /codon_start=1
FT /product="hypothetical protein"
FT /note="ORF C2"
FT /protein_id="ABI54256.1"
FT /translation="MLLDLFNKDSILNSKQVVSSLRATRVGRRQSRAPLALRSLRELRD
FT SGARDENDCHPEGGVFISRLIIDWVREHVSGHVLEFTSLIKLVHIDPSVEGTLLNFTFP
FT PPIHMAIAGKHNLLAGEFHVGTFHRHTVRVPVTQT"
FT gene complement(1190. .1927)
FT /gene="cap"
FT CDS complement(1190. .1927)
FT /codon_start=1
FT /transl_except=(pos:1925. .1927,aa:Met)
FT /gene="cap"
FT /product="capsid protein"
FT /protein_id="ABI54255.1"
FT /translation="MRRRHRLPRNRRGHRLNRIYLFRFRRQFTVDFPSKTDPQKWASDF
FT FTFNLNDFIGQEGIPAAWPFEDYRINLAKVVLRPEGVTTTIARGWGYTVPVQDARVKDL
FT KLQDQTQDPLANWDGARAWNLVRGFKRLVRPKPQLTISDMTASNFSASMWLNSSRSGWL
FT PLQMPAGTQRDGTRVVHYGLAFSWPSPGQALRYIAEITIYVTFRQFAQIMLKKSEEDLS
FT KFGDCLTITDNEKDIDWNVVGSL"
XX
SQ Sequence 2035 BP; 478 A; 506 C; 532 G; 519 T; 0 other;
dq845074 Length: 2035 02-OCT-2006 Type: N Check: 9408 ..
1 accggcgggc cggggccatg gggccatctg gcccctggcc aagcatgcgc
51 agtagtggcg catgcgcaat ggccgcgcag cggcctttat ttgcggagca
101 tgcgcagaag gccgcgcagc ggccgctatt tgcgaagcat catggcagca
151 cggcgagaca gcggagcccg tcgctggtgc tttactctga ataactacac
201 tcccgaggaa gaggagaccg ctcgcaatct gattcatgat gcagataagt
251 acgcgtttgc gataatcggc aaagaagttg gcgaaagcgg tactcctcat
301 ctgcaaggct tcatgcattt taagcaaaag cagcggctta ccgctctaaa
351 gaaacttttt cctcgcgcgc attttgagaa agctcgcggc agtgatcagc
401 agaatgctga ttattgtggt aaagacggcg aaatacttac catgatcggt
451 actccgagtg ataataatcc gagtgatctt gcgggagctg ttgccgctgt
501 gaaacgcgga agtcaaatga gtgaaatcgc gcgagagttc agtgaagtct
551 acgtcaagta tgggcggggc ctccgtgatc tccggttgct gattggttgc
601 ccgccccgcg atttcaaaac agaagtcatc gttctgattg gcccacctgg
651 ctgtggcaag tcaaaattgg ccaatgagat ggaagggtct aagttctaca
701 agatgaaagg tgattggtgg gatggttatg acaatcaaga tattgtcata
751 atcgatgact tctacggttg gctgccgtac tgtgagtgcc tacgtctgtg
801 tgaccggtac cctcaccgtg tgcctgtgaa aggtgcctac gtggaattca
851 ccagcaagaa gattgtgttt accagcaatc gccatgtgga tgggtggtgg
901 aaaggtgaaa ttgagaagag tgccttctac cgacggatca atgtgtacaa
951 gttttatgag actggtgaat tcaaggacat gcccggacac atgctcccgc
1001 acccaatcaa ttattaggcg cgagatgaaa acacccccct cgggatgaca
1051 gtcgttttca tctcgcgccc ccgagtcgcg gagttcgcga agcgaacgga
1101 gcgcgagtgg cgcgcgagat tgtcgacgac cgacgcgcgt agcgcggagg
1151 gaggagacca cttgctttga atttaatatg ctatctttat taaagagatc
1201 caacaacatt ccagtctatg tctttttcat tgtcagtaat tgttaagcag
1251 tccccgaact ttgacaggtc ttcttcagat ttcttcaaca ttatctgagc
1301 gaactgcctg aatgttacat atatagttat ttctgcaata tatctcagtg
1351 cctgtccagg tgaaggccaa gagaatgcca gcccgtagtg cactactctt
1401 gtaccgtccc tctgggtccc tgctggcatt tgtaaaggta accagcctga
1451 tctactgcta ttcagccaca ttgatgctga aaagttagag gctgtcatat
1501 ctgatatggt tagttgaggc tttgggcgta ccagcctctt aaaccctctg
1551 accaagttcc acgccctcgc cccgtcccag ttagccagtg ggtcttgcgt
1601 ttgatcctgt aattttaaat ctttaactct ggcgtcctga actgggactg
1651 tgtaccccca gccccttgca attgttgttg tgacaccttc aggcctcagc
1701 accactttag ctaaatttat cctgtaatcc tcgaacggcc atgctgccgg
1751 tattccctct tgacctataa agtcattcag gttgaaggtg aagaaatctg
1801 aggcccactt ttgtgggtca gtcttggatg ggaagtccac tgtgaattgc
1851 cgtctaaatc tgaacaagta tattctgttc agtctgtgcc ctcttcggtt
1901 cctgggcaac ctgtgacgtc tgcgcactcg tctgtgaaac cgccttctcc
1951 ggaaaccgcg agtaacacgc ctccttctgg ttgctgagcg acgtacctaa
2001 aaatagaatg gccccacggc ccgccgtata gtatt