Sequence of DPV Torque teno midi virus 4
Small anellovirus isolate 6PoSMA ORF2, ORF1, and ORF3 genes, complete cds.
ACC No: EF538876
Dated: 2007-09-23 | Length: 2454 | CRC: 1136543881
ID EF538876; SV 1; linear; genomic DNA; STD; VRL; 2454 BP.
XX
AC EF538876;
XX
DT 23-SEP-2007 (Rel. 93, Created)
DT 23-SEP-2007 (Rel. 93, Last updated, Version 1)
XX
DE Small anellovirus isolate 6PoSMA ORF2, ORF1, and ORF3 genes, complete cds.
XX
KW .
XX
OS Small anellovirus
OC Viruses; ssDNA viruses; Anelloviridae; unclassified Anelloviridae.
XX
RN [1]
RP 1-2454
RX DOI; 10.1099/vir.0.83071-0.
RX PUBMED; 17872521.
RA Biagini P., Uch R., Belhouchet M., Attoui H., Cantaloube J.F.,
RA Brisbarre N., de Micco P.;
RT "Circular genomes related to anelloviruses identified in human and animal
RT samples by using a combined rolling-circle
RT amplification/sequence-independent single primer amplification approach";
RL J. Gen. Virol. 88(Pt 10):2696-2701(2007).
XX
RN [2]
RP 1-2454
RA Biagini P., Belhouchet M., de Micco P.;
RT ;
RL Submitted (05-APR-2007) to the EMBL/GenBank/DDBJ databases.
RL Service de Virologie Moleculaire, EFS Alpes-Mediterranee, 149, Bd. Baille,
RL Marseille 13005, France
XX
FH Key Location/Qualifiers
FH
FT source 1. .2454
FT /organism="Small anellovirus"
FT /host="Homo sapiens"
FT /isolate="6PoSMA"
FT /mol_type="genomic DNA"
FT /country="France"
FT /isolation_source="plasma"
FT /db_xref="taxon:393049"
FT CDS 200. .550
FT /codon_start=1
FT /product="ORF2"
FT /db_xref="InterPro:IPR004118"
FT /db_xref="UniProtKB/TrEMBL:A8DMP3"
FT /protein_id="ABU55877.1"
FT /translation="MSTFYKPSVYNGPTKEQMWMSVVTDFHDSFCRCFHSFAHILDLIF
FT PDGHRDRDKTIREIIERDLKCHSGGDEEERCGGVLAAAIENTENINQEESQDIADADIR
FT ELLAAAESAERR"
FT CDS 394. .2370
FT /codon_start=1
FT /product="ORF1"
FT /db_xref="InterPro:IPR004219"
FT /db_xref="InterPro:IPR008474"
FT /db_xref="UniProtKB/TrEMBL:A8DMP4"
FT /protein_id="ABU55878.1"
FT /translation="MPFWWRRRRKVWWGAGRRYRKYRKYKPRRKPRYRRRRYKRAPRRR
FT RKRRKKVRRKRKAIPIIQWQPDSIKNCKIKGYNALLLGAEGTQYLCYTNERFTFTPPQY
FT AGGGGFAVQTFSLQYLYEEHKFKNNIWTASNIYSDLCRYLRVKMYFYRHPKTDFILNYA
FT RQPPFELNKYTYTLAHPYMLLQSKHKKIIPSKLTKPNGKLWKKIIVKPPKQMLSKWFFQ
FT KQFAPQSLLQLQAAAASFTYPRLGCCNENRIITVYYLDPQFIQHSTWARTIQSPYKPYD
FT SISSQVTFWYPSKGKTEKYTPNYLSETGTEAYYKSINYDTGFFSSKVLTATKVTQNPTS
FT ETGYALPPINAARYNPEEDNGEGNSVYLVSVVNGFYDRPTEDNLIYKGAPLWLIFHGFY
FT SFISKIKSPSFMSLHMFVIRSPFLKPRPSAITKDFFPIIDLNFTIGNNPYKAYITANQK
FT KLWYPTCEHQIETINNFVECGPYIPKFGNDRDSTWELPYHYIFYFKWGGPTTPQQEISD
FT PNSKNIYTVPDTLQGTVQVSNPLKQSTESLLHNWDLRRGLITETAFKRMCENIETDTDF
FT LPDSQETPAKRPRLAGELLHPRRKKTKKSRHVSRISSKKIPSKKHHRHKKESSTSSSSS
FT ESSTPLTSQATPPPPSWVRQGGG"
FT CDS 1853. .2389
FT /codon_start=1
FT /product="ORF3"
FT /db_xref="InterPro:IPR008474"
FT /db_xref="UniProtKB/TrEMBL:A8DMP5"
FT /protein_id="ABU55879.1"
FT /translation="MTEIVLGNYLITIFSILSGEDQQHHNKKLATPTPKTSTLFPILSK
FT EQYKYPTHSNKALKAYSTTGILEEGSLQKQLLKECAKTSKLIQISSQIRKRHQQKDQDW
FT QENSYTPEEKKQRNQDMSPGSLQRRYLPRNTTDTRRNHPPHPAAARAAPPLRHRPRPRR
FT HLGCGRAGAKMAGPN"
XX
SQ Sequence 2454 BP; 889 A; 542 C; 446 G; 577 T; 0 other;
ef538876 Length: 2454 23-SEP-2007 Type: N Check: 9667 ..
1 atataagtaa gtggggtggc gaatggctga gtttaccccg ctagacggtg
51 cagggaccgg atcgagcgca gcgaggaggt ccccggctgc ccatgggcgg
101 gagcccgagg tgagtgaaac caccgaggtc taggggcaat tcgggctagg
151 gcagtctagc ggaacgggca agaaacttaa aaatgctttt tgtttttaga
201 tgtcaacctt ctacaaacca agtgtttaca atggacccac aaaagaacaa
251 atgtggatgt ctgtagtaac tgattttcat gacagtttct gcagatgctt
301 tcacagtttt gctcatatcc ttgaccttat cttcccagac ggtcacagag
351 atagagacaa aactataaga gaaattatag agagagattt aaaatgccat
401 tctggtggag acgaagaaga aaggtgtggt ggggtgctgg ccgccgctat
451 agaaaataca gaaaatataa accaagaaga aagccaagat atcgccgacg
501 cagatataag agagctcctc gccgccgcag aaagcgcaga aagaaggtaa
551 gaagaaaaag aaaagccata cctataattc agtggcagcc agacagcatt
601 aaaaactgta aaattaaagg atacaatgcc ttactactag gagctgaagg
651 gactcaatac ttatgctata ccaatgagag atttacattc acacctccac
701 aatatgctgg aggtggagga tttgcagttc aaaccttctc tcttcaatac
751 ttatatgaag aacataaatt taaaaataat atttggactg cctcaaacat
801 atactcagac ttatgcagat atttaagagt taaaatgtat ttttataggc
851 atcctaaaac agacttcata cttaactatg caagacagcc accttttgaa
901 ctaaataaat acacatacac actagctcat ccatacatgc tactacaaag
951 caagcacaaa aaaataatac ccagcaaatt aactaaacct aatggaaagc
1001 tttggaaaaa aattatagtt aaaccaccta aacaaatgct cagtaaatgg
1051 ttttttcaaa aacaatttgc acctcaaagt ttactgcaac tacaagcagc
1101 agcagctagc tttacttatc ccagattagg ctgctgtaat gaaaacagaa
1151 taattacagt ctattattta gacccacagt tcatccagca ttccacctgg
1201 gctagaacta tacaatctcc atacaaacca tatgacagta tatcaagtca
1251 agtaactttt tggtacccca gtaaaggcaa aacagaaaaa tatacaccca
1301 actacctatc agaaacaggc actgaagcct actacaaaag tataaactat
1351 gatacaggtt ttttttcctc aaaagtacta acagctacaa aggtaacaca
1401 aaaccctact tcagaaacag ggtatgcact tccaccaata aatgcagcca
1451 gatacaatcc tgaagaggac aatggtgaag gcaactctgt atacttagta
1501 tcagtagtga atggtttcta tgacaggccc acagaagaca atttaatata
1551 caaaggagct cctctttggt taatttttca tggcttttat agttttattt
1601 caaaaattaa gagtccttca ttcatgtcac ttcacatgtt tgtaataaga
1651 agcccatttt taaagcctag accatcagca ataactaaag actttttccc
1701 aattatagac ctaaacttta ccataggtaa taatccatat aaggcttata
1751 ttactgccaa tcaaaagaaa ctgtggtacc ctacctgtga acaccaaata
1801 gaaacaatta ataactttgt tgaatgtggc ccatacatac ctaaatttgg
1851 aaatgacaga gatagtactt gggaactacc ttatcactat attttctatt
1901 ttaagtgggg aggaccaaca acaccacaac aagaaattag cgaccccaac
1951 tccaaaaaca tctacactgt tcccgatact ctccaaggaa cagtacaagt
2001 atccaaccca ctcaaacaaa gcactgaaag cttactccac aactgggatc
2051 ttagaagagg gctcattaca gaaacagctt ttaaaagaat gtgcgaaaac
2101 atcgaaactg atacagattt cctcccagat tcgcaagaga caccagcaaa
2151 aagaccaaga ctggcaggag aactcctaca cccccgaaga aaaaaaacaa
2201 agaaatcaag acatgtctcc aggatctctt caaagaagat accttccaag
2251 aaacaccaca gacacaagaa ggaatcatcc acctcatcca gcagcagcga
2301 gagcagcacc ccccttacgt cacaggccac gcccccgccg ccatcttggg
2351 tgcggcaggg cgggggctaa aatggcggga cccaattaaa ttgtactttc
2401 actttccaat taaaaactgc cacgtcacac taaaggggtg gagactttaa
2451 aact