C. A. Suttle, Marine viruses ??? major players in the global ecosystem, Nature Reviews Microbiology, vol.46, issue.10, pp.801-812, 2007.
DOI : 10.1038/nrmicro1750

R. Danovaro, Marine viruses and global climate change. FEMS microbiology reviews 35, pp.993-1034, 2011.

S. F. Altschul, W. Gish, W. Miller, E. W. Myers, and D. J. Lipman, Basic local alignment search tool, Journal of Molecular Biology, vol.215, issue.3, pp.403-410, 1990.
DOI : 10.1016/S0022-2836(05)80360-2

S. Crotty, C. E. Cameron, and R. Andino, RNA virus error catastrophe: Direct molecular test by using ribavirin, Proceedings of the National Academy of Sciences, vol.98, issue.12, pp.6895-6900, 2001.
DOI : 10.1073/pnas.111085598

B. L. Hurwitz and M. B. Sullivan, The Pacific Ocean Virome (POV): A Marine Viral Metagenomic Dataset and Associated Protein Clusters for Quantitative Viral Ecology, PLoS ONE, vol.3, issue.2, p.57355, 2013.
DOI : 10.1371/journal.pone.0057355.s003

S. Cheng, I. Brooks, and C. L. , Viral Capsid Proteins Are Segregated in Structural Fold Space, PLoS Computational Biology, vol.8, issue.2
DOI : 10.1371/journal.pcbi.1002905.s002

J. Cern-`-cern-`-y, B. ?. Bolfíková, J. J. Valdes, L. Grubhoffer, and D. R?ek, Evolution of Tertiary Structure of Viral RNA Dependent Polymerases, PLoS ONE, vol.19, issue.3, p.96070, 2014.
DOI : 10.1371/journal.pone.0096070.s005

P. Ahlquist, RNA-Dependent RNA Polymerases, Viruses, and RNA Silencing, Science, vol.296, issue.5571, pp.1270-1273, 2002.
DOI : 10.1126/science.1069132

S. R. Eddy, Profile hidden Markov models, Bioinformatics, vol.14, issue.9, pp.755-763, 1998.
DOI : 10.1093/bioinformatics/14.9.755

G. Kerbellec, Apprentissage d'automates modélisant des familles de séquences protéiques, Université Rennes, vol.1, 2008.

C. Galiez and C. François, Structural conservation for remote homologues : better and further in contact fragments, 3DSIG : Structural Bioinformatics and Computational Biophysics, 2015.

M. Carpentier, S. Brouillet, and J. Pothier, YAKUSA: A fast structural database scanning method, Proteins: Structure, Function, and Bioinformatics, vol.49, issue.Suppl 6, pp.137-151, 2005.
DOI : 10.1002/prot.20517

R. C. Edgar, MUSCLE: multiple sequence alignment with high accuracy and high throughput, Nucleic Acids Research, vol.32, issue.5, pp.1792-1797, 2004.
DOI : 10.1093/nar/gkh340

W. Li, L. Jaroszewski, and A. Godzik, Clustering of highly homologous sequences to reduce the size of large protein databases, Bioinformatics, vol.17, issue.3, pp.282-283, 2001.
DOI : 10.1093/bioinformatics/17.3.282

O. P. Sharma, A. Jadhav, A. Hussain, M. S. Kumar, and . Vpdb, VPDB: Viral Protein Structural Database, Bioinformation, vol.6, issue.8, p.324, 2011.
DOI : 10.6026/97320630006324

R. C. Edgar, Search and clustering orders of magnitude faster than BLAST, Bioinformatics, vol.26, issue.19, pp.2460-2461, 2010.
DOI : 10.1093/bioinformatics/btq461

J. Davis and M. Goadrich, The relationship between Precision-Recall and ROC curves, Proceedings of the 23rd international conference on Machine learning , ICML '06, pp.233-240, 2006.
DOI : 10.1145/1143844.1143874

Y. Maida and K. Masutomi, RNA-dependent RNA polymerases in RNA silencing, Biological Chemistry, vol.392, issue.4, pp.299-304, 2011.
DOI : 10.1515/bc.2011.035

C. Galiez and F. Coste, Amplitude spectrum distance: measuring the global shape divergence of protein fragments, BMC Bioinformatics, vol.1, issue.2, 2015.
DOI : 10.1109/TPAMI.1979.4766909

URL : https://hal.archives-ouvertes.fr/hal-01214482