J. Jurka, Repbase Update, a database of eukaryotic repetitive elements, Cytogenetic and Genome Research, vol.110, issue.1-4, pp.462-467, 2005.
DOI : 10.1159/000084979

T. Flutre, Considering Transposable Element Diversification in De Novo Annotation Approaches, PLoS ONE, vol.25, issue.3, p.1, 2011.
DOI : 10.1371/journal.pone.0016526.s021

URL : https://hal.archives-ouvertes.fr/hal-00568705

G. Reinert, S. Schbath, and M. Waterman, Probabilistic and Statistical Properties of Finite Words in Finite Sequences, Applied Combinatorics on Words, 2005.

A. Lefebvre, T. Lecroq, and J. Alexandre, An improved algorithm for finding longest repeats with a modified factor oracle, Journal of Automata, Languages and Combinatorics, vol.8, pp.347-658, 2003.

A. Lefebvre, FORRepeats: detects repeats on entire chromosomes and between genomes, Bioinformatics, vol.19, issue.3, pp.319-326, 2003.
DOI : 10.1093/bioinformatics/btf843

D. Ussery, T. Wassenaar, and S. Borini, Word Frequencies and Repeats Computing for Comparative Microbial Genomics: Bioinformatics for Microbiologists, Computational Biology. s.l, issue.8, pp.111-150, 2009.

U. Manber and G. Myers, Suffix Arrays: A New Method for On-Line String Searches, Proceedings of the 1st ACM-SIAM Symposium on Discrete Algorithms, pp.319-327, 1990.
DOI : 10.1137/0222058

S. Puglisi, W. Smyth, and A. Turpin, A taxonomy of suffix array construction algorithms, ACM Computing Surveys, vol.39, issue.2, pp.1-31, 2007.
DOI : 10.1145/1242471.1242472

M. Abouelhoda, S. Kurtz, and E. Ohlebusch, Replacing suffix trees with enhanced suffix arrays, Journal of Discrete Algorithms, vol.2, issue.1, pp.53-86, 2004.
DOI : 10.1016/S1570-8667(03)00065-0

R. Pokrzywa and A. Polanski, BWtrs: A tool for searching for tandem repeats in DNA sequences based on the Burrows???Wheeler transform, Genomics, vol.96, issue.5, pp.316-321, 2010.
DOI : 10.1016/j.ygeno.2010.08.001

G. Nong, S. Zhang, and W. Chan, Linear Suffix Array Construction by Almost Pure Induced-Sorting, 2009 Data Compression Conference, pp.193-202, 2009.
DOI : 10.1109/DCC.2009.42

R. Homann, mkESA: enhanced suffix array construction tool, Bioinformatics, vol.25, issue.8, pp.1084-1085, 2009.
DOI : 10.1093/bioinformatics/btp112

T. Schnattinger, E. Ohlebusch, and S. Gog, Bidirectional Search in a String with Wavelet Trees, Proceedings of the 21st annual conference on Combinatorial pattern matching (CPM'10), pp.40-50, 2010.
DOI : 10.1007/978-3-642-13509-5_5

A. Price, N. Jones, and P. Pevzner, De novo identification of repeat families in large genomes, Proceedings of the 13th Annual International conference on Intelligent Systems for Molecular Biology (ISMB-05, 2005.
DOI : 10.1093/bioinformatics/bti1018

R. Li, ReAS: Recovery of ancestral sequences for transposable elements from the unassembled reads of a whole genome shotgun, PLoS Comput, vol.1, p.4, 2005.

M. Li, PATTERNHUNTER II: HIGHLY SENSITIVE AND FAST HOMOLOGY SEARCH, Journal of Bioinformatics and Computational Biology, vol.02, issue.03, pp.417-439, 2004.
DOI : 10.1142/S0219720004000661

L. Noe and G. Kucherov, YASS: enhancing the sensitivity of DNA similarity search, Nucleic Acids Research, vol.33, issue.Web Server, pp.540-543, 2005.
DOI : 10.1093/nar/gki478

URL : https://hal.archives-ouvertes.fr/inria-00100004

M. Weber, Mammalian Small Nucleolar RNAs Are Mobile Genetic Elements, PLoS Genetics, vol.2, issue.12, p.205, 2006.
DOI : 10.1371/journal.pgen.0020205.st001

URL : https://hal.archives-ouvertes.fr/hal-00309036

D. Grzebelus, Diversity and structure of PIF/Harbinger-like elements in the genome of Medicago truncatula, BMC Genomics, vol.8, issue.1, p.409, 2007.
DOI : 10.1186/1471-2164-8-409

G. Kucherov, L. Noe, and M. Roytberg, A UNIFYING FRAMEWORK FOR SEED SENSITIVITY AND ITS APPLICATION TO SUBSET SEEDS, Journal of Bioinformatics and Computational Biology, vol.04, issue.02, pp.553-569, 2006.
DOI : 10.1142/S0219720006001977

URL : https://hal.archives-ouvertes.fr/inria-00001164

M. Roytberg, On Subset Seeds for Protein Alignment, IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol.6, issue.3, pp.483-494, 2009.
DOI : 10.1109/TCBB.2009.4

URL : https://hal.archives-ouvertes.fr/inria-00354773

V. Nguyen and D. Lavenier, PLAST: parallel local alignment search tool for database comparison, BMC Bioinformatics, vol.10, issue.1, p.329, 2009.
DOI : 10.1186/1471-2105-10-329

URL : https://hal.archives-ouvertes.fr/inria-00425301

S. Kie?basa, Adaptive seeds tame genomic sequence comparison, Genome Research, vol.21, issue.3, pp.487-493, 2011.
DOI : 10.1101/gr.113985.110

J. Hughes, Chimpanzee and human Y chromosomes are remarkably divergent in structure and gene content, Nature, vol.11, issue.7280, pp.536-539, 2010.
DOI : 10.1038/nature08700

J. Krumsiek, Gepard: a rapid and sensitive tool for creating dotplots on genome scale, Bioinformatics, vol.23, issue.8, pp.1026-1028, 2007.
DOI : 10.1093/bioinformatics/btm039

P. Durand, Browsing repeats in genomes: Pygram and an application to non-coding region analysis, BMC Bioinformatics, vol.7, issue.1, p.477, 2006.
DOI : 10.1186/1471-2105-7-477

URL : https://hal.archives-ouvertes.fr/hal-00129773

D. Sokol and F. Atagun, TRedD--A database for tandem repeats over the edit distance, Database : article ID baq003, 2010.
DOI : 10.1093/database/baq003

C. Rousseau, CRISPI: a CRISPR interactive database, Bioinformatics, vol.25, issue.24, pp.3317-3318, 2009.
DOI : 10.1093/bioinformatics/btp586

URL : https://hal.archives-ouvertes.fr/inria-00438512

M. Brudno, Multiple whole genome alignments and novel biomedical applications at the VISTA portal, Nucleic Acids Research, vol.35, issue.Web Server, pp.669-674, 2007.
DOI : 10.1093/nar/gkm279

D. Nix and M. Eisen, GATA: a graphic alignment tool for comparative sequence analysis, BMC Bioinformatics, vol.6, issue.1, p.9, 2005.
DOI : 10.1186/1471-2105-6-9

M. Krzywinski, Circos: An information aesthetic for comparative genomics, Genome Research, vol.19, issue.9, pp.1639-1645, 2009.
DOI : 10.1101/gr.092759.109

N. Darzentas, Circoletto: visualizing sequence similarity with Circos, Bioinformatics, vol.26, issue.20, pp.2620-2621, 2010.
DOI : 10.1093/bioinformatics/btq484

S. Tempel, Domain organization within repeated DNA sequences: application to the study of a family of transposable elements, Bioinformatics, vol.22, issue.16, pp.1948-1954, 2006.
DOI : 10.1093/bioinformatics/btl337

URL : https://hal.archives-ouvertes.fr/hal-00090517

S. Tempel, ModuleOrganizer: detecting modules in families of transposable elements, BMC Bioinformatics, vol.11, issue.1, p.474, 2010.
DOI : 10.1186/1471-2105-11-474

URL : https://hal.archives-ouvertes.fr/inria-00536742

C. Feschotte, Exploring Repetitive DNA Landscapes Using REPCLASS, a Tool That Automates the Classification of Transposable Elements in Eukaryotic Genomes, Genome Biology and Evolution, vol.1, issue.0, pp.205-220, 2009.
DOI : 10.1093/gbe/evp023

J. Estill and J. Bennetzen, The DAWGPAWS pipeline for the annotation of genes and transposable elements in plant genomes, Plant Methods, vol.5, issue.1, p.8, 2009.
DOI : 10.1186/1746-4811-5-8

Y. Han and S. Wessler, MITE-Hunter: a program for discovering miniature inverted-repeat transposable elements from genomic sequences, Nucleic Acids Research, vol.38, issue.22, p.199, 2010.
DOI : 10.1093/nar/gkq862

S. Kurtz, The Vmatch large scale sequence analysis software. A Manual Unpublished reportpdf; + 2 other manuals " Chaining pairwise matches using the program chain2dim. Manual " and " Clustering Matches using the program matchcluster, 2011.

M. Morgante, Structured Motifs Search, Journal of Computational Biology, vol.12, issue.8, pp.1065-1082, 2005.
DOI : 10.1089/cmb.2005.12.1065

Y. Zhang and M. Zaki, SMOTIF: efficient structured pattern and profile motif search, Algorithms Mol Biol, vol.21, pp.1-22, 2006.

D. Ellinghaus, S. Kurtz, and U. Willhoeft, LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons, BMC Bioinformatics, vol.9, issue.1, p.18, 2008.
DOI : 10.1186/1471-2105-9-18

D. Searls, String variable grammar: A logic grammar formalism for the biological language of DNA, The Journal of Logic Programming, vol.24, issue.1-2, pp.73-102, 1993.
DOI : 10.1016/0743-1066(95)00034-H

D. Searls, The language of genes, Nature, vol.10, issue.6912, pp.211-217, 2002.
DOI : 10.1038/29667

J. Nicolas, Suffix-tree analyser (STAN): looking for nucleotidic and peptidic patterns in chromosomes, Bioinformatics, vol.21, issue.24, pp.4408-4410, 2005.
DOI : 10.1093/bioinformatics/bti710

C. Belleannée and N. J. , Logol : Modelling evolving sequence families through a dedicated constrained string language, Inria Research report RR, vol.6350, p.19, 2007.