F. Stephen, W. R. Altschul, W. Gish, E. W. Miller, D. J. Myers et al., A Basic Local Alignment Search Tool, Journal of Molecular Biology, vol.215, pp.403-410, 1990.

V. Batagelj and M. , Comparing resemblance measures, Journal of Classification, vol.25, issue.2, pp.73-90, 1995.
DOI : 10.1007/BF01202268

J. Bourdon and B. Vallée, Generalized Pattern Matching Statistics, Mathematics and Computer Science II, pp.1-16, 2002.
DOI : 10.1007/978-3-0348-8211-8_15

A. Denise, M. Régnier, and M. Vandenbogaert, Assessing the Statistical Significance of Overrepresented Oligonucleotides, Algorithms in Bioinformatics. Proceedings of the 1 st International Workshop on Algorithms in BioInformatics (WABI), pp.85-97, 2001.
DOI : 10.1007/3-540-44696-6_7

I. Eidhammer, I. Jonassen, and W. R. Taylor, Structure Comparison and Structure Patterns, Journal of Computational Biology, vol.7, issue.5, 1999.
DOI : 10.1089/106652701446152

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.34.5869

P. Flajolet and A. Odlyzko, Singularity Analysis of Generating Functions, SIAM Journal on Discrete Mathematics, vol.3, issue.2, pp.216-240, 1990.
DOI : 10.1137/0403019

URL : https://hal.archives-ouvertes.fr/inria-00075725

P. Flajolet and R. Sedgewick, Analytic Combinatorics?Symbolic Combinatorics Research Report of the INRIA, to appear

Z. Gerard, G. D. Hertz, and . Stormo, Identifying DNA and Protein Patterns with Statistically Significant Alignments of Multiples Sequences, Bioinformatics, vol.15, issue.78, pp.563-577, 1999.

I. Jonassen, Efficient discovery of conserved patterns using a pattern graph, Bioinformatics, vol.13, issue.5, pp.509-522, 1997.
DOI : 10.1093/bioinformatics/13.5.509

S. Karlin and S. F. , Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes., Proceedings of National Academy of Science (PNAS), pp.2264-2268, 1990.
DOI : 10.1073/pnas.87.6.2264

J. David, W. R. Lipman, and . Pearson, Rapid and Sensitive Protein Similarity Search, Science, vol.227, issue.4693, pp.1435-1441, 1985.

A. Mancheron and I. Rusu, Pattern discovery allowing gaps, substitution matrices and multiple score functions, Algorithms in Bioinformatics . Proceedings of the 3 rd International Workshop on Algorithms in BioInformatics (WABI), volume 2812 of Lecture Notes in Bioinformatics (LNBI), pp.129-145, 2003.
DOI : 10.1007/978-3-540-39763-2_10

URL : https://hal.archives-ouvertes.fr/hal-00487245

B. Saul, C. D. Needleman, and . Wunsch, A General Method Applicable to the Search for Similarities in the Amino Acid Sequences of Two Proteins, Journal of Molecular Biology, vol.48, pp.443-453, 1970.

R. William, D. J. Pearson, and . Lipman, Improved tools for biological sequences comparison, Proceedings of National Academy of Science, pp.2444-2448, 1988.

I. Rigoutsos and A. Floratos, Combinatorial pattern discovery in biological sequences: The TEIRESIAS algorithm [published erratum appears in Bioinformatics 1998;14(2):229], Bioinformatics, vol.14, issue.1, pp.55-67, 1998.
DOI : 10.1093/bioinformatics/14.1.55

C. E. Shannon, A Mathematical Theory of Communication. The Bell System Technical Journal, pp.379-423, 1948.

T. D. Schneider, G. D. Stormo, L. Gold, and A. Ehrenfeuch, Information content of binding sites on nucleotide sequences, Journal of Molecular Biology, vol.188, issue.3, pp.415-431, 1986.
DOI : 10.1016/0022-2836(86)90165-8

S. Vinga and J. S. Almeida, Alignment-free sequence comparison--a review, Bioinformatics, vol.19, issue.4, pp.513-523, 2003.
DOI : 10.1093/bioinformatics/btg005

B. Vallée, Dynamical sources in information theory: Fundamental intervals and word prefixes, Algorithmica, vol.81, issue.2, pp.262-306, 2001.
DOI : 10.1007/BF02679622

W. , J. Wilbur, and D. J. Lipman, Rapid similarity searches of nucleic acid and protein data banks, Proceedings of National Academy of Science (PNAS), pp.726-730, 1983.