K. Sma¨?lisma¨?li, A. Brun, I. Zitouni, and J. P. Haton, Automatic and manual clustering for large vocabulary speech re cognition : A comparative study, European Conference on Speech Communication and Technology, pp.1795-1798, 1999.

M. Deviren and K. Daoudi, Structural learning of dynamic Bayesian networks in speech recognition, Eurospeech, pp.1669-1673, 2001.
URL : https://hal.archives-ouvertes.fr/inria-00100526

N. Friedman, K. Murphy, and S. Russell, Learning the structure of dynamic probabilistic networks, UAI'98, pp.139-147, 1998.

F. Jelinek and R. L. Mercer, Interpolated estimation of markov source parameters from sparse data, In Pattern Recognition in Practice, pp.381-397, 1980.

H. Ney, U. Essen, and R. Kneser, On structuring probabilistic dependences in stochastic language modelling, Computer Speech & Language, vol.8, issue.1, pp.1-38, 1994.
DOI : 10.1006/csla.1994.1001

D. Heckerman, A tutorial on learning with bayesian networks, 1995.

R. Rosenfeld, Adaptive Statistical Language Modeling : A Maximum Entropy Approach, p.15213, 1994.
DOI : 10.1006/csla.1996.0011