M. Deviren and K. Daoudi, Structural learning of dynamic Bayesian networks in speech recognition, Eurospeech, 2001.
URL : https://hal.archives-ouvertes.fr/inria-00100526

N. Friedman, K. Murphy, and S. Russell, Learning the structure of dynamic probabilistic networks, UAI'98, 1998.

D. Heckerman, A tutorial on learning with bayesian networks, 1995.

F. Jelinek and R. L. Mercer, Interpolated estimation of markov source parameters from sparse data. In Pattern Recognition in Practice, 1980.

H. Ney, U. Essen, and R. Kneser, On structuring probabilistic dependences in stochastic language modelling, Computer Speech & Language, vol.8, issue.1, pp.1-38, 1994.
DOI : 10.1006/csla.1994.1001

R. Rosenfeld, Adaptive Statistical Language Modeling: A Maximum Entropy Approach, 1994.

K. Sma¨?lisma¨?li, A. Brun, I. Zitouni, and J. P. Haton, Automatic and manual clustering for large vocabulary speech re cognition: A comparative study, European Conference on Speech Communication and Technology, 1999.