M. Johnson, Why doesn't em find good hmm postaggers?, Proceedings of EMNLP-CoNLL-07, 2007.

R. Lawrence and . Rabiner, A tutorial on hidden markov models and selected applications in speech recognition, Proceedings of the IEEE, pp.257-286, 1989.

M. Mohri, Finite-state transducers in language and speech processing, Computational linguistics, vol.23, issue.2, pp.269-311, 1997.

B. Merialdo, Tagging english text with a probabilistic model, Computational linguistics, vol.20, issue.2, pp.155-171, 1994.

A. Anandkumar, R. Ge, D. Hsu, M. Sham, M. Kakade et al., Tensor decompositions for learning latent variable models, 2012.

R. Bailly, A. Habrard, and F. Denis, A Spectral Approach for Probabilistic Grammatical Inference on Trees, Proc of ALT-10, 2010.
DOI : 10.1007/978-3-642-16108-7_10

URL : https://hal.archives-ouvertes.fr/hal-00607096

M. Thon and H. Jaeger, Links between multiplicity automata, observable operator models and predictive state representations?a unified learning framework, Journal of Machine Learning Research, vol.16, pp.103-147, 2015.

H. Glaude, O. Pietquin, and C. Enderli, Subspace identification for predictive state representation by nuclear norm minimization, 2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), 2014.
DOI : 10.1109/ADPRL.2014.7010609

URL : https://hal.archives-ouvertes.fr/hal-01104423

B. Balle, W. Hamilton, and J. Pineau, Methods of moments for learning stochastic languages: Unified presentation and empirical comparison, Proceedings of ICML-14, 2014.

M. Gybels, F. Denis, and A. Habrard, Some improvements of the spectral learning approach for probabilistic grammatical inference, Proceedings of ICGI-12, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01075979

W. Jack, A. Carlyle, and . Paz, Realizations by stochastic finite automata, Journal of Computer and System Sciences, vol.5, issue.1, pp.26-40, 1971.

B. Balle, Learning finite-state machines: algorithmic and statistical aspects, 2013.

B. Shay, K. Cohen, M. Stratos, . Collins, P. Dean et al., Experiments with spectral learning of latent-variable pcfgs, Proceedings of HLT-NAACL-13, 2013.

F. Denis and Y. Esposito, On rational stochastic languages, Fundamenta Informaticae, vol.86, issue.1, pp.41-77, 2008.

N. Gillis, The Why and How of Nonnegative Matrix Factorization ArXiv e-prints, 2014.

N. Gillis, A. Stephen, and . Vavasis, Semidefinite programming based preconditioning for more robust nearseparable nonnegative matrix factorization, 2013.

Y. Esposito, A. Lemay, F. Denis, and P. Dupont, Learning Probabilistic Residual Finite State Automata, Grammatical Inference: Algorithms and Applications, pp.77-91, 2002.
DOI : 10.1007/3-540-45790-9_7

D. Pfau, N. Bartlett, and F. Wood, Probabilistic deterministic infinite automata, Proceedings of NIPS-10, 2010.

P. Dupont, F. Denis, and Y. Esposito, Links between probabilistic automata and hidden Markov models: probability distributions, learning models and induction algorithms, Pattern Recognition, vol.38, issue.9, pp.1349-1371, 2005.
DOI : 10.1016/j.patcog.2004.03.020

S. Verwer, R. Eyraud, C. De, and L. Higuera, Results of the pautomac probabilistic automaton learning competition, Journal of Machine Learning Research -Proceedings Track, vol.21, pp.243-248, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00833419

B. Balle, A. Quattoni, and X. Carreras, Local loss optimization in operator models: A new insight into spectral learning, Proceedings of ICML-12, 2012.

I. Sutskever, J. Martens, and G. E. Hinton, Generating text with recurrent neural networks, Proceedings of ICML-11, 2011.

O. Lemon and O. Pietquin, Data-Driven Methods for Adaptive Spoken Dialogue Systems: Computational Learning for Conversational Interfaces, 2012.
DOI : 10.1007/978-1-4614-4803-7

URL : https://hal.archives-ouvertes.fr/hal-00756740