Spectral learning with proper probabilities for finite state automation

Hadrien Glaude; Cyrille Enderli; Olivier Pietquin

Communication Dans Un Congrès Année : 2015

Spectral learning with proper probabilities for finite state automation

(1, 2, 3) , (1) , (2, 4, 5, 3)

1
2
3
4
5

Hadrien Glaude

Fonction : Auteur
PersonId : 9894
IdHAL : hadrien-glaude
IdRef : 197825966

Thales Airborne Systems

Sequential Learning

Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189

Cyrille Enderli

Fonction : Auteur

Thales Airborne Systems

Olivier Pietquin

Fonction : Auteur
PersonId : 4024
IdHAL : olivier-pietquin
ORCID : 0000-0002-5386-465X
IdRef : 142821861

Sequential Learning

Institut universitaire de France

Université de Lille, Sciences et Technologies

Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189

Résumé

Probabilistic Finite Automaton (PFA), Probabilistic Finite State Transducers (PFST) and Hidden Markov Models (HMM) are widely used in Automatic Speech Recognition (ASR), Text-to-Speech (TTS) systems and Part Of Speech (POS) tagging for language mod-eling. Traditionally, unsupervised learning of these latent variable models is done by Expectation-Maximization (EM)-like algorithms, as the Baum-Welch algorithm. In a recent alternative line of work, learning algorithms based on spectral properties of some low order moments matrices or tensors were proposed. In comparison to EM, they are orders of magnitude faster and come with theoretical convergence guarantees. However, returned models are not ensured to compute proper distributions. They often return negative values that do not sum to one, limiting their applicability and preventing them to serve as an initialization to EM-like algorithms. In this paper, we propose a new spectral algorithm able to learn a large range of models constrained to return proper distributions. We assess its performances on synthetic problems from the PAutomaC challenge and real datasets extracted from Wikipedia. Experiments show that it outperforms previous spectral approaches as well as the Baum-Welch algorithm with random restarts, in addition to serve as an efficient initialization step to EM-like algorithms.

Mots clés

spectral learning Baum-welch learning automata non-negative matrix factorization language models

Domaines

Apprentissage [cs.LG] Interface homme-machine [cs.HC]

Fichier principal

ASRU_2015_HGCEOP.pdf (252.33 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Olivier Pietquin : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01225810

Soumis le : lundi 9 novembre 2015-15:00:01

Dernière modification le : lundi 15 avril 2024-11:25:23

Archivage à long terme le : mercredi 10 février 2016-10:09:20

Dates et versions

hal-01225810 , version 1 (09-11-2015)

Identifiants

HAL Id : hal-01225810 , version 1

Citer

Hadrien Glaude, Cyrille Enderli, Olivier Pietquin. Spectral learning with proper probabilities for finite state automation. ASRU 2015 - Automatic Speech Recognition and Understanding Workshop, Dec 2015, Scottsdale, United States. ⟨hal-01225810⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA CRISTAL INRIA2 CRISTAL-SEQUEL UNIV-LILLE

236 Consultations

141 Téléchargements

Spectral learning with proper probabilities for finite state automation

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager