Learning vocal tract variables with multi-task kernels

Hachem Kadri; Emmanuel Duflos; Philippe Preux

Communication Dans Un Congrès Année : 2011

Learning vocal tract variables with multi-task kernels

(1) , (2) , (1, 3, 4)

1
2
3
4

Hachem Kadri

Fonction : Auteur
PersonId : 10319
IdHAL : hkadri
ORCID : 0000-0002-8060-5354
IdRef : 223805254

Sequential Learning

Emmanuel Duflos

Fonction : Auteur

LAGIS-SI

Philippe Preux

Fonction : Auteur
PersonId : 5488
IdHAL : preux-philippe
IdRef : 059896353

Sequential Learning

Groupe de Recherche en Apprentissage Automatique

Laboratoire d'Informatique Fondamentale de Lille

Résumé

The problem of acoustic-to-articulatory speech inversion continues to be a challenging research problem which sig- nificantly impacts automatic speech recognition robustness and accuracy. This paper presents a multi-task kernel based method aimed at learning Vocal Tract (VT) variables from the Mel-Frequency Cepstral Coefficients (MFCCs). Unlike usual speech inversion techniques based on individual esti- mation of each tract variable, the key idea here is to consider all the target variables simultaneously to take advantage of the relationships among them and then improve learning per- formance. The proposed method is evaluated using synthetic speech dataset and corresponding tract variables created by the TAsk Dynamics Application (TADA) model and com- pared to the hierarchical ε-SVR speech inversion technique.

Mots clés

vocal tract variables acoustic-to-articulatory inversion. acoustic-to-articulatory inversion Multi-task learning matrix-valued ker- nel

Domaines

Machine Learning [stat.ML]

Fichier principal

ICASSP2011.pdf (94.49 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Preux Philippe : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00826050

Soumis le : mardi 4 juin 2013-09:18:41

Dernière modification le : vendredi 24 mars 2023-14:52:57

Archivage à long terme le : jeudi 5 septembre 2013-04:19:23

Dates et versions

hal-00826050 , version 1 (04-06-2013)

Identifiants

HAL Id : hal-00826050 , version 1

Citer

Hachem Kadri, Emmanuel Duflos, Philippe Preux. Learning vocal tract variables with multi-task kernels. 36th International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2011, Prague, Czech Republic. ⟨hal-00826050⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-LILLE3 CNRS INRIA LIFL LAGIS LAGIS-SI INRIA2

214 Consultations

199 Téléchargements

Learning vocal tract variables with multi-task kernels

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager