The second 'CHiME' Speech Separation and Recognition Challenge: Datasets, tasks and baselines

Emmanuel Vincent 1, 2 Jon Barker 3 Shinji Watanabe 4 Jonathan Le Roux 4 Francesco Nesta 5 Marco Matassoni 5
1 METISS - Speech and sound data modeling and processing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
2 PAROLE - Analysis, perception and recognition of speech
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : Distant-microphone automatic speech recognition (ASR) remains a challenging goal in everyday environments involving multiple background sources and reverberation. This paper is intended to be a reference on the 2nd 'CHiME' Challenge, an initiative designed to analyze and evaluate the performance of ASR systems in a real-world domestic environment. Two separate tracks have been proposed: a small-vocabulary task with small speaker movements and a medium-vocabulary task without speaker movements. We discuss the rationale for the challenge and provide a detailed description of the datasets, tasks and baseline performance results for each track.
Type de document :
Communication dans un congrès
ICASSP - 38th International Conference on Acoustics, Speech, and Signal Processing - 2013, May 2013, Vancouver, Canada. pp.126-130, 2013
Liste complète des métadonnées

Littérature citée [28 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-00796625
Contributeur : Emmanuel Vincent <>
Soumis le : lundi 4 mars 2013 - 15:57:17
Dernière modification le : mercredi 16 mai 2018 - 11:23:03
Document(s) archivé(s) le : mercredi 5 juin 2013 - 03:57:13

Fichier

vincent_ICASSP13.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00796625, version 1

Citation

Emmanuel Vincent, Jon Barker, Shinji Watanabe, Jonathan Le Roux, Francesco Nesta, et al.. The second 'CHiME' Speech Separation and Recognition Challenge: Datasets, tasks and baselines. ICASSP - 38th International Conference on Acoustics, Speech, and Signal Processing - 2013, May 2013, Vancouver, Canada. pp.126-130, 2013. 〈hal-00796625〉

Partager

Métriques

Consultations de la notice

775

Téléchargements de fichiers

779