The second 'CHiME' Speech Separation and Recognition Challenge: Datasets, tasks and baselines

Emmanuel Vincent; Jon Barker; Shinji Watanabe; Jonathan Le Roux; Francesco Nesta; Marco Matassoni

Communication Dans Un Congrès Année : 2013

The second 'CHiME' Speech Separation and Recognition Challenge: Datasets, tasks and baselines

(1, 2) , (3) , (4) , (4) , (5) , (5)

1
2
3
4
5

Emmanuel Vincent

Fonction : Auteur
PersonId : 1256
IdHAL : emmanuelv
ORCID : 0000-0002-0183-7289
IdRef : 089360176

Speech and sound data modeling and processing

Analysis, perception and recognition of speech

Jon Barker

Fonction : Auteur
PersonId : 895549

Department of Computer Sciences [Scheffield]

Shinji Watanabe

Fonction : Auteur

Mitsubishi Electric Research Laboratories

Jonathan Le Roux

Fonction : Auteur

Mitsubishi Electric Research Laboratories

Francesco Nesta

Fonction : Auteur

Fondazione Bruno Kessler [Trento, Italy]

Marco Matassoni

Fonction : Auteur

Fondazione Bruno Kessler [Trento, Italy]

Résumé

Distant-microphone automatic speech recognition (ASR) remains a challenging goal in everyday environments involving multiple background sources and reverberation. This paper is intended to be a reference on the 2nd 'CHiME' Challenge, an initiative designed to analyze and evaluate the performance of ASR systems in a real-world domestic environment. Two separate tracks have been proposed: a small-vocabulary task with small speaker movements and a medium-vocabulary task without speaker movements. We discuss the rationale for the challenge and provide a detailed description of the datasets, tasks and baseline performance results for each track.

Mots clés

noise-robust ASR CHIME challenge

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Fichier principal

vincent_ICASSP13.pdf (85.54 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Emmanuel Vincent : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00796625

Soumis le : lundi 4 mars 2013-15:57:17

Dernière modification le : lundi 11 septembre 2023-17:41:19

Archivage à long terme le : mercredi 5 juin 2013-03:57:13

Dates et versions

hal-00796625 , version 1 (04-03-2013)

Identifiants

HAL Id : hal-00796625 , version 1

Citer

Emmanuel Vincent, Jon Barker, Shinji Watanabe, Jonathan Le Roux, Francesco Nesta, et al.. The second 'CHiME' Speech Separation and Recognition Challenge: Datasets, tasks and baselines. ICASSP - 38th International Conference on Acoustics, Speech, and Signal Processing - 2013, May 2013, Vancouver, Canada. pp.126-130. ⟨hal-00796625⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA IRISA-D5 UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES INSA-GROUPE UR1-MATH-NUM

581 Consultations

838 Téléchargements

The second 'CHiME' Speech Separation and Recognition Challenge: Datasets, tasks and baselines

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager