VoiceHome-2, an extended corpus for multichannel speech processing in real homes

Nancy Bertin 1 Ewen Camberlein 1 Romain Lebarbenchon 1 Emmanuel Vincent 2 Sunit Sivasankaran 2 Irina Illina 2 Frédéric Bimbot 1
1 PANAMA - Parcimonie et Nouveaux Algorithmes pour le Signal et la Modélisation Audio
Inria Rennes – Bretagne Atlantique , IRISA-D5 - SIGNAUX ET IMAGES NUMÉRIQUES, ROBOTIQUE
2 MULTISPEECH - Speech Modeling for Facilitating Oral-Based Communication
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : We present a new, extended version of the voiceHome corpus for distant-microphone speech processing in domestic environments. This 5-hour corpus includes short reverberated, noisy utterances (smart home commands) spoken in French by 12 native French talkers in diverse realistic acoustic conditions and recorded by an 8-microphone device at various angles and distances and in various noise conditions. Noise-only segments before and after each utterance are included in the recordings. Clean speech and spontaneous speech recorded in 12 real rooms distributed in 4 different homes are also available. All data have been fully annotated. At last, we provide baseline software for speaker and noise localization, enhancement by source separation, and automatic speech recognition. This corpus stands apart from other corpora in the field by the number of rooms and homes considered and by the fact that it is publicly available at no cost. We describe the corpus specifications and annotations and the data recorded so far, and we report baseline results.
Document type :
Journal articles
Complete list of metadatas

Cited literature [25 references]  Display  Hide  Download

https://hal.inria.fr/hal-01923108
Contributor : Emmanuel Vincent <>
Submitted on : Thursday, November 15, 2018 - 12:16:35 AM
Last modification on : Friday, September 13, 2019 - 9:50:02 AM
Long-term archiving on : Saturday, February 16, 2019 - 12:31:21 PM

File

bertin_SpeechCom18.pdf
Files produced by the author(s)

Identifiers

Citation

Nancy Bertin, Ewen Camberlein, Romain Lebarbenchon, Emmanuel Vincent, Sunit Sivasankaran, et al.. VoiceHome-2, an extended corpus for multichannel speech processing in real homes. Speech Communication, Elsevier : North-Holland, 2019, 106, pp.68-78. ⟨10.1016/j.specom.2018.11.002⟩. ⟨hal-01923108⟩

Share

Metrics

Record views

203

Files downloads

290