Skip to Main content Skip to Navigation
Conference papers

Using the FASST source separation toolbox for noise robust speech recognition

Alexey Ozerov 1 Emmanuel Vincent 1
1 METISS - Speech and sound data modeling and processing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : We describe our submission to the 2011 CHiME Speech Separation and Recognition Challenge. Our speech separation algorithm was built using the Flexible Audio Source Separation Toolbox (FASST) we developed recently. This toolbox is an implementation of a general flexible framework based on a library of structured source models that enable the incorporation of prior knowledge about a source separation problem via user-specifiable constraints. We show how to use FASST to develop an efficient speech separation algorithm for the CHiME dataset. We also describe the acoustic model training and adaptation strategies we used for this submission. Altogether, as compared to the baseline system, we obtain an improvement of keyword recognition accuracies in all conditions. The best improvement of about 40 % is achieved in the worst condition of -6 dB Signal-to-Noise-Ratio (SNR), where 18 % of this improvement is due to the speech separation. The improvement decreases when the SNR increases. These results indicate that audio source separation can be very helpful to improve speech recognition in noisy or multi-source environments.
Complete list of metadata
Contributor : Alexey Ozerov Connect in order to contact the contributor
Submitted on : Tuesday, June 7, 2011 - 2:26:46 PM
Last modification on : Tuesday, June 15, 2021 - 4:23:07 PM
Long-term archiving on: : Friday, September 9, 2011 - 3:17:12 PM


Files produced by the author(s)


  • HAL Id : inria-00598734, version 1


Alexey Ozerov, Emmanuel Vincent. Using the FASST source separation toolbox for noise robust speech recognition. International Workshop on Machine Listening in Multisource Environments (CHiME 2011), Sep 2011, Florence, Italy. ⟨inria-00598734⟩



Les métriques sont temporairement indisponibles