Skip to Main content Skip to Navigation
Conference papers

Consistent Wiener filtering: generalized time-frequency masking respecting spectrogram consistency

Abstract : Wiener filtering is one of the most widely used methods in audio source separation. It is often applied on time-frequency representations of signals, such as the short-time Fourier transform (STFT), to exploit their short-term stationarity, but so far the design of the Wiener time-frequency mask did not take into account the necessity for the output spectrograms to be consistent, i.e., to correspond to the STFT of a time-domain signal. In this paper, we generalize the concept of Wiener filtering to time-frequency masks which can involve manipulation of the phase as well by formulating the problem as a consistency-constrained Maximum-Likelihood one. We present two methods to solve the problem, one looking for the optimal time-domain signal, the other promoting consistency through a penalty function directly in the time-frequency domain. We show through experimental evaluation that, both in oracle conditions and combined with spectral subtraction, our method outperforms classical Wiener filtering.
Complete list of metadata

Cited literature [11 references]  Display  Hide  Download
Contributor : Emmanuel Vincent Connect in order to contact the contributor
Submitted on : Tuesday, December 7, 2010 - 11:18:39 AM
Last modification on : Thursday, January 20, 2022 - 4:18:08 PM
Long-term archiving on: : Tuesday, March 8, 2011 - 3:29:24 AM


Publisher files allowed on an open archive


  • HAL Id : inria-00544072, version 1


Jonathan Le Roux, Emmanuel Vincent, Yuu Mizuno, Hirokazu Kameoka, Nobutaka Ono, et al.. Consistent Wiener filtering: generalized time-frequency masking respecting spectrogram consistency. 9th Int. Conf. on Latent Variable Analysis and Signal Separation (LVA/ICA), Sep 2010, Saint-Malo, France. pp.89--96. ⟨inria-00544072⟩



Les métriques sont temporairement indisponibles