Kernel additive modeling for interference reduction in multi-channel music recordings

Thomas Prätzlich 1 Rachel Bittner 2 Antoine Liutkus 3, 4 Meinard Müller 1
3 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
4 MULTISPEECH - Speech Modeling for Facilitating Oral-Based Communication
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : When recording a live musical performance, the different voices, such as the instrument groups or soloists of an orchestra, are typically recorded in the same room simultaneously, with at least one microphone assigned to each voice. However, it is difficult to acoustically shield the microphones. In practice, each one contains interference from every other voice. In this paper, we aim to reduce these interferences in multi-channel recordings to recover only the isolated voices. Following the recently proposed Kernel Additive Modeling framework, we present a method that iteratively estimates both the power spectral density of each voice and the corresponding strength in each microphone signal. With this information, we build an optimal Wiener filter, strongly reducing interferences. The trade-off between distortion and separation can be controlled by the user through the number of iterations of the algorithm. Furthermore, we present a computationally effective approximation of the iterative procedure. Listening tests demonstrate the effectiveness of the method.
Type de document :
Communication dans un congrès
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2015, Brisbane, Australia. 2015
Liste complète des métadonnées

Littérature citée [20 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01116686
Contributeur : Antoine Liutkus <>
Soumis le : mercredi 18 février 2015 - 13:05:37
Dernière modification le : jeudi 11 janvier 2018 - 06:27:31
Document(s) archivé(s) le : samedi 12 septembre 2015 - 17:25:19

Fichier

2015_PraetzlichBittnerLiutkusM...
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01116686, version 2

Collections

Citation

Thomas Prätzlich, Rachel Bittner, Antoine Liutkus, Meinard Müller. Kernel additive modeling for interference reduction in multi-channel music recordings. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2015, Brisbane, Australia. 2015. 〈hal-01116686v2〉

Partager

Métriques

Consultations de la notice

281

Téléchargements de fichiers

490