A multi-resolution approach to common fate-based audio separation

Fatemeh Pishdadian 1 Bryan Pardo 1 Antoine Liutkus 2
2 MULTISPEECH - Speech Modeling for Facilitating Oral-Based Communication
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : We propose the Multi-resolution Common Fate Transform (MCFT), a signal representation that increases the separabil-ity of audio sources with significant energy overlap in the time-frequency domain. The MCFT combines the desirable features of two existing representations: the invertibility of the recently proposed Common Fate Transform (CFT) and the multi-resolution property of the cortical stage output of an auditory model. We compare the utility of the MCFT to the CFT by measuring the quality of source separation performed via ideal binary masking using each representation. Experiments on harmonic sounds with overlapping fundamental frequencies and different spectro-temporal modulation patterns show that ideal masks based on the MCFT yield better separation than those based on the CFT.
Type de document :
Communication dans un congrès
42nd International Conference on Acoustics, Speech and Signal Processing (ICASSP), Mar 2017, New Orleans, United States. 2017
Liste complète des métadonnées

Littérature citée [18 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01515951
Contributeur : Antoine Liutkus <>
Soumis le : vendredi 28 avril 2017 - 12:36:15
Dernière modification le : mercredi 21 février 2018 - 07:50:03
Document(s) archivé(s) le : samedi 29 juillet 2017 - 13:20:13

Fichier

pishdadian.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01515951, version 1

Citation

Fatemeh Pishdadian, Bryan Pardo, Antoine Liutkus. A multi-resolution approach to common fate-based audio separation. 42nd International Conference on Acoustics, Speech and Signal Processing (ICASSP), Mar 2017, New Orleans, United States. 2017. 〈hal-01515951〉

Partager

Métriques

Consultations de la notice

637

Téléchargements de fichiers

210