A multi-resolution approach to common fate-based audio separation - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2017

A multi-resolution approach to common fate-based audio separation

Résumé

We propose the Multi-resolution Common Fate Transform (MCFT), a signal representation that increases the separabil-ity of audio sources with significant energy overlap in the time-frequency domain. The MCFT combines the desirable features of two existing representations: the invertibility of the recently proposed Common Fate Transform (CFT) and the multi-resolution property of the cortical stage output of an auditory model. We compare the utility of the MCFT to the CFT by measuring the quality of source separation performed via ideal binary masking using each representation. Experiments on harmonic sounds with overlapping fundamental frequencies and different spectro-temporal modulation patterns show that ideal masks based on the MCFT yield better separation than those based on the CFT.
Fichier principal
Vignette du fichier
pishdadian.pdf (408.12 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01515951 , version 1 (28-04-2017)

Identifiants

  • HAL Id : hal-01515951 , version 1

Citer

Fatemeh Pishdadian, Bryan Pardo, Antoine Liutkus. A multi-resolution approach to common fate-based audio separation. 42nd International Conference on Acoustics, Speech and Signal Processing (ICASSP), Mar 2017, New Orleans, United States. ⟨hal-01515951⟩
710 Consultations
356 Téléchargements

Partager

Gmail Facebook X LinkedIn More