A multi-resolution approach to common fate-based audio separation

Fatemeh Pishdadian; Bryan Pardo; Antoine Liutkus

Communication Dans Un Congrès Année : 2017

A multi-resolution approach to common fate-based audio separation

(1) , (1) , (2)

1
2

Fatemeh Pishdadian

Fonction : Auteur
PersonId : 1007193

Northwestern University [Evanston]

Bryan Pardo

Fonction : Auteur

Northwestern University [Evanston]

Antoine Liutkus

Fonction : Auteur
PersonId : 2740
IdHAL : antoine-liutkus
ORCID : 0000-0002-3458-6498
IdRef : 167600419

Speech Modeling for Facilitating Oral-Based Communication

Résumé

We propose the Multi-resolution Common Fate Transform (MCFT), a signal representation that increases the separabil-ity of audio sources with significant energy overlap in the time-frequency domain. The MCFT combines the desirable features of two existing representations: the invertibility of the recently proposed Common Fate Transform (CFT) and the multi-resolution property of the cortical stage output of an auditory model. We compare the utility of the MCFT to the CFT by measuring the quality of source separation performed via ideal binary masking using each representation. Experiments on harmonic sounds with overlapping fundamental frequencies and different spectro-temporal modulation patterns show that ideal masks based on the MCFT yield better separation than those based on the CFT.

Mots clés

audio source separation multiresolution

Domaines

Traitement du signal et de l'image [eess.SP]

Fichier principal

pishdadian.pdf (408.12 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Antoine Liutkus : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01515951

Soumis le : vendredi 28 avril 2017-12:36:15

Dernière modification le : jeudi 1 février 2024-10:05:36

Archivage à long terme le : samedi 29 juillet 2017-13:20:13

Dates et versions

hal-01515951 , version 1 (28-04-2017)

Identifiants

HAL Id : hal-01515951 , version 1

Citer

Fatemeh Pishdadian, Bryan Pardo, Antoine Liutkus. A multi-resolution approach to common fate-based audio separation. 42nd International Conference on Acoustics, Speech and Signal Processing (ICASSP), Mar 2017, New Orleans, United States. ⟨hal-01515951⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA IRISA UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES ANR UR1-MATH-NUM

710 Consultations

358 Téléchargements

A multi-resolution approach to common fate-based audio separation

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager