Saliency-based modeling of acoustic scenes using sparse non-negative matrix factorization

Benjamin Cauchi; Mathieu Lagrange; Nicolas Misdariis; Arshia Cont

Communication Dans Un Congrès Année : 2013

Saliency-based modeling of acoustic scenes using sparse non-negative matrix factorization

(1) , (2) , (3) , (3, 4)

1
2
3
4

Benjamin Cauchi

Fonction : Auteur correspondant
PersonId : 951925

Connectez-vous pour contacter l'auteur

Division Hearing, Speech and Audio Technology

Mathieu Lagrange

Fonction : Auteur
PersonId : 4329
IdHAL : mathieu-lagrange

Institut de Recherche et Coordination Acoustique/Musique

Nicolas Misdariis

Fonction : Auteur
PersonId : 939311

Sciences et Technologies de la Musique et du Son

Arshia Cont

Fonction : Auteur
PersonId : 6067
IdHAL : arshiacont
ORCID : 0000-0002-7352-7212
IdRef : 131109758

Sciences et Technologies de la Musique et du Son

Synchronous Realtime Processing and Programming of Music Signals

Résumé

The modelling of auditory scenes is a challenging task in Computational Auditory Scene Analysis. A method based on sparse Non-negative Matrix Factorization that can be used with no prior knowledge of the audio content to establish the similarity between scenes is proposed. The method is evaluated on a corpus of soundscapes of train stations issued from a perceptual study and results are compared with the human perception. The proposed method, by being able to focus on salient events within the scene, achieves better performances than a state-of-the-art Bag-of-Frames approach though not reaching the human performances.

Domaines

Son [cs.SD]

Fichier principal

Cauchi13-SparseSliencyNMG.pdf (396.88 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Arshia Cont : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00940075

Soumis le : vendredi 31 janvier 2014-12:07:08

Dernière modification le : mercredi 19 avril 2023-04:20:35

Archivage à long terme le : dimanche 9 avril 2017-04:18:32

Dates et versions

hal-00940075 , version 1 (31-01-2014)

Identifiants

HAL Id : hal-00940075 , version 1

Citer

Benjamin Cauchi, Mathieu Lagrange, Nicolas Misdariis, Arshia Cont. Saliency-based modeling of acoustic scenes using sparse non-negative matrix factorization. Workshop on Image and Audio Analysis for Multimedia Interactive, Jul 2013, Paris, France. ⟨hal-00940075⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UPMC CNRS INRIA IRCAM STMS INRIA2 SORBONNE-UNIVERSITE SU-SCIENCES

226 Consultations

271 Téléchargements

Saliency-based modeling of acoustic scenes using sparse non-negative matrix factorization

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager