Mid-level sparse representations for timbre identification: design of an instrument-specific harmonic dictionary

Abstract : Several studies have pointed out the need of mid-level representations of music signals for information retrieval and signal processing applications. In this paper, we investigate a new representation based on sparse decomposition of the signal into a collection of instrument-specific harmonic atoms modelling notes of various pitches played by different instruments. Each atom is composed of windowed harmonic sinusoidal partials whose amplitudes are learned on a training database. An efficient Matching Pursuit algorithm was designed to extract the best atoms and to estimate the phases of their partials. Then we explain how the resulting representation can be exploited for automatic instrument recognition. Preliminary experiments on a test database of solo excerpts show promising results.
Type de document :
Communication dans un congrès
1st Workshop on Learning the Semantics of Audio Signals (LSAS), Dec 2006, Athens, Greece. 2006
Liste complète des métadonnées

Littérature citée [14 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00544284
Contributeur : Emmanuel Vincent <>
Soumis le : mardi 7 décembre 2010 - 15:54:40
Dernière modification le : mercredi 21 mars 2018 - 18:57:32
Document(s) archivé(s) le : mardi 8 mars 2011 - 04:42:09

Fichier

leveau_LSAS06.pdf
Fichiers éditeurs autorisés sur une archive ouverte

Identifiants

  • HAL Id : inria-00544284, version 1

Collections

Citation

Pierre Leveau, Emmanuel Vincent, Gaël Richard, Laurent Daudet. Mid-level sparse representations for timbre identification: design of an instrument-specific harmonic dictionary. 1st Workshop on Learning the Semantics of Audio Signals (LSAS), Dec 2006, Athens, Greece. 2006. 〈inria-00544284〉

Partager

Métriques

Consultations de la notice

129

Téléchargements de fichiers

131