Mid-level sparse representations for timbre identification: design of an instrument-specific harmonic dictionary - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2006

Mid-level sparse representations for timbre identification: design of an instrument-specific harmonic dictionary

Emmanuel Vincent
Gael Richard

Résumé

Several studies have pointed out the need of mid-level representations of music signals for information retrieval and signal processing applications. In this paper, we investigate a new representation based on sparse decomposition of the signal into a collection of instrument-specific harmonic atoms modelling notes of various pitches played by different instruments. Each atom is composed of windowed harmonic sinusoidal partials whose amplitudes are learned on a training database. An efficient Matching Pursuit algorithm was designed to extract the best atoms and to estimate the phases of their partials. Then we explain how the resulting representation can be exploited for automatic instrument recognition. Preliminary experiments on a test database of solo excerpts show promising results.
Fichier principal
Vignette du fichier
leveau_LSAS06.pdf (184.54 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

inria-00544284 , version 1 (07-12-2010)

Identifiants

  • HAL Id : inria-00544284 , version 1

Citer

Pierre Leveau, Emmanuel Vincent, Gael Richard, Laurent Daudet. Mid-level sparse representations for timbre identification: design of an instrument-specific harmonic dictionary. 1st Workshop on Learning the Semantics of Audio Signals (LSAS), Dec 2006, Athens, Greece. ⟨inria-00544284⟩
111 Consultations
131 Téléchargements

Partager

Gmail Facebook X LinkedIn More