Mid-level sparse representations for timbre identification: design of an instrument-specific harmonic dictionary

Abstract : Several studies have pointed out the need of mid-level representations of music signals for information retrieval and signal processing applications. In this paper, we investigate a new representation based on sparse decomposition of the signal into a collection of instrument-specific harmonic atoms modelling notes of various pitches played by different instruments. Each atom is composed of windowed harmonic sinusoidal partials whose amplitudes are learned on a training database. An efficient Matching Pursuit algorithm was designed to extract the best atoms and to estimate the phases of their partials. Then we explain how the resulting representation can be exploited for automatic instrument recognition. Preliminary experiments on a test database of solo excerpts show promising results.
Complete list of metadatas

Cited literature [14 references]  Display  Hide  Download

https://hal.inria.fr/inria-00544284
Contributor : Emmanuel Vincent <>
Submitted on : Tuesday, December 7, 2010 - 3:54:40 PM
Last modification on : Monday, March 25, 2019 - 4:16:07 PM
Long-term archiving on : Tuesday, March 8, 2011 - 4:42:09 AM

File

leveau_LSAS06.pdf
Publisher files allowed on an open archive

Identifiers

  • HAL Id : inria-00544284, version 1

Citation

Pierre Leveau, Emmanuel Vincent, Gaël Richard, Laurent Daudet. Mid-level sparse representations for timbre identification: design of an instrument-specific harmonic dictionary. 1st Workshop on Learning the Semantics of Audio Signals (LSAS), Dec 2006, Athens, Greece. ⟨inria-00544284⟩

Share

Metrics

Record views

152

Files downloads

172