Skip to Main content Skip to Navigation
Journal articles

Temporal Integration for Audio Classification With Application to Musical Instrument Classification

Cyril Joder 1 Slim Essid 2 Gael Richard 3, 4
TSI - Département Traitement du Signal et des Images, LTCI - Laboratoire Traitement et Communication de l'Information
3 S2A - Signal, Statistique et Apprentissage
LTCI - Laboratoire Traitement et Communication de l'Information
Abstract : Nowadays, it appears essential to design automatic indexing tools which provide meaningful and efficient means to describe the musical audio content. There is in fact a growing interest for music information retrieval (MIR) applications amongst which the most popular are related to music similarity retrieval, artist identification, musical genre or instrument recognition. Current MIR-related classification systems usually do not take into account the mid-term temporal properties of the signal (over several frames) and lie on the assumption that the observations of the features in different frames are statistically independent. The aim of this paper is to demonstrate the usefulness of the information carried by the evolution of these characteristics over time. To that purpose , we propose a number of methods for early and late temporal integration and provide an in-depth experimental study on their interest for the task of musical instrument recognition on solo musical phrases. In particular, the impact of the time horizon over which the temporal integration is performed will be assessed both for fixed and variable frame length analysis. Also, a number of recently proposed alignment kernels will be used for late temporal integration. For all experiments, the results are compared to a state of the art musical instrument recognition system. Index Terms-Alignment kernels, audio classification, music information retrieval (MIR), musical instrument recognition, support vector machine (SVM), temporal feature integration.
Document type :
Journal articles
Complete list of metadata
Contributor : Gaël Richard Connect in order to contact the contributor
Submitted on : Friday, May 29, 2020 - 8:13:47 PM
Last modification on : Tuesday, October 19, 2021 - 11:16:16 AM

Links full text




Cyril Joder, Slim Essid, Gael Richard. Temporal Integration for Audio Classification With Application to Musical Instrument Classification. IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2009, 17, ⟨10.1109/TASL.2008.2007613⟩. ⟨hal-02652782⟩



Les métriques sont temporairement indisponibles