Benchmarking Methods for Audio-Visual Recognition Using Tiny Training Sets

Xavier Alameda-Pineda 1 Jordi Sanchez-Riera 1 Radu Horaud 1, *
* Corresponding author
1 PERCEPTION - Interpretation and Modelling of Images and Videos
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
Abstract : The problem of choosing a classifier for audio-visual command recognition is addressed. Because such commands are culture- and user-dependant, methods need to learn new commands from a few examples. We benchmark three state-of-the-art discriminative classifiers based on bag of words and SVM. The comparison is made on monocular and monaural recordings of a publicly available dataset. We seek for the best trade off between speed, robustness and size of the training set. In the light of over 150,000 experiments, we conclude that this is a promising direction of work towards a flexible methodology that must be easily adaptable to a large variety of users.
Document type :
Conference papers
Complete list of metadatas

Cited literature [10 references]  Display  Hide  Download

https://hal.inria.fr/hal-00861645
Contributor : Team Perception <>
Submitted on : Friday, September 13, 2013 - 11:27:03 AM
Last modification on : Wednesday, April 11, 2018 - 1:59:39 AM
Long-term archiving on : Thursday, April 6, 2017 - 7:24:19 PM

File

main.pdf
Files produced by the author(s)

Identifiers

Collections

Citation

Xavier Alameda-Pineda, Jordi Sanchez-Riera, Radu Horaud. Benchmarking Methods for Audio-Visual Recognition Using Tiny Training Sets. ICASSP 2013 - IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE Signal Processing Society, May 2013, Vancouver, Canada. pp.3662-3666, ⟨10.1109/ICASSP.2013.6638341⟩. ⟨hal-00861645⟩

Share

Metrics

Record views

778

Files downloads

319