The LEAR submission at Thumos 2014

Dan Oneata 1 Jakob Verbeek 1 Cordelia Schmid 1
1 LEAR - Learning and recognition in vision
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
Abstract : We describe the submission of the INRIA LEAR team to the THU-MOS workshop in conjunction with ECCV 2014. Our system is based on Fisher vector (FV) encoding of dense trajectory features (DTF), which we also used in our 2013 submission. This year's submission additionally incorporates static-image features (SIFT, Color, and CNN) and audio features (ASR and MFCC) for the classification task. For the detection task, we combine scores from the clas-sification task with FV-DTF features extracted from video slices. We found that these additional visual and audio feature significantly improve the classification results. For localization we found that using the classification scores as a contex-tual feature besides local motion features leads to significant improvements.
Type de document :
Autre publication
Liste complète des métadonnées
Contributeur : Thoth Team <>
Soumis le : lundi 3 novembre 2014 - 10:55:33
Dernière modification le : mardi 11 août 2015 - 01:05:12
Document(s) archivé(s) le : mercredi 4 février 2015 - 10:05:39


Fichiers produits par l'(les) auteur(s)


  • HAL Id : hal-01074442, version 1



Dan Oneata, Jakob Verbeek, Cordelia Schmid. The LEAR submission at Thumos 2014. 2014. <hal-01074442>



Consultations de
la notice


Téléchargements du document