The LEAR submission at Thumos 2014

Dan Oneata 1 Jakob Verbeek 1 Cordelia Schmid 1
1 LEAR - Learning and recognition in vision
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
Abstract : We describe the submission of the INRIA LEAR team to the THU-MOS workshop in conjunction with ECCV 2014. Our system is based on Fisher vector (FV) encoding of dense trajectory features (DTF), which we also used in our 2013 submission. This year's submission additionally incorporates static-image features (SIFT, Color, and CNN) and audio features (ASR and MFCC) for the classification task. For the detection task, we combine scores from the clas-sification task with FV-DTF features extracted from video slices. We found that these additional visual and audio feature significantly improve the classification results. For localization we found that using the classification scores as a contex-tual feature besides local motion features leads to significant improvements.
Type de document :
Autre publication
2014
Liste complète des métadonnées

Littérature citée [9 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01074442
Contributeur : Thoth Team <>
Soumis le : lundi 3 novembre 2014 - 10:55:33
Dernière modification le : vendredi 24 novembre 2017 - 13:25:52
Document(s) archivé(s) le : mercredi 4 février 2015 - 10:05:39

Fichier

thumos14inria.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01074442, version 1

Collections

Citation

Dan Oneata, Jakob Verbeek, Cordelia Schmid. The LEAR submission at Thumos 2014. 2014. 〈hal-01074442〉

Partager

Métriques

Consultations de la notice

719

Téléchargements de fichiers

439