Label-Embedding for Attribute-Based Classification, 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013. ,
DOI : 10.1109/CVPR.2013.111
URL : https://hal.archives-ouvertes.fr/hal-00815747
The AXES submissions at TrecVid 2013, TRECVID Workshop, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00904404
Multiscale scattering for audio classification, ISMIR, 2011. ,
ScatNet (v0.2), 2013. ,
LIBSVM, ACM Transactions on Intelligent Systems and Technology, vol.2, issue.3, pp.27-28, 2011. ,
DOI : 10.1145/1961189.1961199
Transmedia pseudo-relevance feedback methods in multimedia retrieval, Advances in Multilingual and Multimodal Information Retrieval, 2008. ,
Human Detection Using Oriented Histograms of Flow and Appearance, ECCV, 2006. ,
DOI : 10.1023/A:1008162616689
URL : https://hal.archives-ouvertes.fr/inria-00548587
On the Use of MLP Features for??Broadcast??News??Transcription, Text, Speech and Dialogue, pp.303-310, 2008. ,
DOI : 10.1007/978-3-540-87391-4_39
Partitioning and transcription of broadcast news data, ICSLP, vol.98, issue.5, pp.1335-1338, 1998. ,
Perceptual linear predictive (PLP) analysis of speech, The Journal of the Acoustical Society of America, vol.87, issue.4, pp.1738-1752, 1990. ,
DOI : 10.1121/1.399423
Caffe, Proceedings of the ACM International Conference on Multimedia, MM '14, 2014. ,
DOI : 10.1145/2647868.2654889
Modeling spatial layout with fisher vectors for image categorization, 2011 International Conference on Computer Vision, 2011. ,
DOI : 10.1109/ICCV.2011.6126406
URL : https://hal.archives-ouvertes.fr/inria-00612277
HMDB: A large video database for human motion recognition, 2011 International Conference on Computer Vision, 2011. ,
DOI : 10.1109/ICCV.2011.6126543
Speech Processing for Audio Indexing, Advances in Natural Language Processing, 2008. ,
DOI : 10.1109/TSA.1996.481450
Speech Processing for Audio Indexing, Proceedings of the 6th International Conference on Natural Language Processing, GoTAL 2008 -Advances in Natural Language Processing, pp.4-15, 2008. ,
DOI : 10.1109/TSA.1996.481450
Distinctive Image Features from Scale-Invariant Keypoints, International Journal of Computer Vision, vol.60, issue.2, pp.91-110, 2004. ,
DOI : 10.1023/B:VISI.0000029664.99615.94
Robust wide baseline stereo from maximally stable extremal regions, BMVC, 2002. ,
Action and Event Recognition with Fisher Vectors on a Compact Feature Set, 2013 IEEE International Conference on Computer Vision, 2013. ,
DOI : 10.1109/ICCV.2013.228
URL : https://hal.archives-ouvertes.fr/hal-00873662
Rapid development of a Latvian speech-to-text system, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 2013. ,
DOI : 10.1109/ICASSP.2013.6639082
Trecvid 2014 ? an overview of the goals, tasks, data, evaluation mechanisms and metrics, Proceedings of TRECVID 2014, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01230444
The Kaldi Speech Recognition Toolkit, Proc. Workshop on Automatic Speech Recognition & Understanding (ASRU), pp.1-4, 2011. ,
ImageNet Large Scale Visual Recognition Challenge, International Journal of Computer Vision, vol.1010, issue.1, 2014. ,
DOI : 10.1007/s11263-015-0816-y
Image Classification with the Fisher Vector: Theory and Practice, International Journal of Computer Vision, vol.73, issue.2, pp.222-245, 2013. ,
DOI : 10.1007/s11263-013-0636-x
Towards Lower Error Rates in Phoneme Recognition, Text, Speech and Dialogue, pp.465-472, 2004. ,
DOI : 10.1007/978-3-540-30120-2_59
Action Recognition with Improved Trajectories, 2013 IEEE International Conference on Computer Vision, 2013. ,
DOI : 10.1109/ICCV.2013.441
URL : https://hal.archives-ouvertes.fr/hal-00873267
Using MLP features in SRI's conversational speech recognition system, Interspeech, pp.2141-2144, 2005. ,