Hal will be stopped for maintenance from friday on june 10 at 4pm until monday june 13 at 9am. More information
Skip to Main content Skip to Navigation
Conference papers

Latent Max-margin Metric Learning for Comparing Video Face Tubes

Abstract : Comparing " face tubes " is a key component of modern systems for face biometrics based video analysis and annotation. We present a novel algorithm to learn a distance metric between such spatio-temporal face tubes in videos. The main novelty in the algorithm is based on incorporation of latent variables in a max-margin metric learning framework. The latent formulation allows us to model, and learn metrics to compare faces under different challenging variations in pose, expressions and lighting We propose a novel dataset named TV Series Face Tubes (TSFT) for evaluating the task. The dataset is collected from 12 different episodes of 8 popular TV series and has 94 subjects with 569 manually annotated face tracks in total. We show quantitatively how incorporating latent variables in max-margin metric learning leads to improvement of current state-of-the-art metric learning methods for the two cases when the testing is done with subjects that were seen during training and when the test subjects were not seen at all during training. We also give results on a challenging benchmark dataset: YouTube faces, and place our algorithm in context w.r.t. existing methods.
Document type :
Conference papers
Complete list of metadata

Cited literature [46 references]  Display  Hide  Download

https://hal.inria.fr/hal-01219829
Contributor : Gaurav Sharma Connect in order to contact the contributor
Submitted on : Friday, October 23, 2015 - 1:25:33 PM
Last modification on : Thursday, September 20, 2018 - 7:54:02 AM
Long-term archiving on: : Friday, April 28, 2017 - 5:38:10 AM

File

sharma_cvprw15.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01219829, version 1

Citation

Gaurav Sharma, Patrick Pérez. Latent Max-margin Metric Learning for Comparing Video Face Tubes. Biometrics Workshop, Computer Vision and Pattern Recognition (CVPR), Jun 2015, Boston, United States. ⟨hal-01219829⟩

Share

Metrics

Record views

173

Files downloads

76