Skip to Main content Skip to Navigation
Conference papers

Thin-Slicing for Pose: Learning to Understand Pose without Explicit Pose Estimation

Suha Kwak 1, 2 Minsu Cho 1, 2 Ivan Laptev 1
1 WILLOW - Models of visual object recognition and scene understanding
CNRS - Centre National de la Recherche Scientifique : UMR8548, Inria Paris-Rocquencourt, DI-ENS - Département d'informatique de l'École normale supérieure
Abstract : We address the problem of learning a pose-aware, compact embedding that projects images with similar human poses to be placed close-by in the embedding space. The embedding function is built on a deep convolutional network, and trained with triplet-based rank constraints on real image data. This architecture allows us to learn a robust representation that captures differences in human poses by effectively factoring out variations in clothing, background, and imaging conditions in the wild. For a variety of pose-related tasks, the proposed pose embedding provides a cost-efficient and natural alternative to explicit pose estimation, circumventing challenges of localizing body joints. We demonstrate the efficacy of the embedding on pose-based image retrieval and action recognition problems.
Document type :
Conference papers
Complete list of metadatas

https://hal.inria.fr/hal-01242724
Contributor : Suha Kwak <>
Submitted on : Thursday, January 5, 2017 - 12:14:18 AM
Last modification on : Tuesday, September 22, 2020 - 3:53:08 AM
Long-term archiving on: : Thursday, April 6, 2017 - 12:16:43 PM

File

kwak_cvpr16.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01242724, version 2

Collections

Citation

Suha Kwak, Minsu Cho, Ivan Laptev. Thin-Slicing for Pose: Learning to Understand Pose without Explicit Pose Estimation. CVPR 2016 - IEEE Conference on Computer Vision and Pattern Recognition, Jun 2016, Las Vegas, United States. ⟨hal-01242724v2⟩

Share

Metrics

Record views

280

Files downloads

913