Skip to Main content Skip to Navigation
Conference papers

Toyota Smarthome: Real-World Activities of Daily Living

Abstract : The performance of deep neural networks is strongly influenced by the quantity and quality of annotated data. Most of the large activity recognition datasets consist of data sourced from the web, which does not reflect challenges that exist in activities of daily living. In this paper, we introduce a large real-world video dataset for activities of daily living: Toyota Smarthome. The dataset consists of 16K RGB+D clips of 31 activity classes, performed by seniors in a smarthome. Unlike previous datasets, videos were fully unscripted. As a result, the dataset poses several challenges: high intra-class variation, high class imbalance, simple and composite activities, and activities with similar motion and variable duration. Activities were annotated with both coarse and fine-grained labels. These characteristics differentiate Toyota Smarthome from other datasets for activity recognition. As recent activity recognition approaches fail to address the challenges posed by Toyota Smarthome, we present a novel activity recognition method with attention mechanism. We propose a pose driven spatio-temporal attention mechanism through 3D ConvNets. We show that our novel method outperforms state-of-the-art methods on benchmark datasets, as well as on the Toyota Smarthome dataset. We release the dataset for research use.
Document type :
Conference papers
Complete list of metadata

Cited literature [51 references]  Display  Hide  Download
Contributor : SRIJAN DAS Connect in order to contact the contributor
Submitted on : Saturday, November 16, 2019 - 1:35:52 PM
Last modification on : Thursday, August 4, 2022 - 5:19:02 PM
Long-term archiving on: : Monday, February 17, 2020 - 1:43:48 PM


Files produced by the author(s)


  • HAL Id : hal-02366687, version 1


Srijan Das, Rui Dai, Michal Koperski, Luca Minciullo, Lorenzo Garattoni, et al.. Toyota Smarthome: Real-World Activities of Daily Living. ICCV 2019 -17th International Conference on Computer Vision, Oct 2019, Seoul, South Korea. ⟨hal-02366687⟩



Record views


Files downloads