Skip to Main content Skip to Navigation
Conference papers

Reinforcement Learning of Context Models for a Ubiquitous Personal Assistant

Sofia Zaidenberg 1 Patrick Reignier 1 James L. Crowley 1
1 PRIMA - Perception, recognition and integration for observation of activity
Inria Grenoble - Rhône-Alpes, UJF - Université Joseph Fourier - Grenoble 1, INPG - Institut National Polytechnique de Grenoble , CNRS - Centre National de la Recherche Scientifique : UMR5217
Abstract : Ubiquitous environments may become a reality in a foreseeable future and research is aimed on making them more and more adapted and comfortable for users. Our work consists on applying reinforcement learning techniques in order to adapt services provided by a ubiquitous assistant to the user. The learning produces a context model, associating actions to perceived situations of the user. Associations are based on feedback given by the user as a reaction to the behavior of the assistant. Our method brings a solution to some of the problems encountered when applying reinforcement learning to systems where the user is in the loop. For instance, the behavior of the system is completely incoherent at the be-ginning and needs time to converge. The user does not accept to wait that long to train the system. The user's habits may change over time and the assistant needs to integrate these changes quickly. We study methods to accelerate the reinforced learning process.
Document type :
Conference papers
Complete list of metadata

Cited literature [19 references]  Display  Hide  Download
Contributor : Sofia Zaidenberg Connect in order to contact the contributor
Submitted on : Wednesday, February 13, 2013 - 3:52:37 PM
Last modification on : Tuesday, February 9, 2021 - 3:24:36 PM
Long-term archiving on: : Tuesday, May 14, 2013 - 4:03:01 AM


Files produced by the author(s)




Sofia Zaidenberg, Patrick Reignier, James L. Crowley. Reinforcement Learning of Context Models for a Ubiquitous Personal Assistant. UCAmI - 3rd Symposium of Ubiquitous Computing and Ambient Intelligence 2008, Oct 2008, Salamanca, Spain. pp.254-264, ⟨10.1007/978-3-540-85867-6_30⟩. ⟨hal-00788055⟩



Les métriques sont temporairement indisponibles