Bisimulation Metrics are Optimal Value Functions

Norman Ferns; Doina Precup

Communication Dans Un Congrès Année : 2014

Bisimulation Metrics are Optimal Value Functions

(1) , (2)

1
2

Norman Ferns

Fonction : Auteur

Analyse Statique par Interprétation Abstraite

Doina Precup

Fonction : Auteur

McGill University = Université McGill [Montréal, Canada]

Résumé

Bisimulation is a notion of behavioural equivalence on the states of a transition system. Its definition has been extended to Markov decision processes, where it can be used to aggregate states. A bisimulation metric is a quantitative analog of bisimulation that measures how similar states are from a the perspective of long-term behavior. Bisimulation metrics have been used to establish approximation bounds for state aggregation and other forms of value function approximation. In this paper, we prove that a bisimulation metric defined on the state space of a Markov decision process is the optimal value function of an optimal coupling of two copies of the original model. We prove the result in the general case of continuous state spaces. This result has important implications in understanding the complexity of computing such metrics, and opens up the possibility of more efficient computational methods.

Domaines

Informatique [cs]

Jérôme Feret : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01101180

Soumis le : jeudi 8 janvier 2015-09:23:32

Dernière modification le : vendredi 19 avril 2024-16:18:58

Dates et versions

hal-01101180 , version 1 (08-01-2015)

Identifiants

HAL Id : hal-01101180 , version 1

Citer

Norman Ferns, Doina Precup. Bisimulation Metrics are Optimal Value Functions. The 30th Conference on Uncertainty in Artificial Intelligence, Ann Nicholson, Jul 2014, Quebec City, Canada. pp.10. ⟨hal-01101180⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS CNRS INRIA INRIA2 PSL

203 Consultations

0 Téléchargements

Bisimulation Metrics are Optimal Value Functions

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager