Reoptimization Nearly Solves Weakly Coupled Markov Decision Processes

Nicolas Gast; Bruno Gaujal; Chen Yan

Preprints, Working Papers, ... Year : 2024

Reoptimization Nearly Solves Weakly Coupled Markov Decision Processes

(1) , (1) , (1)

Nicolas Gast

Function : Author
PersonId : 1247
IdHAL : nicolas-gast
ORCID : 0000-0001-6884-8698
IdRef : 233247874

Performance analysis and optimization of LARge Infrastructures and Systems

Bruno Gaujal

Function : Author
PersonId : 11644
IdHAL : bruno-gaujal
ORCID : 0000-0001-9081-8401
IdRef : 074658441

Performance analysis and optimization of LARge Infrastructures and Systems

Chen Yan

Function : Author
PersonId : 1102255

Performance analysis and optimization of LARge Infrastructures and Systems

Abstract

We propose a new policy, called the LP-update policy, to solve finite horizon weakly-coupled Markov decision processes. The latter can be seen as multi-constraint multi-action bandits, and generalize the classical restless bandit problems. Our solution is based on re-solving periodically a relaxed version of the original problem, that can be cast as a linear program (LP). When the problem is made of $N$ statistically identical sub-components, we show that the LP-update policy becomes asymptotically optimal at rate $O(T^2/\sqrt{N})$. This rate can be improved to $O(T/\sqrt{N})$ if the problem satisfies some ergodicity property and to $O(1/N)$ if the problem is non-degenerate. The definition of non-degeneracy extends the same notion for restless bandits. By using this property, we also improve the computational efficiency of the LP-update policy. We illustrate the performance of our policy on randomly generated examples, as well as a generalized applicant screening problem, and show that it outperforms existing heuristics.

Keywords

Markov processes stochastic optimization linear programming large scale optimization restless bandit

Domains

Operations Research [math.OC] Probability [math.PR]

Fichier principal

LP_update_for_weakly_coupled_MDP.pdf (946.02 Ko)

Origin : Files produced by the author(s)

Nicolas Gast : Connect in order to contact the contributor

https://inria.hal.science/hal-04570177

Submitted on : Monday, May 6, 2024-9:03:07 PM

Last modification on : Friday, May 17, 2024-2:17:37 PM

Dates and versions

hal-04570177 , version 1 (06-05-2024)

Licence

Attribution

Identifiers

HAL Id : hal-04570177 , version 1

Cite

Nicolas Gast, Bruno Gaujal, Chen Yan. Reoptimization Nearly Solves Weakly Coupled Markov Decision Processes. 2024. ⟨hal-04570177⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS INRIA LIG LIG_SRCPR INRIA2 TDS-MACS LIG-SRCPR-POLARIS ANR LIG_SIDCH

0 View

0 Download

Reoptimization Nearly Solves Weakly Coupled Markov Decision Processes

Abstract

Keywords

Domains

Dates and versions

Licence

Identifiers

Cite

Export

Collections

Share