Non linear programming for stochastic dynamic programming

Olivier Teytaud; Sylvain Gelly

Conference Papers Year : 2007

Non linear programming for stochastic dynamic programming

(1) , (1)

Olivier Teytaud

Function : Author
PersonId : 581
IdHAL : olivier-teytaud
IdRef : 05971008X

Algorithmic number theory for cryptology

Sylvain Gelly

Function : Author

Algorithmic number theory for cryptology

Abstract

Many stochastic dynamic programming tasks in continuous action-spaces are tackled through discretization. We here avoid discretization; then, approximate dynamic programming (ADP) involves (i) many learning tasks, performed here by Support Vector Machines, for Bellman-function-regression (ii) many non-linearoptimization tasks for action-selection, for which we compare many algorithms. We include discretizations of the domain as particular non-linear-programming-tools in our experiments, so that by the way we compare optimization approaches and discretization methods. We conclude that robustness is strongly required in the non-linear-optimizations in ADP, and experimental results show that (i) discretization is sometimes inefficient, but some specific discretization is very efficient for "bang-bang" problems (ii) simple evolutionary tools outperform quasi-random in a stable manner (iii) gradient-based techniques are much less stable (iv) for most high-dimensional "less unsmooth" problems Covariance-Matrix-Adaptation is first ranked.

Keywords

Control Dynamic programming Non Linear Programming

Domains

Optimization and Control [math.OC]

Fichier principal

sefordp.pdf (84.11 Ko)

Origin : Files produced by the author(s)

Olivier Teytaud : Connect in order to contact the contributor

https://inria.hal.science/inria-00173202

Submitted on : Wednesday, September 19, 2007-2:15:45 PM

Last modification on : Thursday, April 18, 2024-3:52:14 PM

Long-term archiving on: Friday, April 9, 2010-2:28:16 AM

Dates and versions

inria-00173202 , version 1 (19-09-2007)

Identifiers

HAL Id : inria-00173202 , version 1

Cite

Olivier Teytaud, Sylvain Gelly. Non linear programming for stochastic dynamic programming. Icinco 2007, 2007, Angers, France. ⟨inria-00173202⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

X CNRS INRIA LIX X-LIX X-DEP-INFO PARISTECH INRIA2 TDS-MACS

153 View

560 Download

Non linear programming for stochastic dynamic programming

Abstract

Keywords

Domains

Dates and versions

Identifiers

Cite

Export

Collections

Share