Dynamic programming for variable discounted Markov decision problems

Della Vecchia, Eugenio; Di Marco, Silvia; Vidal, Fernando

Objeto de conferencia

Registro en:

http://sedici.unlp.edu.ar/handle/10915/41704

http://43jaiio.sadio.org.ar/proceedings/SIO/17.pdf

issn:1850-2865

Autor

Della Vecchia, Eugenio

Di Marco, Silvia

Vidal, Fernando

Institución

Universidad Nacional de La Plata (Argentina)

Resumen

We study the existence of optimal strategies and value function of non stationary Markov decision processes under variable discounted criteria, when the action space is assumed to be Borel and the action space to be compact. With this new way of defining the value of a policy, we show existence of Markov deterministic optimal policies in the finite-horizon case, and a recursive method to obtain such ones. For the infinite horizon problem we characterize the value function and show existence of stationary deterministic policies. The approach presented is based on the use of adequate dynamic programming operators.

Sociedad Argentina de Informática e Investigación Operativa (SADIO)

Materias

Ciencias Informáticas

Markov decision processes

variable discount factor

Programming Environments

Decision problems

dynamic programming

Mostrar el registro completo del ítem