Dynamic programming for variable discounted Markov decision problems

dc.creator	Della Vecchia, Eugenio
dc.creator	Di Marco, Silvia
dc.creator	Vidal, Fernando
dc.date	2014-09
dc.date	2014
dc.date	2014-10-22T13:23:33Z
dc.identifier	http://sedici.unlp.edu.ar/handle/10915/41704
dc.identifier	http://43jaiio.sadio.org.ar/proceedings/SIO/17.pdf
dc.identifier	issn:1850-2865
dc.description	We study the existence of optimal strategies and value function of non stationary Markov decision processes under variable discounted criteria, when the action space is assumed to be Borel and the action space to be compact. With this new way of defining the value of a policy, we show existence of Markov deterministic optimal policies in the finite-horizon case, and a recursive method to obtain such ones. For the infinite horizon problem we characterize the value function and show existence of stationary deterministic policies. The approach presented is based on the use of adequate dynamic programming operators.
dc.description	Sociedad Argentina de Informática e Investigación Operativa (SADIO)
dc.format	application/pdf
dc.format	50-62
dc.language	en
dc.rights	http://creativecommons.org/licenses/by/3.0/
dc.rights	Creative Commons Attribution 3.0 Unported (CC BY 3.0)
dc.subject	Ciencias Informáticas
dc.subject	Markov decision processes
dc.subject	variable discount factor
dc.subject	Programming Environments
dc.subject	Decision problems
dc.subject	dynamic programming
dc.title	Dynamic programming for variable discounted Markov decision problems
dc.type	Objeto de conferencia
dc.type	Objeto de conferencia