dc.creatorDella Vecchia, Eugenio
dc.creatorDi Marco, Silvia
dc.creatorVidal, Fernando
dc.date2014-09
dc.date2014
dc.date2014-10-22T13:23:33Z
dc.identifierhttp://sedici.unlp.edu.ar/handle/10915/41704
dc.identifierhttp://43jaiio.sadio.org.ar/proceedings/SIO/17.pdf
dc.identifierissn:1850-2865
dc.descriptionWe study the existence of optimal strategies and value function of non stationary Markov decision processes under variable discounted criteria, when the action space is assumed to be Borel and the action space to be compact. With this new way of defining the value of a policy, we show existence of Markov deterministic optimal policies in the finite-horizon case, and a recursive method to obtain such ones. For the infinite horizon problem we characterize the value function and show existence of stationary deterministic policies. The approach presented is based on the use of adequate dynamic programming operators.
dc.descriptionSociedad Argentina de Informática e Investigación Operativa (SADIO)
dc.formatapplication/pdf
dc.format50-62
dc.languageen
dc.rightshttp://creativecommons.org/licenses/by/3.0/
dc.rightsCreative Commons Attribution 3.0 Unported (CC BY 3.0)
dc.subjectCiencias Informáticas
dc.subjectMarkov decision processes
dc.subjectvariable discount factor
dc.subjectProgramming Environments
dc.subjectDecision problems
dc.subjectdynamic programming
dc.titleDynamic programming for variable discounted Markov decision problems
dc.typeObjeto de conferencia
dc.typeObjeto de conferencia


Este ítem pertenece a la siguiente institución