dc.creator | Karanik, Marcelo J. | |
dc.creator | Gramajo, Sergio D. | |
dc.date.accessioned | 2009-09-17T19:26:43Z | |
dc.date.accessioned | 2019-05-28T15:16:21Z | |
dc.date.available | 2009-09-17T19:26:43Z | |
dc.date.available | 2019-05-28T15:16:21Z | |
dc.date.created | 2009-09-17T19:26:43Z | |
dc.date.issued | 2009-09-17T19:26:43Z | |
dc.identifier | 978-987-24967-3-9 | |
dc.identifier | http://hdl.handle.net/10226/473 | |
dc.identifier.uri | http://repositorioslatinoamericanos.uchile.cl/handle/2250/2877333 | |
dc.description.abstract | The software agents are programs that can perceive from their environment and they act to reach their design goals.
In most cases the selected agent architecture determines its behaviour in response to different problem states.
However, there are some problem domains in which it is desirable that the agent learns a good action execution
policy by interacting with its environment. This kind of learning is called Reinforcement Learning (RL) and is
useful in the process control area. Given a problem state, the agent selects the adequate action to do and receives an
immediate reward. Then it actualizes its estimations about every action and, after a certain period of time, the agent
learns which the best action to execute is. Most RL algorithms execute simple actions even if two o more can be
executed. This work involves the use of RL algorithms to find an optimal policy in a gridworld problem and
proposes a mechanism to combine actions of different types. | |
dc.language | en | |
dc.relation | Karanik, M. y Gramajo, S., (2009, julio). Actions Combination Method for Reinforcement Learning. Trabajo presentado en el Congreso de Inteligencia Computacional Aplicada (CICA), realizado en Buenos Aires del 23 al 24 de julio de 2009. | |
dc.subject | Reinforcement Learning | |
dc.subject | Actions Combination | |
dc.subject | SARSA | |
dc.subject | Optimal Policy | |
dc.title | Actions Combination Method for Reinforcement Learning | |
dc.type | Artículos de revistas | |