dc.creator | Koga, Marcelo Li | |
dc.creator | Silva, Valdinei Freire da | |
dc.creator | Cozman, Fabio Gagliardi | |
dc.creator | Costa, Anna Helena Reali | |
dc.date.accessioned | 2014-10-17T14:59:30Z | |
dc.date.accessioned | 2018-07-04T16:55:04Z | |
dc.date.available | 2014-10-17T14:59:30Z | |
dc.date.available | 2018-07-04T16:55:04Z | |
dc.date.created | 2014-10-17T14:59:30Z | |
dc.date.issued | 2013-05-10 | |
dc.identifier | http://www.producao.usp.br/handle/BDPI/46393 | |
dc.identifier | http://delivery.acm.org/10.1145/2490000/2484942/p119-koga.pdf?ip=143.107.107.100&id=2484942&acc=ACTIVE SERVICE&key=344E943C9DC262BB.0DBCED839AA5AFE8.4D4702B0C3E38B35.4D4702B0C3E38B35&CFID=585715597&CFTO | |
dc.identifier.uri | http://repositorioslatinoamericanos.uchile.cl/handle/2250/1642143 | |
dc.description.abstract | We are interested in the following general question: is it pos-
sible to abstract knowledge that is generated while learning
the solution of a problem, so that this abstraction can ac-
celerate the learning process? Moreover, is it possible to
transfer and reuse the acquired abstract knowledge to ac-
celerate the learning process for future similar tasks? We
propose a framework for conducting simultaneously two lev-
els of reinforcement learning, where an abstract policy is
learned while learning of a concrete policy for the problem,
such that both policies are refined through exploration and
interaction of the agent with the environment. We explore
abstraction both to accelerate the learning process for an op-
timal concrete policy for the current problem, and to allow
the application of the generated abstract policy in learning
solutions for new problems. We report experiments in a
robot navigation environment that show our framework to
be effective in speeding up policy construction for practical
problems and in generating abstractions that can be used to
accelerate learning in new similar problems. | |
dc.language | eng | |
dc.publisher | Saint Paul, Minnesota | |
dc.relation | International Conference on Autonomous Agents and Multiagent Systems, 11 | |
dc.rights | ACM | |
dc.rights | openAccess | |
dc.subject | Machine learning for robotics | |
dc.subject | Single agent learning | |
dc.subject | Evolu- tion and adaptation | |
dc.subject | Transfer learning | |
dc.title | Speeding-up reinforcement learning through abstraction and transfer learning | |
dc.type | Actas de congresos | |