Towards Autonomous Reinforcement Learning: Automatic Setting of Hyper-parameters using Bayesian Optimization

Barsce, Juan Cruz; Palombarini, Jorge Andrés; Martínez, Ernesto Carlos

info:eu-repo/semantics/article

Fecha

2018

Registro en:

Barsce, Juan Cruz; Palombarini, Jorge Andrés; Martínez, Ernesto Carlos; Towards Autonomous Reinforcement Learning: Automatic Setting of Hyper-parameters using Bayesian Optimization; CLEI (Latin-american Center for Informatics Studies); CLEI Electronic Journal; 21; 2; 2018; 1-22

0717-5000

http://hdl.handle.net/11336/86940

CONICET Digital

CONICET

https://repositorioslatinoamericanos.uchile.cl/handle/2250/4373002

Autor

Barsce, Juan Cruz

Palombarini, Jorge Andrés

Martínez, Ernesto Carlos

Institución

Consejo Nacional de Investigaciones Científicas y Tecnológicas (Argentina)

Resumen

With the increase of machine learning usage by industries and scientific communities in a variety of tasks such as text mining, image recognition and self-driving cars, automatic setting of hyper-parameter in learning algorithms is a key factor for obtaining good performances regardless of user expertise in the inner workings of the techniques and methodologies. In particular, for a reinforcement learning algorithm, the efficiency of an agent learning a control policy in an uncertain environment is heavily dependent on the hyper-parameters used to balance exploration with exploitation. In this work, an autonomous learning framework that integrates Bayesian optimization with Gaussian process regression to optimize the hyper-parameters of a reinforcement learning algorithm, is proposed. Also, a bandits-based approach to achieve a balance between computational costs and decreasing uncertainty about the extit{Q}-values, is presented. A gridworld example is used to highlight how hyper-parameter configurations of a learning algorithm (SARSA) are iteratively improved based on two performance functions.

Materias

REINFORCEMENT LEARNING

AUTONOMOUS SYSTEMS

BAYESIAN OPTIMIZATION

HYPER-PARAMETERS SETTING

Mostrar el registro completo del ítem