Reinforcement learning control of robot manipulator

COTRIM, LUCAS P.; JOSE, MARCOS M.; CABRAL, EDUARDO L.L.

dc.creator	COTRIM, LUCAS P.
dc.creator	JOSE, MARCOS M.
dc.creator	CABRAL, EDUARDO L.L.
dc.date	2021
dc.date	2022-03-15T15:41:33Z
dc.date	2022-03-15T15:41:33Z
dc.date.accessioned	2023-09-28T14:21:22Z
dc.date.available	2023-09-28T14:21:22Z
dc.identifier	2176-6649
dc.identifier	http://repositorio.ipen.br/handle/123456789/32793
dc.identifier	3
dc.identifier	13
dc.identifier	10.5335/rbca.v13i3.12091
dc.identifier	0000-0001-6632-2692
dc.identifier	Sem Percentil
dc.identifier	Sem Percentil CiteScore
dc.identifier.uri	https://repositorioslatinoamericanos.uchile.cl/handle/2250/9003012
dc.description	Since the establishment of robotics in industrial applications, industrial robot programming involves the repetitive and time-consuming process of manually specifying a fixed trajectory, resulting in machine idle time in production and the necessity of completely reprogramming the robot for different tasks. The increasing number of robotics applications in unstructured environments requires not only intelligent but also reactive controllers due to the unpredictability of the environment and safety measures, respectively. This paper presents a comparative analysis of two classes of Reinforcement Learning algorithms, value iteration (Q-Learning/DQN) and policy iteration (REINFORCE), applied to the discretized task of positioning a robotic manipulator in an obstacle-filled simulated environment, with no previous knowledge of the obstacles??? positions or of the robot arm dynamics. The agent???s performance and algorithm convergence are analyzed under different reward functions and on four increasingly complex test projects: 1-Degree of Freedom (DOF) robot, 2-DOF robot, Kuka KR16 Industrial robot, Kuka KR16 Industrial robot with random setpoint/obstacle placement. The DQN algorithm presented significantly better performance and reduced training time across all test projects, and the third reward function generated better agents for both algorithms.
dc.format	42-53
dc.relation	Revista Brasileira de Computa????o Aplicada
dc.rights	openAccess
dc.subject	control equipment
dc.subject	robots
dc.subject	manipulators
dc.subject	learning
dc.subject	artificial intelligence
dc.subject	neural networks
dc.title	Reinforcement learning control of robot manipulator
dc.type	Artigo de peri??dico
dc.coverage	I

Este ítem pertenece a la siguiente institución

Instituto de Pesquisas Energéticas e Nucleares (Brasil)