Aprendizaje de programas teleo-reactivos para robótica móvil

BLANCA ALICIA VARGAS GOVEA

Tesis

Registro en:

http://inaoe.repositorioinstitucional.mx/jspui/handle/1009/466

http://repositorioslatinoamericanos.uchile.cl/handle/2250/2258608

Autor

BLANCA ALICIA VARGAS GOVEA

Institución

Conacyt (México)

Resumen

In their beginnings, robots performed repetitive tasks in industrial environments or extremely controlled laboratories. At present, the scope of robotic applications has been extended and now robots have reached home and office environments developing service or assistant tasks. Their integration to society involves challenges and increasing difficulties because the robots require skills in non-controlled and populated environments. An autonomous robot needs skills to perform tasks and to react accordingly to unexpected events in dynamic environments. Programming a robot to have these skills is a complex and time consuming process. These difficulties have motivated to look for methods to ease the programming effort. In this thesis a learning methodology to simplify the programming effort of mobile robots is proposed. The methodology uses two techniques: (i) Teleo-Reactive Programs (PTRs for its name in Spanish) which have proved to be an effective framework to continuously perform a set of actions to achieve particular goals and to react in the presence of unexpected events, and (ii) behavioural cloning, a technique to learn skills by example inducing a model that can be used as a system controller. In this thesis, it is shown how a robot can learn PTRs from human guided traces. A user guides a robot to perform a task and the robot learns how to perform such task in similar dynamic environments. Our approach follows three steps: (i) it transforms traces with low-level sensor information into high-level traces based on natural landmarks, (ii) it learns PTRs that express when to perform an action to achieve simple tasks using an Inductive Logic Programming (ILP) system, and (iii) it learns hierarchical PTRs that express how to achieve goals by following particular sequences of actions using FOSeq, a proposed grammar induction algorithm. We test the approach in a robotics scenario with both simulated and real environments and show that the robot is able to accomplish several navigation tasks with the learned TRPs in different dynamic and unknown environments with good precision. It is also shown a comparison in simulation against a non-reactive approach. FOSeq was also used to learn gesture grammars with competitive results when compare with a recent state-of-the-art system. Classification accuracy with FOSeq is 97.34% and with HMM is 97.56%.

En sus inicios, los robots desempeñaban tareas repetitivas en ambientes industriales muy controlados. Al paso del tiempo las aplicaciones de los robots se han ido diversificando hasta el punto de empezar a integrarse a la sociedad desempeñando tareas en casas y oficinas. Esta integración implica nuevas dificultades y retos pues las habilidades que se necesitan en ambientes no controlados son mayores. El robot está expuesto a personas en movimiento, mobiliario que cambia de posición por lo que el robot necesita, por ejemplo, habilidades de desplazamiento, evasión de obstáculos, entre otras que le permitan integrarse al ambiente. Dos de las características que un robot debe tener para lograr cierta autonomía son: i) contar con habilidades para realizar tareas y ii) responder adecuadamente a eventos en ambientes dinámicos. La programación de un robot no es fácil y la incorporación de este tipo de habilidades y comportamientos puede ser un proceso largo y tedioso. Estos aspectos han motivado la búsqueda de métodos que permitan que la programación de robots sea más rápida y menos compleja. En esta tesis se propone una metodología de aprendizaje que simplifica la programación de robots móviles en ambientes interiores. Se utilizan dos técnicas, la primera es el aprendizaje de Programas Teleo-Reactivos (PTRs) [Nilsson, 1994], que son programas que están orientados a metas y al comportamiento en ambientes dinámicos. La segunda técnica, conocida como clonación [Michie y Sammut, 1995] ha sido utilizada en el aprendizaje de habilidades a partir de ejemplos y su objetivo es la inducción de modelos o conjuntos de reglas que puedan integrarse como controladores en un sistema. Se muestra cómo un robot aprende PTRs a partir de ejemplos o trazas generados por una persona. El usuario guía al robot a ejecutar la tarea y el robot aprende a realizarla. Se propone un aprendizaje en tres fases: (i) transformación de trazas con información de bajo nivel de los sensores a trazas de alto nivel mediante un proceso de identificación de marcas naturales.