Tesis de Maestría / master Thesis
Detection of suspicious attitudes on video using neuroevolved shallow and deep neural networks models
Fecha
2021-11Registro en:
1048740
Autor
Flores Munguía, Carlos
Institución
Resumen
The analysis of surveillance cameras is a critical task usually limited by the people involved in the video supervision devoted to such a task, their knowledge, and their judgment. Security guards protect other people from different events that can compromise their security, like robbery, extortion, fraud, vehicle theft, and more, converting them to an essential part of this type of protection system. If they are not paying attention, crimes may be overlooked.
Nonetheless, different approaches have arisen to automate this task. The methods are mainly based on machine learning and benefit from developing neural networks that extract underlying information from input videos. However, despite how competent those networks have proved to be, developers must face the challenging task of defining the architecture and hyperparameters that allow the network to work adequately and optimize the use of computational resources. Furthermore, selecting the architecture and hyperparameters may significantly impact the neural networks’ performance if it is not carried out adequately.
No matter the type of neural network used, shallow, dense, convolutional, 3D convolutional, or recurrent; hyperparameter selection must be performed using empirical knowledge thanks to the expertise of the designer, or even with the help of automated approaches like Random Search or Bayesian Optimization. However, such methods suffer from problems like not covering the solution space well, especially if the space is made up of large dimensions. Alternatively, the requirement to evaluate the models many times to get more information about the evaluation of the objective function, employing a diverse set of hyperparameters.
This work proposes a model that generates, through a genetic algorithm, neural networks for behavior classification within videos. The application of genetic algorithms allows the exploration in the hyperparameters solution space in different directions simultaneously.
Two types of neural networks are evolved as part of the thesis work: shallow and deep networks, the latter based on dense layers and 3D convolutions. Each sort of network takes distinct input data types: the evolution of people’s pose and videos’ sequences, respectively. Shallow neural networks are generated by NeuroEvolution of Augmented Topologies (NEAT), while CoDeepNEAT generates deep networks. NEAT uses a direct encoding, meaning that each node and connection in the network is directly represented in the chromosome. In contrast, CoDeepNEAT uses indirect encoding, making use of cooperative coevolution of blueprints and modules. This work trains networks and tests them using the Kranok-NV dataset, which exhibited better results than their competitors on various standard metrics.