Clasificación de palabras no pronunciadas presentes en electroencefalogramas (EEG)

ALEJANDRO ANTONIO TORRES GARCIA

info:eu-repo/semantics/masterThesis

Registro en:

http://inaoe.repositorioinstitucional.mx/jspui/handle/1009/732

https://repositorioslatinoamericanos.uchile.cl/handle/2250/7805949

Autor

ALEJANDRO ANTONIO TORRES GARCIA

Institución

Instituto Nacional de Astrofísica, Óptica y Electrónica (México)

Resumen

The present work aims to the development of a method for the processing and classification of electroencephalographic signals (EEG), belonging of a single subject, to recognize unspoken words belonging to a reduced vocabulary. Specifically, a precise recognition would allow to mentally control a computer screen cursor, thus, with this purpose, the vocabulary is composed by the Spanish language words: “arriba”,“abajo”, “izquierda”, “derecha”, and “seleccionar”. To start the process, the EEG signals were acquired using a protocol based on markers, to know a priori in what part of the EEG signal a subject imagines the pronunciation of an indicated word. With the markers were formed windows, between which the samples could be found. To accomplish the objective, only the signals from channels F7, FC5, T7 y P7 (which are the nearest to the Wernicke and Broca areas) were taken in count. After that, the signals coming from those channels were filtered using a band-pass FIR filter at 4-25 Hz., and the windows were adjusted to 256 samples. Furthermore, features were obtained using discrete wavelet transform (DWT) to train and assess four classifiers: Naive Bayes (NB), Random Forests (RF), Support Vector Machines (SVM) and Bagging-RF. The evaluation measures are accuracy and kappa index obtained using 10-folds cross validation. EEG signals belonging to 21 subjects were recorded, for each subject around 30 windows for each word were obtained. In the case of the best classifier, Bagging- RF, the accuracy rate and kappa index average were 40.48% and 0.24, respectively. Furthermore, an analysis at the unspoken words level classification for Bagging- RF and Random Forests was performed, where the classifiers were assessed with recall, specificity, precision, and f-measure. The results are still far of the precision needed to effectively control a BCI, but they are encouraging because accuracy rates are above 20 %, i.e. above chance for five classes, and kappa values are greater than 0, which are values comparable to state of the art research. Based on them, it can be concluded that EEG signals actually, contain information to allow the classification of the unspoken words belonging to a reduced vocabulary by the proposed method. That is why, unspoken speech turns out to be a promising and intuitive electrophysiologic source to control BCIs.

La presente investigación tiene como objetivo desarrollar un método de procesamiento y clasificación de señales electroencefalográficas (EEG) de un individuo para reconocer palabras no pronunciadas de un vocabulario reducido. Específicamente, un reconocimiento preciso permitirá controlar mentalmente el cursor de una computadora, ya que, con ese fin, el vocabulario consta de las palabras del lenguaje español: “arriba”, “abajo”, “izquierda”, “derecha”, y “seleccionar”. Para iniciar el proceso, las señales EEG se adquieren utilizando un protocolo básico basado en marcadores, para saber a priori en qué parte de la señal la persona imagina la dicción (habla no pronunciada) de la palabra indicada, con los marcadores se forman ventanas. Para cumplir el objetivo, únicamente son de interés las señales de los canales F7, FC5, T7 y P7 (los más cercanos a las áreas cerebrales de Broca y Wernicke). Posteriormente, éstas son filtradas utilizando un FIR pasa-bandas en el rango de 4-25 Hz, y las ventanas son ajustadas a 256 muestras. Además, se extraen características mediante la transformada wavelet discreta que son utilizadas para entrenar y evaluar a cuatro clasificadores: Naive Bayes (NB), Random Forests (RF), Máquina de vectores de soporte (SVM), y Bagging de Random Forest (Bagging-RF). Las medidas de evaluación son la exactitud y el coeficiente kappa obtenidas mediante validación cruzada con 10 pliegues. Se grabaron señales EEG de 21 individuos, para cada uno de los cuales se tienen alrededor de 30 ventanas correspondientes a cada una de las palabras. Para el caso del mejor clasificador Bagging-RF, los valores promedio tanto del porcentaje de exactitud como para el coeficiente kappa son 40.48% y 0.24, respectivamente. Además, se presenta un análisis de Bagging-RF y RF a nivel de la clasificación de las palabras no pronunciadas, donde se evalúa a los clasificadores con el recuerdo, la especificidad, la precisión y la medida-F. Los resultados aún distan de lo necesario para controlar eficazmente a una BCI sin embargo, debido a que los porcentajes de exactitud están arriba del 20 %, es decir, arriba del azar para cinco clases, y los valores de kappa mayores a 0, se puede concluir que las señales EEG efectivamente contienen información que hace posible la clasificación de las palabras no pronunciadas del vocabulario reducido mediante el método propuesto.

Materias

info:eu-repo/classification/Electroencefalografía/Electroencephalography

info:eu-repo/classification/Clasificación/Classification

info:eu-repo/classification/Cerebro/Brain

info:eu-repo/classification/cti/1

info:eu-repo/classification/cti/12

info:eu-repo/classification/cti/1203

Mostrar el registro completo del ítem