dc.contributor | Cerri, Ricardo | |
dc.contributor | http://lattes.cnpq.br/6266519868438512 | |
dc.contributor | http://lattes.cnpq.br/2590190539349649 | |
dc.contributor | https://orcid.org/0000-0002-2582-1695 | |
dc.creator | Silva, Luan Vinicius Moraes da | |
dc.date.accessioned | 2023-06-13T12:11:40Z | |
dc.date.accessioned | 2023-09-04T20:27:50Z | |
dc.date.available | 2023-06-13T12:11:40Z | |
dc.date.available | 2023-09-04T20:27:50Z | |
dc.date.created | 2023-06-13T12:11:40Z | |
dc.date.issued | 2023-04-06 | |
dc.identifier | SILVA, Luan Vinicius Moraes da. Investigação de métodos de seleção de atributos para problemas de classificação hierárquica multirrótulo. 2023. Trabalho de Conclusão de Curso (Graduação em Engenharia de Computação) – Universidade Federal de São Carlos, São Carlos, 2023. Disponível em: https://repositorio.ufscar.br/handle/ufscar/18137. | |
dc.identifier | https://repositorio.ufscar.br/handle/ufscar/18137 | |
dc.identifier.uri | https://repositorioslatinoamericanos.uchile.cl/handle/2250/8630683 | |
dc.description.abstract | Classification is the task of assigning data instances to classes. In Hierarchical Multi- label Classification, instances may belong to two or more classes (labels) simultaneously, where the classes are hierarchically structured. Feature Selection is part of the data pre- processing step and plays an important role in classification tasks for Machine Learning, as it can effectively reduce the size of the dataset, removing irrelevant/redundant attributes and improving prediction performance of the classifier. Although many real-world prob- lems are from multi-label hierarchical domain, most related research addresses the feature selection task focusing on single-label problems. In many works, even when the proposal addresses multiple labels, the associated class structure is not hierarchical. Therefore, in this work, we study how feature selection can be used in the context of Hierarchical Multi- Label Classification. For this purpose, we compare global feature selectors known in the literature with flat feature selectors adapted for hierarchical structures. The global fea- ture selectors used were Relief, Genie3 and Symbolic, and the flat feature selectors were ReliefF and Information Gain. For flat selectors, strategies were adopted to transform the Hierarchical Multi-label problem into a non-hierarchical multi-label problem, using the Label Powerset and Binary Relevance transformations. As main results, the global evaluators produced subsets of relevant features, improving the predictive performance while reducing the original dataset by up to 75% of the original dimensionality, with emphasis on the evaluators based on the Genie3 and Symbolic set. Despite the improvement, the flat evaluators were proportionally better compared to the global evaluators. | |
dc.language | por | |
dc.publisher | Universidade Federal de São Carlos | |
dc.publisher | UFSCar | |
dc.publisher | Câmpus São Carlos | |
dc.publisher | Engenharia de Computação - EC | |
dc.rights | http://creativecommons.org/licenses/by-nc-nd/3.0/br/ | |
dc.rights | Attribution-NonCommercial-NoDerivs 3.0 Brazil | |
dc.subject | Seleção de atributos | |
dc.subject | Classificação hierárquica multirrótulo | |
dc.subject | Aprendizado de máquina | |
dc.title | Investigação de métodos de seleção de atributos para problemas de classificação hierárquica multirrótulo | |
dc.type | TCC | |