Artículo de revista
Feature selection for Support Vector Machines via Mixed Integer Linear Programming
Fecha
2014Registro en:
Information Sciences 279 (2014) 163–175
DOI: 10.1016/j.ins.2014.03.110
Autor
Maldonado, Sebastián
Pérez, Juan
Weber, Richard
Labbé, Martine
Institución
Resumen
The performance of classification methods, such as Support Vector Machines, depends
heavily on the proper choice of the feature set used to construct the classifier. Feature
selection is an NP-hard problem that has been studied extensively in the literature. Most
strategies propose the elimination of features independently of classifier construction by
exploiting statistical properties of each of the variables, or via greedy search. All such strategies
are heuristic by nature. In this work we propose two different Mixed Integer Linear
Programming formulations based on extensions of Support Vector Machines to overcome
these shortcomings. The proposed approaches perform variable selection simultaneously
with classifier construction using optimization models. We ran experiments on real-world
benchmark datasets, comparing our approaches with well-known feature selection techniques
and obtained better predictions with consistently fewer relevant features.