Artículos de revistas
Empirical Comparison Of Cross-validation And Internal Metrics For Tuning Svm Hyperparameters
Registro en:
Pattern Recognition Letters. Elsevier Science Bv, v. 88, p. 6 - 11, 2017.
0167-8655
1872-7344
WOS:000396957800002
10.1016/j.patrec.2017.01.007
Autor
Duarte
Edson; Wainer
Jacques
Institución
Resumen
Hyperparameter tuning is a mandatory step for building a support vector machine classifier. In this work, we study some methods based on metrics of the training set itself, and not the performance of the classifier on a different test set - the usual cross-validation approach. We compare cross-validation (5-fold) with Xi-alpha, radius-margin bound, generalized approximate cross validation, maximum discrepancy and distance between two classes on 110 public binary data sets. Cross validation is the method that resulted in the best selection of the hyper-parameters, but it is also the method with one of the highest execution time. Distance between two classes (DBTC) is the fastest and the second best ranked method. We discuss that DBTC is a reasonable alternative to cross validation when training/hyperparameter-selection times are an issue and that the loss in accuracy when using DBTC is reasonably small. (C) 2017 Published by Elsevier B.V. 88 6 11