dc.contributorMonroy Borja, Raúl
dc.contributorSchool of Engineering and Sciences
dc.contributorMorales Manzanares, Eduardo
dc.contributorGutiérrez Rodríguez, Andrés Eduardo
dc.contributorCantú Ortiz, Francisco
dc.contributorConant Pablos, Santiago
dc.contributorMedina Pérez, Miguel Angel
dc.contributorCampus Estado de México
dc.contributorpuemcuervo, emipsanchez
dc.creatorCAÑETE SIFUENTES, LEONARDO MAURICIO; 787723
dc.creatorCañete Sifuentes, Leonardo Mauricio
dc.date.accessioned2023-06-01T22:21:51Z
dc.date.accessioned2023-07-19T19:26:42Z
dc.date.available2023-06-01T22:21:51Z
dc.date.available2023-07-19T19:26:42Z
dc.date.created2023-06-01T22:21:51Z
dc.date.issued2022-11
dc.identifierCañete Sifuentes, L. M. (2022). A novel functional tree for class imbalance problems [Tesis Doctorado]. Instituto Tecnológico y de Estudios Superiores de Monterrey. Recuperado de: https://hdl.handle.net/11285/650793
dc.identifierhttps://hdl.handle.net/11285/650793
dc.identifierhttps://orcid.org/0000-0003-3175-8917
dc.identifier787723
dc.identifier.urihttps://repositorioslatinoamericanos.uchile.cl/handle/2250/7716166
dc.description.abstractDecision trees (DTs) are popular classifiers partly because they provide models that are easy to explain and because they show remarkable performance. To improve the classification performance of individual DTs, researchers have used linear combinations of features in inner nodes (Multivariate Decision Trees), leaf nodes (Model Trees), or both (Functional Trees). Our general objective is to develop a DT using linear feature combinations that outperforms the rest of such DTs in terms of classification performance as measured by the Area Under the ROC Curve (AUC), particularly in class imbalance problems, where one of the classes in the database has few objects compared to another class. We establish that, in terms of classification performance, there exists a hierarchy, where Functional Trees (FTs) surpass Model Trees, that in turn surpass Multivariate Decision Trees. Having shown that Gama's FT, the only FT to date, has the best classification performance, we identify limitations that hinder its classification performance. To improve the classification performance of FTs, we introduce the Functional Tree for class imbalance problems (FT4cip), which takes care in each design decision to improve AUC. The decision of what pruning method to use led us to the design of the AUC-optimizing Cost-Complexity pruning algorithm, a novel pruning algorithm that does not degrade classification performance in class imbalance problems because it optimizes AUC. We show how each design decision taken when building FT4cip contributes to classification performance or to simple tree models. We demonstrate through a set of tests that FT4cip outperforms Gama's FT and excels in class imbalance problems. All our results are supported by a thorough experimental comparison in 110 databases using Bayesian statistical tests.
dc.languageeng
dc.publisherInstituto Tecnológico y de Estudios Superiores de Monterrey
dc.relationpublishedVersion
dc.relationREPOSITORIO NACIONAL CONACYT
dc.rightshttp://creativecommons.org/licenses/by/4.0
dc.rightsopenAccess
dc.titleA novel functional tree for class imbalance problems
dc.typeTesis Doctorado / doctoral Thesis


Este ítem pertenece a la siguiente institución