dc.creatorAnilú Franco Arcega
dc.creatorJesús Ariel Carrasco Ochoa
dc.creatorJosé Francisco Martínez Trinidad
dc.date2011
dc.date.accessioned2023-07-25T16:23:57Z
dc.date.available2023-07-25T16:23:57Z
dc.identifierhttp://inaoe.repositorioinstitucional.mx/jspui/handle/1009/1592
dc.identifier.urihttps://repositorioslatinoamericanos.uchile.cl/handle/2250/7806787
dc.descriptionSeveral algorithms have been proposed in the literature for building decision trees (DT) for large datasets, however almost all of them have memory restrictions because they need to keep in main memory the whole training set, or a big amount of it, and such algorithms that do not have memory restrictions, because they choose a subset of the training set, need extra time for doing this selection or have parameters that could be very difficult to determine. In this paper, we introduce a new algorithm that builds decision trees using a fast splitting attribute selection (DTFS) for large datasets. The proposed algorithm builds a DT without storing the whole training set in main memory and having only one parameter but being very stable regarding to it. Experimental results on both real and synthetic datasets show that our algorithm is faster than three of the most recent algorithms for building decision trees for large datasets, getting a competitive accuracy.
dc.formatapplication/pdf
dc.languageeng
dc.publisherElsevier Ltd.
dc.relationcitation:Franco-Arcega, A., et al., (2011). Decision tree induction using a fast splitting attribute selection for large datasets, Expert Systems with Applications, (38): 14290–14300
dc.rightsinfo:eu-repo/semantics/openAccess
dc.rightshttp://creativecommons.org/licenses/by-nc-nd/4.0
dc.subjectinfo:eu-repo/classification/Decision trees/Decision trees
dc.subjectinfo:eu-repo/classification/Large datasets/Large datasets
dc.subjectinfo:eu-repo/classification/Gain-ratio criterion/Gain-ratio criterion
dc.subjectinfo:eu-repo/classification/cti/1
dc.subjectinfo:eu-repo/classification/cti/12
dc.subjectinfo:eu-repo/classification/cti/1203
dc.subjectinfo:eu-repo/classification/cti/1203
dc.titleDecision tree induction using a fast splitting attribute selection for large datasets
dc.typeinfo:eu-repo/semantics/article
dc.typeinfo:eu-repo/semantics/acceptedVersion
dc.audiencestudents
dc.audienceresearchers
dc.audiencegeneralPublic


Este ítem pertenece a la siguiente institución