Um estudo exploratório sobre o uso de diferentes algoritmos de classificação, de seleção de métricas, e de agrupamento na construção de modelos de predição cruzada de defeitos entre projetos

Satin, Ricardo Francisco de Pierre

dc.contributor	Ré, Reginaldo
dc.contributor	http://lattes.cnpq.br/5545891505433768
dc.contributor	Ré, Reginaldo
dc.contributor	Fabri, José Augusto
dc.contributor	Garcia, Rogério Eduardo
dc.creator	Satin, Ricardo Francisco de Pierre
dc.date.accessioned	2017-10-27T17:21:01Z
dc.date.accessioned	2022-12-06T15:40:10Z
dc.date.available	2017-10-27T17:21:01Z
dc.date.available	2022-12-06T15:40:10Z
dc.date.created	2017-10-27T17:21:01Z
dc.date.issued	2015-08-18
dc.identifier	SATIN, Ricardo Francisco de Pierre. Um estudo exploratório sobre o uso de diferentes algoritmos de classificação, de seleção de métricas, e de agrupamento na construção de modelos de predição cruzada de defeitos entre projetos. 2015. 102 f. Dissertação (Mestrado em Informática) - Universidade Tecnológica Federal do Paraná, Cornélio Procópio, 2016.
dc.identifier	http://repositorio.utfpr.edu.br/jspui/handle/1/2552
dc.identifier.uri	https://repositorioslatinoamericanos.uchile.cl/handle/2250/5269011
dc.description.abstract	To predict defects in software projects is a complex task, especially for those projects that are in early stages of development by, often, providing few data for prediction models. The use of cross-project defect prediction is indicated in such a situation because it allows reuse data of similar projects. This work proposes an exploratory study on the use of different classification algorithms, of selection metrics, and grouping to build cross-project defect predictions models. This model was built using a performance measure, obtained by applying classification algorithms aim to find and group similar projects. Therefore, it was studied the application of 8 classification algorithms, 6 feature selection, and a cluster in a data set with 1283 projects, resulting in the construction of 61584 different prediction models. The classification algorithms and feature selection had their performance evaluated through different statistical tests showed that: the Naive Bayes was the best performance classifier, as compared with other 7 algorithms; the pair of feature selection algorithms that performed better was formed by CFS attribute evaluator and search method Genetic Search, compared with 6 other pairs. Considering the clustering algorithm, this proposal seems to be promising, since the results shows evidence that the predictions were best grouping using the predictions performed without any similarity clustering, and shows the decrease in training cost and testing during the prediction process.
dc.publisher	Universidade Tecnológica Federal do Paraná
dc.publisher	Cornelio Procopio
dc.publisher	Brasil
dc.publisher	Programa de Pós-Graduação em Informática
dc.publisher	UTFPR
dc.rights	openAccess
dc.subject	Teoria da previsão
dc.subject	Software - Desenvolvimento
dc.subject	Falhas de sistemas de computação
dc.subject	Prediction theory
dc.subject	Computer software - Development
dc.subject	Computer system failures
dc.title	Um estudo exploratório sobre o uso de diferentes algoritmos de classificação, de seleção de métricas, e de agrupamento na construção de modelos de predição cruzada de defeitos entre projetos
dc.type	masterThesis

Este ítem pertenece a la siguiente institución

Universidade Tecnológica Federal do Paraná (Brasil)