Actas de congresos
ClusterOSS: a new undersampling method for imbalanced learning
Fecha
2014-10Registro en:
Brazilian Conference on Intelligent Systems, 3th; Encontro Nacional de Inteligência Artificial e Computacional, 11th, 2014, São Carlos.
Autor
Barella, Victor Hugo
Costa, Eduardo de Paula
Carvalho, André Carlos Ponce de Leon Ferreira de
Institución
Resumen
A dataset is said to be imbalanced when its classes are disproportionately represented in terms of the number of instances they contain. This problem is common in applications such as medical diagnosis of rare diseases, detection of fraudulent calls, signature recognition. In this paper we propose an alternative method for imbalanced learning, which balances the dataset using an undersampling strategy. We show that ClusterOSS outperforms OSS, which is the method ClusterOSS is based on. Moreover, we show that the results can be further improved by combining ClusterOSS with random oversampling.