Actas de congresos
Density-based clustering validation
Fecha
2014-04Registro en:
SIAM International Conference on Data Mining, 2014, Philadelphia.
9781611973440
Autor
Moulavi, Davoud
Jaskowiak, Pablo Andretta
Campello, Ricardo José Gabrielli Barreto
Zimek, Arthur
Sander, Jörg
Institución
Resumen
One of the most challenging aspects of clustering is validation, which is the objective and quantitative assessment of clustering results. A number of different relative validity criteria have been proposed for the validation of globular, clusters. Not all data, however, are composed of globular clusters. Density-based clustering algorithms seek partitions with high density areas of points (clusters, not necessarily globular) separated by low density areas, possibly containing noise objects. In these cases relative validity indices proposed for globular cluster validation may fail. In this paper we propose a relative validation index for density-based, arbitrarily shaped clusters. The index assesses clustering quality based on the relative density connection between pairs of objects. Our index is formulated on the basis of a new kernel density function, which is used to compute the density of objects and to evaluate the within- and between-cluster density connectedness of clustering results. Experiments on synthetic and real world data show the effectiveness of our approach for the evaluation and selection of clustering algorithms and their respective appropriate parameters.