Artículos de revistas
A new index for clustering validation with overlapped clusters
Fecha
2016-08Registro en:
Campo, David Nazareno; Stegmayer, Georgina; Milone, Diego Humberto; A new index for clustering validation with overlapped clusters; Pergamon-Elsevier Science Ltd; Expert Systems with Applications; 64; 8-2016; 549-556
0957-4174
CONICET Digital
CONICET
Autor
Campo, David Nazareno
Stegmayer, Georgina
Milone, Diego Humberto
Resumen
External validation indexes allow similarities between two clustering solutions to be quantified. With classical external indexes, it is possible to quantify how similar two disjoint clustering solutions are, where each object can only belong to a single cluster. However, in practical applications, it is common for an object to have more than one label, thereby belonging to overlapped clusters; for example, subjects that belong to multiple communities in social networks. In this study, we propose a new index based on an intuitive probabilistic approach that is applicable to overlapped clusters. Given that recently there has been a remarkable increase in the analysis of data with naturally overlapped clusters, this new index allows to comparing clustering algorithms correctly. After presenting the new index, experiments with artificial and real datasets are shown and analyzed. Results over a real social network are also presented and discussed. The results indicate that the new index can correctly measure the similarity between two partitions of the dataset when there are different levels of overlap in the analyzed clusters.