dc.creator | Souza, Jessica Andressa de | |
dc.creator | Cazzolato, Mirela Teixeira | |
dc.creator | Traina, Agma Juci Machado | |
dc.date.accessioned | 2016-10-19T21:23:03Z | |
dc.date.accessioned | 2018-07-04T17:10:46Z | |
dc.date.available | 2016-10-19T21:23:03Z | |
dc.date.available | 2018-07-04T17:10:46Z | |
dc.date.created | 2016-10-19T21:23:03Z | |
dc.date.issued | 2016-04 | |
dc.identifier | Symposium on Applied Computing, 31st, 2016, Pisa. | |
dc.identifier | 9781450337397 | |
dc.identifier | http://www.producao.usp.br/handle/BDPI/50999 | |
dc.identifier | http://dx.doi.org/10.1145/2851613.2851661 | |
dc.identifier.uri | http://repositorioslatinoamericanos.uchile.cl/handle/2250/1645726 | |
dc.description.abstract | An efficient and effective clustering process is a core task of data mining analysis, and has become more important in the nowadays scenario of big data, where scalability is an issue. In this paper we present the ClusMAM method, which proposes a new strategy for clustering large complex datasets through metric access methods. ClusMAM aims at accelerating the process of relational partitional clustering by taking advantage of the inherent node separations of metric access methods. In comparison with other methods from the literature, ClusMAM is up to four orders of magnitude faster than the competitors maintaining clustering quality. Additionally, ClusMAM exploits the datasets to find compact and coherent clusters, suggesting the number of clusters k found in the data. The method was evaluated employing synthetic and real datasets, and the behavior of the method was consistent regarding the number of distance calculations and time required for the clustering process as well. | |
dc.language | eng | |
dc.publisher | Association for Computing Machinery - ACM | |
dc.publisher | University of Pisa | |
dc.publisher | Scuola Superiore Sant’Anna | |
dc.publisher | Pisa | |
dc.relation | Symposium on Applied Computing, 31st | |
dc.rights | Copyright ACM | |
dc.rights | closedAccess | |
dc.subject | Metric access methods | |
dc.subject | unsupervised clustering | |
dc.subject | complex datasets | |
dc.subject | multimedia indexing | |
dc.title | ClusMAM: fast and effective unsupervised clustering of large complex datasets using metric access methods | |
dc.type | Actas de congresos | |