dc.creator | Errecalde, Marcelo L. | |
dc.creator | Cagnina, Leticia Cecilia | |
dc.creator | Rosso, Paolo | |
dc.date.accessioned | 2016-08-12T17:02:51Z | |
dc.date.accessioned | 2018-11-06T15:53:17Z | |
dc.date.available | 2016-08-12T17:02:51Z | |
dc.date.available | 2018-11-06T15:53:17Z | |
dc.date.created | 2016-08-12T17:02:51Z | |
dc.date.issued | 2015-08-14 | |
dc.identifier | Errecalde, Marcelo L.; Cagnina, Leticia Cecilia; Rosso, Paolo ; Silhouette + Attraction: A Simple and Effective Method for Text Clustering; Cambridge University Press; Natural Language Engineering; 1; 14-8-2015; 1-40 | |
dc.identifier | 1351-3249 | |
dc.identifier | http://hdl.handle.net/11336/7135 | |
dc.identifier.uri | http://repositorioslatinoamericanos.uchile.cl/handle/2250/1902178 | |
dc.description.abstract | This article presents Sil-Att, a simple and effective method for text clustering, which is based on two main concepts: the silhouette coefficient and the idea of attraction. The combination of both principles allows to obtain a general technique that can be used either as a boosting method, which improves results of other clustering algorithms, or as an independent clustering algorithm. The experimental work shows that Sil-Att is able to obtain high quality results on text corpora with very different characteristics. Furthermore, its stable performance on all the considered corpora is indicative that it is a very robust method. This is a very interesting positive aspect of Sil-Att with respect to the other algorithms used in the experiments, whose performances heavily depend on specific characteristics of the corpora being considered. | |
dc.language | eng | |
dc.publisher | Cambridge University Press | |
dc.relation | info:eu-repo/semantics/altIdentifier/doi/http://dx.doi.org/10.1017/S1351324915000273 | |
dc.relation | info:eu-repo/semantics/altIdentifier/url/http://journals.cambridge.org/action/displayAbstract?fromPage=online&aid=9910907 | |
dc.rights | https://creativecommons.org/licenses/by-nc-sa/2.5/ar/ | |
dc.rights | info:eu-repo/semantics/restrictedAccess | |
dc.subject | CLUSTERING | |
dc.subject | SHORT TEXTS CORPORA | |
dc.subject | ATTRACTION | |
dc.subject | SILHOUETTE | |
dc.title | Silhouette + Attraction: A Simple and Effective Method for Text Clustering | |
dc.type | Artículos de revistas | |
dc.type | Artículos de revistas | |
dc.type | Artículos de revistas | |