Comparative Study of Clustering Algorithms in Text Mining Context

JALIL, Abdennour Mohamed; HAFIDI, Imad; ALAMI, Lamiae; ENSA, Khouribga

Articulo Revista Indexada

Registro en:

1989-1660

https://reunir.unir.net/handle/123456789/11227

http://doi.org/ 10.9781/ijimai.2016.376

https://repositorioslatinoamericanos.uchile.cl/handle/2250/5905550

Autor

JALIL, Abdennour Mohamed

HAFIDI, Imad

ALAMI, Lamiae

ENSA, Khouribga

Institución

Universidad internacional de la Rioja (Colombia)

Resumen

The spectacular increasing of Data is due to the appearance of networks and smartphones. Amount 42% of world population using internet [1]; have created a problem related of the processing of the data exchanged, which is rising exponentially and that should be automatically treated. This paper presents a classical process of knowledge discovery databases, in order to treat textual data. This process is divided into three parts: preprocessing, processing and postprocessing. In the processing step, we present a comparative study between several clustering algorithms such as KMeans, Global KMeans, Fast Global KMeans, Two Level KMeans and FWKmeans. The comparison between these algorithms is made on real textual data from the web using RSS feeds. Experimental results identified two problems: the first one quality results which remain for algorithms, which rapidly converge. The second problem is due to the execution time that needs to decrease for some algorithms.

Materias

algorithms

clustering

data

text mining

IJIMAI

Mostrar el registro completo del ítem