Articulo Revista Indexada
Comparative Study of Clustering Algorithms in Text Mining Context
Autor
JALIL, Abdennour Mohamed
HAFIDI, Imad
ALAMI, Lamiae
ENSA, Khouribga
Institución
Resumen
The spectacular increasing of Data is due to
the appearance of networks and smartphones. Amount 42% of
world population using internet [1]; have created a problem
related of the processing of the data exchanged, which is rising
exponentially and that should be automatically treated. This
paper presents a classical process of knowledge discovery
databases, in order to treat textual data. This process is
divided into three parts: preprocessing, processing and postprocessing. In the processing step, we present a comparative
study between several clustering algorithms such as KMeans,
Global KMeans, Fast Global KMeans, Two Level KMeans and
FWKmeans. The comparison between these algorithms is made
on real textual data from the web using RSS feeds. Experimental
results identified two problems: the first one quality results
which remain for algorithms, which rapidly converge. The
second problem is due to the execution time that needs to
decrease for some algorithms.