Articulo
Detecting trends on the web: a multidisciplinary approach
INFORMATION FUSION;
Inf. Fusion
Registro en:
1872-6305
D10I1198
D10I1198
WOS:000337863500013
1566-2535
Autor
Duenas-Fernandez, Rodrigo
Velasquez, Juan D.
L'Huillier, Gaston
Institución
Resumen
This paper introduces a framework for trend modeling and detection on the Web through the usage of Opinion Mining and Topic Modeling tools based on the fusion of freely available information. This framework consists of a four step model that runs periodically: crawl a set of predefined sources of documents; search for potential sources and extract topics from the retrieved documents; retrieve opinionated documents from social networks for each detected topic and extract sentiment information from them. The proposed framework was applied to a set of 20 sources of documents over a period of 8 months. After the analysis period and that the proposed experiments were run, an F-Measure of 0.56 was obtained for the detection of significant events, implying that the proposed framework is a feasible model of how trends could be represented through the analysis of documents freely available on the Web. (C) 2014 Elsevier B.V. All rights reserved. This work was partially supported by FONDEF project D10I-1198, entitled WHALE: Web Hypermedia Analysis Latent Environment and the Millennium Institute on Complex Engineering Systems (ICM: P-05-004-F, CONICYT: FB016). 3 FONDEF rduenas@ing.uchile.cl; jvelasqu@dii.uchile.cl; gaston@groupon.com FONDEF [D10I-1198]; WHALE: Web Hypermedia Analysis Latent Environment; Millennium Institute on Complex Engineering Systems [ICM: P-05-004-F, CONICYT: FB016] FONDEF