Article
Reducing the Number of Canonical Form Tests for Frequent Subgraph Mining
Fecha
2011-12-13Registro en:
Revista Computación y Sistemas; Vol. 15 No. 2
1405-5546
Autor
Gago Alonso, Andrés
Carrasco Ochoa, Jesús A.
Medina Pagola, José E.
Martínez Trinidad, José F.
Institución
Resumen
Abstract. Frequent connected subgraph (FCS) mining is an interesting problem with wide applications in real life. Most of the FCS mining algorithms have been focused on detecting duplicate candidates using canonical form tests. Canonical form tests have high computational complexity, and therefore, they affect the efficiency of graph miners. In this paper, we introduce novel properties to reduce the number of canonical form tests in FCS mining. Based on these properties, a new algorithm for FCS mining called gRed is presented. The experimentation on real world datasets shows the impact of the proposed properties on the efficiency of gRed reducing the number of canonical form tests regarding gSpan. Besides, the performance of our algorithm is compared against gSpan and other state-of-the-art algorithms.