Actas de congresos
Applying Biclustering To Text Mining: An Immune-inspired Approach
Registro en:
3540739211; 9783540739210
Lecture Notes In Computer Science (including Subseries Lecture Notes In Artificial Intelligence And Lecture Notes In Bioinformatics). , v. 4628 LNCS, n. , p. 83 - 94, 2007.
3029743
2-s2.0-38149079776
Autor
De Castro P.A.D.
De Franca F.O.
Ferreira H.M.
Von Zuben F.J.
Institución
Resumen
With the rapid development of information technology, computers are proving to be a fundamental tool for the organization and classification of electronic texts, given the huge amount of available information. The existent methodologies for text mining apply standard clustering algorithms to group similar texts. However, these algorithms generally take into account only the global similarities between the texts and assign each one to only one cluster, limiting the amount of information that can be extracted from the texts. An alternative proposal capable of solving these drawbacks is the biclustering technique. The biclustering is able to perform clustering of rows and columns simultaneously, allowing a more comprehensive analysis of the texts. The main contribution of this paper is the development of an immune-inspired biclustering algorithm to carry out text mining, denoted BIC-aiNet. BIC-aiNet interprets the biclustering problem as several two-way bipartition problems, instead of considering a single two-way permutation framework. The experimental results indicate that our proposal is able to group similar texts efficiently and extract implicit useful information from groups of texts. © Springer-Verlag Berlin Heidelberg 2007. 4628 LNCS
83 94 Agrawal, R., Gehrke, J., Gunopulus, D., Raghavan, P., Automatic subspace clustering of high dimensional data for data mining applications (1998) Proc. of the ACM/SIGMOD Int. Conference on Management of Data, pp. 94-105 Cheng, Y., Church, G.M., Biclustering of expression data (2000) Proc. of the 8th Int. Conf. on Inteligentt Systems for Molecular Biology, pp. 93-103 de Castro, L.N, Von Zuben, F.J.: aiNet: An Artificial Immune Network for Data Analysis. In: Data Mining: A Heuristic Approach, pp. 231-259 (2001)de França, F.O., Bezerra, G., Von Zuben, F.J., New Perspectives for the Biclustering Problem (2006) IEEE Congress on Evolutionary Computation, pp. 2768-2775 Dhillon, I.S., Co-clustering documents and words using bipartite spectral graph partitioning (2001) Proc. of the 7th Int. Con. on Knowledge Discovery and Data Mining, pp. 269-274 Feldman, R., Sanger, J., (2006) The Text Mining Handbook, , Cambridge University Press, Cambridge Goldberg, D., Nichols, D., Brian, M., Terry, D., Using collaborative filtering to weave an information tapestry (1992) ACM Communications, 35 (12), pp. 61-70 Haixun, W., Wei, W., Jiong, Y., Yu, P.S., Clustering by pattern similarity in large data sets (2002) Proc. of the 2002 ACM SIGMOD Int. Conf. on Manag, pp. 394-405. , Data, pp Hartigan, J.A., Direct clustering of a data matrix (1972) Journal of the American Statistical Association (JASA), 67 (337), pp. 123-129 Madeira, S.C., Oliveira, A.L., Biclustering algorithms for biological data analysis: A survey (2004), pp. 24-25. , Trans. on Computational Biology and Bioinformatics 1Sheng, Q., Moreau, Y., De Moor, B., Biclustering micrarray data by Gibbs sampling (2003) Bioinformatics, 19 (SUPPL. 2), pp. 196-205 Symeonidis, P., Nanopoulos, A., Papadopoulos, A., Manolopoulos, Y., Nearest-Biclusters Collaborative Filtering (2006) Proc. of the WebKDD Tang, C., Zhang, L., Zhang, I., Ramanathan, M., Interrelated two-way clustering: An unsupervised approach for gene expression data analysis (2001) Proc. of the 2nd IEEE Int. Symposium on Bioinformatics and Bioengineering, pp. 41-48 Tanay, A., Sharan, R., Shamir, R., Biclustering Algorithms: A Survey (2005) Series, , Alum, S, ed, Handbook of Computational Molecular Biology. Chapman & Hall/CRC Computer and Information Science