Welcome!
To use the personalized features of this site, please log in or register.
If you have forgotten your username or password, we can help.
My Menu
Saved Items

Applying Biclustering to Text Mining: An Immune-Inspired Approach

Pablo A. D. de CastroContact Information, Fabrício O. de FrançaContact Information, Hamilton M. FerreiraContact Information and Fernando J. Von ZubenContact Information

(1)  Laboratory of Bioinformatics and Bio-Inspired Computing - LBIC, School of Electrical and Computer Engineer – FEEC, University of Campinas – UNICAMP, Campinas-SP, Brazil
Abstract
With the rapid development of information technology, computers are proving to be a fundamental tool for the organization and classification of electronic texts, given the huge amount of available information. The existent methodologies for text mining apply standard clustering algorithms to group similar texts. However, these algorithms generally take into account only the global similarities between the texts and assign each one to only one cluster, limiting the amount of information that can be extracted from the texts. An alternative proposal capable of solving these drawbacks is the biclustering technique. The biclustering is able to perform clustering of rows and columns simultaneously, allowing a more comprehensive analysis of the texts. The main contribution of this paper is the development of an immune-inspired biclustering algorithm to carry out text mining, denoted BIC-aiNet. BIC-aiNet interprets the biclustering problem as several two-way bipartition problems, instead of considering a single two-way permutation framework. The experimental results indicate that our proposal is able to group similar texts efficiently and extract implicit useful information from groups of texts.

Keywords  Artificial Immune System - Biclustering - Two-way Bipartition - Text mining


Contact Information Pablo A. D. de Castro
Email: pablo@dca.fee.unicamp.br

Contact Information Fabrício O. de França
Email: olivetti@dca.fee.unicamp.br

Contact Information Hamilton M. Ferreira
Email: hmf@dca.fee.unicamp.br

Contact Information Fernando J. Von Zuben
Email: vonzuben@dca.fee.unicamp.br
Fulltext Preview (Small, Large)
Image of the first page of the fulltext

References secured to subscribers.



Export this chapter
Export this chapter as RIS | Text
 
Remote Address: 38.107.191.114 • Server: mpweb22
HTTP User Agent: CCBot/1.0 (+http://www.commoncrawl.org/bot.html)