Text Mining at Detail Level Using Conceptual Graphs
Manuel Montes-y-Gómez4
, Alexander Gelbukh5
and Aurelio López-López4 
| (4) |
Óptica y Electrónica (INAOE), Instituto Nacional de Astrofísica, Mexico |
| (5) |
Centro de Investigación en Computación (CIC-IPN), Mexico |
Abstract
Text mining is defined as knowledge discovery in large text collections. It detects interesting patterns such as clusters,
associations, deviations, similarities, and differences in sets of texts. Current text mining methods use simplistic representations
of text contents, such as keyword vectors, which imply serious limitations on the kind and meaningfulness of possible discoveries.
We show how to do some typical mining tasks using conceptual graphs as formal but meaningful representation of texts. Our
methods involve qualitative and quantitative comparison of conceptual graphs, conceptual clustering, building a conceptual
hierarchy, and application of data mining techniques to this hierarchy in order to detect interesting associations and deviations.
Our experiments show that, despite widespread misbelief, detailed meaningful mining with conceptual graphs is computationally
affordable.
Keywords text mining - conceptual graphs - conceptual clustering - association discovery - deviation detection
Work done under partial support of CONACyT, CGEPI-IPN, and SNI, Mexico.
References secured to subscribers.