Lecture Notes in Computer Science, 2002, Volume 2534/2002, 323-338, DOI: 10.1007/3-540-36182-0_44

Mining Patterns from Structured Data by Beam-Wise Graph-Based Induction

Takashi Matsuda, Hiroshi Motoda, Tetsuya Yoshida and Takashi Washio

View Related Documents

Abstract

A machine learning technique called Graph-Based Induction (GBI) extracts typical patterns from graph data by stepwise pair expansion (pairwise chunking). Because of its greedy search strategy, it is very efficient but suffers from incompleteness of search. Improvement is made on its search capability without imposing much computational complexity by 1) incorporating a beam search, 2) using a different evaluation function to extract patterns that are more discriminatory than those simply occurring frequently, and 3) adopting canonical labeling to enumerate identical patterns accurately. This new algorithm, now called Beam-wise GBI, B-GBI for short, was tested against a small DNA dataset from UCI repository and shown successful in extracting discriminatory substructures.

Fulltext Preview

Image of the first page of the fulltext document