View Related Documents

Abstract

To facilitate effective search on the World Wide Web, meta search engines have been developed which do not search the Web themselves, but use available search engines to find the required information. By means of wrappers, meta search engines retrieve information from the pages returned by search engines. We present an approach to automatically create such wrappers by means of an incremental grammar induction algorithm. The algorithm uses an adaptation of the string edit distance. Our method performs well; it is quick, can be used for several types of result pages and requires a minimal amount of user interaction.

Keywords  inductive learning - information retrieval and learning - web navigation and mining - grammatical inference - wrapper generation - meta search engines

Supported by the Logic and Language Links project funded by Elsevier Science.
Supported by the Spinoza project ‘Logic in Action.’

Fulltext Preview

Image of the first page of the fulltext document