Optimization of Restricted Searches in Web Directories Using Hybrid Data Structures
Fidel Cacheda5
, Victor Carneiro5
, Carmen Guerrero5
and Angel Viña5 
| (5) |
Department of Information and Communications Technologies, Facultad de Informática, Campus de Elviña s/n, 15.071 A Coruña, Spain |
Abstract
The need of efficient tools in order to manage, retrieve and filter the information in the WWW is clear. Web directories are
taxonomies for the classification of Web documents. These kind of information retrieval systems present a specific type of
search where the document collection is restricted to one area of the category graph. This paper introduces a specific data
architecture for Web directories that improves the performance of restricted searches. That architecture is based on a hybrid
data structure composed of an inverted file with multiple embedded signature files. Two variants are presented: hybrid architecture
with total information and with partial information. This architecture has been analyzed by means of developing both variants
to be compared with a basic model. The performance of the restricted queries was clearly improved, especially the hybrid model
with partial information, which yielded a positive response under any load of the search system.1
This work has been partially sponsored by the Spanish CICYT (TIC2001-0547).
References secured to subscribers.