Volume 17, Number 5, 1231-1251, DOI: 10.1007/s00778-007-0062-1

Reference-based indexing for metric spaces with costly distance measures

Jayendra Venkateswaran, Tamer Kahveci, Christopher Jermaine and Deepak Lachwani

View Related Documents

Abstract

We consider the problem of similarity search in databases with costly metric distance measures. Given limited main memory, our goal is to develop a reference-based index that reduces the number of comparisons in order to answer a query. The idea in reference-based indexing is to select a small set of reference objects that serve as a surrogate for the other objects in the database. We consider novel strategies for selection of references and assigning references to database objects. For dynamic databases with frequent updates, we propose two incremental versions of the selection algorithm. Our experimental results show that our selection and assignment methods far outperform competing methods.

Keywords  Reference-indexing - Metric measures - Edit distance - Earth mover’s distance

This work is partially supported by the National Science Foundation under Grant No. 0347408.

Fulltext Preview

Image of the first page of the fulltext document