Lecture Notes in Computer Science, 2004, Volume 3109/2004, 400-408, DOI: 10.1007/978-3-540-27801-6_30

A Fast Set Intersection Algorithm for Sorted Sequences

Ricardo Baeza-Yates

View Related Documents

Abstract

This paper introduces a simple intersection algorithm for two sorted sequences that is fast on average. It is related to the multiple searching problem and to merging. We present the worst and average case analysis, showing that in the former, the complexity nicely adapts to the smallest list size. In the later case, it performs less comparisons than the total number of elements on both inputs when n = αm (α > 1). Finally, we show its application to fast query processing in Web search engines, where large intersections, or differences, must be performed fast.

Keywords  Set operations - merging - multiple search - Web search engines - inverted indices

Fulltext Preview

Image of the first page of the fulltext document