Welcome!
To use the personalized features of this site, please log in or register.
If you have forgotten your username or password, we can help.
My Menu
Saved Items

Nearest Neighbor Queries in Metric Spaces

K. L. Clarkson1

(1)  Bell Laboratories, Lucent Technologies, Murray Hill, NJ 07974, USA clarkson@research.bell-labs.com, US
Abstract.    Given a set S of n sites (points), and a distance measure d , the nearest neighbor searching problem is to build a data structure so that given a query point q , the site nearest to q can be found quickly. This paper gives data structures for this problem when the sites and queries are in a metric space. One data structure, D(S) , uses a divide-and-conquer recursion. The other data structure, M(S,Q) , is somewhat like a skiplist. Both are simple and implementable. The data structures are analyzed when the metric space obeys a certain sphere-packing bound, and when the sites and query points are random and have distributions with an exchangeability property. This property implies, for example, that query point q is a random element of . Under these conditions, the preprocessing and space bounds for the algorithms are close to linear in n . They depend also on the sphere-packing bound, and on the logarithm of the distance ratio of S , the ratio of the distance between the farthest pair of points in S to the distance between the closest pair. The data structure M(S,Q) requires as input data an additional set Q , taken to be representative of the query points. The resource bounds of M(S,Q) have a dependence on the distance ratio of S Q . While M(S,Q) can return wrong answers, its failure probability can be bounded, and is decreasing in a parameter K . Here K≤ |Q|/n is chosen when building M(S,Q) . The expected query time for M(S,Q) is O(Klog  n)log  , and the resource bounds increase linearly in K . The data structure D(S) has expected O( log n) O(1) query time, for fixed distance ratio. The preprocessing algorithm for M(S,Q) can be used to solve the all nearest neighbor problem for S in O(n(log  n) 2 (log ϒ(S)) 2 ) expected time.
Received September 17, 1996, and in revised form November 1, 1998.

Fulltext Preview (Small, Large)
Image of the first page of the fulltext


Export this article
Export this article as RIS | Text
 
Referenced by
8 newer articles

  1. Gonzalez, E.C. (2008) . IEEE Transactions on Pattern Analysis and Machine Intelligence 30(9)
    [CrossRef]
  2. Gorti, U. (2009) TIME EVOLUTION OF VISCOUS CIRCUMSTELLAR DISKS DUE TO PHOTOEVAPORATION BY FAR-ULTRAVIOLET, EXTREME-ULTRAVIOLET, AND X-RAY RADIATION FROM THE CENTRAL STAR. The Astrophysical Journal 705(2)
    [CrossRef]
  3. Yue, Feng (2009) Competitive code–based fast palmprint identification using a set of cover trees. Optical Engineering 48(6)
    [CrossRef]
  4. Ailon, Nir (2009) The Fast Johnson–Lindenstrauss Transform and Approximate Nearest Neighbors. SIAM Journal on Computing 39(1)
    [CrossRef]
  5. Chan, T.-H. Hubert (2008) Small Hop-diameter Sparse Spanners for Doubling Metrics. Discrete & Computational Geometry
    [CrossRef]
  6. Har-Peled, Sariel (2006) Fast Construction of Nets in Low-Dimensional Metrics and Their Applications. SIAM Journal on Computing 35(5)
    [CrossRef]
  7. Yershova, Anna (2007) . IEEE Transactions on Robotics 23(1)
    [CrossRef]
  8. Giles, Kendall E. (2007) Iterative Denoising. Computational Statistics
    [CrossRef]
Remote Address: 38.107.191.109 • Server: MPWEB26
HTTP User Agent: CCBot/1.0 (+http://www.commoncrawl.org/bot.html)