Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

The Black-Box Complexity of Nearest Neighbor Search

  • Conference paper
Automata, Languages and Programming (ICALP 2004)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3142))

Included in the following conference series:

Abstract

We define a natural notion of efficiency for approximate nearest-neighbor (ANN) search in general n-point metric spaces, namely the existence of a randomized algorithm which answers (1+ε)-approximate nearest neighbor queries in polylog(n) time using only polynomial space. We then study which families of metric spaces admit efficient ANN schemes in the black-box model, where only oracle access to the distance function is given, and any query consistent with the triangle inequality may be asked.

For \(\varepsilon < \frac{2}{5}\), we offer a complete answer to this problem. Using the notion of metric dimension defined in [GKL03] (à la [Ass83]), we show that a metric space X admits an efficient (1+ε)-ANN scheme for any \(\varepsilon < \frac{2}{5}\) if and only if \(\dim(X) = O(\log \log n)\). For coarser approximations, clearly the upper bound continues to hold, but there is a threshold at which our lower bound breaks down—this is precisely when points in the “ambient space” may begin to affect the complexity of “hard” subspaces SX. Indeed, we give examples which show that \(\dim(X)\) does not characterize the black-box complexity of ANN above the threshold.

Our scheme for ANN in low-dimensional metric spaces is the first to yield efficient algorithms without relying on any additional assumptions on the input. In previous approaches (e.g., [Cla99,KR02,KL04]), even spaces with \(\dim(X) = O(1)\) sometimes required Ω(n) query times.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 239.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Assouad, P.: Plongements lipschitziens dans Rn. Bull. Soc. Math. France 111(4), 429–448 (1983)

    MATH  MathSciNet  Google Scholar 

  2. Clarkson, K.L.: Nearest neighbor queries in metric spaces. Discrete Comput. Geom. 22(1), 63–93 (1999)

    Article  MATH  MathSciNet  Google Scholar 

  3. Gupta, A., Krauthgamer, R., Lee, J.R.: Bounded geometries, fractals, and low-distortion embeddings. In: Proceedings of the 44th annual Symposium on the Foundations of Computer Science (2003)

    Google Scholar 

  4. Gromov, M.: Metric structures for Riemannian and non-Riemannian spaces. Birkhäuser, Boston (1999)

    MATH  Google Scholar 

  5. Hildrum, K., Kubiatowicz, J., Ma, S., Rao, S.: A note on finding nearest neighbors in growth-restricted metrics. In: Proceedings of the 15th annual ACM-SIAM Symposium on Discrete Algorithms (2004)

    Google Scholar 

  6. Har-Peled, S.: A replacement for Voronoi diagrams of near linear size. In: 42nd IEEE Symposium on Foundations of Computer Science, Las Vegas, NV, pp. 94–103. IEEE Computer Soc., Los Alamitos (2001)

    Google Scholar 

  7. Indyk, P., Motwani, R.: Approximate nearest neighbors: towards removing the curse of dimensionality. In: 30th Annual ACM Symposium on Theory of Computing, May 1998, pp. 604–613 (1998)

    Google Scholar 

  8. Kakade, S., Kearns, M., Langford, J.: Exploration in metric state spaces. In: Proc. of the 20th International Conference on Machine Learning (2003)

    Google Scholar 

  9. Krauthgamer, R., Lee, J.R.: Navigating nets: Simple algorithms for proximity search. In: Proceedings of the 15th annual ACM-SIAM Symposium on Discrete Algorithms (2004)

    Google Scholar 

  10. Kushilevitz, E., Ostrovsky, R., Rabani, Y.: Efficient search for approximate nearest neighbor in high dimensional spaces. In: 30th Annual ACM Symposium on the Theory of Computing, pp. 614–623 (1998)

    Google Scholar 

  11. Karger, D., Ruhl, M.: Finding nearest neighbors in growth-restricted metrics. In: 34th Annual ACM Symposium on the Theory of Computing, pp. 63–66 (2002)

    Google Scholar 

  12. Talwar, K.: Bypassing the embedding: Approximation schemes and distance labeling schemes for growth restricted metrics. To appear in the procedings of the 36th annual Symposium on the Theory of Computing (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Krauthgamer, R., Lee, J.R. (2004). The Black-Box Complexity of Nearest Neighbor Search. In: Díaz, J., Karhumäki, J., Lepistö, A., Sannella, D. (eds) Automata, Languages and Programming. ICALP 2004. Lecture Notes in Computer Science, vol 3142. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27836-8_72

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-27836-8_72

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22849-3

  • Online ISBN: 978-3-540-27836-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics