pith. sign in

arxiv: 2607.02338 · v1 · pith:QM27Q5K6new · submitted 2026-07-02 · 💻 cs.DB · cs.CL· cs.IR· cs.LG

HNSW with Accuracy Guarantees Using Graph Spanners -- A Technical Report

Pith reviewed 2026-07-03 02:53 UTC · model grok-4.3

classification 💻 cs.DB cs.CLcs.IRcs.LG
keywords HNSWnearest neighbor searchgraph spannersaccuracy guaranteesextreme value theoryapproximate nearest neighborscertification
0
0 comments X

The pith

A Certify-then-Rectify framework adds worst-case correctness to HNSW searches by certifying heuristic results and escalating to exact recovery when needed.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes a tiered retrieval method that begins with standard HNSW graph traversal for speed but inserts a lightweight statistical check to verify result quality. When the check signals insufficient quality, the method switches to an exact algorithm whose cost is controlled by reinterpreting the HNSW structure as a geometric spanner whose largest stretch factor is estimated via Extreme Value Theory. This produces distance bounds that make the exact step practical. Evaluations show the combined system matches ordinary HNSW speed on most queries while delivering the correctness of brute-force search. A reader would care because the approach removes the reliability gap that has kept heuristic nearest-neighbor indexes from high-stakes use.

Core claim

By treating the HNSW graph as a geometric spanner and applying Extreme Value Theory to bound its maximum empirical stretch factor, the framework obtains a distribution-free statistical certificate for the quality of a greedy HNSW traversal; when the certificate fails, the same bound makes an exact recovery step computationally feasible, yielding a method whose average-case cost matches HNSW while its worst-case output is always correct.

What carries the argument

The geometric-spanner reinterpretation of HNSW together with Extreme Value Theory estimation of its maximum stretch factor, which supplies the distance bounds used by the certifier.

If this is right

  • Most queries finish at HNSW speed because the certifier passes in the common case.
  • The exact-recovery step is invoked only when the stretch bound indicates the heuristic may have failed.
  • The same certification logic can be applied to any HNSW index without rebuilding it.
  • The resulting system outperforms prior attempts to add guarantees to HNSW on standard datasets.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The spanner-plus-EVT technique could be tested on other greedy graph indexes such as NSW or NSG to see whether similar bounds hold.
  • Dynamic updates to the HNSW graph would require periodic re-estimation of the stretch factor to keep the certificate valid.
  • Tighter stretch-factor estimators might reduce the frequency of escalation to the exact step.

Load-bearing premise

That the largest stretch factor observed in the HNSW graph can be estimated reliably enough by Extreme Value Theory to produce a valid upper bound on true nearest-neighbor distances.

What would settle it

A collection of queries on a benchmark graph where the certified HNSW output still misses a true nearest neighbor after the rectification step, or where measured distances exceed the EVT-derived bound.

Figures

Figures reproduced from arXiv: 2607.02338 by Gautam Das, Minghao Li, Nick Koudas, Raghav Mittal, Sanjivni Rana, Suraj Shetiya.

Figure 1
Figure 1. Figure 1: Certify-then-Rectify (CTR) search framework [PITH_FULL_IMAGE:figures/full_fig_p001_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Stretch Bound Expansion. On the left, SBE-NN ex [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Metric Bounded Verification. On the left, Recursive [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗
Figure 5
Figure 5. Figure 5: Effect of varying M and 𝑒 𝑓𝑐 on stretch 7.4 System Ablation We compare the number of distance computations (NDC) required to return the true 𝑘 nearest neighbors across system variants. For ease of display, NDC is reported as a percentage of the total num￾ber of base nodes in the 1M-size datasets we use (NDC % of 1M), eg. 1% means we did 10k distance computations. Experiments fix 𝑒 𝑓𝑠=100 and vary 𝑘 ∈ {10, … view at source ↗
Figure 4
Figure 4. Figure 4: Stretch estimation ablating 𝛽 and block count 𝑚. 5Both DEEP and T2I from: https://research.yandex.com 6Vanilla HNSW has no guarantee in connectivity (see also [41]). Our stretch calcula￾tion is therefore conducted only for reachable nodes, despite unreachable nodes being rare (3 𝑖𝑛 𝑑𝑒𝑔𝑟𝑒𝑒=0 nodes in our SIFT1M 𝑀=16, 𝑒 𝑓 𝑐=200 graph). 7.3 Stretch and HNSW Construction Parameters To further investigate how H… view at source ↗
Figure 6
Figure 6. Figure 6: 𝛼-sweeps on three datasets. Blue line = compliance rate as 𝛼 is varied; x-axis is in multiples of HNSW runtime. 0.80 0.85 0.90 0.95 0.99 0.00 0.50 1.00 HNSW Comp. rate 0.94 0.32 SIFT 0.80 0.85 0.90 0.95 0.99 0.00 0.50 1.00 0.49 0.03 GIST 0.80 0.85 0.90 0.95 0.99 0.00 0.50 1.00 0.93 0.38 DEEP 0.80 0.85 0.90 0.95 0.99 0.50 0.75 1.00 CRC Comp. rate 0.94 0.98 0.80 0.85 0.90 0.95 0.99 0.50 0.75 1.00 0.93 0.68 0… view at source ↗
Figure 7
Figure 7. Figure 7: 𝜏-sweeps on three datasets. Primary scale = compliance rate; secondary scale = time in multiples of HNSW runtime [PITH_FULL_IMAGE:figures/full_fig_p012_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: LTT threshold surface. Higher ˆ𝜃 increases the frac￾tion of queries sent to exact rectification. Black contours show fixed 𝜖 slices, highlighting how 𝛼 controls the latency– compliance trade-off. bounds stated in Theorem 6.3, across all values of 𝜏, 𝛼, 𝜖, and 𝛽 (for stretch estimation as experiment setup) used in these experiments, validating our theorem empirically. Notably, the small 𝛼 values that appear… view at source ↗
Figure 9
Figure 9. Figure 9: HNSW-parameter ablation on three datasets. X-axis reports runtime as a multiple of HNSW runtime. [PITH_FULL_IMAGE:figures/full_fig_p013_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: Repeated-certification policy on three datasets. Top row: plain HNSW compliance rate as [PITH_FULL_IMAGE:figures/full_fig_p014_10.png] view at source ↗
Figure 11
Figure 11. Figure 11: 𝜏 ablation on T2I-10M and T2I-100M. We next conduct a full 𝜏-sweep (𝑒 𝑓𝑠=100, 𝑘=100) to evaluate end-to-end overhead. Absolute latency increases with dataset size, as expected: HNSW grows by 8–9.4× and CTR by 7.5–8.4×. The key question is whether CTR adds disproportionate overhead be￾yond this baseline HNSW growth [PITH_FULL_IMAGE:figures/full_fig_p014_11.png] view at source ↗
Figure 12
Figure 12. Figure 12: ConANN vs. CTR (upper-right is better). 7.9 DARTH vs. CRC/LTT Recently, DARTH [12] was proposed solely as a predictor of re￾call. We therefore compare it directly against CRC and LTT in a certifier-only setting, i.e., without true neighbor rectification (as DARTH does not have this capability) [PITH_FULL_IMAGE:figures/full_fig_p015_12.png] view at source ↗
Figure 14
Figure 14. Figure 14: 𝜏-sweep, 𝑘=20: disk-based DiskANN (8GB + NVMe) [PITH_FULL_IMAGE:figures/full_fig_p016_14.png] view at source ↗
Figure 13
Figure 13. Figure 13: 𝑘=10: in-memory Vamana vs. DiskANN. Results. As shown in [PITH_FULL_IMAGE:figures/full_fig_p016_13.png] view at source ↗
Figure 15
Figure 15. Figure 15: 𝛼-sweep on SIFT1M, GIST1M, DEEP1M (𝜎=5%) 2× 5× 30× 200× 500× 42% 60% 80% 100% CRC comp. rate 0.95 0.9 0.8 0.7 0.6 0.4 SIFT 0.2 500× 1000× 21% 40% 60% 80% 100% 0.8 0.7 0.6 0.5 GIST 480× 770× 39% 40% 60% 80% 100% 0.995 0.95 0.9 0.8 0.7 0.6 0.5 0.2 DEEP 200× 300× Runtime / ACORN Runtime 42% 60% 80% 100% LTT comp. rate 0.5 0.2 5e-2 1e-2 1e-3 1e-5 1e-8 1e-12 1e-15 1e-20 410× 990× Runtime / ACORN Runtime 21% 40… view at source ↗
Figure 16
Figure 16. Figure 16: 𝛼-sweep on SIFT1M, GIST1M, DEEP1M (𝜎=10%) 2× 5× 20× 100× 300× 40% 46% 60% 80% 100% CRC comp. rate 0.97 0.8 0.7 0.6 SIFT 0.5 500× 980× 23% 40% 60% 80% 100% 0.8 0.7 0.6 0.5 0.4 GIST 300× 500× 42% 60% 80% 100% 0.995 0.97 0.95 0.9 0.8 0.7 0.6 0.3 DEEP 200× 300× Runtime / ACORN Runtime 40% 46% 60% 80% 100% LTT comp. rate 0.5 0.2 5e-2 1e-2 1e-3 1e-5 1e-6 1e-10 1e-12 1e-15 1e-18 370× 800× Runtime / ACORN Runtime… view at source ↗
Figure 17
Figure 17. Figure 17: 𝛼-sweep on SIFT1M, GIST1M, DEEP1M (𝜎=25%) 640× 1200× 40% 56% 80% 100% CRC comp. rate 0.5 0.3 0.2 0.1 Paper 1000× 2000× 40% 51% 60% 80% 100% 0.5 0.3 0.2 0.1 LAION1M 3500× 9600× 36% 40% 60% 80% 100% 0.5 0.4 Arxiv 0.3 200× 300× 500× Runtime / ACORN Runtime 40% 56% 80% 100% LTT comp. rate 0.5 0.2 0.1 1e-2 1e-3 1e-4 1e-6 1e-8 1e-10 560× 1700× Runtime / ACORN Runtime 40% 51% 60% 80% 100% 0.5 0.2 0.1 1e-2 1e-3 1… view at source ↗
Figure 18
Figure 18. Figure 18: 𝛼-sweep on PAPER, LAION1M, and arXiv Datasets [PITH_FULL_IMAGE:figures/full_fig_p020_18.png] view at source ↗
Figure 19
Figure 19. Figure 19: 𝜏-sweep on SIFT1M, GIST1M, DEEP1M (𝜎=5%) 0.85 0.88 0.90 0.93 0.95 0.00 0.25 0.50 0.75 1.00 ACORN Comp. rate SIFT 0.53 0.24 0.65 0.68 0.70 0.72 0.75 0.00 0.25 0.50 0.75 1.00 GIST 0.34 0.21 0.82 0.84 0.86 0.88 0.90 0.00 0.25 0.50 0.75 1.00 Deep1M 0.57 0.39 0.85 0.88 0.90 0.93 0.95 0.00 0.25 0.50 0.75 1.00 CRC Comp. rate 0.53 0.98 0.65 0.68 0.70 0.72 0.75 0.00 0.25 0.50 0.75 1.00 0.34 0.82 0.82 0.84 0.86 0.8… view at source ↗
Figure 20
Figure 20. Figure 20: 𝜏-sweep on SIFT1M, GIST1M, DEEP1M (𝜎=10%) 0.85 0.88 0.90 0.93 0.95 0.00 0.25 0.50 0.75 1.00 ACORN Comp. rate SIFT 0.57 0.27 0.71 0.72 0.73 0.74 0.75 0.00 0.25 0.50 0.75 1.00 GIST 0.27 0.23 0.82 0.84 0.86 0.88 0.90 0.00 0.25 0.50 0.75 1.00 Deep1M 0.59 0.41 0.85 0.88 0.90 0.93 0.95 0.00 0.25 0.50 0.75 1.00 CRC Comp. rate 0.57 0.98 0.71 0.72 0.73 0.74 0.75 0.00 0.25 0.50 0.75 1.00 0.80 0.81 0.82 0.84 0.86 0.… view at source ↗
Figure 21
Figure 21. Figure 21: 𝜏-sweep on SIFT1M, GIST1M, DEEP1M (𝜎=25%) [PITH_FULL_IMAGE:figures/full_fig_p021_21.png] view at source ↗
Figure 22
Figure 22. Figure 22: 𝜏-sweep on PAPER, LAION1M, and arXiv Datasets candidates and return a result that already meets the recall target. Consequently, fewer queries require rectification, and the average per-query overhead shrinks. Conversely, at 𝜎 = 5, the predicate is highly restrictive; ACORN must traverse deeper into the graph to find 𝑘 qualifying neighbors, increasing the chance of a suboptimal result and triggering recti… view at source ↗
read the original abstract

Hierarchical Navigable Small World (HNSW) graphs serve as the industry standard due to their logarithmic complexity and strong empirical performance. However, HNSW relies on greedy graph traversal, a heuristic that provides no theoretical guarantees of correctness. In this paper, we propose a novel "Certify-then-Rectify" framework that bridges the gap between the speed of heuristic search and the rigor of exact retrieval. Rather than discarding HNSW, our approach first employs a distribution-free statistical certifier to dynamically evaluate the quality of a standard HNSW search with minimal overhead. If certification indicates that the retrieved neighbors are of low quality, the framework safely escalates to a rigorous exact recovery algorithm. To make this exact recovery computationally feasible, we reinterpret the HNSW graph as a geometric spanner and utilize Extreme Value Theory to stochastically estimate its maximum empirical stretch factor. This allows us to mathematically bound the maximum distance of true nearest neighbors. Extensive evaluations on benchmark datasets demonstrate that our tiered framework delivers the average-case speed of HNSW while ensuring the worst-case correctness of exact search and outperforming other applicable approaches.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper proposes a 'Certify-then-Rectify' framework for HNSW graphs in approximate nearest neighbor search. A distribution-free statistical certifier evaluates the quality of standard HNSW greedy traversal results with low overhead; if quality is insufficient, the system escalates to an exact recovery procedure. To keep exact recovery tractable, the HNSW graph is reinterpreted as a geometric spanner whose maximum empirical stretch factor is estimated via Extreme Value Theory, yielding a mathematical bound on the distance to true nearest neighbors. Benchmark evaluations are claimed to show that the tiered approach retains HNSW average-case speed while delivering the worst-case correctness of exact search and outperforming alternatives.

Significance. If the derivations and empirical claims hold, the work would be significant for database and retrieval systems: it offers a practical route to attach correctness guarantees to the dominant heuristic ANN index without discarding its performance. The spanner reinterpretation combined with EVT for stretch estimation is a distinctive technical move that, if made rigorous, could generalize to other graph-based indexes.

major comments (2)
  1. [Abstract] Abstract: the central guarantee is stated as 'ensuring the worst-case correctness of exact search' via a 'mathematically bound' on true nearest-neighbor distances obtained by EVT estimation of the spanner stretch factor. EVT supplies high-probability tail bounds (statements of the form 'with probability 1-δ the stretch ≤ s') whose validity rests on domain-of-attraction assumptions; the supplied text contains no derivation converting this into a distribution-free, δ=0 deterministic bound that holds for every query and dataset. Consequently the escalation trigger remains probabilistic and the worst-case claim is not yet supported.
  2. [Abstract] Abstract (and implied § on Certify-then-Rectify): the certifier is described as 'distribution-free' yet the stretch estimation step invokes EVT, which is parametric in its tail model. No equation or section is visible that shows how the two components compose to a guarantee that is simultaneously distribution-free and worst-case; this composition is load-bearing for the tiered correctness claim.
minor comments (2)
  1. [Abstract] Abstract is overloaded; the framework, certifier, spanner reinterpretation, and EVT application are asserted without any equation or complexity statement, making it difficult to assess overhead or correctness.
  2. [Abstract] The phrase 'maximum empirical stretch factor' is used without a preceding definition or reference to the geometric-spanner literature; a short background paragraph would help readers outside the immediate subfield.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the insightful comments highlighting the distinction between high-probability and deterministic guarantees. We agree that the abstract wording requires clarification to avoid overstating the nature of the bounds. We will revise the manuscript to precisely describe the probabilistic character of the EVT-based stretch estimation while preserving the distribution-free property of the certifier.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the central guarantee is stated as 'ensuring the worst-case correctness of exact search' via a 'mathematically bound' on true nearest-neighbor distances obtained by EVT estimation of the spanner stretch factor. EVT supplies high-probability tail bounds (statements of the form 'with probability 1-δ the stretch ≤ s') whose validity rests on domain-of-attraction assumptions; the supplied text contains no derivation converting this into a distribution-free, δ=0 deterministic bound that holds for every query and dataset. Consequently the escalation trigger remains probabilistic and the worst-case claim is not yet supported.

    Authors: We concur that the current abstract phrasing implies a deterministic worst-case guarantee, whereas the EVT step produces high-probability bounds on the empirical stretch factor. No derivation to a δ=0 deterministic bound exists in the manuscript because the method relies on tail estimation. We will revise the abstract and add explicit language in the Certify-then-Rectify section stating that the rectification bound holds with probability at least 1-δ (under the EVT domain-of-attraction assumption) rather than for every query. This will also clarify that the overall escalation decision is probabilistic. revision: yes

  2. Referee: [Abstract] Abstract (and implied § on Certify-then-Rectify): the certifier is described as 'distribution-free' yet the stretch estimation step invokes EVT, which is parametric in its tail model. No equation or section is visible that shows how the two components compose to a guarantee that is simultaneously distribution-free and worst-case; this composition is load-bearing for the tiered correctness claim.

    Authors: The certifier operates without distributional assumptions on the data or queries. The EVT component for stretch estimation is parametric in the tail model and yields probabilistic bounds. We will insert a dedicated paragraph (and, if space permits, an equation) in the framework section that composes the two: the distribution-free certification test decides whether to accept the HNSW result or invoke the spanner-based rectification whose distance bound holds with high probability. The revised text will not claim a fully deterministic, distribution-free worst-case guarantee but will accurately describe the hybrid probabilistic guarantee that results. revision: yes

Circularity Check

0 steps flagged

No circularity: framework uses external statistical tools without self-referential reduction

full rationale

The derivation chain reinterprets HNSW as a spanner and applies EVT for stretch estimation to support a certify-then-rectify procedure. No equations, fitted parameters, or self-citations are shown that reduce the claimed bounds or guarantees to the inputs by construction. The statistical certifier and EVT tail modeling are presented as independent methods whose validity does not depend on the target result itself. This is the common case of a self-contained proposal whose correctness may be debated on other grounds but exhibits no definitional or fitted-input circularity.

Axiom & Free-Parameter Ledger

1 free parameters · 2 axioms · 0 invented entities

Ledger is necessarily incomplete because only the abstract is available; full paper would likely list additional parameters for the certifier and stretch estimator.

free parameters (1)
  • certifier thresholds or parameters
    Statistical certifier must have decision thresholds whose values are not stated in abstract.
axioms (2)
  • domain assumption HNSW graph admits reinterpretation as a geometric spanner
    Abstract invokes this reinterpretation to enable stretch bounding.
  • domain assumption Extreme Value Theory can be applied to estimate the maximum empirical stretch factor of the spanner
    Abstract states this estimation step without further justification.

pith-pipeline@v0.9.1-grok · 5748 in / 1367 out tokens · 42304 ms · 2026-07-03T02:53:42.393576+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

64 extracted references · 14 canonical work pages · 3 internal anchors

  1. [1]

    Anas Ait Aomar, Karima Echihabi, Marco Arnaboldi, Ioannis Alagiannis, Damien Hilloulin, and Manal Cherkaoui. 2025. RWalks: Random Walks as At- tribute Diffusers for Filtered Vector Search.PACMMOD 3, 3 (2025)

  2. [2]

    Yousef Al-Jazzazi, Haya Diwan, Jinrui Gou, Cameron Musco, Christopher Musco, and Torsten Suel. 2025. Distance Adaptive Beam Search for Provably Accurate Graph-Based Nearest Neighbor Search.arXiv preprint arXiv:2505.15636 (2025)

  3. [3]

    Anastasios N Angelopoulos and Stephen Bates. 2021. A gentle introduction to conformal prediction and distribution-free uncertainty quantification.arXiv preprint arXiv:2107.07511 (2021)

  4. [4]

    Angelopoulos, Stephen Bates, Emmanuel J

    Anastasios N. Angelopoulos, Stephen Bates, Emmanuel J. Candès, and Michael I. Jordan. 2024. Conformal Risk Control. J. Amer. Statist. Assoc. 119, 548 (2024), 2386–2408. https://doi.org/10.1080/01621459.2024.2302500

  5. [5]

    Angelopoulos, Stephen Bates, Michael I

    Anastasios N. Angelopoulos, Stephen Bates, Michael I. Jordan, and Jitendra Ma- lik. 2022. Learn then Test: Calibrating Predictive Algorithms to Achieve Risk Control. InICLR. OpenReview

  6. [6]

    Jees Augustine, Suraj Shetiya, Mohammadreza Esfandiari, Senjuti Basu Roy, and Gautam Das. 2021. A Generalized Approach for Reducing Expensive Distance Calls for A Broad Class of Proximity Problems. InACM SIGMOD. 142–154

  7. [7]

    Stephen Bates, Anastasios Angelopoulos, Lihua Lei, Jitendra Malik, and Michael Jordan. 2021. Distribution-free, Risk-controlling Prediction Sets.J. ACM 68, 6, Article 43 (Sept. 2021), 34 pages. https://doi.org/10.1145/3478535

  8. [8]

    Francesco Busolin, Claudio Lucchese, Franco Maria Nardini, Salvatore Orlando, Raffaele Perego, and Salvatore Trani. 2024. Early exit strategies for approximate k-NN search in dense retrieval. InCIKM. 3647–3652

  9. [9]

    Yuzheng Cai, Jiayang Shi, Yizhuo Chen, and Weiguo Zheng. 2024. Navigating La- bels and Vectors: A Unified Approach to Filtered Approximate Nearest Neighbor Search.PACMMOD 2, 6 (2024)

  10. [10]

    Matteo Ceccarello, Alexandra Levchenko, Ioana Ileana, and Themis Palpanas

  11. [11]

    InACM SIGKDD

    Evaluating and Generating Query Workloads for High Dimensional Vector Similarity Search. InACM SIGKDD

  12. [12]

    Charikar

    Moses S. Charikar. 2002. Similarity estimation techniques from rounding algo- rithms. InSTOC. 380–388

  13. [13]

    Manos Chatzakis, Yannis Papakonstantinou, and Themis Palpanas. 2025. Darth: Declarative recall through early termination for approximate nearest neighbor search.PACMMOD 3, 4 (2025), 1–26

  14. [14]

    Qi Chen, Bing Zhao, Haidong Wang, Mingqin Li, Chuanjie Liu, Zengzhong Li, Mao Yang, and Jingdong Wang. 2021. SPANN: Highly-Efficient Billion-Scale Approximate Nearest Neighborhood Search. InAdvances in Neural Information Processing Systems, Vol. 34

  15. [15]

    Yilun Chen, Zhiding Yu, Yukang Chen, Shiyi Lan, Anima Anandkumar, Jiaya Jia, and Jose M. Alvarez. 2023. FocalFormer3D: Focusing on Hard Instance for 3D Object Detection. InProceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)

  16. [16]

    Yannis Chronis, Helena Caminal, Yannis Papakonstantinou, Fatma Özcan, and Anastasia Ailamaki. 2025. Filtered Vector Search: State-of-the-Art and Research Opportunities.PVLDB 18, 12 (2025), 5488–5492

  17. [17]

    2001.An introduc- tion to statistical modeling of extreme values

    Stuart Coles, Joanna Bawa, Lesley Trenner, and Pat Dorazio. 2001.An introduc- tion to statistical modeling of extreme values . Vol. 208. Springer

  18. [18]

    Mirrokni

    Mayur Datar, Nicole Immorlica, Piotr Indyk, and Vahab S. Mirrokni. 2004. Locality-sensitive hashing scheme based on p-stable distributions. InSCG. 253– 262

  19. [19]

    Liwei Deng, Penghao Chen, Ximu Zeng, Tianfu Wang, Yan Zhao, and Kai Zheng. 2024. Efficient Data-Aware Distance Comparison Operations for High- Dimensional Approximate Nearest Neighbor Search.PVLDB 18, 3 (2024), 812– 821

  20. [20]

    Matthijs Douze, Alexandr Guzhva, Chengqi Deng, Jeff Johnson, Gergely Szilvasy, Pierre-Emmanuel Mazaré, Maria Lomeli, Lucas Hosseini, and Hervé Jégou. 2024. The Faiss library. (2024). arXiv:2401.08281 [cs.LG]

  21. [21]

    Hao Duan, Yitong Song, Bin Yao, and Anqi Liang. 2025. PGTuner: An Efficient Framework for Automatic and Transferable Configuration Tuning of Proximity Graphs. arXiv:2508.17886 [cs.DB] https://arxiv.org/abs/2508.17886

  22. [22]

    R. A. Fisher and L. H. C. Tippett. 1928. Limiting forms of the frequency distribu- tion of the largest or smallest member of a sample.Mathematical Proceedings of the Cambridge Philosophical Society 24, 2 (1928), 180–190

  23. [23]

    Cong Fu, Chao Xiang, Changxu Wang, and Deng Cai. 2019. Fast approximate nearest neighbor search with the navigating spreading-out graph.PVLDB 12, 5 (2019), 461–474

  24. [24]

    Jianyang Gao and Cheng Long. 2023. High-Dimensional Approximate Nearest Neighbor Search: with Reliable and Efficient Distance Comparison Operations. PACMMOD 1, 2 (2023)

  25. [25]

    E. J. Gumbel. 1958. Statistics of Extremes. Columbia University Press, New York, Chichester, West Sussex.https://doi.org/10.7312/gumb92958

  26. [26]

    Sonia Horchidan, Fabian Zeiher, Henrik Boström, and Paris Carbone. 2026. Co- nANN: Conformal Approximate Nearest Neighbor Search.PVLDB 19, 1 (2026), 29–42

  27. [27]

    Physical Intelligence, Ali Amin, Raichelle Aniceto, Ashwin Balakrishna, Kevin Black, Ken Conley, Grace Connors, James Darpinian, Karan Dhabalia, Jared Di- Carlo, Danny Driess, Michael Equi, Adnan Esmail, Yunhao Fang, Chelsea Finn, Catherine Glossop, Thomas Godden, Ivan Goryachev, Lachy Groom, Hunter Hancock, Karol Hausman, Gashon Hussein, Brian Ichter, Sz...

  28. [28]

    Suhas Jayaram Subramanya, Fnu Devvrit, Harsha Vardhan Simhadri, Ravis- hankar Krishnawamy, and Rohan Kadekodi. 2019. DiskANN: Fast Accurate Billion-point Nearest Neighbor Search on a Single Node. InNeurIPS, Vol. 32

  29. [29]

    Jeff Johnson, Matthijs Douze, and Hervé Jégou. 2019. Billion-scale similarity search with GPUs.IEEE Transactions on Big Data 7, 3 (2019), 535–547

  30. [30]

    Herve Jégou, Matthijs Douze, and Cordelia Schmid. 2011. Product Quantization for Nearest Neighbor Search.IEEE Transactions on Pattern Analysis and Machine Intelligence 33, 1 (2011), 117–128

  31. [31]

    Conglong Li, Minjia Zhang, David G Andersen, and Yuxiong He. 2020. Improv- ing approximate nearest neighbor search through learned adaptive early termi- nation. InACM SIGMOD. 2539–2554

  32. [32]

    Minghao Li, Raghav Mittal, Sanjivni Rana, Suraj Shetiya, Gautam Das, and Nick Koudas. 2026. Technical Report for Recall Certification for HNSW. https:// github.com/ming-afk/recall_certify_hnsw/blob/main/tr.pdfTechnical report

  33. [33]

    Zhenxin Li, Shuibing He, Jiahao Guo, Xuechen Zhang, Xian-He Sun, and Gang Chen. 2025. CRouting: Reducing Expensive Distance Calls in Graph-Based Ap- proximate Nearest Neighbor Search.arXiv preprint arXiv:2509.00365 (2025)

  34. [34]

    Zhaoheng Li, Silu Huang, Wei Ding, Yongjoo Park, and Jianjun Chen. 2025. SIEVE: Effective Filtered Vector Search with Collection of Indexes.PVLDB 18, 11 (2025), 4723–4736

  35. [35]

    Anqi Liang, Pengcheng Zhang, Bin Yao, Zhongpu Chen, Yitong Song, and Guangxu Cheng. 2024. UNIFY: Unified Index for Range Filtered Approximate Nearest Neighbors Search.PVLDB 18, 4 (2024), 1118–1130

  36. [36]

    Zhenxi Lin, Ziheng Zhang, Xian Wu, and Yefeng Zheng. 2023. Improving Biomedical Entity Linking with Retrieval-Enhanced Learning.arXiv preprint arXiv:2312.09806 (2023). https://doi.org/10.48550/arXiv.2312.09806

  37. [37]

    Qin Lv, William Josephson, Zhe Wang, Moses Charikar, and Kai Li. 2007. Multi- probe LSH: efficient indexing for high-dimensional similarity search. InVLDB. 950–961

  38. [38]

    Yury Malkov, Alexander Ponomarenko, Andrey Logvinov, and Vladimir Krylov

  39. [39]

    Information Systems 45 (2014), 61–68

    Approximate nearest neighbor algorithm based on navigable small world graphs. Information Systems 45 (2014), 61–68

  40. [40]

    Yu A Malkov and Dmitry A Yashunin. 2018. Efficient and robust approximate nearest neighbor search using hierarchical navigable small world graphs.IEEE Transactions On Pattern Analysis And Machine Intelligence 42, 4 (2018), 824–836

  41. [41]

    Ilyas, Theodoros Rekatsinas, and Shivaram Venkataraman

    Jason Mohoney, Devesh Sarda, Mengze Tang, Shihabur Rahman Chowdhury, Anil Pacaci, Ihab F. Ilyas, Theodoros Rekatsinas, and Shivaram Venkataraman

  42. [42]

    InProceedings of the 19th USENIX Symposium on Operating Systems Design and Implementation (OSDI 25)

    Quake: Adaptive Indexing for Vector Search. InProceedings of the 19th USENIX Symposium on Operating Systems Design and Implementation (OSDI 25) . USENIX Association

  43. [43]

    James Jie Pan, Jianguo Wang, and Guoliang Li. 2024. Survey of vector database management systems.VLDB 33, 5 (2024), 1591–1615

  44. [44]

    Liana Patel, Peter Kraft, Carlos Guestrin, and Matei Zaharia. 2024. ACORN: Per- formant and Predicate-Agnostic Search Over Vector Embeddings and Structured Data.PACMMOD 2, 3 (2024), 1–27

  45. [45]

    Zhencan Peng, Miao Qiao, Wenchao Zhou, Feifei Li, and Dong Deng. 2025. Dy- namic Range-Filtering Approximate Nearest Neighbor Search.PVLDB 18, 10 (2025)

  46. [46]

    2012.Computational Geometry: An Introduction

    Franco P Preparata and Michael I Shamos. 2012.Computational Geometry: An Introduction. Springer Science & Business Media

  47. [47]

    Christoph Schuhmann, Romain Beaumont, Richard Vencu, Cade Gordon, Ross Wightman, Mehdi Cherti, Theo Coombes, Aarush Katta, Clayton Mullis, Mitchell Wortsman, Patrick Schramowski, Srivatsa Kundurthy, Katherine Crowson, Lud- wig Schmidt, Robert Kaczmarczyk, and Jenia Jitsev. 2022. LAION-5B: An open large-scale dataset for training next generation image-text...

  48. [48]

    Yitong Song, Pengcheng Zhang, Chao Gao, Bin Yao, Kai Wang, Zongyuan Wu, and Lin Qu. 2025. TRIM: Accelerating High-Dimensional Vector Similarity Search with Enhanced Triangle-Inequality-Based Pruning.PACMMOD 3, 6 (2025), 1–26

  49. [49]

    Ziwen Song, Bin Wang, and Xiaochun Yang. 2025. Accelerating High- Dimensional ANN Search via Skipping Redundant Distance Computations. PACMMOD 3, 6 (2025), 1–29

  50. [50]

    Tommaso Teofili and Jimmy Lin. 2025. Patience In Proximity: A Simple Early Ter- mination Strategy for HNSW Graph Traversal in Approximate k-Nearest Neigh- bor Search. InECIR. 401–407

  51. [51]

    Toussaint

    Godfried T. Toussaint. 1980. The relative neighbourhood graph of a finite planar set.Pattern Recognition 12, 4 (1980), 261–268

  52. [52]

    Mengzhao Wang, Lingwei Lv, Xiaoliang Xu, Yuxiang Wang, Qiang Yue, and Jiongkang Ni. 2022. Navigable Proximity Graph-Driven Native Hybrid Queries with Structured and Unstructured Constraints. arXiv:2203.13601 [cs.DB] https: //arxiv.org/abs/2203.13601

  53. [53]

    Zeyu Wang, Qitong Wang, Xiaoxing Cheng, Peng Wang, Themis Palpanas, and Wei Wang. 2024. Steiner-Hardness: A Query Hardness Measure for Graph-Based ANN Indexes. PVLDB 17, 13 (2024), 4668–4682

  54. [54]

    Maciej Wiatrak, Eirini Arvaniti, Angus Brayne, Jonas Vetterle, and Aaron Sim

  55. [55]

    arXiv preprint arXiv:2301.13318 (2023)

    Proxy-based Zero-Shot Entity Linking by Effective Candidate Retrieval. arXiv preprint arXiv:2301.13318 (2023). https://doi.org/10.48550/arXiv.2301. 13318

  56. [56]

    Qian Xu, Juan Yang, Feng Zhang, Junda Pan, Kang Chen, Youren Shen, Amelie Chi Zhou, and Xiaoyong Du. 2025. Tribase: A Vector Data Query En- gine for Reliable and Lossless Pruning Compression using Triangle Inequalities. PACMMOD 3, 1 (2025), 1–28

  57. [57]

    Yuexuan Xu, Jianyang Gao, Yutong Gou, Cheng Long, and Christian S. Jensen

  58. [58]

    iRangeGraph: Improvising Range-dedicated Graphs for Range-filtering Nearest Neighbor Search.PACMMOD 2, 6 (2024)

  59. [59]

    Kaixiang Yang, Hongya Wang, Bo Xu, Wei Wang, Yingyuan Xiao, Ming Du, and Junfeng Zhou. 2021. Tao: A learning framework for adaptive nearest neighbor search using static features only.arXiv preprint arXiv:2110.00696 (2021)

  60. [60]

    Mingyu Yang, Wentao Li, Jiabao Jin, Xiaoyao Zhong, Xiangyu Wang, Zhitao Shen, Wei Jia, and Wei Wang. 2025. Effective and general distance computation for approximate nearest neighbor search. InIEEE ICDE. 1098–1110

  61. [61]

    Chao Zhang and Renée J. Miller. 2025. Distribution-Aware Exploration for Adap- tive HNSW Search. arXiv:2512.06636 [cs.DB] https://arxiv.org/abs/2512.06636

  62. [62]

    Zili Zhang, Chao Jin, Linpeng Tang, Xuanzhe Liu, and Xin Jin. 2023. Fast, Ap- proximate Vector Queries on Very Large Unstructured Datasets. In20th USENIX Symposium on Networked Systems Design and Implementation (NSDI 23). USENIX Association, 995–1011

  63. [63]

    Xiaoyao Zhong, Haotian Li, Jiabao Jin, Mingyu Yang, Deming Chu, Xiangyu Wang, Zhitao Shen, Wei Jia, George Gu, Yi Xie, Xuemin Lin, Heng Tao Shen, Jingkuan Song, and Peng Cheng. 2025. VSAG: An Optimized Search Framework for Graph-Based Approximate Nearest Neighbor Search.PVLDB 18, 12 (2025), 5017–5030

  64. [64]

    Chaoji Zuo, Miao Qiao, Wenchao Zhou, Feifei Li, and Dong Deng. 2024. SeRF: Segment Graph for Range-Filtering Approximate Nearest Neighbor Search. PACMMOD 2, 1 (2024)