pith. sign in

arxiv: 1704.04684 · v1 · pith:N4X5GCSTnew · submitted 2017-04-15 · 💻 cs.DS · cs.AI· cs.IR

Generic LSH Families for the Angular Distance Based on Johnson-Lindenstrauss Projections and Feature Hashing LSH

classification 💻 cs.DS cs.AIcs.IR
keywords familiesdistanceangularfeaturehashingdatasetseuclideangeneric
0
0 comments X
read the original abstract

In this paper we propose the creation of generic LSH families for the angular distance based on Johnson-Lindenstrauss projections. We show that feature hashing is a valid J-L projection and propose two new LSH families based on feature hashing. These new LSH families are tested on both synthetic and real datasets with very good results and a considerable performance improvement over other LSH families. While the theoretical analysis is done for the angular distance, these families can also be used in practice for the euclidean distance with excellent results [2]. Our tests using real datasets show that the proposed LSH functions work well for the euclidean distance.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. RaBitQCache: Rotated Binary Quantization for KVCache in Long Context LLM Inference

    cs.LG 2026-06 unverdicted novelty 5.0

    RaBitQCache proposes rotated binary quantization with binary-INT4 arithmetic for unbiased attention weight estimation in long-context LLMs, enabling adaptive Top-p retrieval and hardware optimizations.