Random Indexing K-tree

Christopher M. De Vries; Lance De Vine; Shlomo Geva

arxiv: 1001.0833 · v2 · submitted 2010-01-06 · 💻 cs.IR · cs.AI· cs.DS

Random Indexing K-tree

Christopher M. De Vries , Lance De Vine , Shlomo Geva This is my paper

classification 💻 cs.IR cs.AIcs.DS

keywords k-treedocumentalgorithmsclusteringindexinglargequalityrandom

0 comments

read the original abstract

Random Indexing (RI) K-tree is the combination of two algorithms for clustering. Many large scale problems exist in document clustering. RI K-tree scales well with large inputs due to its low complexity. It also exhibits features that are useful for managing a changing collection. Furthermore, it solves previous issues with sparse document vectors when using K-tree. The algorithms and data structures are defined, explained and motivated. Specific modifications to K-tree are made for use with RI. Experiments have been executed to measure quality. The results indicate that RI K-tree improves document cluster quality over the original K-tree algorithm.

This paper has not been read by Pith yet.

Random Indexing K-tree

discussion (0)