StreamSampling.jl implements efficient one-pass sampling algorithms for data streams in Julia with constant memory footprint and performance gains over traditional methods.
StreamSampling.jl: Efficient Sampling from Data Streams in Julia
1 Pith paper cite this work. Polarity classification is still indexing.
abstract
StreamSampling$.$jl is a Julia library designed to provide general and efficient methods for sampling from data streams in a single pass, even when the total number of items is unknown. In this paper, we describe the capabilities of the library and its advantages over traditional sampling procedures, such as maintaining a small, constant memory footprint and avoiding the need to fully materialize the stream in memory. Furthermore, we provide empirical benchmarks comparing online sampling methods against standard approaches, demonstrating performance and memory improvements.
fields
cs.SE 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
StreamSampling.jl: Efficient Sampling from Data Streams in Julia
StreamSampling.jl implements efficient one-pass sampling algorithms for data streams in Julia with constant memory footprint and performance gains over traditional methods.