pith. machine review for the scientific record. sign in

arxiv: 1611.07555 · v1 · submitted 2016-11-22 · 💻 cs.DC · math.NA· stat.ML

Recognition: unknown

Randomized Distributed Mean Estimation: Accuracy vs Communication

Jakub Kone\v{c}n\'y, Peter Richt\'arik

Authors on Pith no claims yet
classification 💻 cs.DC math.NAstat.ML
keywords communicationdistributederroralgorithmsepsilonestimationextremefamily
0
0 comments X
read the original abstract

We consider the problem of estimating the arithmetic average of a finite collection of real vectors stored in a distributed fashion across several compute nodes subject to a communication budget constraint. Our analysis does not rely on any statistical assumptions about the source of the vectors. This problem arises as a subproblem in many applications, including reduce-all operations within algorithms for distributed and federated optimization and learning. We propose a flexible family of randomized algorithms exploring the trade-off between expected communication cost and estimation error. Our family contains the full-communication and zero-error method on one extreme, and an $\epsilon$-bit communication and ${\cal O}\left(1/(\epsilon n)\right)$ error method on the opposite extreme. In the special case where we communicate, in expectation, a single bit per coordinate of each vector, we improve upon existing results by obtaining $\mathcal{O}(r/n)$ error, where $r$ is the number of bits used to represent a floating point value.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Federated Learning: Strategies for Improving Communication Efficiency

    cs.LG 2016-10 conditional novelty 8.0

    Structured updates (low-rank or masked) and sketched updates (quantized, rotated, subsampled) reduce uplink communication in federated learning by up to two orders of magnitude on convolutional and recurrent networks.