Distributed learning with compressed gradients

Hamid Reza Feyzmahdavian; Mikael Johansson; Sarit Khirirat

arxiv: 1806.06573 · v2 · pith:CHWFBCIVnew · submitted 2018-06-18 · 🧮 math.OC · stat.ML

Distributed learning with compressed gradients

Sarit Khirirat , Hamid Reza Feyzmahdavian , Mikael Johansson This is my paper

classification 🧮 math.OC stat.ML

keywords compressionconvergencedistributedgradientalgorithmsboundscompressedexchange

0 comments

read the original abstract

Asynchronous computation and gradient compression have emerged as two key techniques for achieving scalability in distributed optimization for large-scale machine learning. This paper presents a unified analysis framework for distributed gradient methods operating with staled and compressed gradients. Non-asymptotic bounds on convergence rates and information exchange are derived for several optimization algorithms. These bounds give explicit expressions for step-sizes and characterize how the amount of asynchrony and the compression accuracy affect iteration and communication complexity guarantees. Numerical results highlight convergence properties of different gradient compression algorithms and confirm that fast convergence under limited information exchange is indeed possible.

This paper has not been read by Pith yet.

Distributed learning with compressed gradients

discussion (0)