Federated Optimization:Distributed Optimization Beyond the Datacenter

Jakub Konečn` y, Brendan McMahan, Daniel Ramage · 2015 · cs.LG · arXiv 1511.03575

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

open full Pith review browse 4 citing papers arXiv PDF

abstract

We introduce a new and increasingly relevant setting for distributed optimization in machine learning, where the data defining the optimization are distributed (unevenly) over an extremely large number of \nodes, but the goal remains to train a high-quality centralized model. We refer to this setting as Federated Optimization. In this setting, communication efficiency is of utmost importance. A motivating example for federated optimization arises when we keep the training data locally on users' mobile devices rather than logging it to a data center for training. Instead, the mobile devices are used as nodes performing computation on their local data in order to update a global model. We suppose that we have an extremely large number of devices in our network, each of which has only a tiny fraction of data available totally; in particular, we expect the number of data points available locally to be much smaller than the number of devices. Additionally, since different users generate data with different patterns, we assume that no device has a representative sample of the overall distribution. We show that existing algorithms are not suitable for this setting, and propose a new algorithm which shows encouraging experimental results. This work also sets a path for future research needed in the context of federated optimization.

representative citing papers

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

cs.LG · 2019-10-23 · unverdicted · novelty 7.0

T5 casts all NLP tasks as text-to-text generation, systematically explores pre-training choices, and reaches strong performance on summarization, QA, classification and other tasks via large-scale training on the Colossal Clean Crawled Corpus.

Federated Learning with Non-IID Data

cs.LG · 2018-06-02 · conditional · novelty 6.0

Non-IID data causes up to 55% accuracy loss in federated learning due to weight divergence measured by earth mover's distance; 5% globally shared data recovers 30% accuracy on CIFAR-10.

FED-FSTQ: Fisher-Guided Token Quantization for Communication-Efficient Federated Fine-Tuning of LLMs on Edge Devices

cs.LG · 2026-04-28 · unverdicted · novelty 5.0 · 3 refs

Fed-FSTQ uses Fisher-guided token quantization to cut uplink traffic 46-fold and improve straggler-limited time-to-accuracy by 52% versus Fed-LoRA in non-IID multilingual and medical QA tasks.

An AI-Based Solution for Secure Service Provisioning in IoT

cs.CR · 2026-06-29 · unverdicted · novelty 3.0

A DRL agent selects IoT service providers under security constraints while an FL-based behavioral fingerprinting model computes reliability scores incorporated into the selection process.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer cs.LG · 2019-10-23 · unverdicted · none · ref 36
T5 casts all NLP tasks as text-to-text generation, systematically explores pre-training choices, and reaches strong performance on summarization, QA, classification and other tasks via large-scale training on the Colossal Clean Crawled Corpus.

Federated Optimization:Distributed Optimization Beyond the Datacenter

fields

years

verdicts

representative citing papers

citing papers explorer