TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems

Andrew Harp, Andy Davis, Ashish Agarwal, Benoit Steiner, Chris Olah, Craig Citro, Dan Mane, Derek Murray, Eugene Brevdo, Fernanda Viegas, Geoffrey Irving, Greg S. Corrado, Ian Goodfellow, Ilya Sutskever, Jeffrey Dean, Jonathon Shlens, Josh Levenberg, Kunal Talwar, Lukasz Kaiser, Manjunath Kudlur, Mart\'in Abadi, Martin Wattenberg, Martin Wicke, Matthieu Devin, Michael Isard, Mike Schuster, Oriol Vinyals, Paul Barham, Paul Tucker, Pete Warden, Rafal Jozefowicz, Rajat Monga, Sanjay Ghemawat, Sherry Moore, Vijay Vasudevan, Vincent Vanhoucke, Xiaoqiang Zheng, Yangqing Jia, Yuan Yu, Zhifeng Chen

classification 💻 cs.DC cs.LG

keywords tensorflowalgorithmssystemsimplementationinterfacelearningmachinecomputational

0 comments

read the original abstract

TensorFlow is an interface for expressing machine learning algorithms, and an implementation for executing such algorithms. A computation expressed using TensorFlow can be executed with little or no change on a wide variety of heterogeneous systems, ranging from mobile devices such as phones and tablets up to large-scale distributed systems of hundreds of machines and thousands of computational devices such as GPU cards. The system is flexible and can be used to express a wide variety of algorithms, including training and inference algorithms for deep neural network models, and it has been used for conducting research and for deploying machine learning systems into production across more than a dozen areas of computer science and other fields, including speech recognition, computer vision, robotics, information retrieval, natural language processing, geographic information extraction, and computational drug discovery. This paper describes the TensorFlow interface and an implementation of that interface that we have built at Google. The TensorFlow API and a reference implementation were released as an open-source package under the Apache 2.0 license in November, 2015 and are available at www.tensorflow.org.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 17 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Floating-Point Networks with Automatic Differentiation Can Represent Almost All Floating-Point Functions and Their Gradients
cs.LG 2026-05 unverdicted novelty 8.0

Floating-point neural networks with automatic differentiation can represent arbitrary floating-point functions and their gradients under mild conditions.
Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer
cs.LG 2017-01 accept novelty 8.0

A noisy top-k gated mixture-of-experts layer between LSTMs scales neural networks to 137B parameters with sub-linear compute, beating SOTA on language modeling and machine translation.
Density estimation using Real NVP
cs.LG 2016-05 accept novelty 8.0

Real NVP uses affine coupling layers to create invertible transformations that support exact density estimation, sampling, and latent inference without approximations.
OAM-Induced Lattice Rotation Reveals a Fractional Optimum in Fault-Tolerant GKP Quantum Sensing
quant-ph 2026-05 unverdicted novelty 7.0

Fractional OAM charge ℓ=1.5 optimizes twisted GKP lattices, cutting error probability by 23.9× versus square lattices at fixed Fisher information.
SMART: A Spectral Transfer Approach to Multi-Task Learning
cs.LG 2026-04 unverdicted novelty 7.0

SMART transfers knowledge in multi-task linear regression via spectral subspace similarity assumptions, achieving near-minimax Frobenius error rates while requiring only a fitted source model.
The Kinetics Human Action Video Dataset
cs.CV 2017-05 accept novelty 7.0

Kinetics is a new video dataset of 400 human actions with over 160000 ten-second clips collected from YouTube, accompanied by baseline action-classification results from neural networks.
HyperNetworks
cs.LG 2016-09 unverdicted novelty 7.0

Hypernetworks generate weights for a main network, allowing LSTMs to use non-shared weights and achieve near state-of-the-art results on sequence modeling tasks while using fewer parameters overall.
Real-time Surface-Code Error Correction Using an FPGA-based Neural-Network Decoder
quant-ph 2026-05 unverdicted novelty 6.0

An FPGA-based neural-network decoder achieves 550 ns deterministic closed-loop latency for real-time distance-3 surface code error correction on a superconducting processor, matching offline decoding performance.
Amortized Variational Inference for Joint Posterior and Predictive Distributions in Bayesian Uncertainty Quantification
stat.ML 2026-05 unverdicted novelty 6.0

An amortized variational framework jointly targets the posterior and posterior-predictive distributions via a KL upper bound and moment regularization, yielding more accurate predictions at lower online cost than two-...
On Model-Based Clustering With Entropic Optimal Transport
stat.ME 2026-05 unverdicted novelty 6.0

Entropic optimal transport yields a clustering loss with the same global optimum as log-likelihood but a better-behaved optimization surface, outperforming standard EM in experiments.
Alikhanov-XfPINNs: Adaptive Physics-Informed Learning for Nonlinear Fractional PDEs on Nonuniform Meshes
math.NA 2026-05 unverdicted novelty 6.0

Alikhanov-XfPINNs integrates accelerated Alikhanov discretization on nonuniform time grids with physics-informed neural networks to solve general nonlinear fractional PDEs for both forward and inverse problems with im...
TCL: Enabling Fast and Efficient Cross-Hardware Tensor Program Optimization via Continual Learning
cs.LG 2026-04 conditional novelty 6.0

TCL delivers 16.8x faster tuning on CPU and 12.48x on GPU with modestly lower inference latency by combining RDU active sampling, a lightweight Mamba cost model, and cross-platform continual knowledge distillation.
MONAI: An open-source framework for deep learning in healthcare
cs.LG 2022-11 accept novelty 6.0

MONAI is a community-supported PyTorch framework that extends deep learning to medical data with domain-specific architectures, transforms, and deployment tools.
Rethinking Atrous Convolution for Semantic Image Segmentation
cs.CV 2017-06 unverdicted novelty 6.0

DeepLabv3 improves semantic segmentation by capturing multi-scale context with cascaded or parallel atrous convolutions and adding global context to ASPP, achieving better results on PASCAL VOC 2012 without DenseCRF p...
Lit2Vec: A Reproducible Workflow for Building a Legally Screened Chemistry Corpus from S2ORC for Downstream Retrieval and Text Mining
cs.DB 2026-04 unverdicted novelty 5.0

Lit2Vec delivers a documented, reproducible pipeline that extracts and annotates a large licensed chemistry paper corpus from S2ORC with paragraph embeddings and subfield labels.
SecureAFL: Secure Asynchronous Federated Learning
cs.CR 2026-04 conditional novelty 5.0

SecureAFL secures asynchronous federated learning against poisoning attacks by detecting anomalous updates, estimating missing client contributions, and using Byzantine-robust aggregation.
Optimizing Yukawa couplings to suppress Dimension-five Proton Decay in $SU(5)$ GUT
hep-ph 2026-05 unverdicted novelty 4.0

Machine-learning optimization of Yukawa couplings in a 45-extended SUSY SU(5) model identifies parameter regions where the proton lifetime exceeds the Super-Kamiokande bound of 5.9e33 years.