Tune: A Research Platform for Distributed Model Selection and Training

Richard Liaw , Eric Liang , Robert Nishihara , Philipp Moritz , Joseph E. Gonzalez , Ion Stoica

Authors on Pith no claims yet

classification 💻 cs.LG cs.DCstat.ML

keywords algorithmssearchtunedistributedhyperparametermodelselectiontraining

read the original abstract

Modern machine learning algorithms are increasingly computationally demanding, requiring specialized hardware and distributed computation to achieve high performance in a reasonable time frame. Many hyperparameter search algorithms have been proposed for improving the efficiency of model selection, however their adaptation to the distributed compute environment is often ad-hoc. We propose Tune, a unified framework for model selection and training that provides a narrow-waist interface between training scripts and search algorithms. We show that this interface meets the requirements for a broad range of hyperparameter search algorithms, allows straightforward scaling of search to large clusters, and simplifies algorithm implementation. We demonstrate the implementation of several state-of-the-art hyperparameter search algorithms in Tune. Tune is available at http://ray.readthedocs.io/en/latest/tune.html.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 6 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

CDS4RAG: Cyclic Dual-Sequential Hyperparameter Optimization for RAG
cs.LG 2026-05 unverdicted novelty 7.0

CDS4RAG cyclically optimizes full RAG hyperparameters by distinguishing and alternating between retriever and generator components, boosting performance up to 1.54x over prior methods on benchmarks.
PEML: Parameter-efficient Multi-Task Learning with Optimized Continuous Prompts
cs.CL 2026-05 unverdicted novelty 6.0

PEML co-optimizes continuous prompts and low-rank adaptations to deliver up to 6.67% average accuracy gains over existing multi-task PEFT methods on GLUE, SuperGLUE, and other benchmarks.
Deep Researcher Agent: An Autonomous Framework for 24/7 Deep Learning Experimentation with Zero-Cost Monitoring
cs.AI 2026-04 unverdicted novelty 5.0

Deep Researcher Agent is a framework for autonomous 24/7 deep learning experimentation by LLM agents using zero-cost monitoring, constant-size memory, and a minimal-toolset multi-agent design.
Prediction of Magnetic Flux Evolution During Solar Active Region Emergence using Long Short-Term Memory Networks
astro-ph.SR 2026-04 unverdicted novelty 5.0

Standard LSTM networks predict solar active region magnetic flux evolution 3-10 hours ahead from intensity and oscillation maps, outperforming encoder-decoder variants on held-out test regions.
Chrono::Ray: A Distributed Framework for High-Throughput Simulation-Based Analysis of Multibody Systems
cs.CE 2026-05 accept novelty 4.0

Chrono::Ray integrates Chrono and Ray into an open-source framework that enables scalable, user-friendly orchestration of large ensembles of multibody dynamics simulations.
Optimization with SpotOptim
cs.LG 2026-04 unverdicted novelty 3.0

spotoptim is an open-source Python package that implements a Kriging-based optimization loop with Expected Improvement, mixed-variable support, noise handling via OCBA, parallelization, and restart mechanisms for blac...