Recognition: unknown
Tune: A Research Platform for Distributed Model Selection and Training
read the original abstract
Modern machine learning algorithms are increasingly computationally demanding, requiring specialized hardware and distributed computation to achieve high performance in a reasonable time frame. Many hyperparameter search algorithms have been proposed for improving the efficiency of model selection, however their adaptation to the distributed compute environment is often ad-hoc. We propose Tune, a unified framework for model selection and training that provides a narrow-waist interface between training scripts and search algorithms. We show that this interface meets the requirements for a broad range of hyperparameter search algorithms, allows straightforward scaling of search to large clusters, and simplifies algorithm implementation. We demonstrate the implementation of several state-of-the-art hyperparameter search algorithms in Tune. Tune is available at http://ray.readthedocs.io/en/latest/tune.html.
This paper has not been read by Pith yet.
Forward citations
Cited by 6 Pith papers
-
CDS4RAG: Cyclic Dual-Sequential Hyperparameter Optimization for RAG
CDS4RAG cyclically optimizes full RAG hyperparameters by distinguishing and alternating between retriever and generator components, boosting performance up to 1.54x over prior methods on benchmarks.
-
PEML: Parameter-efficient Multi-Task Learning with Optimized Continuous Prompts
PEML co-optimizes continuous prompts and low-rank adaptations to deliver up to 6.67% average accuracy gains over existing multi-task PEFT methods on GLUE, SuperGLUE, and other benchmarks.
-
Deep Researcher Agent: An Autonomous Framework for 24/7 Deep Learning Experimentation with Zero-Cost Monitoring
Deep Researcher Agent is a framework for autonomous 24/7 deep learning experimentation by LLM agents using zero-cost monitoring, constant-size memory, and a minimal-toolset multi-agent design.
-
Prediction of Magnetic Flux Evolution During Solar Active Region Emergence using Long Short-Term Memory Networks
Standard LSTM networks predict solar active region magnetic flux evolution 3-10 hours ahead from intensity and oscillation maps, outperforming encoder-decoder variants on held-out test regions.
-
Chrono::Ray: A Distributed Framework for High-Throughput Simulation-Based Analysis of Multibody Systems
Chrono::Ray integrates Chrono and Ray into an open-source framework that enables scalable, user-friendly orchestration of large ensembles of multibody dynamics simulations.
-
Optimization with SpotOptim
spotoptim is an open-source Python package that implements a Kriging-based optimization loop with Expected Improvement, mixed-variable support, noise handling via OCBA, parallelization, and restart mechanisms for blac...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.