MLEvolve: A Self-Evolving Framework for Automated Machine Learning Algorithm Discovery

· 2026 · cs.AI · arXiv 2606.06473

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open full Pith review browse 2 citing papers arXiv PDF

abstract

Large language model (LLM) agents are increasingly applied to long-horizon tasks such as scientific discovery and machine learning engineering (MLE), where sustained self-evolution becomes a key capability. However, existing MLE agents suffer from inter-branch information isolation, memoryless search, and lack of hierarchical control, which together hinder long-horizon optimization. We present MLEvolve, an LLM-based self-evolving multi-agent framework for end-to-end machine learning algorithm discovery. By extending tree search to Progressive MCGS, MLEvolve enables cross-branch information flow through graph-based reference edges and gradually shifts the search from broad exploration to focused exploitation with an entropy-inspired progressive schedule. To allow the agent to evolve with accumulated experience, we introduce Retrospective Memory, which combines a cold-start domain knowledge base with a dynamic global memory for task-specific experience retrieval and reuse. For stable long-horizon iteration, we further decouple strategic planning from code generation with adaptive coding modes. Evaluation on MLE-Bench shows that MLEvolve achieves state-of-the-art performance across multiple dimensions including average medal rate and valid submission rate under a 12-hour budget (half the standard runtime). Moreover, MLEvolve also outperforms specialized algorithm discovery methods including AlphaEvolve on mathematical algorithm optimization tasks, demonstrating strong cross-domain generalization. Our code is available at https://github.com/InternScience/MLEvolve.

representative citing papers

Agents-K1: Towards Agent-native Knowledge Orchestration

cs.AI · 2026-06-11 · unverdicted · novelty 6.0

Agents-K1 is an end-to-end pipeline with a multimodal parser, 4B GRPO-trained extractor, and agent CLI that builds scientific knowledge graphs from full papers and was run on 2.46 million documents to produce Scholar-KG.

Scaling the Horizon, Not the Parameters: Reaching Trillion-Parameter Performance with a 35B Agent

cs.CL · 2026-06-29 · unverdicted · novelty 5.0

A 35B MoE agent model trained on 45K-token trajectories via three-stage SFT and domain-routed distillation achieves leading or competitive scores against 1T models on SEAL-0, IFBench, HiPhO, FrontierScience-Olympiad and MolBench-Bind.

citing papers explorer

Showing 2 of 2 citing papers.

Agents-K1: Towards Agent-native Knowledge Orchestration cs.AI · 2026-06-11 · unverdicted · none · ref 19 · internal anchor
Agents-K1 is an end-to-end pipeline with a multimodal parser, 4B GRPO-trained extractor, and agent CLI that builds scientific knowledge graphs from full papers and was run on 2.46 million documents to produce Scholar-KG.
Scaling the Horizon, Not the Parameters: Reaching Trillion-Parameter Performance with a 35B Agent cs.CL · 2026-06-29 · unverdicted · none · ref 26 · internal anchor
A 35B MoE agent model trained on 45K-token trajectories via three-stage SFT and domain-routed distillation achieves leading or competitive scores against 1T models on SEAL-0, IFBench, HiPhO, FrontierScience-Olympiad and MolBench-Bind.

MLEvolve: A Self-Evolving Framework for Automated Machine Learning Algorithm Discovery

fields

years

verdicts

representative citing papers

citing papers explorer