hub Mixed citations

The CMA Evolution Strategy: A Tutorial

Nikolaus Hansen (TAO) · 2016 · cs.LG · arXiv 1604.00772

Mixed citation behavior. Most common role is background (50%).

64 Pith papers citing it

Background 50% of classified citations

open full Pith review browse 64 citing papers arXiv PDF

abstract

This tutorial introduces the CMA Evolution Strategy (ES), where CMA stands for Covariance Matrix Adaptation. The CMA-ES is a stochastic, or randomized, method for real-parameter (continuous domain) optimization of non-linear, non-convex functions. We try to motivate and derive the algorithm from intuitive concepts and from requirements of non-linear, non-convex search in continuous domain.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 4 method 2

citation-polarity summary

background 3 use method 2 unclear 1

representative citing papers

Certified Gradient-Based Contact-Rich Manipulation via Smoothing-Error Reachable Tubes

cs.RO · 2026-02-10 · unverdicted · novelty 8.0

A certified gradient-based method for contact-rich manipulation that quantifies smoothing-induced errors via set-valued discrepancies and incorporates them into analytical reachable sets for robust affine feedback policies.

Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution

cs.CL · 2023-09-28 · unverdicted · novelty 8.0

Promptbreeder evolves both task prompts and the mutation prompts that improve them using LLMs, outperforming Chain-of-Thought and Plan-and-Solve on arithmetic and commonsense reasoning benchmarks.

Runtime Analysis of the $(\mu + 1)$-ES in a Homogenous Progress Model

cs.NE · 2026-06-11 · unverdicted · novelty 7.0

Introduces homogeneous progress model and proves that for Z = N(-δ,1) with μ ≤ e^δ the growth rate R_μ equals (log^{1+o(1)} μ / μ) R_1.

Latent Heuristic Search: Continuous Optimization for Automated Algorithm Design

cs.AI · 2026-05-16 · unverdicted · novelty 7.0

Latent Heuristic Search performs continuous optimization over learned embeddings of heuristics, using normalizing flows and LLM prompting to discover competitive solvers for TSP, CVRP, KSP, and OBP.

Low Stage High Order Explicit Runge--Kutta Methods via Q- and D-Conditions: General Theory and Efficient Recursive Construction

math.NA · 2026-05-16 · unverdicted · novelty 7.0 · 2 refs

A Q/D-space framework supplies sufficient order conditions for explicit Runge-Kutta methods and supports a recursive construction of even-order methods with stage count (p²-2p+8)/4.

EVA-0: Test-Time Model Evolution with Only Two Forward Passes per Sample

cs.LG · 2026-05-15 · unverdicted · novelty 7.0

EVA-0 is a zeroth-order test-time adaptation method that uses scale-invariant loss, anchor-guided optimization, and symmetric two-sided perturbations to enable inference and adaptation in two forward passes, outperforming prior methods on ImageNet-C with ViT-Base.

Gradient-Free Training of Spiking Neural Networks via Low-Rank Evolution Strategies

cs.NE · 2026-05-14 · unverdicted · novelty 7.0

EGGROLL applies low-rank evolution strategies to train leaky integrate-and-fire spiking neural networks, reaching 79.21% accuracy on N-MNIST with 2.23 times lower per-generation time than full-rank ES.

EvoPref: Multi-Objective Evolutionary Optimization Discovers Diverse LLM Alignments Beyond Gradient Descent

cs.NE · 2026-05-10 · unverdicted · novelty 7.0

EvoPref applies NSGA-II evolutionary optimization with archive-based diversity to populations of LoRA adapters, yielding 18% higher preference coverage and 47% lower collapse than gradient descent baselines while matching alignment quality.

Evolutionary Negative Module Pruning for Better LoRA Merging

cs.AI · 2026-04-20 · conditional · novelty 7.0

ENMP prunes negative LoRA modules via evolutionary search to boost merging performance to new state-of-the-art levels across language and vision tasks.

Simulator Adaptation for Sim-to-Real Learning of Legged Locomotion via Proprioceptive Distribution Matching

cs.RO · 2026-04-13 · unverdicted · novelty 7.0

Proprioceptive distribution matching adapts simulators for legged robot policies by comparing observation and action distributions, reducing sim-to-real gaps with minimal real data and no external sensing.

Bootstrapping non-unitary CFTs

hep-th · 2025-12-08 · unverdicted · novelty 7.0

A bootstrap strategy for non-unitary CFTs uses statistical stability of OPE data across cross-ratios to optimize spectra, reproducing A-series minimal models and yielding candidate solutions for c>1.

Exploring Exploration in Bayesian Optimization

cs.LG · 2025-02-12 · unverdicted · novelty 7.0

Introduces observation traveling salesman distance and observation entropy to quantify exploration in Bayesian optimization acquisition functions and links them to empirical performance.

Physics-Informed Eikonal Caging for Whole-Arm Manipulation Planning

cs.RO · 2026-06-20 · unverdicted · novelty 6.0

Reformulates caging as an eikonal minimum-time escape field, approximated via physics-informed neural network and embedded into whole-arm manipulation planning for improved robustness to contact-model mismatch.

Evolutionary Discovery of Developmental Reward Schedules in Deep Reinforcement Learning

cs.LG · 2026-06-18 · unverdicted · novelty 6.0

Evolutionary optimization discovers developmental reward schedules that improve performance over extrinsic-only baselines on some MiniGrid tasks, with novelty emerging as the dominant early signal.

Tuning Agent-Based Predator-Prey Models Toward Lotka-Volterra Dynamics

cs.MA · 2026-06-11 · unverdicted · novelty 6.0

Optimizing environmental and demographic parameters in a JAX-based agent-based model with RNN controllers produces population dynamics resembling Lotka-Volterra cycles using a feature-based loss.

Learning What to Remember: A Cognitively Grounded Multi-Factor Value Model for Agentic Memory

cs.AI · 2026-06-11 · unverdicted · novelty 6.0

A learned linear multi-factor value model over seven cognitive psychology factors retains 0.770 gold evidence on LongMemEval blind regime versus 0.368 for recency and 0.518 for best single factor.

Active Perception for Radio Map Reconstruction in Uncharted 3D Air-Ground Environments

eess.SP · 2026-06-11 · unverdicted · novelty 6.0

3D-URAM decouples radio map reconstruction into Bayesian UNet recovery with uncertainty and transformer-based PPO waypoint selection, reporting over 50% error reduction in simulations and real 300x200x100m field tests.

Consolidating Rewarded Perturbations for LLM Post-Training

cs.CL · 2026-05-29 · unverdicted · novelty 6.0

CoRP consolidates reward-weighted perturbations into a single model via low-rank structure, improving base LLMs by 8.1 points on average while using one-tenth the budget of prior ensembles and one forward pass.

Rethinking Literature Search Evaluation: Deep Research Helps, and Human Citation Lists Are Not a Ground Truth

cs.AI · 2026-05-28 · unverdicted · novelty 6.0

Deep bibliography expansion in literature search achieves high recall while human citations are found to have only 51% moderate relevance compared to 86-88% for AI methods.

ECo-MoE: Embodiment-Conditioned Mixture of Experts Increases the Evolvability of Robots

cs.RO · 2026-05-22 · unverdicted · novelty 6.0

ECo-MoE co-optimizes latent robot genotypes and a gated mixture of control experts to improve evolvability in robot body-controller co-design.

KSOS-BO: Improving Sampling in Bayesian Optimization via Kernel Sum of Squares

cs.CE · 2026-05-20 · unverdicted · novelty 6.0

KSOS-BO improves acquisition function optimization in Bayesian optimization by casting it as a kernel sum of squares semidefinite program, outperforming Sobol, DE, and CMA-ES baselines on 10/15 benchmarks with 81% average regret reduction.

Global Convergence of Sampling-Based Nonconvex Optimization through Diffusion-Style Smoothing

cs.LG · 2026-05-15 · unverdicted · novelty 6.0

Recasts sampling-based nonconvex optimization as smoothed gradient descent to obtain non-asymptotic convergence guarantees and introduces the DIDA annealed algorithm that converges to the global optimum.

Solving Classical and Quantum Spin Glasses with Deep Boltzmann Quantum States

cond-mat.dis-nn · 2026-05-15 · unverdicted · novelty 6.0

Deep Boltzmann Quantum States with natural-gradient optimization and annealing-like training match exact or best-known solutions for large infinite-range Ising spin glasses and solve job shop scheduling instances.

Parameter-Efficient Neuroevolution for Diverse LLM Generation: Quality-Diversity Optimization via Prompt Embedding Evolution

cs.NE · 2026-05-10 · unverdicted · novelty 6.0 · 2 refs

QD-LLM applies neuroevolution to prompt embeddings within a quality-diversity framework, producing 46% higher coverage and 41% higher QD-score than QDAIF on HumanEval, MBPP, and creative writing benchmarks.

citing papers explorer

Showing 12 of 12 citing papers after filters.

Runtime Analysis of the $(\mu + 1)$-ES in a Homogenous Progress Model cs.NE · 2026-06-11 · unverdicted · none · ref 11 · internal anchor
Introduces homogeneous progress model and proves that for Z = N(-δ,1) with μ ≤ e^δ the growth rate R_μ equals (log^{1+o(1)} μ / μ) R_1.
Gradient-Free Training of Spiking Neural Networks via Low-Rank Evolution Strategies cs.NE · 2026-05-14 · unverdicted · none · ref 1 · internal anchor
EGGROLL applies low-rank evolution strategies to train leaky integrate-and-fire spiking neural networks, reaching 79.21% accuracy on N-MNIST with 2.23 times lower per-generation time than full-rank ES.
EvoPref: Multi-Objective Evolutionary Optimization Discovers Diverse LLM Alignments Beyond Gradient Descent cs.NE · 2026-05-10 · unverdicted · none · ref 24 · internal anchor
EvoPref applies NSGA-II evolutionary optimization with archive-based diversity to populations of LoRA adapters, yielding 18% higher preference coverage and 47% lower collapse than gradient descent baselines while matching alignment quality.
Parameter-Efficient Neuroevolution for Diverse LLM Generation: Quality-Diversity Optimization via Prompt Embedding Evolution cs.NE · 2026-05-10 · unverdicted · none · ref 29 · 2 links · internal anchor
QD-LLM applies neuroevolution to prompt embeddings within a quality-diversity framework, producing 46% higher coverage and 41% higher QD-score than QDAIF on HumanEval, MBPP, and creative writing benchmarks.
Benchmarking Stopping Criteria for Evolutionary Multi-objective Optimization cs.NE · 2026-04-28 · unverdicted · none · ref 19 · internal anchor
Introduces a single-number performance measure, file-based benchmarking, and efficient text-file storage to evaluate and compare stopping criteria for EMO algorithms.
Similarity-based Portfolio Construction for Black-box Optimization cs.NE · 2026-04-20 · unverdicted · none · ref 10 · internal anchor
A k-nearest-neighbor approach constructs problem-specific algorithm portfolios that outperform both single solvers and the virtual best solver in fixed-budget black-box optimization.
Evolution With Purpose: Hierarchy-Informed Optimization of Whole-Brain Models cs.NE · 2026-02-11 · unverdicted · none · ref 13 · internal anchor
Hierarchy-informed curricular optimization of heterogeneous whole-brain models enables generalization to new subjects and prediction of behavioral abilities from parameters.
Learning Evolution via Optimization Knowledge Adaptation cs.NE · 2025-01-04 · unverdicted · none · ref 25 · internal anchor
OKAEM is a unified learnable evolutionary framework that uses attention-based operators for pre-training on prior knowledge and real-time self-tuning adaptation.
Model Merging to Evolution: Parameter Space Exploration for Expert Models cs.NE · 2026-06-17 · unverdicted · none · ref 10 · internal anchor
MERGEvolve unifies model merging with evolutionary strategies to explore outside convex parameter space and achieves competitive benchmark performance.
Black-Box Optimization of Mixed Binary-Continuous Variables: Challenges and Opportunities in Evolutionary Model Merging cs.NE · 2026-05-12 · unverdicted · none · ref 5 · internal anchor
Data flow space model merging is formalized as a mixed binary-continuous black-box optimization problem, where a structured approach respecting variable dependencies achieves 6.7% higher accuracy and 51.4% smaller search space than unstructured methods on real language models.
Diffusion Models are Evolutionary Algorithms cs.NE · 2024-10-03 · unverdicted · none · ref 4 · internal anchor
Diffusion models are evolutionary algorithms via a denoising-evolution equivalence, yielding Diffusion Evolution that outperforms mainstream EAs on multi-optima tasks.
Quantitative Performance Analysis of Stopping Criteria for CMA-ES cs.NE · 2026-06-08 · unverdicted · none · ref 7 · internal anchor
Empirical benchmarking shows tolfunhist and the full portfolio stop CMA-ES closest to the optimal evaluation count on BBOB, while tolfun and tolfunhist often trigger before full stagnation.

The CMA Evolution Strategy: A Tutorial

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer