hub Mixed citations

The CMA Evolution Strategy: A Tutorial

Nikolaus Hansen (TAO) · 2016 · cs.LG · arXiv 1604.00772

Mixed citation behavior. Most common role is background (50%).

47 Pith papers citing it

Background 50% of classified citations

open full Pith review browse 47 citing papers arXiv PDF

abstract

This tutorial introduces the CMA Evolution Strategy (ES), where CMA stands for Covariance Matrix Adaptation. The CMA-ES is a stochastic, or randomized, method for real-parameter (continuous domain) optimization of non-linear, non-convex functions. We try to motivate and derive the algorithm from intuitive concepts and from requirements of non-linear, non-convex search in continuous domain.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 4 method 2

citation-polarity summary

background 3 use method 2 unclear 1

representative citing papers

Certified Gradient-Based Contact-Rich Manipulation via Smoothing-Error Reachable Tubes

cs.RO · 2026-02-10 · unverdicted · novelty 8.0

A certified gradient-based method for contact-rich manipulation that quantifies smoothing-induced errors via set-valued discrepancies and incorporates them into analytical reachable sets for robust affine feedback policies.

Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution

cs.CL · 2023-09-28 · unverdicted · novelty 8.0

Promptbreeder evolves both task prompts and the mutation prompts that improve them using LLMs, outperforming Chain-of-Thought and Plan-and-Solve on arithmetic and commonsense reasoning benchmarks.

Latent Heuristic Search: Continuous Optimization for Automated Algorithm Design

cs.AI · 2026-05-16 · unverdicted · novelty 7.0

Latent Heuristic Search performs continuous optimization over learned embeddings of heuristics, using normalizing flows and LLM prompting to discover competitive solvers for TSP, CVRP, KSP, and OBP.

EVA-0: Test-Time Model Evolution with Only Two Forward Passes per Sample

cs.LG · 2026-05-15 · unverdicted · novelty 7.0

EVA-0 is a zeroth-order test-time adaptation method that uses scale-invariant loss, anchor-guided optimization, and symmetric two-sided perturbations to enable inference and adaptation in two forward passes, outperforming prior methods on ImageNet-C with ViT-Base.

Evolutionary Negative Module Pruning for Better LoRA Merging

cs.AI · 2026-04-20 · conditional · novelty 7.0

ENMP prunes negative LoRA modules via evolutionary search to boost merging performance to new state-of-the-art levels across language and vision tasks.

Simulator Adaptation for Sim-to-Real Learning of Legged Locomotion via Proprioceptive Distribution Matching

cs.RO · 2026-04-13 · unverdicted · novelty 7.0

Proprioceptive distribution matching adapts simulators for legged robot policies by comparing observation and action distributions, reducing sim-to-real gaps with minimal real data and no external sensing.

Bootstrapping non-unitary CFTs

hep-th · 2025-12-08 · unverdicted · novelty 7.0

A bootstrap strategy for non-unitary CFTs uses statistical stability of OPE data across cross-ratios to optimize spectra, reproducing A-series minimal models and yielding candidate solutions for c>1.

Exploring Exploration in Bayesian Optimization

cs.LG · 2025-02-12 · unverdicted · novelty 7.0

Introduces observation traveling salesman distance and observation entropy to quantify exploration in Bayesian optimization acquisition functions and links them to empirical performance.

Consolidating Rewarded Perturbations for LLM Post-Training

cs.CL · 2026-05-29 · unverdicted · novelty 6.0

CoRP consolidates reward-weighted perturbations into a single model via low-rank structure, improving base LLMs by 8.1 points on average while using one-tenth the budget of prior ensembles and one forward pass.

KSOS-BO: Improving Sampling in Bayesian Optimization via Kernel Sum of Squares

cs.CE · 2026-05-20 · unverdicted · novelty 6.0

KSOS-BO improves acquisition function optimization in Bayesian optimization by casting it as a kernel sum of squares semidefinite program, outperforming Sobol, DE, and CMA-ES baselines on 10/15 benchmarks with 81% average regret reduction.

Global Convergence of Sampling-Based Nonconvex Optimization through Diffusion-Style Smoothing

cs.LG · 2026-05-15 · unverdicted · novelty 6.0

Recasts sampling-based nonconvex optimization as smoothed gradient descent to obtain non-asymptotic convergence guarantees and introduces the DIDA annealed algorithm that converges to the global optimum.

Solving Classical and Quantum Spin Glasses with Deep Boltzmann Quantum States

cond-mat.dis-nn · 2026-05-15 · unverdicted · novelty 6.0

Deep Boltzmann Quantum States with natural-gradient optimization and annealing-like training match exact or best-known solutions for large infinite-range Ising spin glasses and solve job shop scheduling instances.

Reward-Guided Semantic Evolution for Test-time Adaptive Object Detection

cs.CV · 2026-05-06 · unverdicted · novelty 6.0

RGSE adapts text embeddings at test time via evolutionary search, using cosine similarity rewards from high-confidence visual proposals to improve open-vocabulary object detection under distribution shifts.

Global Sampling-Based Trajectory Optimization for Contact-Rich Manipulation via KernelSOS

cs.RO · 2026-04-29 · unverdicted · novelty 6.0

Global-MPPI integrates kernel SOS global search with MPPI local refinement and graduated non-convexity smoothing to achieve faster convergence and lower costs on high-dimensional contact-rich manipulation tasks.

Benchmarking Stopping Criteria for Evolutionary Multi-objective Optimization

cs.NE · 2026-04-28 · unverdicted · novelty 6.0

Introduces a single-number performance measure, file-based benchmarking, and efficient text-file storage to evaluate and compare stopping criteria for EMO algorithms.

A Complex-Valued Continuous-Variable Quantum Approximation Optimization Algorithm (CCV-QAOA)

quant-ph · 2026-04-23 · unverdicted · novelty 6.0 · 2 refs

CCV-QAOA is a new complex-valued continuous-variable variant of QAOA that solves real and complex multivariate optimization problems via a variational framework.

Stability-Driven Motion Generation for Object-Guided Human-Human Co-Manipulation

cs.CV · 2026-04-22 · unverdicted · novelty 6.0

A flow-matching model derives manipulation strategies from object affordance, adds an adversarial interaction prior, and uses stability simulation to generate natural, effective human-human co-manipulation motions.

cs.NE · 2026-04-20 · unverdicted · novelty 6.0

A k-nearest-neighbor approach constructs problem-specific algorithm portfolios that outperform both single solvers and the virtual best solver in fixed-budget black-box optimization.

On the Generalization Bounds of Symbolic Regression with Genetic Programming

cs.LG · 2026-04-19 · unverdicted · novelty 6.0

Derives a generalization bound for GP-based symbolic regression that decomposes the gap into structure-selection complexity and constant-fitting complexity under tree constraints.

Optimal Majoranas in Mesoscopic Kitaev Chains

cond-mat.mes-hall · 2026-04-15 · unverdicted · novelty 6.0

Microscopic treatment of the hybrid segment in mesoscopic Kitaev chains shows that Andreev bound state parity crossings define optimal sweet spots for localized Majoranas with large gaps.

Trajectory-based actuator identification via differentiable simulation

cs.RO · 2026-04-11 · unverdicted · novelty 6.0

Differentiable simulation enables torque-sensor-free actuator model identification from trajectory data, achieving 1.88x better position tracking than a stand-trained baseline and 46% longer travel in downstream locomotion policies.

GeoPAS: Geometric Probing for Algorithm Selection in Continuous Black-Box Optimization

cs.LG · 2026-04-10 · unverdicted · novelty 6.0 · 2 refs

GeoPAS uses multi-scale 2D geometric slices of optimization landscapes with validity-mask pooling and a learned-plus-prior composite score to select from 12 solvers, cutting mean relative expected running time from 30.37 to around 3.1-3.6 on within-suite benchmarks.

Sumo: Dynamic and Generalizable Whole-Body Loco-Manipulation

cs.RO · 2026-04-09 · unverdicted · novelty 6.0

Test-time steering of pre-trained whole-body policies via sample-based planning lets legged robots generalize dynamic loco-manipulation to varied heavy objects and tasks without additional training or tuning.

PhDLspec: physical-prior embedded deep learning method for spectroscopic determination of stellar labels in high-dimensional parameter space

astro-ph.GA · 2026-04-03 · unverdicted · novelty 6.0

PhDLspec combines differential spectra from physical stellar models with a transformer to derive approximately 30 stellar parameters from low-resolution spectra hundreds of times faster than traditional calculations.

citing papers explorer

Showing 47 of 47 citing papers.

Certified Gradient-Based Contact-Rich Manipulation via Smoothing-Error Reachable Tubes cs.RO · 2026-02-10 · unverdicted · none · ref 16 · internal anchor
A certified gradient-based method for contact-rich manipulation that quantifies smoothing-induced errors via set-valued discrepancies and incorporates them into analytical reachable sets for robust affine feedback policies.
Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution cs.CL · 2023-09-28 · unverdicted · none · ref 138 · internal anchor
Promptbreeder evolves both task prompts and the mutation prompts that improve them using LLMs, outperforming Chain-of-Thought and Plan-and-Solve on arithmetic and commonsense reasoning benchmarks.
Latent Heuristic Search: Continuous Optimization for Automated Algorithm Design cs.AI · 2026-05-16 · unverdicted · none · ref 6 · internal anchor
Latent Heuristic Search performs continuous optimization over learned embeddings of heuristics, using normalizing flows and LLM prompting to discover competitive solvers for TSP, CVRP, KSP, and OBP.
EVA-0: Test-Time Model Evolution with Only Two Forward Passes per Sample cs.LG · 2026-05-15 · unverdicted · none · ref 22 · internal anchor
EVA-0 is a zeroth-order test-time adaptation method that uses scale-invariant loss, anchor-guided optimization, and symmetric two-sided perturbations to enable inference and adaptation in two forward passes, outperforming prior methods on ImageNet-C with ViT-Base.
Evolutionary Negative Module Pruning for Better LoRA Merging cs.AI · 2026-04-20 · conditional · none · ref 13 · internal anchor
ENMP prunes negative LoRA modules via evolutionary search to boost merging performance to new state-of-the-art levels across language and vision tasks.
Simulator Adaptation for Sim-to-Real Learning of Legged Locomotion via Proprioceptive Distribution Matching cs.RO · 2026-04-13 · unverdicted · none · ref 34 · internal anchor
Proprioceptive distribution matching adapts simulators for legged robot policies by comparing observation and action distributions, reducing sim-to-real gaps with minimal real data and no external sensing.
Bootstrapping non-unitary CFTs hep-th · 2025-12-08 · unverdicted · none · ref 13 · internal anchor
A bootstrap strategy for non-unitary CFTs uses statistical stability of OPE data across cross-ratios to optimize spectra, reproducing A-series minimal models and yielding candidate solutions for c>1.
Exploring Exploration in Bayesian Optimization cs.LG · 2025-02-12 · unverdicted · none · ref 8 · internal anchor
Introduces observation traveling salesman distance and observation entropy to quantify exploration in Bayesian optimization acquisition functions and links them to empirical performance.
Consolidating Rewarded Perturbations for LLM Post-Training cs.CL · 2026-05-29 · unverdicted · none · ref 14 · internal anchor
CoRP consolidates reward-weighted perturbations into a single model via low-rank structure, improving base LLMs by 8.1 points on average while using one-tenth the budget of prior ensembles and one forward pass.
KSOS-BO: Improving Sampling in Bayesian Optimization via Kernel Sum of Squares cs.CE · 2026-05-20 · unverdicted · none · ref 7 · internal anchor
KSOS-BO improves acquisition function optimization in Bayesian optimization by casting it as a kernel sum of squares semidefinite program, outperforming Sobol, DE, and CMA-ES baselines on 10/15 benchmarks with 81% average regret reduction.
Global Convergence of Sampling-Based Nonconvex Optimization through Diffusion-Style Smoothing cs.LG · 2026-05-15 · unverdicted · none · ref 88 · internal anchor
Recasts sampling-based nonconvex optimization as smoothed gradient descent to obtain non-asymptotic convergence guarantees and introduces the DIDA annealed algorithm that converges to the global optimum.
Solving Classical and Quantum Spin Glasses with Deep Boltzmann Quantum States cond-mat.dis-nn · 2026-05-15 · unverdicted · none · ref 145 · internal anchor
Deep Boltzmann Quantum States with natural-gradient optimization and annealing-like training match exact or best-known solutions for large infinite-range Ising spin glasses and solve job shop scheduling instances.
Reward-Guided Semantic Evolution for Test-time Adaptive Object Detection cs.CV · 2026-05-06 · unverdicted · none · ref 24 · internal anchor
RGSE adapts text embeddings at test time via evolutionary search, using cosine similarity rewards from high-confidence visual proposals to improve open-vocabulary object detection under distribution shifts.
Global Sampling-Based Trajectory Optimization for Contact-Rich Manipulation via KernelSOS cs.RO · 2026-04-29 · unverdicted · none · ref 7 · internal anchor
Global-MPPI integrates kernel SOS global search with MPPI local refinement and graduated non-convexity smoothing to achieve faster convergence and lower costs on high-dimensional contact-rich manipulation tasks.
Benchmarking Stopping Criteria for Evolutionary Multi-objective Optimization cs.NE · 2026-04-28 · unverdicted · none · ref 19 · internal anchor
Introduces a single-number performance measure, file-based benchmarking, and efficient text-file storage to evaluate and compare stopping criteria for EMO algorithms.
A Complex-Valued Continuous-Variable Quantum Approximation Optimization Algorithm (CCV-QAOA) quant-ph · 2026-04-23 · unverdicted · none · ref 22 · 2 links · internal anchor
CCV-QAOA is a new complex-valued continuous-variable variant of QAOA that solves real and complex multivariate optimization problems via a variational framework.
Stability-Driven Motion Generation for Object-Guided Human-Human Co-Manipulation cs.CV · 2026-04-22 · unverdicted · none · ref 7 · internal anchor
A flow-matching model derives manipulation strategies from object affordance, adds an adversarial interaction prior, and uses stability simulation to generate natural, effective human-human co-manipulation motions.
Similarity-based Portfolio Construction for Black-box Optimization cs.NE · 2026-04-20 · unverdicted · none · ref 10 · internal anchor
A k-nearest-neighbor approach constructs problem-specific algorithm portfolios that outperform both single solvers and the virtual best solver in fixed-budget black-box optimization.
On the Generalization Bounds of Symbolic Regression with Genetic Programming cs.LG · 2026-04-19 · unverdicted · none · ref 10 · internal anchor
Derives a generalization bound for GP-based symbolic regression that decomposes the gap into structure-selection complexity and constant-fitting complexity under tree constraints.
Optimal Majoranas in Mesoscopic Kitaev Chains cond-mat.mes-hall · 2026-04-15 · unverdicted · none · ref 63 · internal anchor
Microscopic treatment of the hybrid segment in mesoscopic Kitaev chains shows that Andreev bound state parity crossings define optimal sweet spots for localized Majoranas with large gaps.
Trajectory-based actuator identification via differentiable simulation cs.RO · 2026-04-11 · unverdicted · none · ref 40 · internal anchor
Differentiable simulation enables torque-sensor-free actuator model identification from trajectory data, achieving 1.88x better position tracking than a stand-trained baseline and 46% longer travel in downstream locomotion policies.
GeoPAS: Geometric Probing for Algorithm Selection in Continuous Black-Box Optimization cs.LG · 2026-04-10 · unverdicted · none · ref 14 · 2 links · internal anchor
GeoPAS uses multi-scale 2D geometric slices of optimization landscapes with validity-mask pooling and a learned-plus-prior composite score to select from 12 solvers, cutting mean relative expected running time from 30.37 to around 3.1-3.6 on within-suite benchmarks.
Sumo: Dynamic and Generalizable Whole-Body Loco-Manipulation cs.RO · 2026-04-09 · unverdicted · none · ref 14 · internal anchor
Test-time steering of pre-trained whole-body policies via sample-based planning lets legged robots generalize dynamic loco-manipulation to varied heavy objects and tasks without additional training or tuning.
PhDLspec: physical-prior embedded deep learning method for spectroscopic determination of stellar labels in high-dimensional parameter space astro-ph.GA · 2026-04-03 · unverdicted · none · ref 12 · internal anchor
PhDLspec combines differential spectra from physical stellar models with a transformer to derive approximately 30 stellar parameters from low-resolution spectra hundreds of times faster than traditional calculations.
Depth Augmented and FE Free 3D/2D Liver Registration for Laparoscopic Liver AR cs.CV · 2026-02-19 · unverdicted · none · ref 30 · internal anchor
A depth-augmented rigid pose refinement combined with a patient-specific statistical deformation model from NICP correspondences achieves 14.73 mm mean TRE for 3D-2D liver registration in controlled laparoscopic settings.
Evolution With Purpose: Hierarchy-Informed Optimization of Whole-Brain Models cs.NE · 2026-02-11 · unverdicted · none · ref 13 · internal anchor
Hierarchy-informed curricular optimization of heterogeneous whole-brain models enables generalization to new subjects and prediction of behavioral abilities from parameters.
Sample-Efficient Optimisation over the Outputs of Generative Models stat.ML · 2025-09-28 · unverdicted · none · ref 8 · internal anchor
O3 uses surrogate latent spaces extracted from generative models to perform sample-efficient black-box optimization over their outputs, outperforming direct sampling and original-latent optimization on image and protein tasks.
Learning Evolution via Optimization Knowledge Adaptation cs.NE · 2025-01-04 · unverdicted · none · ref 25 · internal anchor
OKAEM is a unified learnable evolutionary framework that uses attention-based operators for pre-training on prior knowledge and real-time self-tuning adaptation.
Bounding the Effect of HOD Assumptions on Small-Scale Clustering Constraints astro-ph.CO · 2026-06-10 · unverdicted · none · ref 89 · internal anchor
The fraction of AbacusSummit cosmologies excluded at 3σ by small-scale clustering multipoles drops from 81% to 25% when moving from fixed HOD parameters to broad marginalization over the five-parameter HOD model.
Black-Box Optimization of Mixed Binary-Continuous Variables: Challenges and Opportunities in Evolutionary Model Merging cs.NE · 2026-05-12 · unverdicted · none · ref 5 · internal anchor
Data flow space model merging is formalized as a mixed binary-continuous black-box optimization problem, where a structured approach respecting variable dependencies achieves 6.7% higher accuracy and 51.4% smaller search space than unstructured methods on real language models.
Distributed Quantum-Enhanced Optimization: A Topographical Preconditioning Approach for High-Dimensional Search quant-ph · 2026-04-22 · unverdicted · none · ref 49 · internal anchor
D-QEO framework uses quantum topographical preconditioning on separable functions via small parallel subcircuits to generate seeds that accelerate classical global optimization and avoid exponential failure rates.
Rapid LoRA Aggregation for Wireless Channel Adaptation in Open-Set Radio Frequency Fingerprinting eess.SP · 2026-04-14 · unverdicted · none · ref 20 · internal anchor
LoRA pretraining per environment plus weighted aggregation at inference cuts EER by 15% and training time by 83% for open-set RFF authentication under varying channels.
What Drives Success in Physical Planning with Joint-Embedding Predictive World Models? cs.AI · 2025-12-30 · unverdicted · none · ref 35 · internal anchor
An empirical study of JEPA world models identifies architecture, training objective, and planning choices that yield a model outperforming DINO-WM and V-JEPA-2-AC on navigation and manipulation tasks.
Emergence of Internal State-Modulated Swarming in Multi-Agent Patch Foraging System nlin.AO · 2025-10-14 · unverdicted · none · ref 21 · internal anchor
In a simulated multi-agent foraging environment, evolved neural controllers lead to swarming that is modulated by the agents' internal resource levels, with hidden states encoding resource information.
Machine Learning in the 2HDM2S model for Dark Matter hep-ph · 2025-09-01 · unverdicted · none · ref 57 · internal anchor
A 2HDM extended by two real scalar singlets is scanned with evolutionary strategies to locate regions satisfying vacuum, unitarity, oblique-parameter, collider and dark-matter constraints.
Sampling-Based Global Optimal Control and Estimation via Semidefinite Programming cs.RO · 2025-07-23 · unverdicted · none · ref 12 · internal anchor
KernelSOS is shown to be competitive on robot localization and to improve solution quality in high-dimensional trajectory optimization when paired with local solvers.
Diffusion Models are Evolutionary Algorithms cs.NE · 2024-10-03 · unverdicted · none · ref 4 · internal anchor
Diffusion models are evolutionary algorithms via a denoising-evolution equivalence, yielding Diffusion Evolution that outperforms mainstream EAs on multi-optima tasks.
CC-VPSTO: Chance-Constrained Via-Point-Based Stochastic Trajectory Optimisation for Online Robot Motion Planning under Uncertainty cs.RO · 2024-02-02 · unverdicted · none · ref 27 · internal anchor
CC-VPSTO formulates stochastic trajectory optimization as a chance-constrained problem, approximates it with Monte Carlo sampling and padding, and integrates it into MPC for online robot motion planning under uncertainty.
Local Online Motor Babbling: Learning Motor Abundance of A Musculoskeletal Robot Arm cs.RO · 2019-06-21 · unverdicted · none · ref 16 · internal anchor
A method combining goal babbling with CMA-ES-based local online motor babbling is used to learn inverse kinematics and explore motor abundance on a 10-DoF musculoskeletal robot arm.
Artificial Adaptive Intelligence: The Missing Stage Between Narrow and General Intelligence cs.AI · 2026-05-16 · unverdicted · none · ref 7 · internal anchor
Proposes Artificial Adaptive Intelligence as the regime between narrow and general AI, defined by elimination of human-specified hyperparameters, and introduces an adaptivity index plus parametric minimality principle grounded in minimum description length.
Nested Control Co-design of a Spar Buoy Horizontal-axis Floating Offshore Wind Turbine eess.SY · 2023-10-24 · unverdicted · none · ref 25 · internal anchor
Nested CCD optimization on a reduced-order spar-buoy FOWT model yields over 11% AEP improvement versus baseline.
Convolutional Reservoir Computing for World Models cs.LG · 2019-07-18 · unverdicted · none · ref 17 · internal anchor
RCRC uses untrained random CNNs and reservoir computing plus evolution strategies to reach claimed state-of-the-art scores in reinforcement learning tasks while avoiding data storage and heavy training.
Acoustic Model Optimization Based On Evolutionary Stochastic Gradient Descent with Anchors for Automatic Speech Recognition cs.CL · 2019-07-10 · unverdicted · none · ref 14 · internal anchor
ESGD with anchors guarantees no degradation from the anchor model and reports improved loss and ASR performance on BN50 and SWB300 datasets.
Benchmarking Optimization Algorithms for Automated Calibration of Quantum Devices quant-ph · 2025-09-10 · unverdicted · none · ref 38 · internal anchor
Simulations show CMA-ES outperforms Nelder-Mead and other algorithms for quantum device calibration across low- and high-dimensional regimes.
Evolution Attack On Neural Networks cs.CV · 2019-06-21 · unverdicted · none · ref 20 · internal anchor
Covariance matrix adaptive evolution strategy generates effective black-box adversarial examples for neural networks, outperforming other evolution methods tested.
Low Stage High Order Explicit Runge--Kutta Methods via Q- and D-Conditions: General Theory and Efficient Recursive Construction math.NA · 2026-05-16 · unreviewed · ref 23 · internal anchor
Parameter-Efficient Neuroevolution for Diverse LLM Generation: Quality-Diversity Optimization via Prompt Embedding Evolution cs.NE · 2026-05-10 · unreviewed · ref 29 · internal anchor

The CMA Evolution Strategy: A Tutorial

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer