hub Mixed citations

The CMA Evolution Strategy: A Tutorial

Nikolaus Hansen (TAO) · 2016 · cs.LG · arXiv 1604.00772

Mixed citation behavior. Most common role is background (50%).

65 Pith papers citing it

Background 50% of classified citations

open full Pith review browse 65 citing papers arXiv PDF

abstract

This tutorial introduces the CMA Evolution Strategy (ES), where CMA stands for Covariance Matrix Adaptation. The CMA-ES is a stochastic, or randomized, method for real-parameter (continuous domain) optimization of non-linear, non-convex functions. We try to motivate and derive the algorithm from intuitive concepts and from requirements of non-linear, non-convex search in continuous domain.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 4 method 2

citation-polarity summary

background 3 use method 2 unclear 1

representative citing papers

Certified Gradient-Based Contact-Rich Manipulation via Smoothing-Error Reachable Tubes

cs.RO · 2026-02-10 · unverdicted · novelty 8.0

A certified gradient-based method for contact-rich manipulation that quantifies smoothing-induced errors via set-valued discrepancies and incorporates them into analytical reachable sets for robust affine feedback policies.

Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution

cs.CL · 2023-09-28 · unverdicted · novelty 8.0

Promptbreeder evolves both task prompts and the mutation prompts that improve them using LLMs, outperforming Chain-of-Thought and Plan-and-Solve on arithmetic and commonsense reasoning benchmarks.

Runtime Analysis of the $(\mu + 1)$-ES in a Homogenous Progress Model

cs.NE · 2026-06-11 · unverdicted · novelty 7.0

Introduces homogeneous progress model and proves that for Z = N(-δ,1) with μ ≤ e^δ the growth rate R_μ equals (log^{1+o(1)} μ / μ) R_1.

Latent Heuristic Search: Continuous Optimization for Automated Algorithm Design

cs.AI · 2026-05-16 · unverdicted · novelty 7.0

Latent Heuristic Search performs continuous optimization over learned embeddings of heuristics, using normalizing flows and LLM prompting to discover competitive solvers for TSP, CVRP, KSP, and OBP.

Low Stage High Order Explicit Runge--Kutta Methods via Q- and D-Conditions: General Theory and Efficient Recursive Construction

math.NA · 2026-05-16 · unverdicted · novelty 7.0 · 2 refs

A Q/D-space framework supplies sufficient order conditions for explicit Runge-Kutta methods and supports a recursive construction of even-order methods with stage count (p²-2p+8)/4.

EVA-0: Test-Time Model Evolution with Only Two Forward Passes per Sample

cs.LG · 2026-05-15 · unverdicted · novelty 7.0

EVA-0 is a zeroth-order test-time adaptation method that uses scale-invariant loss, anchor-guided optimization, and symmetric two-sided perturbations to enable inference and adaptation in two forward passes, outperforming prior methods on ImageNet-C with ViT-Base.

Gradient-Free Training of Spiking Neural Networks via Low-Rank Evolution Strategies

cs.NE · 2026-05-14 · unverdicted · novelty 7.0

EGGROLL applies low-rank evolution strategies to train leaky integrate-and-fire spiking neural networks, reaching 79.21% accuracy on N-MNIST with 2.23 times lower per-generation time than full-rank ES.

EvoPref: Multi-Objective Evolutionary Optimization Discovers Diverse LLM Alignments Beyond Gradient Descent

cs.NE · 2026-05-10 · unverdicted · novelty 7.0

EvoPref applies NSGA-II evolutionary optimization with archive-based diversity to populations of LoRA adapters, yielding 18% higher preference coverage and 47% lower collapse than gradient descent baselines while matching alignment quality.

Evolutionary Negative Module Pruning for Better LoRA Merging

cs.AI · 2026-04-20 · conditional · novelty 7.0

ENMP prunes negative LoRA modules via evolutionary search to boost merging performance to new state-of-the-art levels across language and vision tasks.

Simulator Adaptation for Sim-to-Real Learning of Legged Locomotion via Proprioceptive Distribution Matching

cs.RO · 2026-04-13 · unverdicted · novelty 7.0

Proprioceptive distribution matching adapts simulators for legged robot policies by comparing observation and action distributions, reducing sim-to-real gaps with minimal real data and no external sensing.

Bootstrapping non-unitary CFTs

hep-th · 2025-12-08 · unverdicted · novelty 7.0

A bootstrap strategy for non-unitary CFTs uses statistical stability of OPE data across cross-ratios to optimize spectra, reproducing A-series minimal models and yielding candidate solutions for c>1.

Exploring Exploration in Bayesian Optimization

cs.LG · 2025-02-12 · unverdicted · novelty 7.0

Introduces observation traveling salesman distance and observation entropy to quantify exploration in Bayesian optimization acquisition functions and links them to empirical performance.

Bridging Spherical Black-Box Optimizers

cs.LG · 2026-06-24 · unverdicted · novelty 6.0

Unifies ES, CBO, and OVI black-box optimizers via two design axes and proposes hybrid methods that outperform base algorithms on benchmarks.

Physics-Informed Eikonal Caging for Whole-Arm Manipulation Planning

cs.RO · 2026-06-20 · unverdicted · novelty 6.0

Reformulates caging as an eikonal minimum-time escape field, approximated via physics-informed neural network and embedded into whole-arm manipulation planning for improved robustness to contact-model mismatch.

Evolutionary Discovery of Developmental Reward Schedules in Deep Reinforcement Learning

cs.LG · 2026-06-18 · unverdicted · novelty 6.0

Evolutionary optimization discovers developmental reward schedules that improve performance over extrinsic-only baselines on some MiniGrid tasks, with novelty emerging as the dominant early signal.

Tuning Agent-Based Predator-Prey Models Toward Lotka-Volterra Dynamics

cs.MA · 2026-06-11 · unverdicted · novelty 6.0

Optimizing environmental and demographic parameters in a JAX-based agent-based model with RNN controllers produces population dynamics resembling Lotka-Volterra cycles using a feature-based loss.

Learning What to Remember: A Cognitively Grounded Multi-Factor Value Model for Agentic Memory

cs.AI · 2026-06-11 · unverdicted · novelty 6.0

A learned linear multi-factor value model over seven cognitive psychology factors retains 0.770 gold evidence on LongMemEval blind regime versus 0.368 for recency and 0.518 for best single factor.

Active Perception for Radio Map Reconstruction in Uncharted 3D Air-Ground Environments

eess.SP · 2026-06-11 · unverdicted · novelty 6.0

3D-URAM decouples radio map reconstruction into Bayesian UNet recovery with uncertainty and transformer-based PPO waypoint selection, reporting over 50% error reduction in simulations and real 300x200x100m field tests.

Consolidating Rewarded Perturbations for LLM Post-Training

cs.CL · 2026-05-29 · unverdicted · novelty 6.0

CoRP consolidates reward-weighted perturbations into a single model via low-rank structure, improving base LLMs by 8.1 points on average while using one-tenth the budget of prior ensembles and one forward pass.

Rethinking Literature Search Evaluation: Deep Research Helps, and Human Citation Lists Are Not a Ground Truth

cs.AI · 2026-05-28 · unverdicted · novelty 6.0

Deep bibliography expansion in literature search achieves high recall while human citations are found to have only 51% moderate relevance compared to 86-88% for AI methods.

ECo-MoE: Embodiment-Conditioned Mixture of Experts Increases the Evolvability of Robots

cs.RO · 2026-05-22 · unverdicted · novelty 6.0

ECo-MoE co-optimizes latent robot genotypes and a gated mixture of control experts to improve evolvability in robot body-controller co-design.

KSOS-BO: Improving Sampling in Bayesian Optimization via Kernel Sum of Squares

cs.CE · 2026-05-20 · unverdicted · novelty 6.0

KSOS-BO improves acquisition function optimization in Bayesian optimization by casting it as a kernel sum of squares semidefinite program, outperforming Sobol, DE, and CMA-ES baselines on 10/15 benchmarks with 81% average regret reduction.

Global Convergence of Sampling-Based Nonconvex Optimization through Diffusion-Style Smoothing

cs.LG · 2026-05-15 · unverdicted · novelty 6.0

Recasts sampling-based nonconvex optimization as smoothed gradient descent to obtain non-asymptotic convergence guarantees and introduces the DIDA annealed algorithm that converges to the global optimum.

Solving Classical and Quantum Spin Glasses with Deep Boltzmann Quantum States

cond-mat.dis-nn · 2026-05-15 · unverdicted · novelty 6.0

Deep Boltzmann Quantum States with natural-gradient optimization and annealing-like training match exact or best-known solutions for large infinite-range Ising spin glasses and solve job shop scheduling instances.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Evolutionary Negative Module Pruning for Better LoRA Merging cs.AI · 2026-04-20 · conditional · none · ref 13 · internal anchor
ENMP prunes negative LoRA modules via evolutionary search to boost merging performance to new state-of-the-art levels across language and vision tasks.

The CMA Evolution Strategy: A Tutorial

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer