hub Mixed citations

The CMA Evolution Strategy: A Tutorial

Nikolaus Hansen (TAO) · 2016 · cs.LG · arXiv 1604.00772

Mixed citation behavior. Most common role is background (50%).

50 Pith papers citing it

Background 50% of classified citations

open full Pith review browse 50 citing papers arXiv PDF

abstract

This tutorial introduces the CMA Evolution Strategy (ES), where CMA stands for Covariance Matrix Adaptation. The CMA-ES is a stochastic, or randomized, method for real-parameter (continuous domain) optimization of non-linear, non-convex functions. We try to motivate and derive the algorithm from intuitive concepts and from requirements of non-linear, non-convex search in continuous domain.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 4 method 2

citation-polarity summary

background 3 use method 2 unclear 1

representative citing papers

Certified Gradient-Based Contact-Rich Manipulation via Smoothing-Error Reachable Tubes

cs.RO · 2026-02-10 · unverdicted · novelty 8.0

A certified gradient-based method for contact-rich manipulation that quantifies smoothing-induced errors via set-valued discrepancies and incorporates them into analytical reachable sets for robust affine feedback policies.

Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution

cs.CL · 2023-09-28 · unverdicted · novelty 8.0

Promptbreeder evolves both task prompts and the mutation prompts that improve them using LLMs, outperforming Chain-of-Thought and Plan-and-Solve on arithmetic and commonsense reasoning benchmarks.

Latent Heuristic Search: Continuous Optimization for Automated Algorithm Design

cs.AI · 2026-05-16 · unverdicted · novelty 7.0

Latent Heuristic Search performs continuous optimization over learned embeddings of heuristics, using normalizing flows and LLM prompting to discover competitive solvers for TSP, CVRP, KSP, and OBP.

EVA-0: Test-Time Model Evolution with Only Two Forward Passes per Sample

cs.LG · 2026-05-15 · unverdicted · novelty 7.0

EVA-0 is a zeroth-order test-time adaptation method that uses scale-invariant loss, anchor-guided optimization, and symmetric two-sided perturbations to enable inference and adaptation in two forward passes, outperforming prior methods on ImageNet-C with ViT-Base.

Evolutionary Negative Module Pruning for Better LoRA Merging

cs.AI · 2026-04-20 · conditional · novelty 7.0

ENMP prunes negative LoRA modules via evolutionary search to boost merging performance to new state-of-the-art levels across language and vision tasks.

Simulator Adaptation for Sim-to-Real Learning of Legged Locomotion via Proprioceptive Distribution Matching

cs.RO · 2026-04-13 · unverdicted · novelty 7.0

Proprioceptive distribution matching adapts simulators for legged robot policies by comparing observation and action distributions, reducing sim-to-real gaps with minimal real data and no external sensing.

Bootstrapping non-unitary CFTs

hep-th · 2025-12-08 · unverdicted · novelty 7.0

A bootstrap strategy for non-unitary CFTs uses statistical stability of OPE data across cross-ratios to optimize spectra, reproducing A-series minimal models and yielding candidate solutions for c>1.

Exploring Exploration in Bayesian Optimization

cs.LG · 2025-02-12 · unverdicted · novelty 7.0

Introduces observation traveling salesman distance and observation entropy to quantify exploration in Bayesian optimization acquisition functions and links them to empirical performance.

Consolidating Rewarded Perturbations for LLM Post-Training

cs.CL · 2026-05-29 · unverdicted · novelty 6.0

CoRP consolidates reward-weighted perturbations into a single model via low-rank structure, improving base LLMs by 8.1 points on average while using one-tenth the budget of prior ensembles and one forward pass.

Rethinking Literature Search Evaluation: Deep Research Helps, and Human Citation Lists Are Not a Ground Truth

cs.AI · 2026-05-28 · unverdicted · novelty 6.0

Deep bibliography expansion in literature search achieves high recall while human citations are found to have only 51% moderate relevance compared to 86-88% for AI methods.

KSOS-BO: Improving Sampling in Bayesian Optimization via Kernel Sum of Squares

cs.CE · 2026-05-20 · unverdicted · novelty 6.0

KSOS-BO improves acquisition function optimization in Bayesian optimization by casting it as a kernel sum of squares semidefinite program, outperforming Sobol, DE, and CMA-ES baselines on 10/15 benchmarks with 81% average regret reduction.

Global Convergence of Sampling-Based Nonconvex Optimization through Diffusion-Style Smoothing

cs.LG · 2026-05-15 · unverdicted · novelty 6.0

Recasts sampling-based nonconvex optimization as smoothed gradient descent to obtain non-asymptotic convergence guarantees and introduces the DIDA annealed algorithm that converges to the global optimum.

Solving Classical and Quantum Spin Glasses with Deep Boltzmann Quantum States

cond-mat.dis-nn · 2026-05-15 · unverdicted · novelty 6.0

Deep Boltzmann Quantum States with natural-gradient optimization and annealing-like training match exact or best-known solutions for large infinite-range Ising spin glasses and solve job shop scheduling instances.

Reward-Guided Semantic Evolution for Test-time Adaptive Object Detection

cs.CV · 2026-05-06 · unverdicted · novelty 6.0

RGSE adapts text embeddings at test time via evolutionary search, using cosine similarity rewards from high-confidence visual proposals to improve open-vocabulary object detection under distribution shifts.

Global Sampling-Based Trajectory Optimization for Contact-Rich Manipulation via KernelSOS

cs.RO · 2026-04-29 · unverdicted · novelty 6.0

Global-MPPI integrates kernel SOS global search with MPPI local refinement and graduated non-convexity smoothing to achieve faster convergence and lower costs on high-dimensional contact-rich manipulation tasks.

Benchmarking Stopping Criteria for Evolutionary Multi-objective Optimization

cs.NE · 2026-04-28 · unverdicted · novelty 6.0

Introduces a single-number performance measure, file-based benchmarking, and efficient text-file storage to evaluate and compare stopping criteria for EMO algorithms.

A Complex-Valued Continuous-Variable Quantum Approximation Optimization Algorithm (CCV-QAOA)

quant-ph · 2026-04-23 · unverdicted · novelty 6.0 · 2 refs

CCV-QAOA is a new complex-valued continuous-variable variant of QAOA that solves real and complex multivariate optimization problems via a variational framework.

Stability-Driven Motion Generation for Object-Guided Human-Human Co-Manipulation

cs.CV · 2026-04-22 · unverdicted · novelty 6.0

A flow-matching model derives manipulation strategies from object affordance, adds an adversarial interaction prior, and uses stability simulation to generate natural, effective human-human co-manipulation motions.

cs.NE · 2026-04-20 · unverdicted · novelty 6.0

A k-nearest-neighbor approach constructs problem-specific algorithm portfolios that outperform both single solvers and the virtual best solver in fixed-budget black-box optimization.

On the Generalization Bounds of Symbolic Regression with Genetic Programming

cs.LG · 2026-04-19 · unverdicted · novelty 6.0

Derives a generalization bound for GP-based symbolic regression that decomposes the gap into structure-selection complexity and constant-fitting complexity under tree constraints.

Optimal Majoranas in Mesoscopic Kitaev Chains

cond-mat.mes-hall · 2026-04-15 · unverdicted · novelty 6.0

Microscopic treatment of the hybrid segment in mesoscopic Kitaev chains shows that Andreev bound state parity crossings define optimal sweet spots for localized Majoranas with large gaps.

Trajectory-based actuator identification via differentiable simulation

cs.RO · 2026-04-11 · unverdicted · novelty 6.0

Differentiable simulation enables torque-sensor-free actuator model identification from trajectory data, achieving 1.88x better position tracking than a stand-trained baseline and 46% longer travel in downstream locomotion policies.

GeoPAS: Geometric Probing for Algorithm Selection in Continuous Black-Box Optimization

cs.LG · 2026-04-10 · unverdicted · novelty 6.0 · 2 refs

GeoPAS uses multi-scale 2D geometric slices of optimization landscapes with validity-mask pooling and a learned-plus-prior composite score to select from 12 solvers, cutting mean relative expected running time from 30.37 to around 3.1-3.6 on within-suite benchmarks.

Sumo: Dynamic and Generalizable Whole-Body Loco-Manipulation

cs.RO · 2026-04-09 · unverdicted · novelty 6.0

Test-time steering of pre-trained whole-body policies via sample-based planning lets legged robots generalize dynamic loco-manipulation to varied heavy objects and tasks without additional training or tuning.

citing papers explorer

Showing 2 of 2 citing papers after filters.

A Complex-Valued Continuous-Variable Quantum Approximation Optimization Algorithm (CCV-QAOA) quant-ph · 2026-04-23 · unverdicted · none · ref 22 · 2 links · internal anchor
CCV-QAOA is a new complex-valued continuous-variable variant of QAOA that solves real and complex multivariate optimization problems via a variational framework.
PhDLspec: physical-prior embedded deep learning method for spectroscopic determination of stellar labels in high-dimensional parameter space astro-ph.GA · 2026-04-03 · unverdicted · none · ref 12 · internal anchor
PhDLspec combines differential spectra from physical stellar models with a transformer to derive approximately 30 stellar parameters from low-resolution spectra hundreds of times faster than traditional calculations.

The CMA Evolution Strategy: A Tutorial

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer