hub Mixed citations

Illuminating search spaces by mapping elites

Mouret, J · 2015 · cs.AI · arXiv 1504.04909

Mixed citation behavior. Most common role is background (64%).

67 Pith papers citing it

Background 64% of classified citations

open full Pith review browse 67 citing papers arXiv PDF

abstract

Many fields use search algorithms, which automatically explore a search space to find high-performing solutions: chemists search through the space of molecules to discover new drugs; engineers search for stronger, cheaper, safer designs, scientists search for models that best explain data, etc. The goal of search algorithms has traditionally been to return the single highest-performing solution in a search space. Here we describe a new, fundamentally different type of algorithm that is more useful because it provides a holistic view of how high-performing solutions are distributed throughout a search space. It creates a map of high-performing solutions at each point in a space defined by dimensions of variation that a user gets to choose. This Multi-dimensional Archive of Phenotypic Elites (MAP-Elites) algorithm illuminates search spaces, allowing researchers to understand how interesting attributes of solutions combine to affect performance, either positively or, equally of interest, negatively. For example, a drug company may wish to understand how performance changes as the size of molecules and their cost-to-produce vary. MAP-Elites produces a large diversity of high-performing, yet qualitatively different solutions, which can be more helpful than a single, high-performing solution. Interestingly, because MAP-Elites explores more of the search space, it also tends to find a better overall solution than state-of-the-art search algorithms. We demonstrate the benefits of this new algorithm in three different problem domains ranging from producing modular neural networks to designing simulated and real soft robots. Because MAP- Elites (1) illuminates the relationship between performance and dimensions of interest in solutions, (2) returns a set of high-performing, yet diverse solutions, and (3) improves finding a single, best solution, it will advance science and engineering.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 7 method 4

citation-polarity summary

background 7 use method 4

representative citing papers

LLM-Evolved Domain-Independent Heuristics for Symbolic AI Planning

cs.AI · 2026-05-28 · unverdicted · novelty 8.0

LLM-guided evolutionary search yields the first domain-independent C++ planning heuristics that exceed the strongest hand-engineered baselines on coverage and speed trade-offs across unseen domains.

Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution

cs.CL · 2023-09-28 · unverdicted · novelty 8.0

Promptbreeder evolves both task prompts and the mutation prompts that improve them using LLMs, outperforming Chain-of-Thought and Plan-and-Solve on arithmetic and commonsense reasoning benchmarks.

The Defense Trilemma: Why Prompt Injection Defense Wrappers Fail?

cs.CR · 2026-04-07 · unverdicted · novelty 8.0

No continuous utility-preserving input wrapper can eliminate all prompt injection risks in connected prompt spaces for language models.

Explaining Attention with Program Synthesis

cs.LG · 2026-06-17 · unverdicted · novelty 7.0

Language-model-guided program synthesis can approximate transformer attention heads with over 75% IoU fidelity on held-out data and allow replacing 25% of heads with only 16% average perplexity increase.

FML-bench: A Controlled Study of AI Research Agent Strategies from the Perspective of Search Dynamics

cs.LG · 2026-05-17 · unverdicted · novelty 7.0 · 2 refs

FML-Bench shows a simple greedy hill-climber nearly matches tree search on dense-opportunity tasks while an adaptive agent that broadens search on stagnation outperforms six baselines across 18 tasks.

Diversified Residual Symbolic Regression

cs.NE · 2026-05-15 · unverdicted · novelty 7.0

DRSR uses Quality-Diversity to produce diverse symbolic regression expressions differing in residual distributions, enabling post-search selection on synthetic and astronomical data.

FrontierSmith: Synthesizing Open-Ended Coding Problems at Scale

cs.LG · 2026-05-14 · conditional · novelty 7.0

FrontierSmith automates synthesis of open-ended coding problems from closed-ended seeds and shows measurable gains on two open-ended LLM coding benchmarks.

Beyond Cooperative Simulators: Generating Realistic User Personas for Robust Evaluation of LLM Agents

cs.AI · 2026-05-13 · unverdicted · novelty 7.0

PPol uses LLM-driven evolutionary program search to create diverse human-like user personas for simulators, yielding 33-62% fitness gains and +17% agent task success on retail and airline domains.

EvoPref: Multi-Objective Evolutionary Optimization Discovers Diverse LLM Alignments Beyond Gradient Descent

cs.NE · 2026-05-10 · unverdicted · novelty 7.0

EvoPref applies NSGA-II evolutionary optimization with archive-based diversity to populations of LoRA adapters, yielding 18% higher preference coverage and 47% lower collapse than gradient descent baselines while matching alignment quality.

EvolveSignal: A Large Language Model Powered Coding Agent for Discovering Traffic Signal Control Strategies

cs.LG · 2025-09-03 · unverdicted · novelty 7.0

EvolveSignal applies LLM-driven evolutionary program synthesis to discover heuristic variations of traffic signal control logic that reduce delay and stops compared to Webster's method in simulation.

Automated Design of Agentic Systems

cs.AI · 2024-08-15 · conditional · novelty 7.0

Meta Agent Search uses a meta-agent to iteratively program novel agentic systems in code, producing agents that outperform state-of-the-art hand-designed ones across coding, science, and math while transferring across domains and models.

EvoPrompt: Connecting LLMs with Evolutionary Algorithms Yields Powerful Prompt Optimizers

cs.CL · 2023-09-15 · unverdicted · novelty 7.0

EvoPrompt uses LLMs to run evolutionary operators on populations of prompts, outperforming human-engineered prompts by up to 25% on BIG-Bench Hard tasks across 31 datasets.

Prediction of neural network performance by phenotypic modeling

cs.NE · 2019-07-16 · unverdicted · novelty 7.0

Phenotypic distance from output differences on fixed inputs enables surrogate models that predict performance of variable-topology neural networks as well as or better than weight-based models on fixed topologies in a robotic navigation task.

Internal vs. External: Comparing Deliberation and Evolution for Multi-Agent Constitutional Design

cs.MA · 2026-05-09 · unverdicted · novelty 7.0

External evolution beats internal deliberation in collective-action tasks with statistical significance but neither helps in trading, and deliberation never discovers punishment while evolution does.

Autopoiesis: A Self-Evolving System Paradigm for LLM Serving Under Runtime Dynamics

cs.DC · 2026-04-08 · unverdicted · novelty 7.0

Autopoiesis uses LLM-driven program synthesis to evolve serving policies online during deployment, delivering up to 53% and average 34% gains over prior LLM serving systems under runtime dynamics.

LensAgent: A Self Evolving Agent for Autonomous Physical Inference of Sub-galactic Structure

astro-ph.GA · 2026-04-04 · unverdicted · novelty 7.0

LensAgent is a training-free LLM agent framework that reconstructs mass distributions in SLACS strong lensing systems to extract sub-galactic substructures.

AlphaEvolve: A coding agent for scientific and algorithmic discovery

cs.AI · 2025-06-16 · unverdicted · novelty 7.0

AlphaEvolve is an LLM-orchestrated evolutionary coding agent that discovered a 4x4 complex matrix multiplication algorithm using 48 scalar multiplications, the first improvement over Strassen's algorithm in 56 years, plus optimizations for Google data centers and hardware.

Heuresis: Search Strategies for Autonomous AI Research Agents Across Quality, Diversity and Novelty

cs.AI · 2026-06-23 · unverdicted · novelty 6.0

Heuresis evaluates six search strategies for autonomous ML research agents and finds that novel ideas are rare, none rated original, and only one reaches top-10 quality while strategies steer axes but do not expand the quality-novelty frontier.

Structure-Conditioned Actor-Critic Branches for Quality-Diversity Reinforcement Learning

cs.AI · 2026-06-07 · unverdicted · novelty 6.0

SV-QD-RL couples actor structure with branch-specific value learning via structure-conditioned actor-critic branches to generate diverse high-quality policy repertoires in QD-RL.

U-Net-Accelerated Quality-Diversity Optimization for Climate-Adaptive Urban Layouts

cs.NE · 2026-06-03 · unverdicted · novelty 6.0

U-Net surrogate enables offline MAP-Elites to achieve R²=0.996 on climate physics and ρ=0.994 fitness ranking using only random samples, while GP surrogates fail without active QD data.

Cross-Generational Transfer of Adversarial Attacks Reveals Non-Monotonic Safety Alignment in LLMs

cs.CR · 2026-05-30 · unverdicted · novelty 6.0

Non-monotonic safety alignment appears in Gemma models, with Gemma 3 at 68.7% ASR versus 45.5% in Gemma 2 and 33.9% in Gemma 4 via MAP-Elites red-teaming and cross-generational attack transfer.

Quality-Diversity Evolution for Discovering Diverse Vulnerabilities in LLM Safety

cs.CR · 2026-05-30 · conditional · novelty 6.0

Applies MAP-Elites quality-diversity optimization to evolve semantic attack strategies across dimensions like strategy type, encoding, and length, uncovering distinct vulnerability profiles in four LLMs including GPT-4o-mini and Claude 3.5 Sonnet.

Procedural Generation of First Person Shooter Maps using Map-Elites

cs.AI · 2026-05-28 · unverdicted · novelty 6.0

New Point-Line and Spatial-Layout map representations enable MAP-Elites to produce FPS maps with higher diversity and quality than prior All-Black and Grid-Graph methods.

DEI: Diversity in Evolutionary Inference for Quality-Diversity Search

cs.LG · 2026-05-26 · unverdicted · novelty 6.0

DEI shows a heterogeneous four-LLM ensemble achieving 124% higher QD-Score and 28% higher coverage than single-model baselines on Core War at equal compute budget.

citing papers explorer

Showing 50 of 67 citing papers.

LLM-Evolved Domain-Independent Heuristics for Symbolic AI Planning cs.AI · 2026-05-28 · unverdicted · none · ref 30 · internal anchor
LLM-guided evolutionary search yields the first domain-independent C++ planning heuristics that exceed the strongest hand-engineered baselines on coverage and speed trade-offs across unseen domains.
Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution cs.CL · 2023-09-28 · unverdicted · none · ref 239 · internal anchor
Promptbreeder evolves both task prompts and the mutation prompts that improve them using LLMs, outperforming Chain-of-Thought and Plan-and-Solve on arithmetic and commonsense reasoning benchmarks.
The Defense Trilemma: Why Prompt Injection Defense Wrappers Fail? cs.CR · 2026-04-07 · unverdicted · full · ref 15
No continuous utility-preserving input wrapper can eliminate all prompt injection risks in connected prompt spaces for language models.
Explaining Attention with Program Synthesis cs.LG · 2026-06-17 · unverdicted · none · ref 22 · internal anchor
Language-model-guided program synthesis can approximate transformer attention heads with over 75% IoU fidelity on held-out data and allow replacing 25% of heads with only 16% average perplexity increase.
FML-bench: A Controlled Study of AI Research Agent Strategies from the Perspective of Search Dynamics cs.LG · 2026-05-17 · unverdicted · none · ref 35 · 2 links · internal anchor
FML-Bench shows a simple greedy hill-climber nearly matches tree search on dense-opportunity tasks while an adaptive agent that broadens search on stagnation outperforms six baselines across 18 tasks.
Diversified Residual Symbolic Regression cs.NE · 2026-05-15 · unverdicted · none · ref 22 · internal anchor
DRSR uses Quality-Diversity to produce diverse symbolic regression expressions differing in residual distributions, enabling post-search selection on synthetic and astronomical data.
FrontierSmith: Synthesizing Open-Ended Coding Problems at Scale cs.LG · 2026-05-14 · conditional · none · ref 25 · internal anchor
FrontierSmith automates synthesis of open-ended coding problems from closed-ended seeds and shows measurable gains on two open-ended LLM coding benchmarks.
Beyond Cooperative Simulators: Generating Realistic User Personas for Robust Evaluation of LLM Agents cs.AI · 2026-05-13 · unverdicted · none · ref 13 · internal anchor
PPol uses LLM-driven evolutionary program search to create diverse human-like user personas for simulators, yielding 33-62% fitness gains and +17% agent task success on retail and airline domains.
EvoPref: Multi-Objective Evolutionary Optimization Discovers Diverse LLM Alignments Beyond Gradient Descent cs.NE · 2026-05-10 · unverdicted · none · ref 41 · internal anchor
EvoPref applies NSGA-II evolutionary optimization with archive-based diversity to populations of LoRA adapters, yielding 18% higher preference coverage and 47% lower collapse than gradient descent baselines while matching alignment quality.
EvolveSignal: A Large Language Model Powered Coding Agent for Discovering Traffic Signal Control Strategies cs.LG · 2025-09-03 · unverdicted · none · ref 25 · internal anchor
EvolveSignal applies LLM-driven evolutionary program synthesis to discover heuristic variations of traffic signal control logic that reduce delay and stops compared to Webster's method in simulation.
Automated Design of Agentic Systems cs.AI · 2024-08-15 · conditional · none · ref 185 · internal anchor
Meta Agent Search uses a meta-agent to iteratively program novel agentic systems in code, producing agents that outperform state-of-the-art hand-designed ones across coding, science, and math while transferring across domains and models.
EvoPrompt: Connecting LLMs with Evolutionary Algorithms Yields Powerful Prompt Optimizers cs.CL · 2023-09-15 · unverdicted · none · ref 180 · internal anchor
EvoPrompt uses LLMs to run evolutionary operators on populations of prompts, outperforming human-engineered prompts by up to 25% on BIG-Bench Hard tasks across 31 datasets.
Prediction of neural network performance by phenotypic modeling cs.NE · 2019-07-16 · unverdicted · none · ref 19 · internal anchor
Phenotypic distance from output differences on fixed inputs enables surrogate models that predict performance of variable-topology neural networks as well as or better than weight-based models on fixed topologies in a robotic navigation task.
Internal vs. External: Comparing Deliberation and Evolution for Multi-Agent Constitutional Design cs.MA · 2026-05-09 · unverdicted · none · ref 26
External evolution beats internal deliberation in collective-action tasks with statistical significance but neither helps in trading, and deliberation never discovers punishment while evolution does.
Autopoiesis: A Self-Evolving System Paradigm for LLM Serving Under Runtime Dynamics cs.DC · 2026-04-08 · unverdicted · none · ref 33
Autopoiesis uses LLM-driven program synthesis to evolve serving policies online during deployment, delivering up to 53% and average 34% gains over prior LLM serving systems under runtime dynamics.
LensAgent: A Self Evolving Agent for Autonomous Physical Inference of Sub-galactic Structure astro-ph.GA · 2026-04-04 · unverdicted · none · ref 40
LensAgent is a training-free LLM agent framework that reconstructs mass distributions in SLACS strong lensing systems to extract sub-galactic substructures.
AlphaEvolve: A coding agent for scientific and algorithmic discovery cs.AI · 2025-06-16 · unverdicted · none · ref 75
AlphaEvolve is an LLM-orchestrated evolutionary coding agent that discovered a 4x4 complex matrix multiplication algorithm using 48 scalar multiplications, the first improvement over Strassen's algorithm in 56 years, plus optimizations for Google data centers and hardware.
Heuresis: Search Strategies for Autonomous AI Research Agents Across Quality, Diversity and Novelty cs.AI · 2026-06-23 · unverdicted · none · ref 42 · internal anchor
Heuresis evaluates six search strategies for autonomous ML research agents and finds that novel ideas are rare, none rated original, and only one reaches top-10 quality while strategies steer axes but do not expand the quality-novelty frontier.
Structure-Conditioned Actor-Critic Branches for Quality-Diversity Reinforcement Learning cs.AI · 2026-06-07 · unverdicted · none · ref 3 · internal anchor
SV-QD-RL couples actor structure with branch-specific value learning via structure-conditioned actor-critic branches to generate diverse high-quality policy repertoires in QD-RL.
U-Net-Accelerated Quality-Diversity Optimization for Climate-Adaptive Urban Layouts cs.NE · 2026-06-03 · unverdicted · none · ref 2 · internal anchor
U-Net surrogate enables offline MAP-Elites to achieve R²=0.996 on climate physics and ρ=0.994 fitness ranking using only random samples, while GP surrogates fail without active QD data.
Cross-Generational Transfer of Adversarial Attacks Reveals Non-Monotonic Safety Alignment in LLMs cs.CR · 2026-05-30 · unverdicted · none · ref 6 · internal anchor
Non-monotonic safety alignment appears in Gemma models, with Gemma 3 at 68.7% ASR versus 45.5% in Gemma 2 and 33.9% in Gemma 4 via MAP-Elites red-teaming and cross-generational attack transfer.
Quality-Diversity Evolution for Discovering Diverse Vulnerabilities in LLM Safety cs.CR · 2026-05-30 · conditional · none · ref 6 · internal anchor
Applies MAP-Elites quality-diversity optimization to evolve semantic attack strategies across dimensions like strategy type, encoding, and length, uncovering distinct vulnerability profiles in four LLMs including GPT-4o-mini and Claude 3.5 Sonnet.
Procedural Generation of First Person Shooter Maps using Map-Elites cs.AI · 2026-05-28 · unverdicted · none · ref 6 · internal anchor
New Point-Line and Spatial-Layout map representations enable MAP-Elites to produce FPS maps with higher diversity and quality than prior All-Black and Grid-Graph methods.
DEI: Diversity in Evolutionary Inference for Quality-Diversity Search cs.LG · 2026-05-26 · unverdicted · none · ref 19 · internal anchor
DEI shows a heterogeneous four-LLM ensemble achieving 124% higher QD-Score and 28% higher coverage than single-model baselines on Core War at equal compute budget.
Constitutional Arms Races in the Public Goods Game: Co-Evolving LLM Constitutions Under Cooperation-Defection Pressure cs.MA · 2026-05-26 · unverdicted · none · ref 22 · internal anchor
Adversarial co-evolution of LLM constitutions in public goods games reaches near-parity equilibrium only when fitness is coupled across factions and evaluation uses at least five seeds per generation.
optimize_anything: A Universal API for Optimizing any Text Parameter cs.CL · 2026-05-19 · unverdicted · none · ref 19 · internal anchor
A universal LLM optimizer for text artifacts achieves SOTA results on six tasks including tripling ARC-AGI accuracy and cutting cloud costs by 40% via cross-task transfer and side information.
PopuLoRA: Co-Evolving LLM Populations for Reasoning Self-Play cs.AI · 2026-05-16 · unverdicted · none · ref 8 · internal anchor
PopuLoRA shows that co-evolving populations of LoRA adapters through cross-evaluated self-play can outperform compute-matched single-agent baselines on multiple code and math reasoning benchmarks.
ToolMol: Evolutionary Agentic Framework for Multi-objective Drug Discovery cs.LG · 2026-05-12 · unverdicted · none · ref 29 · 2 links · internal anchor
ToolMol integrates evolutionary algorithms with agentic LLMs and precise RDKit tools to optimize multi-objective drug properties, yielding ligands with over 10% better predicted binding affinity and 35% gains in absolute binding free energy on three protein targets.
Parameter-Efficient Neuroevolution for Diverse LLM Generation: Quality-Diversity Optimization via Prompt Embedding Evolution cs.NE · 2026-05-10 · unverdicted · none · ref 49 · 2 links · internal anchor
QD-LLM applies neuroevolution to prompt embeddings within a quality-diversity framework, producing 46% higher coverage and 41% higher QD-score than QDAIF on HumanEval, MBPP, and creative writing benchmarks.
Kernel-Smith: A Unified Recipe for Evolutionary Kernel Optimization cs.CL · 2026-03-30 · unverdicted · none · ref 17 · internal anchor
Kernel-Smith combines evolutionary search with RL post-training to generate optimized GPU kernels, achieving SOTA speedups on KernelBench that beat Gemini-3.0-pro and Claude-4.6-opus on NVIDIA Triton and generalize to MetaX MACA.
Red-Teaming Vision-Language-Action Models via Quality Diversity Prompt Generation for Robust Robot Policies cs.RO · 2026-03-12 · unverdicted · none · ref 14 · internal anchor
Q-DIG applies quality diversity optimization with vision-language models to generate diverse adversarial instructions that reveal VLA robot failures and enable robustness improvements via fine-tuning.
Space Syntax-guided Post-training for Residential Floor Plan Generation cs.LG · 2026-02-26 · unverdicted · none · ref 31 · internal anchor
SSPT turns space-syntax integration metrics into post-training feedback signals that improve public-space dominance and functional hierarchy in AI-generated residential floor plans.
Diversifying Toxicity Search in Large Language Models Through Speciation cs.NE · 2026-01-28 · unverdicted · none · ref 20 · internal anchor
ToxSearch-S applies unsupervised speciation to evolutionary prompt search, maintaining capacity-limited species with exemplar leaders and species-aware selection to achieve higher peak toxicity and broader semantic coverage than standard methods.
Tournament Informed Adversarial Quality Diversity cs.NE · 2026-01-27 · unverdicted · none · ref 27 · internal anchor
Tournament-informed task selection in adversarial QD produces higher quality and diversity in coevolved solutions across Pong, cat-and-mouse, and pursuers-evaders games.
Motif Diversity in Human Liver ChIP-seq Data Using MAP-Elites cs.NE · 2026-01-25 · unverdicted · none · ref 9 · internal anchor
MAP-Elites recovers multiple high-quality motif variants from CTCF ChIP-seq data with fitness comparable to MEME while revealing structured diversity.
JSON-Bag: A generic game trajectory representation cs.LG · 2025-08-01 · conditional · none · ref 6 · internal anchor
JSON-Bag tokenizes JSON game trajectories, applies Jensen-Shannon distance and prototype nearest-neighbor search to classify agents/parameters/seeds across six tabletop games, outperforming hand-crafted features and correlating with policy distances.
Adversarial Coevolutionary Illumination with Generational Adversarial MAP-Elites cs.NE · 2025-05-10 · unverdicted · none · ref 37 · internal anchor
GAME is a new adversarial coevolutionary QD algorithm using generational alternation and vision embeddings that outperforms one-sided baselines across battle, wrestling, and deck-building tasks while revealing arms-race dynamics and the role of neutral mutations.
Automatic Calibration of Artificial Neural Networks for Zebrafish Collective Behaviours using a Quality Diversity Algorithm cs.NE · 2019-07-22 · unverdicted · none · ref 26 · internal anchor
CVT-MAP-Elites quality diversity search calibrates ANN-based agent models of zebrafish collective motion to outperform standard evolutionary methods on both macroscopic group metrics and microscopic individual realism.
Diverse Agents for Ad-Hoc Cooperation in Hanabi cs.AI · 2019-07-08 · unverdicted · none · ref 9 · internal anchor
Quality Diversity algorithms are proposed to generate diverse agent populations for ad-hoc cooperation evaluation in Hanabi, with discussion of metrics and adaptive agent building.
HMACE: Heterogeneous Multi-Agent Collaborative Evolution for Combinatorial Optimization cs.AI · 2026-05-08 · unverdicted · none · ref 33
HMACE deploys Proposer, Generator, Evaluator, and Reflector agents in an evolutionary loop to generate and refine heuristics for NP-hard problems, reporting lower optimality gaps and token costs than baselines on TSP and Online BPP.
Stable-GFlowNet: Toward Diverse and Robust LLM Red-Teaming via Contrastive Trajectory Balance cs.LG · 2026-05-01 · unverdicted · none · ref 38
Stable-GFlowNet stabilizes GFN training for LLM red-teaming by eliminating Z estimation via pairwise comparisons and robust masking against noisy rewards while adding a fluency stabilizer.
Agentic Architect: An Agentic AI Framework for Architecture Design Exploration and Optimization cs.AI · 2026-04-28 · accept · none · ref 32
An LLM-driven agentic system evolves microarchitectural policies for cache replacement, data prefetching, and branch prediction, producing designs that match or exceed prior state-of-the-art in IPC on standard benchmarks.
QDTraj: Exploration of Diverse Trajectory Primitives for Articulated Objects Robotic Manipulation cs.RO · 2026-04-24 · unverdicted · none · ref 26
QDTraj uses Quality-Diversity algorithms with sparse rewards to produce at least five times more diverse high-performing trajectories for articulated object manipulation than compared methods, validated across 30 objects with hundreds of trajectories per task.
LLM-Guided Prompt Evolution for Password Guessing cs.CR · 2026-04-14 · unverdicted · none · ref 15
LLM-guided evolutionary prompt optimization using MAP-Elites and island models raises password cracking rates from 2.02% to 8.48% on a RockYou-derived test set across local, cloud, and ensemble LLM setups.
TurboEvolve: Towards Fast and Robust LLM-Driven Program Evolution cs.NE · 2026-04-12 · unverdicted · none · ref 15
TurboEvolve improves LLM program evolution by running parallel islands with LLM-generated diverse candidates that carry self-assigned weights, an adaptive scheduler, and clustered seed injection to reach stronger solutions at lower evaluation budgets.
Uncovering Linguistic Fragility in Vision-Language-Action Models via Diversity-Aware Red Teaming cs.RO · 2026-04-07 · unverdicted · none · ref 25
DAERT generates diverse adversarial instructions via a uniform policy in RL to drop VLA task success rates from 93.33% to 5.85% on benchmarks with models like π0 and OpenVLA.
Self-Evolving Agents with Anytime-Valid Certificates cs.AI · 2026-07-01 · unverdicted · none · ref 30 · internal anchor
SEA architecture gates self-modifications via anytime-valid certificates on a frozen base model plus five verifier mechanisms, yielding +4 to +5 gains on a SWE-bench subset for two strong bases.
TacEvo: Self-Evolving Architecture Discovery for Robotic Tactile Perception via LLM-Driven Quality-Diversity Search cs.RO · 2026-06-29 · unverdicted · none · ref 26 · internal anchor
TacEvo is an LLM-driven self-evolving search method that discovers neural architectures for robotic tactile force regression and grating classification, reporting fitness gains of 56.1% and 96.1% over 20 generations.
The Red Queen G\"odel Machine: Co-Evolving Agents and Their Evaluators cs.LG · 2026-06-24 · unverdicted · none · ref 47 · internal anchor
RQGM enables co-evolution of agents and evaluators across epochs with non-stationary utilities, reporting gains in coding pass rates, paper acceptance, and proof grading over prior self-improving agents.
Quality-Diversity Search in Sound Generation: Investigating Innovation Engines for Audio Exploration cs.SD · 2026-06-08 · unverdicted · none · ref 6 · internal anchor
MAP-Elites with CPPNs, DSP graphs, and a deep classifier produces diverse synthetic sounds across durations and musical/non-musical contexts.

Illuminating search spaces by mapping elites

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer