Automated skill discovery for language agents through exploration and iterative feedback

Automated Skill Discovery for Language Agents through Exploration · 2025 · arXiv 2506.04287

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

Co-Evolving Skill Generation and Policy Optimization

cs.CL · 2026-06-07 · unverdicted · novelty 7.0

Framework estimates context-dependent marginal utility of candidate skills via reward gaps in matched base vs. skill-augmented rollouts to filter skills and co-train policy as generator.

Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning

cs.LG · 2026-04-08 · unverdicted · novelty 7.0

This survey introduces the Generate-Filter-Control-Replay (GFCR) taxonomy to structure rollout pipelines for RL-based post-training of reasoning LLMs.

SKILL-DISCO: Distilling and Compiling Agent Traces into Reusable Procedural Skills

cs.AI · 2026-06-25 · unverdicted · novelty 5.0

SkillDisCo distills reusable PFSM subgraphs from successful agent traces and compiles them into callable procedural skills, improving success rates and reducing turns on ALFWorld and WebArena.

SkillHone: A Harness for Continual Agent Skill Evolution Through Persistent Decision History

cs.LG · 2026-06-07 · unverdicted · novelty 5.0

SkillHone introduces a harness that maintains persistent decision histories to support continual evolution of language-model agent skills, reporting 15.8-point gains on GAIA over a commercial deep-research agent.

citing papers explorer

Showing 1 of 1 citing paper after filters.

SKILL-DISCO: Distilling and Compiling Agent Traces into Reusable Procedural Skills cs.AI · 2026-06-25 · unverdicted · none · ref 25
SkillDisCo distills reusable PFSM subgraphs from successful agent traces and compiles them into callable procedural skills, improving success rates and reducing turns on ALFWorld and WebArena.

Automated skill discovery for language agents through exploration and iterative feedback

fields

years

verdicts

representative citing papers

citing papers explorer