Towards Enterprise-Ready computer using generalist agent

Towards enterprise-ready computer using generalist agent · 2023 · arXiv 2503.01861

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

read on arXiv browse 5 citing papers

citation-role summary

baseline 1

citation-polarity summary

baseline 1

representative citing papers

X-SYNTH: Beyond Retrieval -- Enterprise Context Synthesis from Observed Digital Human Attention

cs.AI · 2026-05-15 · unverdicted · novelty 7.0 · 2 refs

X-SYNTH synthesizes enterprise context from digital human attention using Digital Twin Signatures and seven attention filters, raising true lead rate from 9.5% to 61.9% while cutting false lead rate to 18.8%.

Learning and Reusing Policy Decompositions for Hierarchical Generalized Planning with LLM Agents

cs.AI · 2026-05-07 · unverdicted · novelty 6.0

HCL-GP learns parameterized policies and reuses extracted components to achieve 98% accuracy on AppWorld benchmark tasks for LLM agents, outperforming static synthesis by 15.8 points on challenges.

Does The Way You Plan Matter? An Empirical Study of Planning Representations for LLM Web Agents

cs.CL · 2026-05-28 · unverdicted · novelty 5.0

Empirical evaluation of four natural language plan representations in a static planner-executor framework shows that plan formulation and the underlying LLM both affect LLM web-agent robustness and task success on hard WebArena tasks.

Governance by Construction for Generalist Agents

cs.AI · 2026-05-20 · unverdicted · novelty 5.0

CUGA introduces a runtime governance architecture that enforces policies at five checkpoints in generalist agent execution pipelines for predictable and compliant behavior.

Agent Mentor: Framing Agent Knowledge through Semantic Trajectory Analysis

cs.AI · 2026-04-12 · unverdicted · novelty 5.0

Agent Mentor analyzes semantic trajectories in agent logs to identify undesired behaviors and derives corrective prompt instructions, yielding measurable accuracy gains on benchmark tasks across three agent setups.

citing papers explorer

Showing 5 of 5 citing papers.

X-SYNTH: Beyond Retrieval -- Enterprise Context Synthesis from Observed Digital Human Attention cs.AI · 2026-05-15 · unverdicted · none · ref 48 · 2 links
X-SYNTH synthesizes enterprise context from digital human attention using Digital Twin Signatures and seven attention filters, raising true lead rate from 9.5% to 61.9% while cutting false lead rate to 18.8%.
Learning and Reusing Policy Decompositions for Hierarchical Generalized Planning with LLM Agents cs.AI · 2026-05-07 · unverdicted · none · ref 13
HCL-GP learns parameterized policies and reuses extracted components to achieve 98% accuracy on AppWorld benchmark tasks for LLM agents, outperforming static synthesis by 15.8 points on challenges.
Does The Way You Plan Matter? An Empirical Study of Planning Representations for LLM Web Agents cs.CL · 2026-05-28 · unverdicted · none · ref 2
Empirical evaluation of four natural language plan representations in a static planner-executor framework shows that plan formulation and the underlying LLM both affect LLM web-agent robustness and task success on hard WebArena tasks.
Governance by Construction for Generalist Agents cs.AI · 2026-05-20 · unverdicted · none · ref 13
CUGA introduces a runtime governance architecture that enforces policies at five checkpoints in generalist agent execution pipelines for predictable and compliant behavior.
Agent Mentor: Framing Agent Knowledge through Semantic Trajectory Analysis cs.AI · 2026-04-12 · unverdicted · none · ref 9
Agent Mentor analyzes semantic trajectories in agent logs to identify undesired behaviors and derives corrective prompt instructions, yielding measurable accuracy gains on benchmark tasks across three agent setups.

Towards Enterprise-Ready computer using generalist agent

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer