RepoRescue creates a benchmark of 315 repositories and shows LLM agents rescue up to 41.5% with runtime enforcement and 62.7% when combining systems, with hardest cases requiring cross-file changes.
FreshBrew: A benchmark for evaluating AI agents on Java code migration,
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.SE 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
Pilot study of an empathic AI IDE found no significant gains in learning or workload over standard AI tools, with only greater perceived help in error correction.
citing papers explorer
-
RepoRescue: An Empirical Study of LLM Agents on Whole-Repository Compatibility Rescue
RepoRescue creates a benchmark of 315 repositories and shows LLM agents rescue up to 41.5% with runtime enforcement and 62.7% when combining systems, with hardest cases requiring cross-file changes.
-
Towards More Empathic Programming Environments: An Experimental Empathic AI-Enhanced IDE
Pilot study of an empathic AI IDE found no significant gains in learning or workload over standard AI tools, with only greater perceived help in error correction.