FreshBrew: A benchmark for evaluating AI agents on Java code migration,

· 2025 · arXiv 2510.04852

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

RepoRescue: An Empirical Study of LLM Agents on Whole-Repository Compatibility Rescue

cs.SE · 2026-07-01 · unverdicted · novelty 6.0

RepoRescue creates a benchmark of 315 repositories and shows LLM agents rescue up to 41.5% with runtime enforcement and 62.7% when combining systems, with hardest cases requiring cross-file changes.

Towards More Empathic Programming Environments: An Experimental Empathic AI-Enhanced IDE

cs.SE · 2026-04-21 · unverdicted · novelty 4.0

Pilot study of an empathic AI IDE found no significant gains in learning or workload over standard AI tools, with only greater perceived help in error correction.

citing papers explorer

Showing 2 of 2 citing papers.

RepoRescue: An Empirical Study of LLM Agents on Whole-Repository Compatibility Rescue cs.SE · 2026-07-01 · unverdicted · none · ref 38
RepoRescue creates a benchmark of 315 repositories and shows LLM agents rescue up to 41.5% with runtime enforcement and 62.7% when combining systems, with hardest cases requiring cross-file changes.
Towards More Empathic Programming Environments: An Experimental Empathic AI-Enhanced IDE cs.SE · 2026-04-21 · unverdicted · none · ref 30
Pilot study of an empathic AI IDE found no significant gains in learning or workload over standard AI tools, with only greater perceived help in error correction.

FreshBrew: A benchmark for evaluating AI agents on Java code migration,

fields

years

verdicts

representative citing papers

citing papers explorer