Journal of Educational and Behavioral Statistics25, 101–132 (2000)

[Online] · 2000 · DOI 10.3102/10769986025002101

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

open at publisher browse 4 citing papers

representative citing papers

Do AI Models Dream of Faster Code? An Empirical Study on LLM-Proposed Performance Improvements in Real-World Software

cs.SE · 2025-10-17 · unverdicted · novelty 7.0

LLMs propose volatile performance improvements on real-world Java tasks that lag human developers on average, showing algorithmic benchmarks overestimate capabilities.

ConcoLixir: Reactive LLM Discovery Oracles for Python Concolic Testing

cs.SE · 2026-06-25 · unverdicted · novelty 6.0

ConcoLixir uses a reactive LLM oracle to improve line coverage in Python concolic testing by 8.6 to 17 percentage points on synthetic, real-world, and library targets.

Agentic Persona Generation with Critique-Refinement: An Industrial Evaluation

cs.SE · 2026-06-08 · unverdicted · novelty 6.0

PerGent, an agentic critique-refinement system for persona generation, reaches 96.9% expert approval in an industrial evaluation at Kinaxis and reproduces more pre-LLM expert content than single-shot baselines.

Genetic Programming for Self-Adaptive Auto-Scaling of Microservices

cs.SE · 2026-05-02 · unverdicted · novelty 4.0

AutoSLO applies genetic programming inside a monitoring loop to evolve scaling policies that cut resource use in microservices while keeping SLO violations low and short-lived.

citing papers explorer

Showing 4 of 4 citing papers after filters.

Do AI Models Dream of Faster Code? An Empirical Study on LLM-Proposed Performance Improvements in Real-World Software cs.SE · 2025-10-17 · unverdicted · none · ref 41
LLMs propose volatile performance improvements on real-world Java tasks that lag human developers on average, showing algorithmic benchmarks overestimate capabilities.
ConcoLixir: Reactive LLM Discovery Oracles for Python Concolic Testing cs.SE · 2026-06-25 · unverdicted · none · ref 30
ConcoLixir uses a reactive LLM oracle to improve line coverage in Python concolic testing by 8.6 to 17 percentage points on synthetic, real-world, and library targets.
Agentic Persona Generation with Critique-Refinement: An Industrial Evaluation cs.SE · 2026-06-08 · unverdicted · none · ref 30
PerGent, an agentic critique-refinement system for persona generation, reaches 96.9% expert approval in an industrial evaluation at Kinaxis and reproduces more pre-LLM expert content than single-shot baselines.
Genetic Programming for Self-Adaptive Auto-Scaling of Microservices cs.SE · 2026-05-02 · unverdicted · none · ref 37
AutoSLO applies genetic programming inside a monitoring loop to evolve scaling policies that cut resource use in microservices while keeping SLO violations low and short-lived.

Journal of Educational and Behavioral Statistics25, 101–132 (2000)

fields

years

verdicts

representative citing papers

citing papers explorer