• Learning rate:1×10 −5 • Batch size: 64 • Epochs: 5 • Optimizer: AdamW (β 1 = 0.9,β 2 = 0.999) • Hybrid loss weight:λ reg = 0.5 B.3

Pass through a two-layer MLP (hidden dim 512, ReLU activation) to produce a scalar score Training Details · 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

BoostAPR: Boosting Automated Program Repair via Execution-Grounded Reinforcement Learning with Dual Reward Models

cs.AI · 2026-05-09 · unverdicted · novelty 6.0

BoostAPR boosts automated program repair by training a sequence-level assessor and line-level credit allocator from execution outcomes, then applying them in PPO to reach 40.7% on SWE-bench Verified.

citing papers explorer

Showing 1 of 1 citing paper.

BoostAPR: Boosting Automated Program Repair via Execution-Grounded Reinforcement Learning with Dual Reward Models cs.AI · 2026-05-09 · unverdicted · none · ref 7
BoostAPR boosts automated program repair by training a sequence-level assessor and line-level credit allocator from execution outcomes, then applying them in PPO to reach 40.7% on SWE-bench Verified.

• Learning rate:1×10 −5 • Batch size: 64 • Epochs: 5 • Optimizer: AdamW (β 1 = 0.9,β 2 = 0.999) • Hybrid loss weight:λ reg = 0.5 B.3

fields

years

verdicts

representative citing papers

citing papers explorer