CoAct: Co-Active LLM Preference Learning with Human-AI Synergy

· 2026 · cs.CL · arXiv 2604.17501

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Learning from preference-based feedback has become an effective approach for aligning LLMs across diverse tasks. However, high-quality human-annotated preference data remains expensive and scarce. Existing methods address this challenge through either self-rewarding, which scales by using purely AI-generated labels but risks unreliability, or active learning, which ensures quality through oracle annotation but cannot fully leverage unlabeled data. In this paper, we present CoAct, a novel framework that synergistically combines self-rewarding and active learning through strategic human-AI collaboration. CoAct leverages self-consistency to identify both reliable self-labeled data and samples that require oracle verification. Additionally, oracle feedback guides the model to generate new instructions within its solvable capability. Evaluated on three reasoning benchmarks across two model families, CoAct achieves average improvements of +13.25% on GSM8K, +8.19% on MATH, and +13.16% on WebInstruct, consistently outperforming all baselines.

representative citing papers

DOG-DPO:Dynamic Optimization in Geometry for Safety Alignment

cs.LG · 2026-06-04 · unverdicted · novelty 5.0

DOG-DPO selects 11% of preference pairs via geometric subspace decomposition to recover most safety gains of full-data DPO training across six benchmarks.

citing papers explorer

Showing 1 of 1 citing paper after filters.

DOG-DPO:Dynamic Optimization in Geometry for Safety Alignment cs.LG · 2026-06-04 · unverdicted · none · ref 9 · internal anchor
DOG-DPO selects 11% of preference pairs via geometric subspace decomposition to recover most safety gains of full-data DPO training across six benchmarks.

CoAct: Co-Active LLM Preference Learning with Human-AI Synergy

fields

years

verdicts

representative citing papers

citing papers explorer