One possible way is to partition the positive integers into three subsets: •A 1 contains numbers of the form3k+ 1

Further Partitioning:We need to further partition the subsets to ensure that for each n≥15 , there exist two distinct elements in the same subset that sum to n

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

AGPO: Asymmetric Group Policy Optimization for Verifiable Reasoning and Search Ads Relevance at JD

cs.AI · 2026-05-07 · unverdicted · novelty 5.0

AGPO applies asymmetric negative-dominant reinforcement and variance-scaled group advantages in RLVR to preserve base model exploration while boosting accuracy and pass@k on math benchmarks and industrial ad relevance data.

citing papers explorer

Showing 1 of 1 citing paper.

AGPO: Asymmetric Group Policy Optimization for Verifiable Reasoning and Search Ads Relevance at JD cs.AI · 2026-05-07 · unverdicted · none · ref 3
AGPO applies asymmetric negative-dominant reinforcement and variance-scaled group advantages in RLVR to preserve base model exploration while boosting accuracy and pass@k on math benchmarks and industrial ad relevance data.

One possible way is to partition the positive integers into three subsets: •A 1 contains numbers of the form3k+ 1

fields

years

verdicts

representative citing papers

citing papers explorer