- Accepting the current proposal would result in Player 1 receiving $0 .01 , which is far below their required $1 .60

** Player 1's Secret Instructions :** - Player 1's instructions are clear : they must get at least $1

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Breaking the Impasse: Dual-Scale Evolutionary Policy Training for Social Language Agents

cs.CL · 2026-05-09 · unverdicted · novelty 5.0

DEPT detects training impasses in social language agents via dual-scale divergence and entropy, then uses asymmetric reshaping to restore exploration gradients and prevent policy homogenization.

citing papers explorer

Showing 1 of 1 citing paper.

Breaking the Impasse: Dual-Scale Evolutionary Policy Training for Social Language Agents cs.CL · 2026-05-09 · unverdicted · none · ref 19
DEPT detects training impasses in social language agents via dual-scale divergence and entropy, then uses asymmetric reshaping to restore exploration gradients and prevent policy homogenization.

- Accepting the current proposal would result in Player 1 receiving $0 .01 , which is far below their required $1 .60

fields

years

verdicts

representative citing papers

citing papers explorer