CoRR , volume =

Leonardo Ranaldi, Fabio Massimo Zanzotto , title = · 2023 · arXiv 2311.08097

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Language as a Latent Variable for Reasoning Optimization

cs.CL · 2026-04-23 · unverdicted · novelty 5.0

Treating language as a latent variable via polyGRPO RL improves Qwen2.5-7B-Instruct by 6.72% on English reasoning benchmarks and 6.89% on multilingual ones, with cross-task gains on commonsense reasoning from math-only training.

citing papers explorer

Showing 1 of 1 citing paper.

Language as a Latent Variable for Reasoning Optimization cs.CL · 2026-04-23 · unverdicted · none · ref 16
Treating language as a latent variable via polyGRPO RL improves Qwen2.5-7B-Instruct by 6.72% on English reasoning benchmarks and 6.89% on multilingual ones, with cross-task gains on commonsense reasoning from math-only training.

CoRR , volume =

fields

years

verdicts

representative citing papers

citing papers explorer