G-core: A simple, scalable and balanced rlhf trainer

Junyu Wu, Weiming Chang, Xiaotao Liu, Guanyou He, Haoqiang Hong, Boqi Liu, Hongtao Tian, Tao Yang, Yunsheng Shi, Feng Lin, et al · 2025 · arXiv 2507.22789

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

read on arXiv browse 1 citing papers

representative citing papers

Next-Generation Agentic Reinforcement Learning Systems Enable Self-Evolving Agents

cs.DC · 2026-07-01 · unverdicted · novelty 4.0 · 2 refs

Current agentic RL systems lack three key components needed for self-evolving agents at scale, requiring new co-designed architectures such as AReaL2.0 to enable policy updates from deployed workloads.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Next-Generation Agentic Reinforcement Learning Systems Enable Self-Evolving Agents cs.DC · 2026-07-01 · unverdicted · none · ref 21 · 2 links
Current agentic RL systems lack three key components needed for self-evolving agents at scale, requiring new co-designed architectures such as AReaL2.0 to enable policy updates from deployed workloads.

G-core: A simple, scalable and balanced rlhf trainer

fields

years

verdicts

representative citing papers

citing papers explorer