G-core: A simple, scalable and balanced rlhf trainer

Junyu Wu, Weiming Chang, Xiaotao Liu, Guanyou He, Haoqiang Hong, Boqi Liu, Hongtao Tian, Tao Yang, Yunsheng Shi, Feng Lin, et al · 2025 · arXiv 2507.22789

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

read on arXiv browse 1 citing papers

representative citing papers

Next-Generation Agentic Reinforcement Learning Systems Enable Self-Evolving Agents

cs.DC · 2026-07-01 · unverdicted · novelty 3.0

Enterprise self-evolving agents require new agentic RL systems built around standardized trajectory data protocols, workload-to-learning data proxies, and automatic policy evolution control planes.

citing papers explorer

Showing 1 of 1 citing paper.

Next-Generation Agentic Reinforcement Learning Systems Enable Self-Evolving Agents cs.DC · 2026-07-01 · unverdicted · none · ref 21
Enterprise self-evolving agents require new agentic RL systems built around standardized trajectory data protocols, workload-to-learning data proxies, and automatic policy evolution control planes.

G-core: A simple, scalable and balanced rlhf trainer

fields

years

verdicts

representative citing papers

citing papers explorer