sGPO uses an initial-policy success-rate profiling pass to adaptively set rollout group sizes, filter data, and build a curriculum, cutting total RLVR training compute by 3x while matching baseline performance.
Regression Transformer enables concurrent sequence regres- sion and generation for molecular language modelling
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
verdicts
UNVERDICTED 2representative citing papers
Contextually-enhanced transformers integrating timetable and occupancy data achieve 26.6% and 56.3% average MAE reductions in railway and building energy forecasting respectively, outperforming prior methods.
citing papers explorer
-
sGPO: Trading Inference FLOPs for Training Efficiency in RLVR
sGPO uses an initial-policy success-rate profiling pass to adaptively set rollout group sizes, filter data, and build a curriculum, cutting total RLVR training compute by 3x while matching baseline performance.
-
Integrating the Expected Future in Load Forecasts with Contextually Enhanced Transformer Models
Contextually-enhanced transformers integrating timetable and occupancy data achieve 26.6% and 56.3% average MAE reductions in railway and building energy forecasting respectively, outperforming prior methods.