In Findings of the Association for Computational Linguistics: NAACL 2025 , pages 1496–1524

BOSE: A Systematic Evaluation Method Optimized for Base Models , author= · 2025 · arXiv 2503.00812

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Revealing the Learning Dynamics of Long-Context Continual Pre-training

cs.CL · 2026-04-03 · unverdicted · novelty 6.0

Industrial-scale LLMs require over 150B tokens for long-context continual pre-training to reach intrinsic saturation, with perplexity and retrieval-head attention providing stronger signals than needle-in-a-haystack tests.

Ling and Ring 2.6 Technical Report: Efficient and Instant Agentic Intelligence at Trillion-Parameter Scale

cs.CL · 2026-06-13 · unverdicted · novelty 4.0

Technical report announcing Ling-2.6 and Ring-2.6 models with hybrid linear attention, evolutionary CoT, and KPop RL for efficient agentic intelligence at scale.

citing papers explorer

Showing 2 of 2 citing papers.

Revealing the Learning Dynamics of Long-Context Continual Pre-training cs.CL · 2026-04-03 · unverdicted · none · ref 2
Industrial-scale LLMs require over 150B tokens for long-context continual pre-training to reach intrinsic saturation, with perplexity and retrieval-head attention providing stronger signals than needle-in-a-haystack tests.
Ling and Ring 2.6 Technical Report: Efficient and Instant Agentic Intelligence at Trillion-Parameter Scale cs.CL · 2026-06-13 · unverdicted · none · ref 52
Technical report announcing Ling-2.6 and Ring-2.6 models with hybrid linear attention, evolutionary CoT, and KPop RL for efficient agentic intelligence at scale.

In Findings of the Association for Computational Linguistics: NAACL 2025 , pages 1496–1524

fields

years

verdicts

representative citing papers

citing papers explorer