Predictable scale: Part ii, farseer: A refined scaling law in large language models.arXiv preprint arXiv:2506.10972, 2025

Houyi Li, Wenzhen Zheng, Qiufeng Wang, Zhenyu Ding, Haoying Wang, Zili Wang, Shijie Xuyang, Ning Ding, Shuigeng Zhou, Xiangyu Zhang, et al · 2025 · arXiv 2506.10972

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

representative citing papers

Scaling Laws for Neural-Network Quantum States

cond-mat.dis-nn · 2026-06-01 · unverdicted · novelty 6.0

Transformer wave functions for the J1-J2 Heisenberg model exhibit size-independent power-law decay of V-score with compute, with the exponent decreasing as frustration increases.

Neural Scaling Universality: If Exponents Are Fixed, Time to Understand Coefficients

cs.LG · 2026-06-23 · unverdicted · novelty 4.0

Position paper claims fixed exponents in scaling laws arise from generic mechanisms while coefficients vary with data and architecture, making the latter the focus for improvements.

citing papers explorer

Showing 2 of 2 citing papers.

Scaling Laws for Neural-Network Quantum States cond-mat.dis-nn · 2026-06-01 · unverdicted · none · ref 49
Transformer wave functions for the J1-J2 Heisenberg model exhibit size-independent power-law decay of V-score with compute, with the exponent decreasing as frustration increases.
Neural Scaling Universality: If Exponents Are Fixed, Time to Understand Coefficients cs.LG · 2026-06-23 · unverdicted · none · ref 31
Position paper claims fixed exponents in scaling laws arise from generic mechanisms while coefficients vary with data and architecture, making the latter the focus for improvements.

Predictable scale: Part ii, farseer: A refined scaling law in large language models.arXiv preprint arXiv:2506.10972, 2025

fields

years

verdicts

representative citing papers

citing papers explorer