arXiv preprint arXiv:2506.07998 , year=

Generative Modeling of Weights: Generalization or Memorization? , author= · arXiv 2506.07998

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories

cs.LG · 2026-05-20 · unverdicted · novelty 6.0

RELEX extrapolates LLM checkpoints from short RLVR prefixes by projecting deltas onto a rank-1 subspace and fitting a linear trend, matching full training performance at 15% of the steps.

What Linear Probes Miss: Multi-View Probing for Weight-Space Learning

cs.LG · 2026-05-22 · unverdicted · novelty 5.0

MVProbe is a multi-perspective probing framework for weight-space learning that combines first-order and Gram-based views and outperforms ProbeX on the Model Jungle benchmark.

citing papers explorer

Showing 2 of 2 citing papers.

You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories cs.LG · 2026-05-20 · unverdicted · none · ref 12
RELEX extrapolates LLM checkpoints from short RLVR prefixes by projecting deltas onto a rank-1 subspace and fitting a linear trend, matching full training performance at 15% of the steps.
What Linear Probes Miss: Multi-View Probing for Weight-Space Learning cs.LG · 2026-05-22 · unverdicted · none · ref 16
MVProbe is a multi-perspective probing framework for weight-space learning that combines first-order and Gram-based views and outperforms ProbeX on the Model Jungle benchmark.

arXiv preprint arXiv:2506.07998 , year=

fields

years

verdicts

representative citing papers

citing papers explorer