pith. sign in

Understanding llm behaviors via compression: Data generation, knowledge acquisition and scaling laws

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

citation-role summary

other 1

citation-polarity summary

fields

cs.LG 3 cs.CL 2

years

2026 4 2025 1

roles

other 1

polarities

unclear 1

representative citing papers

Critical Percolation as a Synthetic Data Model for Interpretability

cs.LG · 2026-06-18 · unverdicted · novelty 6.0

Critical percolation clusters embedded in high dimensions, combined with taxonomic latent variables, form an analytically tractable synthetic data model whose ground-truth hierarchy can be linearly decoded from network activations.

Truth as a Compression Artifact in Language Model Training

cs.CL · 2026-03-12 · unverdicted · novelty 6.0

Controlled experiments show language models extract correct answers from contradictory data only when errors are structurally incoherent, supporting the hypothesis that gradient descent selects the most compressible answer cluster.

citing papers explorer

Showing 5 of 5 citing papers.