pith. machine review for the scientific record. sign in

arXiv preprint arXiv:2203.09168 , year=

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

years

2026 3

verdicts

UNVERDICTED 3

representative citing papers

Variance-aware Reward Modeling with Anchor Guidance

stat.ML · 2026-05-12 · unverdicted · novelty 7.0

Anchor-guided variance-aware reward modeling uses two response-level anchors to resolve non-identifiability in Gaussian models of pluralistic preferences, yielding provable identification, a joint training objective, and improved RLHF performance.

citing papers explorer

Showing 3 of 3 citing papers.

  • Variance-aware Reward Modeling with Anchor Guidance stat.ML · 2026-05-12 · unverdicted · none · ref 16

    Anchor-guided variance-aware reward modeling uses two response-level anchors to resolve non-identifiability in Gaussian models of pluralistic preferences, yielding provable identification, a joint training objective, and improved RLHF performance.

  • Probabilistic denoising for reliable signal extraction in spectroscopy cond-mat.str-el · 2026-05-08 · unverdicted · none · ref 14

    A probabilistic denoising model recovers spectral features from Poisson-noisy 3D ARPES data at 0.02 electrons per voxel and propagates uncertainties into superconducting gap fits for cuprate superconductors.

  • Monte Carlo PDE Solvers for Nonlinear Radiative Boundary Conditions cs.GR · 2026-04-22 · unverdicted · none · ref 53

    A relaxed Picard iteration plus heteroscedastic boundary denoising lets Monte Carlo PDE solvers solve heat equations with nonlinear radiation boundary conditions more accurately than linearization.