pith. sign in

hub Mixed citations

Decoupled weight decay regularization

Mixed citation behavior. Most common role is background (60%).

10 Pith papers citing it
5 external citations · external index
Background 60% of classified citations

hub tools

citation-role summary

background 3 method 1 other 1

citation-polarity summary

years

2026 10

verdicts

UNVERDICTED 10

clear filters

representative citing papers

Inverse Design for Conditional Distribution Matching

cs.LG · 2026-05-10 · unverdicted · novelty 7.0

Defines Conditional Distribution Matching (CDM) as finding inputs whose induced conditional distributions match a target distribution and proposes the MLGD-F inference-time algorithm using pretrained diffusion models to solve it without retraining.

Stateful Agent Backdoor

cs.CR · 2026-05-07 · unverdicted · novelty 7.0

A stateful backdoor for LLM agents, modeled as a Mealy machine with a decomposition framework, enables incremental malicious actions across sessions and achieves 80-95% attack success rate on four models.

citing papers explorer

Showing 3 of 3 citing papers after filters.

  • Inverse Design for Conditional Distribution Matching cs.LG · 2026-05-10 · unverdicted · none · ref 24

    Defines Conditional Distribution Matching (CDM) as finding inputs whose induced conditional distributions match a target distribution and proposes the MLGD-F inference-time algorithm using pretrained diffusion models to solve it without retraining.

  • LaTER: Efficient Test-Time Reasoning via Latent Exploration and Explicit Verification cs.CL · 2026-05-08 · unverdicted · none · ref 26

    LaTER reduces LLM token usage 16-33% on reasoning benchmarks by exploring in latent space then switching to explicit CoT verification, with gains like 70% to 73.3% on AIME 2025 in the training-free version.

  • Signal Reshaping for GRPO in Weak-Feedback Agentic Code Repair cs.AI · 2026-05-08 · unverdicted · none · ref 23

    Reshaping outcome rewards, process signals, and rollout comparability in GRPO raises strict compile-and-semantic accuracy in agentic code repair from 0.385 to 0.535 under weak feedback.