pith. sign in

hints” which can then be provided to psmall. 14 •In addition to generating hints, use the large models as “gatekeepers

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.AI 1

years

2025 1

verdicts

UNVERDICTED 1

representative citing papers

Consensus Sampling for Safer Generative AI

cs.AI · 2025-11-12 · unverdicted · novelty 5.0

Consensus sampling aggregates k distributions to achieve risk competitive with the average of the safest s, abstaining on low agreement, and formalizes this via R-robustness that bounds leakage and adversarial influence for generative AI safety.

citing papers explorer

Showing 1 of 1 citing paper.

  • Consensus Sampling for Safer Generative AI cs.AI · 2025-11-12 · unverdicted · none · ref 2

    Consensus sampling aggregates k distributions to achieve risk competitive with the average of the safest s, abstaining on low agreement, and formalizes this via R-robustness that bounds leakage and adversarial influence for generative AI safety.