DAPRO provides the first dynamic, theoretically guaranteed way to allocate interaction budgets across test cases for bounding time-to-event in multi-turn LLM evaluations, achieving tighter coverage than static conformal survival methods.
Inductive confidence machines for regression
2 Pith papers cite this work. Polarity classification is still indexing.
years
2026 2verdicts
UNVERDICTED 2representative citing papers
A new kernel nonconformity score for multivariate conformal prediction that adapts to residual geometry, provides finite-sample coverage, and achieves convergence rates based on effective kernel rank rather than ambient dimension.
citing papers explorer
-
How Many Iterations to Jailbreak? Dynamic Budget Allocation for Multi-Turn LLM Evaluation
DAPRO provides the first dynamic, theoretically guaranteed way to allocate interaction budgets across test cases for bounding time-to-event in multi-turn LLM evaluations, achieving tighter coverage than static conformal survival methods.
-
A Kernel Nonconformity Score for Multivariate Conformal Prediction
A new kernel nonconformity score for multivariate conformal prediction that adapts to residual geometry, provides finite-sample coverage, and achieves convergence rates based on effective kernel rank rather than ambient dimension.