Therefore the regret bound of Theorem 1 applies to ideal posterior sampling over the unknown graph and reward parameters

Then Assumption 2 holds with cA,θ = nX i=1 αi, s j =τ j + nX i=1 ηij · 2020

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Adaptive Policy Learning Under Unknown Network Interference

stat.ML · 2026-05-11 · unverdicted · novelty 8.0

A Thompson sampling algorithm jointly infers unknown network interference and learns optimal individual-level treatments, with sublinear regret for additive spillover models and an explore-then-commit variant for general neighborhood interference.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Adaptive Policy Learning Under Unknown Network Interference stat.ML · 2026-05-11 · unverdicted · none · ref 5
A Thompson sampling algorithm jointly infers unknown network interference and learns optimal individual-level treatments, with sublinear regret for additive spillover models and an explore-then-commit variant for general neighborhood interference.

Therefore the regret bound of Theorem 1 applies to ideal posterior sampling over the unknown graph and reward parameters

fields

years

verdicts

representative citing papers

citing papers explorer