Control Consistency Losses for Diffusion Bridges

Samuel Howard , Nikolas N\"usken , Jakiw Pidstrigach

Authors on Pith no claims yet

classification 📊 stat.ML cs.LG

keywords diffusioncontroldynamicsbridgesconditionedoptimalself-consistencyterminal

read the original abstract

Simulating the conditioned dynamics of diffusion processes, given their initial and terminal states, is an important but challenging problem in the sciences. The difficulty is particularly pronounced for rare events, for which the unconditioned dynamics rarely reach the terminal state. In this work, we propose a novel approach for learning diffusion bridges based on a self-consistency property of the optimal control. The resulting algorithm learns the conditioned dynamics in an iterative online manner, and exhibits strong performance in a range of empirical settings without requiring differentiation through simulated trajectories. Beyond the diffusion bridge setting, we draw connections between our self-consistency framework and recent advances in the wider stochastic optimal control literature.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Reinforce Adjoint Matching: Scaling RL Post-Training of Diffusion and Flow-Matching Models
cs.LG 2026-05 unverdicted novelty 7.0

Reinforce Adjoint Matching derives a simple consistency loss for RL post-training of diffusion models by tilting the clean distribution toward higher-reward samples under KL regularization while keeping the noising pr...