hub Mixed citations

Improving and generalizing flow-based generative models with minibatch optimal transport

Alexander Tong, Kilian Fatras, Nikolay Malkin, Guillaume Huguet, Yanlei Zhang, Jarrid Rector-Brooks · 2023 · cs.LG · arXiv 2302.00482

Mixed citation behavior. Most common role is background (50%).

77 Pith papers citing it

Background 50% of classified citations

open full Pith review browse 77 citing papers arXiv PDF

abstract

Continuous normalizing flows (CNFs) are an attractive generative modeling technique, but they have been held back by limitations in their simulation-based maximum likelihood training. We introduce the generalized conditional flow matching (CFM) technique, a family of simulation-free training objectives for CNFs. CFM features a stable regression objective like that used to train the stochastic flow in diffusion models but enjoys the efficient inference of deterministic flow models. In contrast to both diffusion models and prior CNF training algorithms, CFM does not require the source distribution to be Gaussian or require evaluation of its density. A variant of our objective is optimal transport CFM (OT-CFM), which creates simpler flows that are more stable to train and lead to faster inference, as evaluated in our experiments. Furthermore, we show that when the true OT plan is available, our OT-CFM method approximates dynamic OT. Training CNFs with CFM improves results on a variety of conditional and unconditional generation tasks, such as inferring single cell dynamics, unsupervised image translation, and Schr\"odinger bridge inference.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 9 method 4 baseline 2 dataset 1

citation-polarity summary

background 8 use method 3 baseline 2 extend 1 unclear 1 use dataset 1

representative citing papers

What Time Is It? How Data Geometry Makes Time Conditioning Optional for Flow Matching

cs.LG · 2026-05-08 · unverdicted · novelty 8.0

Data geometry makes time identifiable from noisy interpolants at rate O(1/sqrt(d-k)), rendering the time-blindness gap asymptotically negligible relative to coupling variance.

Generative Modeling with Flux Matching

cs.LG · 2026-05-08 · unverdicted · novelty 8.0

Flux Matching generalizes score-based generative modeling by using a weaker objective that admits infinitely many non-conservative vector fields with the data as stationary distribution, enabling new design choices beyond traditional score matching.

Generative models on phase space

hep-ph · 2026-04-02 · unverdicted · novelty 8.0

Generative diffusion and flow models are constructed to remain exactly on the Lorentz-invariant massless N-particle phase space manifold during sampling for particle physics applications.

FlowHijack: A Dynamics-Aware Backdoor Attack on Flow-Matching Vision-Language-Action Models

cs.CV · 2026-03-30 · unverdicted · novelty 8.0

FlowHijack is the first dynamics-aware backdoor attack on flow-matching VLAs that achieves high success rates with stealthy triggers while preserving benign performance and making malicious actions kinematically indistinguishable from normal ones.

Cross-Space Distillation: Teaching One-Step Students with Modern Diffusion Teachers

cs.CV · 2026-06-30 · unverdicted · novelty 7.0

Introduces a Bridge latent interface that maps mismatched student latents into teacher space, enabling distillation from modern diffusion teachers to compact one-step students and raising SD 1.5 HPSv3 from 5.4 to 9.4 while keeping one-step speed.

A Distributionally Robust Framework for Learned Reconstructions in Inverse Problems

math.OC · 2026-06-29 · unverdicted · novelty 7.0

Introduces structured DRO for learned inverse problem reconstructions with ambiguity sets aligned to the forward operator, yielding explicit dual representations and a worst-case bound that induces Tikhonov regularization on the operator Lipschitz constant.

Bridging Vision and Language Concepts through Optimal Transport Semantic Flow

cs.CV · 2026-06-25 · unverdicted · novelty 7.0

OTF-CBM replaces static cosine similarity in vision-language CBMs with data-driven optimal transport flow to improve concept alignment, accuracy, and faithfulness.

Patched Flow Matching: Generative Wall-Pressure Reconstruction Beyond Training-Domain Scales from Sparse Sensors

physics.flu-dyn · 2026-06-20 · unverdicted · novelty 7.0

Patched Flow Matching reconstructs full-resolution wall-pressure fields on domains four times larger than training data from 0.25% sensor coverage by fusing short-domain DNS patch priors with sparse measurements via training-free posterior sampling.

Intrinsic Flow Matching on Quantum Pure-State Manifolds with Phase-Aligned Transport

cs.LG · 2026-06-19 · unverdicted · novelty 7.0

IFM learns deterministic tangent velocity fields on CP^{d-1} via Pancharatnam phase-aligned paths, recovering marginal transport with endpoint and stability guarantees while showing empirical gains over Euclidean flow matching on quantum benchmarks.

TriFlow: Generating Artist-Like 3D Mesh Topology via Nearest-Vertex Vector Fields

cs.CV · 2026-06-18 · unverdicted · novelty 7.0

TriFlow synthesizes nearest-vertex vector fields via flow-matching to generate artist-like 3D mesh topology, then extracts meshes via clustering and topology-aware QEM simplification.

Start Right, Arrive Right: Asynchronous Execution via Initial Noise Selection

cs.RO · 2026-06-18 · unverdicted · novelty 7.0

PAINT reframes asynchronous flow-based action chunking as an initial noise selection problem solved via backward Euler inversion and a repainting rule.

Learning to Distort: Weakly-Supervised Image Quality Transfer for Prostate DWI Correction

cs.CV · 2026-06-17 · unverdicted · novelty 7.0

A weakly-supervised image quality transfer method generates synthetic distorted DWI images from quality labels to train improved distortion correction models for prostate MRI.

Learning Individual Dynamics from Sparse Cross-Sectional Snapshots

cs.LG · 2026-05-22 · unverdicted · novelty 7.0

CADENCE recovers individualized continuous-time trajectories from cross-sectional snapshots via context-anchored latent dynamics, a bijective score-based encoder, and SMoE routing, with claimed identifiability guarantees and benchmark performance matching dense-data models.

Increasing the Precision of Surrogate Models for Weak Lensing Mass Maps with Flow Matching

astro-ph.CO · 2026-05-22 · unverdicted · novelty 7.0

A flow matching generative model produces weak lensing mass maps with fidelity improved to below 1% and 5% on basic and higher-order statistics relative to GAN benchmarks.

Learning Unbiased Permutations via Flow Matching

cs.LG · 2026-05-16 · unverdicted · novelty 7.0

PermFlow applies conditional flow matching on the affine subspace of doubly stochastic matrices with a closed-form tangent projector and nearest-target coupling to capture multimodal permutation distributions.

The Velocity Deficit: Initial Energy Injection for Flow Matching

cs.CV · 2026-05-14 · unverdicted · novelty 7.0

Flow matching underestimates velocities due to MSE loss leading to integration lag; Initial Energy Injection corrects the start-end asymmetry, improving FID by 44.6% and achieving 5x speedup on ImageNet-1k.

Aligning Flow Map Policies with Optimal Q-Guidance

cs.LG · 2026-05-12 · unverdicted · novelty 7.0

Flow map policies enable fast one-step inference for flow-based RL policies, and FMQ provides an optimal closed-form Q-guided target for offline-to-online adaptation under trust-region constraints, achieving SOTA performance.

Generative Transfer for Entropic Optimal Transport with Unknown Costs

math.OC · 2026-05-12 · unverdicted · novelty 7.0

A generative transfer framework using iterative path-wise tilting integrated with conditional flow matching recovers target entropic optimal transport couplings from reference samples, achieving O(δ) convergence in Wasserstein-1 distance.

A Call to Lagrangian Action: Learning Population Mechanics from Temporal Snapshots

cs.LG · 2026-05-08 · unverdicted · novelty 7.0 · 3 refs

Wasserstein Lagrangian Mechanics formalizes second-order dynamics in Wasserstein space and provides an algorithm to learn them from observed marginals without specifying the Lagrangian, outperforming gradient flows on various dynamics.

Stochastic Transition-Map Distillation for Fast Probabilistic Inference

cs.LG · 2026-05-08 · unverdicted · novelty 7.0

STMD distills the full transition map of diffusion sampling SDEs into a conditional Mean Flow model to enable fast one- or few-step stochastic sampling without teacher models or bi-level optimization.

Generative Modeling with Orbit-Space Particle Flow Matching

cs.GR · 2026-05-04 · unverdicted · novelty 7.0

OGPP is a particle flow-matching method using orbit-space canonicalization and geometric paths that achieves lower error and fewer steps than prior approaches on 3D benchmarks.

Talker-T2AV: Joint Talking Audio-Video Generation with Autoregressive Diffusion Modeling

cs.CV · 2026-04-26 · unverdicted · novelty 7.0

Talker-T2AV achieves better lip-sync accuracy, video quality, and audio quality than dual-branch baselines by separating high-level shared autoregressive modeling from modality-specific low-level diffusion refinement in a joint audio-video generation framework.

Is Flow Matching Just Trajectory Replay for Sequential Data?

stat.ML · 2026-02-09 · unverdicted · novelty 7.0

Flow matching on time series targets a closed-form nonparametric velocity field that is a similarity-weighted mixture of observed transition velocities, making neural models approximations to an ideal memory-augmented dynamical system sampler.

DisRFM: Polar Riemannian Flow Matching for Structure-Preserving Graph Domain Adaptation

cs.LG · 2026-01-31 · unverdicted · novelty 7.0

DisRFM uses polar Riemannian flow matching on constant-curvature manifolds to align graph domains while preserving label-relevant topology via radial Wasserstein and angular confidence matching.

citing papers explorer

Showing 17 of 17 citing papers after filters.

FlowHijack: A Dynamics-Aware Backdoor Attack on Flow-Matching Vision-Language-Action Models cs.CV · 2026-03-30 · unverdicted · none · ref 34 · internal anchor
FlowHijack is the first dynamics-aware backdoor attack on flow-matching VLAs that achieves high success rates with stealthy triggers while preserving benign performance and making malicious actions kinematically indistinguishable from normal ones.
Cross-Space Distillation: Teaching One-Step Students with Modern Diffusion Teachers cs.CV · 2026-06-30 · unverdicted · none · ref 47 · internal anchor
Introduces a Bridge latent interface that maps mismatched student latents into teacher space, enabling distillation from modern diffusion teachers to compact one-step students and raising SD 1.5 HPSv3 from 5.4 to 9.4 while keeping one-step speed.
Bridging Vision and Language Concepts through Optimal Transport Semantic Flow cs.CV · 2026-06-25 · unverdicted · none · ref 37 · internal anchor
OTF-CBM replaces static cosine similarity in vision-language CBMs with data-driven optimal transport flow to improve concept alignment, accuracy, and faithfulness.
TriFlow: Generating Artist-Like 3D Mesh Topology via Nearest-Vertex Vector Fields cs.CV · 2026-06-18 · unverdicted · none · ref 56 · internal anchor
TriFlow synthesizes nearest-vertex vector fields via flow-matching to generate artist-like 3D mesh topology, then extracts meshes via clustering and topology-aware QEM simplification.
Learning to Distort: Weakly-Supervised Image Quality Transfer for Prostate DWI Correction cs.CV · 2026-06-17 · unverdicted · none · ref 21 · internal anchor
A weakly-supervised image quality transfer method generates synthetic distorted DWI images from quality labels to train improved distortion correction models for prostate MRI.
The Velocity Deficit: Initial Energy Injection for Flow Matching cs.CV · 2026-05-14 · unverdicted · none · ref 5 · internal anchor
Flow matching underestimates velocities due to MSE loss leading to integration lag; Initial Energy Injection corrects the start-end asymmetry, improving FID by 44.6% and achieving 5x speedup on ImageNet-1k.
Talker-T2AV: Joint Talking Audio-Video Generation with Autoregressive Diffusion Modeling cs.CV · 2026-04-26 · unverdicted · none · ref 20 · internal anchor
Talker-T2AV achieves better lip-sync accuracy, video quality, and audio quality than dual-branch baselines by separating high-level shared autoregressive modeling from modality-specific low-level diffusion refinement in a joint audio-video generation framework.
CurvSegFlow: Time-Conditioned Flow Matching for Robust Segmentation of Curvilinear Structures in Noisy Biomedical Images cs.CV · 2026-06-19 · unverdicted · none · ref 83 · internal anchor
CurvSegFlow applies time-conditioned flow matching with a U-Net backbone and triple-term loss to progressively refine segmentations of thin structures in noisy images, reporting competitive performance on microtubule, vessel, and nerve datasets.
Towards Continuous Sign Language Conversation from Isolated Signs cs.CV · 2026-05-14 · unverdicted · none · ref 79 · internal anchor
Constructs continuous sign conversation data from isolated signs using retrieval and diffusion models to train a direct sign-to-sign conversational AI.
DanceOPD: On-Policy Generative Field Distillation cs.CV · 2026-06-25 · unverdicted · none · ref 94 · internal anchor
DanceOPD routes samples across capability velocity fields in flow-matching models and trains via on-policy student-induced states to compose T2I, local editing, and global editing without mutual interference.
SynVA: A Modular Toolkit for Vessel Generation and Aneurysm Editing cs.CV · 2026-05-13 · unverdicted · none · ref 88 · internal anchor
SynVA toolkit generates realistic vascular meshes and anatomically plausible aneurysms, releasing 50,000 labeled samples for medical vision tasks.
PixelFlowCast: Latent-Free Precipitation Nowcasting via Pixel Mean Flows cs.CV · 2026-05-11 · unverdicted · none · ref 20 · internal anchor
PixelFlowCast delivers high-fidelity precipitation nowcasts from radar sequences using a latent-free Pixel Mean Flows predictor guided by a deterministic coarse stage and KANCondNet features.
FluxFlow: Conservative Flow-Matching for Astronomical Image Super-Resolution cs.CV · 2026-05-05 · unverdicted · none · ref 20 · 2 links · internal anchor
FluxFlow uses conservative pixel-space flow-matching with uncertainty weights and Wiener test-time correction to outperform baselines on photometric and scientific accuracy for ground-to-space super-resolution, validated on a new real 19,500-pair DESI-HST dataset.
Unifying Deep Stochastic Processes for Image Enhancement cs.CV · 2026-05-02 · unverdicted · none · ref 46 · internal anchor
Stochastic image enhancement methods are shown to be variants of a shared SDE differing in drift, diffusion, terminal distributions and boundary conditions, with controlled experiments revealing no single dominant family and a new modular library released.
The Amazing Stability of Flow Matching cs.CV · 2026-04-17 · unverdicted · none · ref 31 · internal anchor
Flow matching generative models preserve sample quality, diversity, and latent representations despite pruning 50% of the CelebA-HQ dataset or altering architecture and training configurations.
FlowDec: Temporal Conditional Flow Decorruptor for Robust Continuous Vision-Language Navigation cs.CV · 2026-06-21 · unverdicted · none · ref 40 · internal anchor
FlowDec is a novel image restoration framework using hybrid temporal conditioning and action-centroid filtering that claims to outperform prior decorruption methods on navigation accuracy and latency in VLN-CE.
Stabilizing, Scaling & Enhancing MeanFlow for Large-scale Diffusion Distillation cs.CV · 2026-05-18 · unverdicted · none · ref 30 · internal anchor
Stabilizes MeanFlow for large-scale diffusion distillation via discrete warm-up and trajectory alignment, reporting better results on FLUX.1-dev and HunyuanImage 3.0.

Improving and generalizing flow-based generative models with minibatch optimal transport

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer