Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

Dreambooth: Fine tuning text-to-image diffusion models for subject-driven generation , author=

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

browse 4 citing papers

representative citing papers

Designing streetscapes from street-view imagery using diffusion models

cs.CV · 2026-05-17 · conditional · novelty 7.0

A multimodal diffusion model generates controllable alternative streetscapes from street-view imagery using visual metrics and text, shown on Chicago and Orlando data with gains in semantic consistency.

Polyphonia: Zero-Shot Timbre Transfer in Polyphonic Music with Acoustic-Informed Attention Calibration

cs.SD · 2026-05-11 · unverdicted · novelty 7.0

Polyphonia improves zero-shot stem-specific timbre transfer in polyphonic music by 15.5% target alignment via acoustic-informed attention calibration that uses probabilistic priors to set coarse boundaries.

Geometric Decoupling: Diagnosing the Structural Instability of Latent

cs.CV · 2026-04-20 · unverdicted · novelty 6.0

Latent diffusion models exhibit geometric decoupling where curvature in out-of-distribution generation is misallocated to unstable semantic boundaries instead of image details, identifying geometric hotspots as the structural cause of editing instability.

What Linear Probes Miss: Multi-View Probing for Weight-Space Learning

cs.LG · 2026-05-22 · unverdicted · novelty 5.0

MVProbe is a multi-perspective probing framework for weight-space learning that combines first-order and Gram-based views and outperforms ProbeX on the Model Jungle benchmark.

citing papers explorer

Showing 4 of 4 citing papers.

Designing streetscapes from street-view imagery using diffusion models cs.CV · 2026-05-17 · conditional · none · ref 12
A multimodal diffusion model generates controllable alternative streetscapes from street-view imagery using visual metrics and text, shown on Chicago and Orlando data with gains in semantic consistency.
Polyphonia: Zero-Shot Timbre Transfer in Polyphonic Music with Acoustic-Informed Attention Calibration cs.SD · 2026-05-11 · unverdicted · none · ref 30
Polyphonia improves zero-shot stem-specific timbre transfer in polyphonic music by 15.5% target alignment via acoustic-informed attention calibration that uses probabilistic priors to set coarse boundaries.
Geometric Decoupling: Diagnosing the Structural Instability of Latent cs.CV · 2026-04-20 · unverdicted · none · ref 15
Latent diffusion models exhibit geometric decoupling where curvature in out-of-distribution generation is misallocated to unstable semantic boundaries instead of image details, identifying geometric hotspots as the structural cause of editing instability.
What Linear Probes Miss: Multi-View Probing for Weight-Space Learning cs.LG · 2026-05-22 · unverdicted · none · ref 34
MVProbe is a multi-perspective probing framework for weight-space learning that combines first-order and Gram-based views and outperforms ProbeX on the Model Jungle benchmark.

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition , pages=

fields

years

verdicts

representative citing papers

citing papers explorer