Systematic benchmarking of diffusion model optimizations on Apple M3 Ultra produces 22.7 FPS real-time img2img at 512x512 and demonstrates that CUDA-derived techniques do not transfer directly to Apple Silicon.
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation,
2 Pith papers cite this work. Polarity classification is still indexing.
years
2026 2representative citing papers
Coherence-first rendering with 15 FPS anchors plus FSR4 upsampling to 30 FPS preserves scene geometry and identity longer than native 30 FPS generation across tested forest, sword, desert, and snow scenes, with LPIPS favoring the coherence branch.
citing papers explorer
-
Systematic Optimization of Real-Time Diffusion Model Inference on Apple M3 Ultra
Systematic benchmarking of diffusion model optimizations on Apple M3 Ultra produces 22.7 FPS real-time img2img at 512x512 and demonstrates that CUDA-derived techniques do not transfer directly to Apple Silicon.
-
Fewer, Better Frames: A Compute-Normalized Proof of Concept for Coherence-First World-Model Rendering with Model-Guided FSR4 Frame Generation
Coherence-first rendering with 15 FPS anchors plus FSR4 upsampling to 30 FPS preserves scene geometry and identity longer than native 30 FPS generation across tested forest, sword, desert, and snow scenes, with LPIPS favoring the coherence branch.