Increasing the sampling steps to25allows the synthesis of an image that closely adheres to the prompt

With just five steps, Show-o can produce an image that is roughly related to the given prompt · 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Show-o: One Single Transformer to Unify Multimodal Understanding and Generation

cs.CV · 2024-08-22 · unverdicted · novelty 5.0

Show-o unifies autoregressive and discrete diffusion modeling inside one transformer to support multimodal understanding and generation tasks with competitive benchmark performance.

citing papers explorer

Showing 1 of 1 citing paper.

Show-o: One Single Transformer to Unify Multimodal Understanding and Generation cs.CV · 2024-08-22 · unverdicted · none · ref 35
Show-o unifies autoregressive and discrete diffusion modeling inside one transformer to support multimodal understanding and generation tasks with competitive benchmark performance.

Increasing the sampling steps to25allows the synthesis of an image that closely adheres to the prompt

fields

years

verdicts

representative citing papers

citing papers explorer