OctoT2I uses a no-supervision PSEL loop to discover model capability frontiers and route T2I tasks, reaching 0.96 GenEval score with 90.3% speedup over Flow-GRPO.
Talkphoto: A versatile training-free conversa- tional assistant for intelligent image editing.arXiv preprint arXiv:2601.01915, 2026
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
Introduces Image Reconstruction Game benchmark showing describer model dominates reconstruction quality in multi-turn VLM-generator dialogue, with math images hardest and token budget affecting convergence.
citing papers explorer
-
The Image Reconstruction Game: Drawing Common Ground Through Iterative Multimodal Dialogue
Introduces Image Reconstruction Game benchmark showing describer model dominates reconstruction quality in multi-turn VLM-generator dialogue, with math images hardest and token budget affecting convergence.