Introduces WorldCoder-Bench and StateProbe for evaluating LLM-generated physically grounded 3D browser worlds, with frontier models reaching at most 27.8% verification coverage.
Gt23d-bench: A comprehensive general text-to-3d gen- eration benchmark.arXiv preprint arXiv:2412.09997, 2024
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
DB-3DME supplies a human-rated 3D mesh dataset and shows that fine-tuning the visual encoder of Qwen-2.5-VL-7B produces automatic evaluations that align better with humans than prior VLMs.
citing papers explorer
-
WorldCoder-Bench: Benchmarking Physically Grounded 3D World Synthesis
Introduces WorldCoder-Bench and StateProbe for evaluating LLM-generated physically grounded 3D browser worlds, with frontier models reaching at most 27.8% verification coverage.