3d-moe: A mixture-of-experts multi-modal LLM for 3d vision and pose diffusion via rectified flow

· 2025 · arXiv 2501.16698

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

DoReMi: Bridging 3D Domains via Topology-Aware Domain-Representation Mixture of Experts

cs.CV · 2025-11-14 · unverdicted · novelty 6.0

DoReMi uses self-supervised pre-training on topological and texture variations plus domain-aware experts with spatial-guided routing and entropy-controlled allocation to reach 80.1% mIoU on ScanNet and 77.2% mIoU on S3DIS.

A Survey on Vision-Language-Action Models for Embodied AI

cs.RO · 2024-05-23 · unverdicted · novelty 6.0

This is the first survey on vision-language-action models, providing a taxonomy across three lines, plus summaries of datasets, simulators, benchmarks, challenges, and future directions in embodied AI.

citing papers explorer

Showing 2 of 2 citing papers.

DoReMi: Bridging 3D Domains via Topology-Aware Domain-Representation Mixture of Experts cs.CV · 2025-11-14 · unverdicted · none · ref 22
DoReMi uses self-supervised pre-training on topological and texture variations plus domain-aware experts with spatial-guided routing and entropy-controlled allocation to reach 80.1% mIoU on ScanNet and 77.2% mIoU on S3DIS.
A Survey on Vision-Language-Action Models for Embodied AI cs.RO · 2024-05-23 · unverdicted · none · ref 134
This is the first survey on vision-language-action models, providing a taxonomy across three lines, plus summaries of datasets, simulators, benchmarks, challenges, and future directions in embodied AI.

3d-moe: A mixture-of-experts multi-modal LLM for 3d vision and pose diffusion via rectified flow

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer