hub

Cad-mllm: Unifying multimodality-conditioned cad generation with mllm

· 2025 · arXiv 2411.04954

19 Pith papers cite this work. Polarity classification is still indexing.

19 Pith papers citing it

read on arXiv browse 19 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 3

citation-polarity summary

background 3

representative citing papers

FllumaOne: A Code-Native Multimodal CAD Dataset with Executable Programs and Kernel-Validated Feature Histories

cs.AI · 2026-06-16 · unverdicted · novelty 7.0

FllumaOne releases 100,000 kernel-validated CAD models as executable Python programs with aligned multimodal data including feature histories and geometry exports.

BRepCLIP: Contrastive Multimodal Pretraining on BRep Primitives for CAD Understanding

cs.CV · 2026-06-03 · unverdicted · novelty 7.0

BRepCLIP is the first contrastive pretraining framework that tokenizes BRep CAD geometry into surface and curve vocabularies and aligns the resulting embeddings with CLIP text and image encoders, reporting large gains in retrieval and zero-shot classification over point-based baselines.

UniCAD: A Unified Benchmark and Universal Model for Multi-Modal Multi-Task CAD

cs.CV · 2026-06-03 · unverdicted · novelty 7.0

UniCAD supplies a unified multi-modal benchmark and an end-to-end MLLM that performs reconstruction, generation, and QA on CAD data, reporting SOTA results on UniCAD and Fusion360.

MUSE: Benchmarking Manufacturable, Functional, and Assemblable Text-to-CAD Generation

cs.AI · 2026-05-27 · unverdicted · novelty 7.0

MUSE is a new benchmark and three-stage evaluation protocol for text-to-CAD generation that assesses functionality, manufacturability, and assemblability of B-Rep assemblies beyond geometric similarity.

BrepForge: Factorized B-rep Synthesis via Wireframe Composition and Boundary-Conditioned Surface Instantiation

cs.GR · 2026-05-19 · unverdicted · novelty 7.0

BrepForge factorizes B-rep synthesis into face-aware autoregressive wireframe composition followed by boundary-conditioned surface instantiation using learning-free geometric priors.

Img2CADSeq: Image-to-CAD Generation via Sequence-Based Diffusion

cs.CV · 2026-05-13 · unverdicted · novelty 7.0

Img2CADSeq generates standard CAD sequences from images via a multi-stage pipeline with three-level hierarchical codebook encoding, importance-guided compression, and contrastive point-cloud conditioning of a VQ-Diffusion model, outperforming prior methods on new CAD-220K and PrintCAD datasets.

AssemblyBench: Physics-Aware Assembly of Complex Industrial Objects

cs.CV · 2026-05-13 · unverdicted · novelty 7.0

AssemblyBench dataset and AssemblyDyno transformer model enable physics-aware prediction of assembly sequences and trajectories for complex industrial objects from multimodal instructions and 3D shapes.

CADBench: A Multimodal Benchmark for AI-Assisted CAD Program Generation

cs.CV · 2026-05-11 · unverdicted · novelty 7.0 · 2 refs

CADBench is a new multimodal benchmark for CAD program generation that combines 18k samples from DeepCAD, Fusion 360, ABC, MCB, and Objaverse across clean/noisy meshes and various renders, used to test 11 models and reveal failure modes.

ArtiCAD: Articulated CAD Assembly Design via Multi-Agent Code Generation

cs.CV · 2026-04-13 · unverdicted · novelty 7.0

ArtiCAD presents the first training-free multi-agent framework that generates articulated, editable CAD assemblies from text or images by predicting assembly relationships early and using validation with rollback.

CAD-Coder: Text-to-CAD Generation with Chain-of-Thought and Geometric Reward

cs.GR · 2025-05-26 · unverdicted · novelty 7.0

CAD-Coder generates valid CadQuery scripts from text via supervised fine-tuning followed by reinforcement learning with geometric Chamfer Distance rewards and chain-of-thought planning.

MV-GEL: Language-Driven Multi-View Geometric Entity Localization on Meshes

cs.CV · 2026-06-30 · unverdicted · novelty 6.0

MV-GEL localizes fine-grained geometric entities on 3D meshes from natural language by ranking informative views with GELviews, applying VLM segmentation, and lifting masks via geometry-aware ray casting, reporting up to 1.7X face IoU and 4.5X edge F1 gains over baselines.

Physics-in-the-Loop: A Hybrid Agentic Architecture for Validated CAD Engineering Design

cs.CV · 2026-05-19 · unverdicted · novelty 6.0

A hybrid agentic architecture integrates knowledge-based physical verification tools into LLM-driven CAD design loops, producing more complex and functionally valid designs than prior agentic baselines.

CADFit: Precise Mesh-to-CAD Program Generation with Hybrid Optimization

cs.CV · 2026-05-02 · unverdicted · novelty 6.0

CADFit recovers complex editable CAD construction sequences from meshes via IoU-driven hybrid optimization over structured programs, outperforming prior methods on volumetric IoU, Chamfer Distance, and invalid ratio.

Agent-Aided Design for Dynamic CAD Models

cs.AI · 2026-04-16 · unverdicted · novelty 6.0

AADvark extends agent-aided CAD design to dynamic 3D assemblies with movable parts by integrating constraint solvers and visual feedback to create a verification signal for the agent.

Pointer-CAD: Unifying B-Rep and Command Sequences via Pointer-based Edges & Faces Selection

cs.CV · 2026-03-04 · unverdicted · novelty 6.0

Pointer-CAD unifies B-Rep geometry with command sequences via pointer-based entity selection, allowing LLMs to perform complex CAD edits while cutting topological errors from quantization.

Pointer-CAD v2: Plan-Then-Construct CAD Generation with Dimension-Aware Parametric Precision

cs.CV · 2026-06-28 · unverdicted · novelty 5.0

Pointer-CAD v2 decouples planning from construction in LLM-based CAD generation by using a pointer mechanism to reference continuous parameters from a design plan, paired with new hierarchical accuracy metrics.

Memory-Augmented Reinforcement Learning Agent for CAD Generation

cs.AI · 2026-05-19 · unverdicted · novelty 5.0

Memory-augmented RL agent with case and skill libraries plus dynamic retrieval improves success rate and geometric consistency for complex CAD model generation.

CADDesigner: Conceptual CAD Model Generation with a General-Purpose Agent

cs.AI · 2025-08-01 · unverdicted · novelty 5.0 · 2 refs

CADDesigner is an LLM agent that generates conceptual CAD models from text and sketches via requirement analysis, the ECIP paradigm, and iterative visual feedback, outperforming baselines in experiments.

Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

cs.CV · 2025-01-21 · unverdicted · novelty 4.0

Hunyuan3D 2.0 scales flow-based diffusion transformers and texture synthesis models to generate high-resolution textured 3D assets that outperform prior state-of-the-art in geometry, alignment, and texture quality.

citing papers explorer

Showing 19 of 19 citing papers.

FllumaOne: A Code-Native Multimodal CAD Dataset with Executable Programs and Kernel-Validated Feature Histories cs.AI · 2026-06-16 · unverdicted · none · ref 31
FllumaOne releases 100,000 kernel-validated CAD models as executable Python programs with aligned multimodal data including feature histories and geometry exports.
BRepCLIP: Contrastive Multimodal Pretraining on BRep Primitives for CAD Understanding cs.CV · 2026-06-03 · unverdicted · none · ref 39
BRepCLIP is the first contrastive pretraining framework that tokenizes BRep CAD geometry into surface and curve vocabularies and aligns the resulting embeddings with CLIP text and image encoders, reporting large gains in retrieval and zero-shot classification over point-based baselines.
UniCAD: A Unified Benchmark and Universal Model for Multi-Modal Multi-Task CAD cs.CV · 2026-06-03 · unverdicted · none · ref 50
UniCAD supplies a unified multi-modal benchmark and an end-to-end MLLM that performs reconstruction, generation, and QA on CAD data, reporting SOTA results on UniCAD and Fusion360.
MUSE: Benchmarking Manufacturable, Functional, and Assemblable Text-to-CAD Generation cs.AI · 2026-05-27 · unverdicted · none · ref 30
MUSE is a new benchmark and three-stage evaluation protocol for text-to-CAD generation that assesses functionality, manufacturability, and assemblability of B-Rep assemblies beyond geometric similarity.
BrepForge: Factorized B-rep Synthesis via Wireframe Composition and Boundary-Conditioned Surface Instantiation cs.GR · 2026-05-19 · unverdicted · none · ref 93
BrepForge factorizes B-rep synthesis into face-aware autoregressive wireframe composition followed by boundary-conditioned surface instantiation using learning-free geometric priors.
Img2CADSeq: Image-to-CAD Generation via Sequence-Based Diffusion cs.CV · 2026-05-13 · unverdicted · none · ref 50
Img2CADSeq generates standard CAD sequences from images via a multi-stage pipeline with three-level hierarchical codebook encoding, importance-guided compression, and contrastive point-cloud conditioning of a VQ-Diffusion model, outperforming prior methods on new CAD-220K and PrintCAD datasets.
AssemblyBench: Physics-Aware Assembly of Complex Industrial Objects cs.CV · 2026-05-13 · unverdicted · none · ref 42
AssemblyBench dataset and AssemblyDyno transformer model enable physics-aware prediction of assembly sequences and trajectories for complex industrial objects from multimodal instructions and 3D shapes.
CADBench: A Multimodal Benchmark for AI-Assisted CAD Program Generation cs.CV · 2026-05-11 · unverdicted · none · ref 13 · 2 links
CADBench is a new multimodal benchmark for CAD program generation that combines 18k samples from DeepCAD, Fusion 360, ABC, MCB, and Objaverse across clean/noisy meshes and various renders, used to test 11 models and reveal failure modes.
ArtiCAD: Articulated CAD Assembly Design via Multi-Agent Code Generation cs.CV · 2026-04-13 · unverdicted · none · ref 56
ArtiCAD presents the first training-free multi-agent framework that generates articulated, editable CAD assemblies from text or images by predicting assembly relationships early and using validation with rollback.
CAD-Coder: Text-to-CAD Generation with Chain-of-Thought and Geometric Reward cs.GR · 2025-05-26 · unverdicted · none · ref 33
CAD-Coder generates valid CadQuery scripts from text via supervised fine-tuning followed by reinforcement learning with geometric Chamfer Distance rewards and chain-of-thought planning.
MV-GEL: Language-Driven Multi-View Geometric Entity Localization on Meshes cs.CV · 2026-06-30 · unverdicted · none · ref 49
MV-GEL localizes fine-grained geometric entities on 3D meshes from natural language by ranking informative views with GELviews, applying VLM segmentation, and lifting masks via geometry-aware ray casting, reporting up to 1.7X face IoU and 4.5X edge F1 gains over baselines.
Physics-in-the-Loop: A Hybrid Agentic Architecture for Validated CAD Engineering Design cs.CV · 2026-05-19 · unverdicted · none · ref 6
A hybrid agentic architecture integrates knowledge-based physical verification tools into LLM-driven CAD design loops, producing more complex and functionally valid designs than prior agentic baselines.
CADFit: Precise Mesh-to-CAD Program Generation with Hybrid Optimization cs.CV · 2026-05-02 · unverdicted · none · ref 13
CADFit recovers complex editable CAD construction sequences from meshes via IoU-driven hybrid optimization over structured programs, outperforming prior methods on volumetric IoU, Chamfer Distance, and invalid ratio.
Agent-Aided Design for Dynamic CAD Models cs.AI · 2026-04-16 · unverdicted · none · ref 38
AADvark extends agent-aided CAD design to dynamic 3D assemblies with movable parts by integrating constraint solvers and visual feedback to create a verification signal for the agent.
Pointer-CAD: Unifying B-Rep and Command Sequences via Pointer-based Edges & Faces Selection cs.CV · 2026-03-04 · unverdicted · none · ref 52
Pointer-CAD unifies B-Rep geometry with command sequences via pointer-based entity selection, allowing LLMs to perform complex CAD edits while cutting topological errors from quantization.
Pointer-CAD v2: Plan-Then-Construct CAD Generation with Dimension-Aware Parametric Precision cs.CV · 2026-06-28 · unverdicted · none · ref 49
Pointer-CAD v2 decouples planning from construction in LLM-based CAD generation by using a pointer mechanism to reference continuous parameters from a design plan, paired with new hierarchical accuracy metrics.
Memory-Augmented Reinforcement Learning Agent for CAD Generation cs.AI · 2026-05-19 · unverdicted · none · ref 78
Memory-augmented RL agent with case and skill libraries plus dynamic retrieval improves success rate and geometric consistency for complex CAD model generation.
CADDesigner: Conceptual CAD Model Generation with a General-Purpose Agent cs.AI · 2025-08-01 · unverdicted · none · ref 18 · 2 links
CADDesigner is an LLM agent that generates conceptual CAD models from text and sketches via requirement analysis, the ECIP paradigm, and iterative visual feedback, outperforming baselines in experiments.
Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation cs.CV · 2025-01-21 · unverdicted · none · ref 103
Hunyuan3D 2.0 scales flow-based diffusion transformers and texture synthesis models to generate high-resolution textured 3D assets that outperform prior state-of-the-art in geometry, alignment, and texture quality.

Cad-mllm: Unifying multimodality-conditioned cad generation with mllm

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer