Towards seamless interaction: Causal turn-level modeling of inter- active 3d conversational head dynamics

Junjie Chen, Fei Wang, Zhihao Hunag, Qing Zhou, Kun Li, Dan Guo, Linfeng Zhang, Xun Yang · 2025 · arXiv 2512.15340

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

EmbodiedHead: Real-Time Listening and Speaking Avatar for Conversational Agents

cs.CV · 2026-04-19 · unverdicted · novelty 6.0

EmbodiedHead introduces a Rectified-Flow Diffusion Transformer with differentiable renderer and single-stream listening-speaking conditioning to achieve real-time high-fidelity conversational avatars.

3D Smoke Scene Reconstruction Guided by Vision Priors from Multimodal Large Language Models

cs.CV · 2026-04-07 · unverdicted · novelty 5.0

A framework that combines MLLM-based image enhancement with a medium-aware 3D Gaussian Splatting model to reconstruct and render smoke scenes.

citing papers explorer

Showing 2 of 2 citing papers.

EmbodiedHead: Real-Time Listening and Speaking Avatar for Conversational Agents cs.CV · 2026-04-19 · unverdicted · none · ref 3
EmbodiedHead introduces a Rectified-Flow Diffusion Transformer with differentiable renderer and single-stream listening-speaking conditioning to achieve real-time high-fidelity conversational avatars.
3D Smoke Scene Reconstruction Guided by Vision Priors from Multimodal Large Language Models cs.CV · 2026-04-07 · unverdicted · none · ref 7
A framework that combines MLLM-based image enhancement with a medium-aware 3D Gaussian Splatting model to reconstruct and render smoke scenes.

Towards seamless interaction: Causal turn-level modeling of inter- active 3d conversational head dynamics

fields

years

verdicts

representative citing papers

citing papers explorer