Polyglot introduces a unified diffusion model for multilingual speech-driven facial animation that jointly conditions on language via transcript embeddings and personal style via reference sequences without requiring explicit labels.
Title resolution pending
3 Pith papers cite this work. Polarity classification is still indexing.
verdicts
UNVERDICTED 3representative citing papers
A parametric multi-objective Bayesian optimizer amortizes optimization across continuous task spaces by alternating generative solution sampling and acquisition-driven search to enable direct prediction for unseen problems without re-evaluations.
A data-derived baseline using feature effects on binary outcomes provides a model-agnostic way to check if machine learning explanations align with the underlying data structure.
citing papers explorer
-
Polyglot: Multilingual Style Preserving Speech-Driven Facial Animation
Polyglot introduces a unified diffusion model for multilingual speech-driven facial animation that jointly conditions on language via transcript embeddings and personal style via reference sequences without requiring explicit labels.
-
Amortized Multi-Objective Optimization Across Tasks with Generative Solution Modeling
A parametric multi-objective Bayesian optimizer amortizes optimization across continuous task spaces by alternating generative solution sampling and acquisition-driven search to enable direct prediction for unseen problems without re-evaluations.
-
Does the Model Say What the Data Says? A Simple Heuristic for Model Data Alignment
A data-derived baseline using feature effects on binary outcomes provides a model-agnostic way to check if machine learning explanations align with the underlying data structure.