Title resolution pending

Lavida: A large diffusion language model for multimodal understanding · 2025 · arXiv 2505.24496

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Chatterbox-Flash: Prior-Calibrated Block Diffusion for Streaming Zero-Shot TTS

cs.SD · 2026-05-29 · unverdicted · novelty 6.0

Block-diffusion decoder with prior-calibrated scoring and early stopping produces streaming zero-shot TTS at quality comparable to AR and NAR baselines with lower real-time factor.

A Simple Method to Enhance Pre-trained Language Models with Speech Tokens for Classification

cs.CL · 2025-12-08 · unverdicted · novelty 5.0

Lasso-selected speech tokens enhance text LLMs for multimodal classification by reducing long audio sequences to task-relevant features via self-supervised adaptation.

citing papers explorer

Showing 2 of 2 citing papers.

Chatterbox-Flash: Prior-Calibrated Block Diffusion for Streaming Zero-Shot TTS cs.SD · 2026-05-29 · unverdicted · none · ref 5
Block-diffusion decoder with prior-calibrated scoring and early stopping produces streaming zero-shot TTS at quality comparable to AR and NAR baselines with lower real-time factor.
A Simple Method to Enhance Pre-trained Language Models with Speech Tokens for Classification cs.CL · 2025-12-08 · unverdicted · none · ref 20
Lasso-selected speech tokens enhance text LLMs for multimodal classification by reducing long audio sequences to task-relevant features via self-supervised adaptation.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer