pith. sign in

Qwen3-omni technical report

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

fields

cs.CV 3 cs.AI 1

years

2026 4

verdicts

UNVERDICTED 4

representative citing papers

UAF: A Unified Audio Front-end LLM for Full-Duplex Speech Interaction

cs.AI · 2026-04-21 · unverdicted · novelty 6.0

UAF is the first unified audio front-end LLM that turns multiple front-end tasks into one sequence prediction model processing streaming audio chunks and reference prompts to output semantic and control tokens for full-duplex interaction.

HumanOmni-Speaker: Identifying Who said What and When

cs.CV · 2026-03-23 · unverdicted · novelty 6.0

HumanOmni-Speaker introduces a Visual Delta Encoder and VR-SDR benchmark that enable end-to-end speaker diarization and recognition by sampling video at 25 fps and compressing inter-frame motion residuals into 6 tokens per frame.

citing papers explorer

Showing 4 of 4 citing papers.