pith. sign in

Xinshen Zhang

Identifiers

  • name variant Xinshen Zhang 0.60 · backfill

Papers (3)

  1. Which Speech Representation Better Matches Text-Native Reasoning? A Study of Speech-Text Alignment on Frame Rate and Representation eess.AS · 2026 · author #10
  2. EARL: Towards a Unified Analysis-Guided Reinforcement Learning Framework for Egocentric Interaction Reasoning and Pixel Grounding cs.CV · 2026 · author #2
  3. Audio-FLAN: An Instruction-Following Dataset for Unified Audio Understanding and Generation of Speech, Music, and Sound cs.SD · 2025 · author #14

Mentions

  • 2606.12199 #10 · arxiv_oai · confidence 0.70 Xinshen Zhang
  • 2502.16584 #14 · arxiv_oai · confidence 0.70 Xinshen Zhang

Frequent Coauthors