Hierarchical Policy Optimization post-trains LLMs for simultaneous speech translation on imperfect data, yielding over +7 COMET and +1.25 MetricX improvements at 1.5-second latency on English-to-Chinese/German/Japanese tasks.
InProceed- ings of the 16th Conference of the European Chap- ter of the Association for Computational Linguistics: Main Volume, pages 3222–3233, Online
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Hierarchical Policy Optimization for Simultaneous Translation of Unbounded Speech
Hierarchical Policy Optimization post-trains LLMs for simultaneous speech translation on imperfect data, yielding over +7 COMET and +1.25 MetricX improvements at 1.5-second latency on English-to-Chinese/German/Japanese tasks.