hub

Findings of the WMT 25 general machine translation shared task: Time to stop evaluating on easy test sets

Kocmi, Tom, Artemova, Ekaterina, Avramidis, Eleftherios, Bawden, Rachel, Bojar, Ond r ej, Dranch, Konstantin · 2025 · DOI 10.18653/v1/2025.wmt-1.22

19 Pith papers cite this work. Polarity classification is still indexing.

19 Pith papers citing it

open at publisher browse 19 citing papers

hub tools

JSON dossier citing papers JSON publisher DOI

citation-role summary

other 1

citation-polarity summary

unclear 1

representative citing papers

Rewrite to Translate, Translate to Reward: Reinforcement Learning for Source Rewriting in Machine Translation

cs.CL · 2026-06-06 · unverdicted · novelty 7.0

RLSR trains source rewriters via RL with translation-quality improvement as the reward, outperforming prompt baselines at 4B scale while matching larger models.

Ouvia: A User-centered Framework for Measuring Usability of Speech Translation in Real-World Communication Scenarios

cs.CL · 2026-06-04 · unverdicted · novelty 7.0

Ouvia is a user-centered evaluation framework for speech translation usability in real-world scenarios, showing limited usability rates and the superiority of QA-based metrics.

Creativity Bias: How Machine Evaluation Struggles with Creativity in Literary Translations

cs.CL · 2026-05-13 · unverdicted · novelty 7.0

Automatic evaluation tools for literary translations correlate poorly with expert human judgments on creativity and exhibit bias favoring machine-translated texts.

What Does LLM Refinement Actually Improve? A Systematic Study on Document-Level Literary Translation

cs.CL · 2026-05-13 · accept · novelty 7.0

Document-level machine translation followed by segment-level LLM refinement provides the strongest and most stable improvements in literary translation quality, mainly enhancing fluency and style rather than adequacy.

AI translation of literary texts is "fine", but readers still prefer human translations

cs.CL · 2026-06-24 · unverdicted · novelty 6.0

Human readers prefer human literary translations over AI-generated ones for immersion and clarity despite finding MT adequate and struggling to identify the source.

Misaligned by Reward: Socially Undesirable Preferences in LLMs

cs.CL · 2026-05-06 · unverdicted · novelty 6.0

Reward models for LLMs frequently select socially undesirable options across four social domains, show no overall best performer, and exhibit a bias-avoidance versus context-sensitivity trade-off.

Speaking in Self-Assessing Tongues: On the Verbalized Confidence of LLMs in Machine Translation

cs.CL · 2026-06-15 · unverdicted · novelty 5.0

Empirical study finds verbalized per-token confidence methods in LLMs for MT perform similarly to internal signals on error detection and calibration but show little correlation.

Better Literary Translation: A Multi-Aspect Data Generation and LLM Training Approach

cs.CL · 2026-06-04 · unverdicted · novelty 5.0

Multi-aspect iterative refinement with specialized LLMs generates superior literary translation data, enabling SFT and GRPO to produce LitMT-8B and LitMT-14B models scoring 67.25 and 69.07 CEA100 on MetaphorTrans, competitive with Claude Sonnet 4.5.

A Systematic Analysis of Linguistic Features in AI-Generated Text Detection Across Domains and Models

cs.CL · 2026-06-02 · unverdicted · novelty 5.0

Lexical richness is a robust linguistic signal for AI-generated text detection across models and domains, while most other features are context-dependent.

Why Low-Resource NLP Needs More Than Cross-Lingual Transfer: Lessons Learned from Luxembourgish

cs.CL · 2026-05-11 · unverdicted · novelty 4.0

Cross-lingual transfer and language-specific data efforts are interdependent and complementary for effective low-resource NLP, as demonstrated through Luxembourgish case studies and synthesis.

CAT-Translate: Building Compact Open-Source Models for Japanese-English Translation

cs.CL · 2026-06-19 · unverdicted · novelty 3.0

Compact 0.8B-7B models for bidirectional Japanese-English translation outperform large multilingual models on real-world domain benchmarks.

LLM Consumer Behavior Theory: Foundations of a Novel Research Field

cs.AI · 2026-06-16 · unverdicted · novelty 3.0

Introduces LLM Consumer Behavior Theory to analyze consumer behavior when LLMs serve as autonomous decision-making agents in markets.

MLLP-VRAIN UPV system for the IWSLT 2026 Simultaneous Speech Translation task

cs.CL · 2026-06-15 · unverdicted · novelty 3.0

A cascaded SimulST system using Parakeet and Qwen 3.5 with adaptive black-box policies and RAG context achieves +5.82 XCOMET-XL improvement on En→De for IWSLT 2026.

FMI_SU_Yotkova_Kastreva at SemEval-2026 Task 13: Lightweight Detection of LLM-Generated Code via Stylometric Signals

cs.CL · 2026-05-05 · unverdicted · novelty 3.0

A feature-based decision tree with parsing-derived signals and heuristics detects LLM-generated code in a lightweight, CPU-only setup for SemEval-2026 Task 13.

Hy-MT2: A Family of Fast, Efficient and Powerful Multilingual Translation Models in the Wild

cs.CL · 2026-05-21

A Recipe for Long-Context Reasoning in Large Language Models via On-Policy Optimization and Distillation

cs.CL · 2026-05-12

Dynamic Meta-Metrics: Source-Sentence Conditioned Weighting for MT Evaluation

cs.CL · 2026-05-09

Psychologically Potent, Computationally Invisible: LLMs Generate Social-Comparison-Eliciting Posts They Fail to Detect

cs.CL · 2026-05-01

Syntax as a Rosetta Stone: Universal Dependencies for In-Context Coptic Translation

cs.CL · 2026-04-20

citing papers explorer

Showing 1 of 1 citing paper after filters.

LLM Consumer Behavior Theory: Foundations of a Novel Research Field cs.AI · 2026-06-16 · unverdicted · none · ref 270
Introduces LLM Consumer Behavior Theory to analyze consumer behavior when LLMs serve as autonomous decision-making agents in markets.

Findings of the WMT 25 general machine translation shared task: Time to stop evaluating on easy test sets

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer