Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing , month=

Andrews, Pierre, Artetxe, Mikel, Meglioli, Mariano Coria, Costa-juss · 2025 · DOI 10.18653/v1/2025.emnlp-main.1400

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

open at publisher browse 5 citing papers

representative citing papers

ALEE: Any-Language Evaluation of Embeddings via English-Centric Minimal Pairs

cs.CL · 2026-06-30 · unverdicted · novelty 7.0

ALEE generates AMR-based English minimal pairs with fine-grained semantic shifts, translates them, and evaluates embedding models on 275+ languages to expose cross-lingual gaps linked to training data and tokenization.

OpenBibleTTS: Large-Scale Speech Resources and TTS Models for Low-Resource Languages

cs.CL · 2026-06-08 · accept · novelty 7.0

OpenBibleTTS supplies speech data and alignments for 37 underrepresented languages and shows that no single TTS system leads on all metrics, with Gemini-TTS highest in listener ratings but monolingual EveryVoice models strongest on intelligibility for several African languages.

Want Better Synthetic Data? Steer It: Activation Steering for Low-Resource Language Generation

cs.CL · 2026-06-16 · unverdicted · novelty 6.0

Activation steering on early layers improves diversity of synthetic data for low-resource languages and often boosts downstream classifier performance compared to non-steered prompting.

Beyond "To whom it may concern": Tailoring Machine Translation to Audience and Intent

cs.CL · 2026-06-02 · unverdicted · novelty 6.0

Explicit purpose instructions improve LLM translation adaptedness across 50 languages and 8 domains, with larger gains on informal text, while standard metrics often penalize the adapted outputs.

Model-Based Quality Assessment for Massively Multilingual Parallel Data

cs.CL · 2026-05-29 · unverdicted · novelty 4.0

Large-scale benchmarks of multilingual embeddings and QE models show no universal performer; direction-aware routing and calibration recommended for parallel data assessment.

citing papers explorer

Showing 5 of 5 citing papers after filters.

ALEE: Any-Language Evaluation of Embeddings via English-Centric Minimal Pairs cs.CL · 2026-06-30 · unverdicted · none · ref 41
ALEE generates AMR-based English minimal pairs with fine-grained semantic shifts, translates them, and evaluates embedding models on 275+ languages to expose cross-lingual gaps linked to training data and tokenization.
OpenBibleTTS: Large-Scale Speech Resources and TTS Models for Low-Resource Languages cs.CL · 2026-06-08 · accept · none · ref 36
OpenBibleTTS supplies speech data and alignments for 37 underrepresented languages and shows that no single TTS system leads on all metrics, with Gemini-TTS highest in listener ratings but monolingual EveryVoice models strongest on intelligibility for several African languages.
Want Better Synthetic Data? Steer It: Activation Steering for Low-Resource Language Generation cs.CL · 2026-06-16 · unverdicted · none · ref 87
Activation steering on early layers improves diversity of synthetic data for low-resource languages and often boosts downstream classifier performance compared to non-steered prompting.
Beyond "To whom it may concern": Tailoring Machine Translation to Audience and Intent cs.CL · 2026-06-02 · unverdicted · none · ref 1
Explicit purpose instructions improve LLM translation adaptedness across 50 languages and 8 domains, with larger gains on informal text, while standard metrics often penalize the adapted outputs.
Model-Based Quality Assessment for Massively Multilingual Parallel Data cs.CL · 2026-05-29 · unverdicted · none · ref 69
Large-scale benchmarks of multilingual embeddings and QE models show no universal performer; direction-aware routing and calibration recommended for parallel data assessment.

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing , month=

fields

years

verdicts

representative citing papers

citing papers explorer