Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing , year =

· 2024 · DOI 10.18653/v1/2024.emnlp-main.726

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

open at publisher browse 5 citing papers

representative citing papers

SciNLP: A Domain-Specific Benchmark for Full-Text Scientific Entity and Relation Extraction in NLP

cs.CL · 2025-09-09 · accept · novelty 7.0

SciNLP is the first full-text entity and relation extraction benchmark for the NLP domain, built from 60 manually annotated publications and used to evaluate models and construct a domain knowledge graph.

MultiSynt/MT: Trillion-Token Multi-Parallel Pre-Training Data Translated Across 36 Languages

cs.CL · 2026-07-01 · unverdicted · novelty 6.0

MultiSynt/MT supplies 4.8 trillion translated tokens in 36 languages from 100B English tokens, letting LLMs match native-data baselines with 72% fewer tokens and beat them by 15% at equal budget.

Scaling Performance and Low-Resource Annotation with Many-Shot In-Context Learning for Named Entity Recognition

cs.CL · 2026-06-20 · unverdicted · novelty 6.0

Many-shot ICL with LLMs matches or exceeds supervised BERT on NER and generates high-quality labels for low-resource settings, producing ~10% absolute F1 gains when used to fine-tune BERT.

Building Agent Harnesses for Scientific Curation from Multimodal Sources

cs.AI · 2026-06-19 · unverdicted · novelty 5.0

Beaver agent harness achieves 81.0 GRAS on multimodal scientific curation, outperforming frontier agents by over 23 points through scaffolding and evidence tooling.

Task Decomposition for Efficient Annotation

cs.CL · 2026-06-23 · unverdicted · novelty 4.0

Decomposing annotation tasks using centers from centering theory reduces aggregate inferential load via a degrees-of-freedom model and enables better sub-task allocation.

citing papers explorer

Showing 4 of 4 citing papers after filters.

MultiSynt/MT: Trillion-Token Multi-Parallel Pre-Training Data Translated Across 36 Languages cs.CL · 2026-07-01 · unverdicted · none · ref 103
MultiSynt/MT supplies 4.8 trillion translated tokens in 36 languages from 100B English tokens, letting LLMs match native-data baselines with 72% fewer tokens and beat them by 15% at equal budget.
Scaling Performance and Low-Resource Annotation with Many-Shot In-Context Learning for Named Entity Recognition cs.CL · 2026-06-20 · unverdicted · none · ref 62
Many-shot ICL with LLMs matches or exceeds supervised BERT on NER and generates high-quality labels for low-resource settings, producing ~10% absolute F1 gains when used to fine-tune BERT.
Building Agent Harnesses for Scientific Curation from Multimodal Sources cs.AI · 2026-06-19 · unverdicted · none · ref 27
Beaver agent harness achieves 81.0 GRAS on multimodal scientific curation, outperforming frontier agents by over 23 points through scaffolding and evidence tooling.
Task Decomposition for Efficient Annotation cs.CL · 2026-06-23 · unverdicted · none · ref 47
Decomposing annotation tasks using centers from centering theory reduces aggregate inferential load via a degrees-of-freedom model and enables better sub-task allocation.

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing , year =

fields

years

verdicts

representative citing papers

citing papers explorer