(eds.) Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

Zhang, S · 2018 · DOI 10.18653/v1/p18-1205

12 Pith papers cite this work. Polarity classification is still indexing.

12 Pith papers citing it

open at publisher browse 12 citing papers

representative citing papers

CheckMIABench: Firm Foundations For Membership Inference Attacks on Language Models

cs.LG · 2026-06-16 · conditional · novelty 7.0

CheckMIABench converts LLMs with intermediate checkpoints into clean MIA testbeds by using pre- and post-checkpoint training data from the same distribution and evaluates published attacks on Pythia and OLMo models while releasing an open-source library.

ProMediate: A Socio-cognitive framework for evaluating proactive agents in multi-party negotiation

cs.CL · 2025-10-29 · conditional · novelty 7.0

ProMediate introduces a theory-grounded simulation testbed and socio-cognitive metrics to evaluate proactive AI mediator agents in multi-party multi-topic negotiations, with experiments showing a socially intelligent mediator improves consensus change and intervention speed over a generic baseline.

PeReGrINE: Evaluating Personalized Review Fidelity with User Item Graph Context

cs.IR · 2026-04-09 · unverdicted · novelty 6.0

PeReGrINE is a graph-based benchmark that restructures Amazon Reviews 2023 with temporal cutoffs and introduces dissonance analysis to measure how well retrieval-conditioned models match user style and product consensus.

On Emotion-Sensitive Decision Making of Small Language Model Agents

cs.AI · 2026-04-08 · unverdicted · novelty 6.0

Emotional perturbations induced via activation steering systematically alter strategic choices made by small language model agents in cooperative and competitive game templates, yet the resulting behaviors remain unstable and only partially aligned with human patterns.

Toxic Subword Pruning for Dialogue Response Generation on Large Language Models

cs.CL · 2024-10-05 · unverdicted · novelty 6.0

ToxPrune prunes toxic subwords from BPE tokenizers in LLMs to mitigate toxic dialogue responses and improve diversity on both toxic and non-toxic models.

PEC-Home: Interpretation of Progressively Elliptical Commands in Smart Homes

cs.CL · 2026-06-17 · unverdicted · novelty 5.0

Presents PEC-Home dataset for elliptical smart-home commands and shows LLMs achieve lower execution accuracy on elliptical inputs than complete commands even with dialogue history access.

Resonant Minds: Closed-Loop Social Avatars with Theory of Mind

cs.CV · 2026-06-04 · unverdicted · novelty 5.0

A dual-agent closed-loop system integrates Theory of Mind reasoning with multimodal video generation to create social avatars that outperform full-information baselines on dialogue quality under information asymmetry.

RECAP: Transparent Inference-Time Emotion Alignment for Medical Dialogue Systems

cs.CL · 2025-09-12 · unverdicted · novelty 5.0

RECAP is an inference-time framework using cognitive appraisal theory to enhance emotional alignment and transparency in medical dialogue systems across model scales.

Large Language Models as Virtual Survey Respondents: Evaluating Sociodemographic Response Generation

cs.AI · 2025-09-08 · conditional · novelty 5.0

Introduces PAS and FAS task abstractions plus the LLM-S^3 benchmark to evaluate LLMs on generating sociodemographic survey responses across 11 real datasets and multiple models.

Creating Multilingual Mental Health Dialogue Datasets: Limits of Persona-Based Localization via Nationality and Language

cs.CL · 2026-06-17 · unverdicted · novelty 4.0

Modifying nationality and language parameters in English-centric personas for mental health dialogues introduces clinical inconsistencies across languages and causes LLM judges to perform inaccurately on non-English depression severity assessments.

Learn-To-Learn on Arbitrary Textual Conditioning: A Hypernetwork-Driven Meta-Gated LLM

cs.CL · 2026-05-03

Strategic Persuasion with Trait-Conditioned Multi-Agent Systems for Iterative Legal Argumentation

cs.MA · 2026-04-08

citing papers explorer

Showing 12 of 12 citing papers.

CheckMIABench: Firm Foundations For Membership Inference Attacks on Language Models cs.LG · 2026-06-16 · conditional · none · ref 260
CheckMIABench converts LLMs with intermediate checkpoints into clean MIA testbeds by using pre- and post-checkpoint training data from the same distribution and evaluates published attacks on Pythia and OLMo models while releasing an open-source library.
ProMediate: A Socio-cognitive framework for evaluating proactive agents in multi-party negotiation cs.CL · 2025-10-29 · conditional · none · ref 1
ProMediate introduces a theory-grounded simulation testbed and socio-cognitive metrics to evaluate proactive AI mediator agents in multi-party multi-topic negotiations, with experiments showing a socially intelligent mediator improves consensus change and intervention speed over a generic baseline.
PeReGrINE: Evaluating Personalized Review Fidelity with User Item Graph Context cs.IR · 2026-04-09 · unverdicted · none · ref 9
PeReGrINE is a graph-based benchmark that restructures Amazon Reviews 2023 with temporal cutoffs and introduces dissonance analysis to measure how well retrieval-conditioned models match user style and product consensus.
On Emotion-Sensitive Decision Making of Small Language Model Agents cs.AI · 2026-04-08 · unverdicted · none · ref 5
Emotional perturbations induced via activation steering systematically alter strategic choices made by small language model agents in cooperative and competitive game templates, yet the resulting behaviors remain unstable and only partially aligned with human patterns.
Toxic Subword Pruning for Dialogue Response Generation on Large Language Models cs.CL · 2024-10-05 · unverdicted · none · ref 42
ToxPrune prunes toxic subwords from BPE tokenizers in LLMs to mitigate toxic dialogue responses and improve diversity on both toxic and non-toxic models.
PEC-Home: Interpretation of Progressively Elliptical Commands in Smart Homes cs.CL · 2026-06-17 · unverdicted · none · ref 62
Presents PEC-Home dataset for elliptical smart-home commands and shows LLMs achieve lower execution accuracy on elliptical inputs than complete commands even with dialogue history access.
Resonant Minds: Closed-Loop Social Avatars with Theory of Mind cs.CV · 2026-06-04 · unverdicted · none · ref 49
A dual-agent closed-loop system integrates Theory of Mind reasoning with multimodal video generation to create social avatars that outperform full-information baselines on dialogue quality under information asymmetry.
RECAP: Transparent Inference-Time Emotion Alignment for Medical Dialogue Systems cs.CL · 2025-09-12 · unverdicted · none · ref 56
RECAP is an inference-time framework using cognitive appraisal theory to enhance emotional alignment and transparency in medical dialogue systems across model scales.
Large Language Models as Virtual Survey Respondents: Evaluating Sociodemographic Response Generation cs.AI · 2025-09-08 · conditional · none · ref 42
Introduces PAS and FAS task abstractions plus the LLM-S^3 benchmark to evaluate LLMs on generating sociodemographic survey responses across 11 real datasets and multiple models.
Creating Multilingual Mental Health Dialogue Datasets: Limits of Persona-Based Localization via Nationality and Language cs.CL · 2026-06-17 · unverdicted · none · ref 60
Modifying nationality and language parameters in English-centric personas for mental health dialogues introduces clinical inconsistencies across languages and causes LLM judges to perform inaccurately on non-English depression severity assessments.
Learn-To-Learn on Arbitrary Textual Conditioning: A Hypernetwork-Driven Meta-Gated LLM cs.CL · 2026-05-03 · unreviewed · ref 105
Strategic Persuasion with Trait-Conditioned Multi-Agent Systems for Iterative Legal Argumentation cs.MA · 2026-04-08 · unreviewed · ref 19

(eds.) Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

fields

years

verdicts

representative citing papers

citing papers explorer