Peper, Christopher Clarke, Andrew Lee, Parker Hill, Jonathan K

Stefan Larson, Anish Mahendran, Joseph J · 2019 · DOI 10.18653/v1/d19-1131

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

open at publisher browse 6 citing papers

representative citing papers

IPQA: A Benchmark for Core Intent Identification in Personalized Question Answering

cs.CL · 2025-10-27 · conditional · novelty 7.0

IPQA is a new benchmark that measures how well models identify core user intents from history in personalized question answering, finding that performance is poor and declines with greater question complexity.

SCOPE: Sequential Conformal Probing for Reliable OOD Rejection in LLM Services

cs.CL · 2026-06-19 · unverdicted · novelty 6.0

SCOPE selects readable hidden layers, constructs conformal gates with IND calibration, and uses supermartingale e-processes to certify persistent service-boundary evidence, improving rejection over final-layer detectors across multiple LLMs and boundary conditions.

Template-assisted Contrastive Learning of Task-oriented Dialogue Sentence Embeddings

cs.CL · 2023-05-23 · unverdicted · novelty 6.0

TaDSE learns dialogue sentence embeddings via template-guided self-supervised contrastive learning plus synthetic slot-filling augmentation and reports gains on five downstream benchmarks.

TextClusterLab: An Integrated Framework for Reliable Text Clustering Studies

cs.IR · 2026-05-17 · unverdicted · novelty 5.0

TextClusterLab introduces an LLM-driven generator for synthetic text clustering datasets with tunable attributes and a suitability benchmark for evaluation.

Training LLMs with Reinforcement Learning for Intent-Aware Personalized Question Answering

cs.CL · 2026-05-12 · unverdicted · novelty 5.0

IAP uses RL to train LLMs to explicitly infer and apply implicit user intent in single-turn personalized QA, achieving ~7.5% average macro-score gains over baselines on LaMP-QA.

Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions

cs.CL · 2025-07-07

citing papers explorer

Showing 6 of 6 citing papers.

IPQA: A Benchmark for Core Intent Identification in Personalized Question Answering cs.CL · 2025-10-27 · conditional · none · ref 16
IPQA is a new benchmark that measures how well models identify core user intents from history in personalized question answering, finding that performance is poor and declines with greater question complexity.
SCOPE: Sequential Conformal Probing for Reliable OOD Rejection in LLM Services cs.CL · 2026-06-19 · unverdicted · none · ref 50
SCOPE selects readable hidden layers, constructs conformal gates with IND calibration, and uses supermartingale e-processes to certify persistent service-boundary evidence, improving rejection over final-layer detectors across multiple LLMs and boundary conditions.
Template-assisted Contrastive Learning of Task-oriented Dialogue Sentence Embeddings cs.CL · 2023-05-23 · unverdicted · none · ref 22
TaDSE learns dialogue sentence embeddings via template-guided self-supervised contrastive learning plus synthetic slot-filling augmentation and reports gains on five downstream benchmarks.
TextClusterLab: An Integrated Framework for Reliable Text Clustering Studies cs.IR · 2026-05-17 · unverdicted · none · ref 19
TextClusterLab introduces an LLM-driven generator for synthetic text clustering datasets with tunable attributes and a suitability benchmark for evaluation.
Training LLMs with Reinforcement Learning for Intent-Aware Personalized Question Answering cs.CL · 2026-05-12 · unverdicted · none · ref 46
IAP uses RL to train LLMs to explicitly infer and apply implicit user intent in single-turn personalized QA, achieving ~7.5% average macro-score gains over baselines on LaMP-QA.
Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions cs.CL · 2025-07-07 · unreviewed · ref 19

Peper, Christopher Clarke, Andrew Lee, Parker Hill, Jonathan K

fields

years

verdicts

representative citing papers

citing papers explorer