hub

R., Rocktäschel, T., and Perez, E

Khan, A · 2024 · arXiv 2402.06782

12 Pith papers cite this work. Polarity classification is still indexing.

12 Pith papers citing it

read on arXiv browse 12 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 3

citation-polarity summary

background 3

representative citing papers

LLM-Based Persuasion Enables Guardrail Override in Frontier LLMs

cs.CL · 2026-05-13 · conditional · novelty 7.0

LLM attackers persuade frontier LLMs to generate prohibited essays on consensus topics through multi-turn natural-language pressure, with success rates up to 100% in some model-topic pairs.

EquiMem: Calibrating Shared Memory in Multi-Agent Debate via Game-Theoretic Equilibrium

cs.AI · 2026-05-10 · unverdicted · novelty 7.0

EquiMem calibrates shared memory in multi-agent debate by computing a game-theoretic equilibrium from agent queries and paths, outperforming heuristics and LLM validators across benchmarks while remaining robust to adversarial agents.

Spontaneous Persuasion: An Audit of Model Persuasiveness in Everyday Conversations

cs.HC · 2026-04-23 · unverdicted · novelty 6.0

LLMs engage in spontaneous persuasion in virtually all multi-turn conversations by favoring information-based strategies like logic and evidence, in contrast to human responses that rely more on social influence and negative emotions.

Towards an AI co-scientist

cs.AI · 2025-02-26 · unverdicted · novelty 6.0

A multi-agent AI system generates novel biomedical hypotheses that show promising experimental validation in drug repurposing for leukemia, new targets for liver fibrosis, and a bacterial gene transfer mechanism.

Seeing Like an AI: How LLMs Apply (and Misapply) Wikipedia Neutrality Norms

cs.CL · 2024-07-04 · conditional · novelty 6.0

LLMs achieve 64% accuracy detecting Wikipedia bias and remove 79% of words removed by editors when correcting, but produce high-recall low-precision edits rated more neutral by crowds than human versions.

Interactive Critique-Revision Training for Reliable Structured LLM Generation

cs.LG · 2026-05-08 · unverdicted · novelty 5.0

DPA-GRPO trains a generator-verifier pair via group-relative policy optimization on paired counterfactual actions, improving structured output accuracy on TaxCalcBench over zero-shot and generator-only baselines.

Representing expertise accelerates learning from pedagogical interaction data

cs.CL · 2026-04-14 · unverdicted · novelty 5.0

Transformer models trained on synthetic pedagogical interaction data in spatial navigation achieve more robust expert-like performance than those trained only on expert demonstrations, particularly when they can distinguish epistemic states of expert and novice agents.

Fact-Checking with Contextual Narratives: Leveraging Retrieval-Augmented LLMs for Social Media Analysis

cs.MM · 2025-04-14 · unverdicted · novelty 5.0

CRAVE is a new framework that clusters retrieved text and image evidence into narratives and uses an LLM judge to produce explained fact-checking verdicts.

Towards Robust Argumentative Essay Understanding via TIDE: An Interactive Framework with Trial and Debate

cs.AI · 2026-05-17 · unverdicted · novelty 4.0

TIDE integrates trial and debate mechanisms to improve criteria-based prompt optimization for argumentative essay tasks including automated scoring, component detection, and relation identification.

AI Realtor: Towards Grounded Persuasive Language Generation for Automated Copywriting

cs.AI · 2025-02-24 · unverdicted · novelty 4.0

An LLM agent with grounding, personalization, and marketing modules generates real estate descriptions that human buyers prefer over expert-written ones while matching factual accuracy.

Large Language Model Agent: A Survey on Methodology, Applications and Challenges

cs.CL · 2025-03-27 · accept · novelty 3.0

A survey that deconstructs LLM agent systems via a methodology-centered taxonomy linking design principles to emergent behaviors, applications, and challenges.

LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods

cs.CL · 2024-12-07 · accept · novelty 3.0

A survey that organizes LLMs-as-judges research into functionality, methodology, applications, meta-evaluation, and limitations.

citing papers explorer

Showing 12 of 12 citing papers.

LLM-Based Persuasion Enables Guardrail Override in Frontier LLMs cs.CL · 2026-05-13 · conditional · none · ref 15
LLM attackers persuade frontier LLMs to generate prohibited essays on consensus topics through multi-turn natural-language pressure, with success rates up to 100% in some model-topic pairs.
EquiMem: Calibrating Shared Memory in Multi-Agent Debate via Game-Theoretic Equilibrium cs.AI · 2026-05-10 · unverdicted · none · ref 28
EquiMem calibrates shared memory in multi-agent debate by computing a game-theoretic equilibrium from agent queries and paths, outperforming heuristics and LLM validators across benchmarks while remaining robust to adversarial agents.
Spontaneous Persuasion: An Audit of Model Persuasiveness in Everyday Conversations cs.HC · 2026-04-23 · unverdicted · none · ref 12
LLMs engage in spontaneous persuasion in virtually all multi-turn conversations by favoring information-based strategies like logic and evidence, in contrast to human responses that rely more on social influence and negative emotions.
Towards an AI co-scientist cs.AI · 2025-02-26 · unverdicted · none · ref 275
A multi-agent AI system generates novel biomedical hypotheses that show promising experimental validation in drug repurposing for leukemia, new targets for liver fibrosis, and a bacterial gene transfer mechanism.
Seeing Like an AI: How LLMs Apply (and Misapply) Wikipedia Neutrality Norms cs.CL · 2024-07-04 · conditional · none · ref 18
LLMs achieve 64% accuracy detecting Wikipedia bias and remove 79% of words removed by editors when correcting, but produce high-recall low-precision edits rated more neutral by crowds than human versions.
Interactive Critique-Revision Training for Reliable Structured LLM Generation cs.LG · 2026-05-08 · unverdicted · none · ref 20
DPA-GRPO trains a generator-verifier pair via group-relative policy optimization on paired counterfactual actions, improving structured output accuracy on TaxCalcBench over zero-shot and generator-only baselines.
Representing expertise accelerates learning from pedagogical interaction data cs.CL · 2026-04-14 · unverdicted · none · ref 1
Transformer models trained on synthetic pedagogical interaction data in spatial navigation achieve more robust expert-like performance than those trained only on expert demonstrations, particularly when they can distinguish epistemic states of expert and novice agents.
Fact-Checking with Contextual Narratives: Leveraging Retrieval-Augmented LLMs for Social Media Analysis cs.MM · 2025-04-14 · unverdicted · none · ref 32
CRAVE is a new framework that clusters retrieved text and image evidence into narratives and uses an LLM judge to produce explained fact-checking verdicts.
Towards Robust Argumentative Essay Understanding via TIDE: An Interactive Framework with Trial and Debate cs.AI · 2026-05-17 · unverdicted · none · ref 78
TIDE integrates trial and debate mechanisms to improve criteria-based prompt optimization for argumentative essay tasks including automated scoring, component detection, and relation identification.
AI Realtor: Towards Grounded Persuasive Language Generation for Automated Copywriting cs.AI · 2025-02-24 · unverdicted · none · ref 27
An LLM agent with grounding, personalization, and marketing modules generates real estate descriptions that human buyers prefer over expert-written ones while matching factual accuracy.
Large Language Model Agent: A Survey on Methodology, Applications and Challenges cs.CL · 2025-03-27 · accept · none · ref 77
A survey that deconstructs LLM agent systems via a methodology-centered taxonomy linking design principles to emergent behaviors, applications, and challenges.
LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods cs.CL · 2024-12-07 · accept · none · ref 111
A survey that organizes LLMs-as-judges research into functionality, methodology, applications, meta-evaluation, and limitations.

R., Rocktäschel, T., and Perez, E

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer