Title resolution pending

Zhang, Honghua, Dang, Meihua, Peng, Nanyun, Broeck, Guy Van den , month = nov, year = · 2023 · arXiv 2304.07438

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

REALISTA: Realistic Latent Adversarial Attacks that Elicit LLM Hallucinations

cs.CL · 2026-05-12 · unverdicted · novelty 8.0

REALISTA optimizes continuous combinations of valid editing directions in latent space to produce realistic adversarial prompts that elicit hallucinations more effectively than prior methods, including on large reasoning models.

Binary Rewards and Reinforcement Learning: Fundamental Challenges

cs.LG · 2026-05-04 · unverdicted · novelty 6.0

Binary rewards make the set of reward-maximizing policies infinite in policy gradients; KL control selects the filtered base model but misspecification drives collapse to concentrated valid outputs instead.

citing papers explorer

Showing 2 of 2 citing papers.

REALISTA: Realistic Latent Adversarial Attacks that Elicit LLM Hallucinations cs.CL · 2026-05-12 · unverdicted · none · ref 102
REALISTA optimizes continuous combinations of valid editing directions in latent space to produce realistic adversarial prompts that elicit hallucinations more effectively than prior methods, including on large reasoning models.
Binary Rewards and Reinforcement Learning: Fundamental Challenges cs.LG · 2026-05-04 · unverdicted · none · ref 21
Binary rewards make the set of reward-maximizing policies infinite in policy gradients; KL control selects the filtered base model but misspecification drives collapse to concentrated valid outputs instead.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer