Air: A systematic analysis of annotations, instructions, and response pairs in preference dataset

AIR: A Systematic Analysis of Annotations, Instructions · 2025 · arXiv 2504.03612

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

Yuvion LLM: An Adversarially-Aware Large Language Model for Content And AI Safety

cs.CL · 2026-06-26 · unverdicted · novelty 5.0

Yuvion LLM applies adversarially aware training and introduces the YLRE benchmark set, claiming superior safety robustness over larger models on multiple tasks.

Trust Region On-Policy Distillation

cs.LG · 2026-05-31 · unverdicted · novelty 5.0

TrOPD stabilizes on-policy distillation for LLMs with trust-region learning, outlier estimation, and off-policy guidance, outperforming prior OPD methods on reasoning and code benchmarks.

A Survey of Reinforcement Learning for Large Reasoning Models

cs.CL · 2025-09-10 · accept · novelty 3.0

A survey compiling RL methods, challenges, data resources, and applications for enhancing reasoning in large language models and large reasoning models since DeepSeek-R1.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Yuvion LLM: An Adversarially-Aware Large Language Model for Content And AI Safety cs.CL · 2026-06-26 · unverdicted · none · ref 16
Yuvion LLM applies adversarially aware training and introduces the YLRE benchmark set, claiming superior safety robustness over larger models on multiple tasks.
Trust Region On-Policy Distillation cs.LG · 2026-05-31 · unverdicted · none · ref 195
TrOPD stabilizes on-policy distillation for LLMs with trust-region learning, outlier estimation, and off-policy guidance, outperforming prior OPD methods on reasoning and code benchmarks.

Air: A systematic analysis of annotations, instructions, and response pairs in preference dataset

fields

years

verdicts

representative citing papers

citing papers explorer