Personalisation within bounds: A risk taxonomy and policy frame- work for the alignment of large language models with personalised feedback

Scott A · 2023 · arXiv 2303.05453

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

read on arXiv browse 5 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

"Label from Somewhere": Reflexive Annotating for Situated AI Alignment

cs.HC · 2026-01-25 · unverdicted · novelty 6.0

Reflexive annotating elicits intersectional and positional metadata from crowd workers to make AI alignment annotations more situated and less assumed-neutral.

Simple synthetic data reduces sycophancy in large language models

cs.CL · 2023-08-07 · unverdicted · novelty 6.0

Scaling and instruction tuning increase sycophancy in LLMs on opinion and fact tasks, but a synthetic data fine-tuning intervention reduces it on held-out prompts.

AI-Augmented Surveys: Leveraging Large Language Models and Surveys for Opinion Prediction

cs.CL · 2023-05-16 · unverdicted · novelty 6.0

LLM embeddings enable strong retrodiction of masked GSS opinions via cross-validation and external validation but only modest performance on entirely unasked opinions.

When to Ask a Question: Understanding Communication Strategies in Generative AI Tools

cs.GT · 2026-05-11 · unverdicted · novelty 5.0

A tradeoff model shows generative AI can reduce bias against diverse preferences by strategically eliciting information instead of always inferring from majority patterns.

Opportunities and Challenges of Large Language Models for Low-Resource Languages in Humanities Research

cs.CL · 2024-11-30 · unverdicted · novelty 2.0

This survey paper identifies opportunities for LLMs in low-resource language humanities research along with challenges in data accessibility, model adaptability, and cultural sensitivity.

citing papers explorer

Showing 5 of 5 citing papers.

"Label from Somewhere": Reflexive Annotating for Situated AI Alignment cs.HC · 2026-01-25 · unverdicted · none · ref 62
Reflexive annotating elicits intersectional and positional metadata from crowd workers to make AI alignment annotations more situated and less assumed-neutral.
Simple synthetic data reduces sycophancy in large language models cs.CL · 2023-08-07 · unverdicted · none · ref 17
Scaling and instruction tuning increase sycophancy in LLMs on opinion and fact tasks, but a synthetic data fine-tuning intervention reduces it on held-out prompts.
AI-Augmented Surveys: Leveraging Large Language Models and Surveys for Opinion Prediction cs.CL · 2023-05-16 · unverdicted · none · ref 60
LLM embeddings enable strong retrodiction of masked GSS opinions via cross-validation and external validation but only modest performance on entirely unasked opinions.
When to Ask a Question: Understanding Communication Strategies in Generative AI Tools cs.GT · 2026-05-11 · unverdicted · none · ref 27
A tradeoff model shows generative AI can reduce bias against diverse preferences by strategically eliciting information instead of always inferring from majority patterns.
Opportunities and Challenges of Large Language Models for Low-Resource Languages in Humanities Research cs.CL · 2024-11-30 · unverdicted · none · ref 69
This survey paper identifies opportunities for LLMs in low-resource language humanities research along with challenges in data accessibility, model adaptability, and cultural sensitivity.

Personalisation within bounds: A risk taxonomy and policy frame- work for the alignment of large language models with personalised feedback

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer