LLMs show practical but imperfect ability to replicate human annotations of language ideologies in Luxembourgish comments, with some gains from machine translation to high-resource languages.
Language Ideologies in a Multilingual Society: An LLM-based Analysis of Luxembourgish News Comments
2 Pith papers cite this work. Polarity classification is still indexing.
abstract
Detecting language ideologies is a valuable yet complex task for understanding how identities are constructed through discourse. In Luxembourg's multicultural and multilingual society, language ideologies reflect more than simple preferences: they carry deep cultural and social meanings, shaping identities and social belonging. Following recent developments in applying Natural Language Processing tools to linguistics and social science, this paper explores the potential of large language models to assist in the detection of language ideologies. We manually annotate a corpus of user comments in Luxembourgish with predefined ideological categories and then evaluate the performance of large language models under varying prompt conditions to assess their ability to replicate these human annotations. Since Luxembourgish is a small language and poorly represented in the LLMs' training data, we also investigate whether machine-translating the data to high-resource languages increases performance on the ideology detection task. Our findings suggest that, while LLMs are not yet fully optimized for a multi-class ideological annotation task, they are practical tools to identify language ideological content.
years
2026 2verdicts
UNVERDICTED 2representative citing papers
Ctx2Skill uses a self-evolving multi-agent loop with Challenger, Reasoner, Judge, and Cross-time Replay to discover context-specific skills, improving task-solving rates on CL-bench benchmarks across models.
citing papers explorer
-
Language Ideologies in a Multilingual Society: An LLM-based Analysis of Luxembourgish News Comments
LLMs show practical but imperfect ability to replicate human annotations of language ideologies in Luxembourgish comments, with some gains from machine translation to high-resource languages.
-
From Context to Skills: Can Language Models Learn from Context Skillfully?
Ctx2Skill uses a self-evolving multi-agent loop with Challenger, Reasoner, Judge, and Cross-time Replay to discover context-specific skills, improving task-solving rates on CL-bench benchmarks across models.