Introduces Defensibility Index, Ambiguity Index, and Probabilistic Defensibility Signal to evaluate AI moderation decisions by logical derivability from explicit rules rather than agreement with historical labels, with validation on 193k+ Reddit cases showing 33-46.6 pp metric gaps and a Governance
Title resolution pending
5 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 5roles
background 1polarities
background 1representative citing papers
Large-scale observational analysis of Reddit moderation shows bot moderators yield higher compliance and lower self-censorship than human or team moderators, with linguistic strategies' effectiveness depending on violation severity.
Admins in India used Meta AI to help create WhatsApp group rules, appreciating reduced workload but remaining cautious about privacy, relational trust, and contextual tone.
Qualitative analysis of Reddit discussions reveals four tensions users face with AI-generated fitness feedback, showing resistance to AI that limits personal interpretations of lived experiences.
Reddit data analysis shows reply-based mobile scams growing nearly twice as fast as click-based ones while evading commercial and open-source detectors.
citing papers explorer
-
Escaping the Agreement Trap: Defensibility Signals for Evaluating Rule-Governed AI
Introduces Defensibility Index, Ambiguity Index, and Probabilistic Defensibility Signal to evaluate AI moderation decisions by logical derivability from explicit rules rather than agreement with historical labels, with validation on 193k+ Reddit cases showing 33-46.6 pp metric gaps and a Governance
-
Who, Why, and How: Disentangling the Effects of Moderation Source, Context, and Language on Post-Removal Behavior
Large-scale observational analysis of Reddit moderation shows bot moderators yield higher compliance and lower self-censorship than human or team moderators, with linguistic strategies' effectiveness depending on violation severity.
-
Creating Group Rules with AI: Human-AI Collaboration in WhatsApp Moderation
Admins in India used Meta AI to help create WhatsApp group rules, appreciating reduced workload but remaining cautious about privacy, relational trust, and contextual tone.
-
Who Gets to Interpret the Workout? User Tensions with AI-Generated Fitness Feedback
Qualitative analysis of Reddit discussions reveals four tensions users face with AI-generated fitness feedback, showing resistance to AI that limits personal interpretations of lived experiences.
-
Read This Paper to Get $50 Million:* An Analysis of Mobile Messaging Scams Using Reddit Data
Reddit data analysis shows reply-based mobile scams growing nearly twice as fast as click-based ones while evading commercial and open-source detectors.