pith. machine review for the scientific record. sign in

archive

Every paper Pith has read. Search by title, abstract, or pith.

718 papers in cs.CY · page 1

  1. cs.CY 2026-05-14 reviewed
    Standard rules understaff SNAP call centers by ignoring redials

    Due Process on Hold: A Queueing Framework for Improving Access in SNAP

    Andrew Daw +2

  2. cs.CY 2026-05-14 reviewed
    This paper uses data from 26 million U.S

    Tradeoffs are Domain Dependent: Improving Accuracy and Fairness in Property Tax Assessments

    Christopher Berry +4

  3. cs.CV 2026-05-14 reviewed
    ViMU benchmark tests video AI on hidden meanings

    ViMU: Benchmarking Video Metaphorical Understanding

    Qi Li +1

  4. cs.CY 2026-05-14 reviewed
    4B genome agent matches larger LLMs on microbial trait prediction

    GGBound: A Genome-Grounded Agent for Microbial Life-Boundary Prediction

    Hanbo Huang +6

  5. cs.CY 2026-05-14 reviewed
    Moderate starters gain most in AI agent workshops

    Computational Thinking Development in AI Agent Creation_A Mixed-Methods Study

    Gaowei Chen +5

  6. cs.CL 2026-05-14 reviewed
    Agent harnesses allow unsafe actions even with correct final outputs

    Auditing Agent Harness Safety

    Chengzhi Liu +10

  7. cs.AI 2026-05-13 reviewed
    AI benchmarks redefine capabilities to fit their own rules

    The Evaluation Trap: Benchmark Design as Theoretical Commitment

    Theodore J Kalaitzidis

  8. cs.CL 2026-05-13 reviewed
    Safety refusals rise with Korean language but drop with Korean context

    ROK-FORTRESS: Measuring the Effect of Geopolitical Transcreation for National Security and Public Safety

    Bert Herring +15

  9. cs.CY 2026-05-13 reviewed
    Generative models automate social doing

    Synthetic Sociality: How Generative Models Privatize the Social Fabric

    Ana Dodik +1

  10. cs.AI 2026-05-13 reviewed
    Formal checks can keep AI legal reasoning inside the text

    Bridging Legal Interpretation and Formal Logic: Faithfulness, Assumption, and the Future of AI Legal Reasoning

    Leilani H. Gilpin +1

  11. cs.AI 2026-05-13 reviewed
    GraphRAG retrieval aligns LLM agents with social values

    From Descriptive to Prescriptive: Uncover the Social Value Alignment of LLM-based Agents

    Jinxian Qu +3

  12. cs.CY 2026-05-13 reviewed
    AI Overviews appear in 14% of searches with 11% unsupported claims

    Measuring Google AI Overviews: Activation, Source Quality, Claim Fidelity, and Publisher Impact

    Haofei Xu +2

  13. cs.CY 2026-05-13 reviewed
    Election tweets on X rose to 93 percent original content in 2024 from 59 percent in 2016

    Amplification to Synthesis: A Comparative Analysis of Cognitive Operations Before and After Generative AI

    Dongwook Yoon +1

  14. cs.CR 2026-05-13 reviewed
    Canary tokens link scrapers to the LLMs they feed

    Identifying AI Web Scrapers Using Canary Tokens

    Caroline Zhang +5

  15. cs.CL 2026-05-13 reviewed
    Fine-tuning plus hierarchical prompts strengthen propaganda detection

    Fine-tuning with Hierarchical Prompting for Robust Propaganda Classification Across Annotation Schemas

    Ariana Sahitaj +8

  16. cs.CY 2026-05-13 reviewed
    Europe Needs Preparedness Plan for AGI by 2030-2040

    Europe and the Geopolitics of AGI: The Need for a Preparedness Plan

    Afek Shamir +10

  17. cs.AI 2026-05-13 reviewed
    Students rate AI slides equal to instructor ones

    AI-Generated Slides: Are They Good? Can Students Tell?

    Arto Hellas +2

  18. cs.CY 2026-05-13 reviewed
    3C framework links competition and networks to women's computing participation

    3C: Competition, Competence, and Collaboration for Women in Computing

    Ioana Visescu +1

  19. cs.CY 2026-05-13 reviewed
    Bias audits for AI image generators must match use-case risks

    Context Matters: Auditing Gender Bias in T2I Generation through Risk-Tiered Use-Case Profiles

    Jose Luna +3

  20. cs.CR 2026-05-13 reviewed
    Aggregation turns watermarking into monitoring

    Watermarking Should Be Treated as a Monitoring Primitive

    Jie Zhang +2

  21. cs.CR 2026-05-13 reviewed
    Watermarking turns into entity monitoring via output aggregation

    Watermarking Should Be Treated as a Monitoring Primitive

    Jie Zhang +2

  22. cs.CY 2026-05-13 reviewed
    Chinese tech writing needs separate terms for safety and security

    Not All Anquan Is the Same: A Terminological Proposal for Chinese Computer Science and Engineering

    Xingyu Zhao

  23. cs.CY 2026-05-13 reviewed
    Use 'anbao' for security, keep 'anquan' for safety in Chinese tech writing

    Not All Anquan Is the Same: A Terminological Proposal for Chinese Computer Science and Engineering

    Xingyu Zhao

  24. cs.CL 2026-05-13 reviewed
    GenAI flattens L2 writers' voices into uniform English

    The Cost of Perfect English: Pragmatic Flattening and the Erasure of Authorial Voice in L2 Writing Supported by GenAI

    Ao Liu +1

  25. cs.AI 2026-05-13 reviewed
    KITE tutor raises simulated student accuracy on algorithm tasks

    Retrieval-Augmented Tutoring for Algorithm Tracing and Problem-Solving in AI Education

    Arto Hellas +8

  26. cs.CY 2026-05-13 reviewed
    87% of teachers quit AI agent creation weeks after training

    An Activity-Theoretical Approach to Teacher Professional Development in Pedagogical AI Agent Design

    Ching Sing Chai +6

  27. cs.CY 2026-05-13 reviewed
    The MIRACLE system uses multiple AI agents to guide students through planning

    MIRACLE_Multi-Agent Intelligent Regulation to Advance Collaborative Learning Environment

    Ching Sing Chai +6

  28. cs.CY 2026-05-13 reviewed
    AI-TPACK forms through thinking style and beliefs

    Modeling AI-TPACK in Practice Insights from Teachers Multi-Agent Workflow Design

    Ching Sing Chai +6

  29. cs.LG 2026-05-13 reviewed
    Clinical AI models passing accuracy tests can fail hidden deployment checks

    RISED: A Pre-Deployment Safety Evaluation Framework for Clinical AI Decision-Support Systems

    Rohith Reddy Bellibatlu

  30. cs.MA 2026-05-12 reviewed
    Scale separates mechanistic explanation from reproduction in LLM models

    Mechanism Plausibility in Generative Agent-Based Modeling

    David Huu Pham +2

  31. cs.MA 2026-05-12 reviewed
    Synthetic dataset benchmarks AI for swim coaching

    Synthesizing the Expert: A Validated Multimodal Dataset for Trustworthy AI-Assisted Swimming Coaching

    Ahmad Al-Kabbany +1

  32. cs.LG 2026-05-12 reviewed
    Feature models cut error 22-33% on student effort forecasts

    From Heuristics to Analytics: Forecasting Effort and Progress in Online Learning

    Boyuan Guo +4

  33. physics.ed-ph 2026-05-12 reviewed
    AI forces new rules for how universities change teaching

    A Framework for institutional change in the age of AI

    David Perl-Nussbaum +1

  34. cs.CL 2026-05-12 reviewed
    LLM simulators fix answers regardless of feedback relevance

    Simulating Students or Sycophantic Problem Solving? On Misconception Faithfulness of LLM Simulators

    Heejin Do +2

  35. cs.LG 2026-05-12 reviewed
    Outcome-fair models still reason differently for similar applicants

    Do Fair Models Reason Fairly? Counterfactual Explanation Consistency for Procedural Fairness in Credit Decisions

    Gideon Popoola +1

  36. cs.CV 2026-05-12 reviewed
    Nobody knows the state of the art in geospatial foundation models

    No One Knows the State of the Art in Geospatial Foundation Models

    Anthony Fuller +8

  37. cs.CY 2026-05-12 reviewed
    Multisector moves boost upward mobility for planning alumni

    Career Mobility of Planning Alumni in the United States: Evidence from Professional Profile Data using Large Language Models

    Su Jeong Jo +1

  38. cs.AI 2026-05-12 reviewed
    Simulator trains AI agents on utility demand response

    Towards Affordable Energy: A Gymnasium Environment for Electric Utility Demand-Response Programs

    Huazheng Wang +3

  39. cs.CL 2026-05-12 reviewed
    LLM political discourse lacks real population variation in crises

    The Algorithmic Caricature: Auditing LLM-Generated Political Discourse Across Crisis Events

    Gunjan +2

  40. cs.CL 2026-05-12 reviewed
    Embedding geometry flags LLM rating disagreements

    Predicting Disagreement with Human Raters in LLM-as-a-Judge Difficulty Assessment without Using Generation-Time Probability Signals

    Yo Ehara

  41. cs.CY 2026-05-12 reviewed
    AI in exams makes judging solutions the new measure of learning

    Reimagining Assessment in the Age of Generative AI: Lessons from Open-Book Exams with ChatGPT

    Qusay H. Mahmoud

  42. cs.CY 2026-05-12 reviewed
    Culturally responsive outreach builds AI knowledge in Black youth

    Early AI Literacy in Culturally Responsive STEM Outreach for Black Youth

    Hossam Kishawy +4

  43. cs.AI 2026-05-12 reviewed
    LLM arbitration cuts delays at signal-free intersections

    LISA: Cognitive Arbitration for Signal-Free Autonomous Intersection Management

    Abderrahmane Lakas +2

  44. cs.CY 2026-05-12 reviewed
    Budget split cuts gender skew in ads without excluding unknowns

    Into the Unknown: Accounting for Missing Demographic Data when Mitigating Ad Delivery Skew

    Allison Koenecke +1

  45. cs.AI 2026-05-12 reviewed
    Same facts produce different conclusions when inference profiles differ

    Why Conclusions Diverge from the Same Observations: Formalizing World-Model Non-Identifiability via an Inference

    Toru Takahashi

  46. cs.LG 2026-05-12 reviewed
    Adaptive weights add feature selection to FGW distances

    Fused Gromov-Wasserstein Distance with Feature Selection

    Harlin Lee +3

  47. cs.CL 2026-05-12 reviewed
    Poetic prompts create separate processing paths that evade LLM safety

    Metaphor Is Not All Attention Needs

    Daniele Nardi +8

  48. cs.CY 2026-05-12 reviewed
    GDPR access requests expose contracts of African content moderators

    Auditing African Content Moderators' Working Conditions by Using the European General Data Protection Regulation (GDPR)

    James Oyange +6

  49. q-fin.TR 2026-05-12 reviewed
    Polymarket shows single fill-side cluster for all addresses

    Fill-Side Non-Retail Trading on Polymarket: An Empirical Study of Behavioral Tiers and Microstructure Signatures Under Quote-Attribution Constraints

    Maksym Nechepurenko

  50. cs.AI 2026-05-12 reviewed
    The paper introduces the Evaluation Differential (ED) as a divergence in AI model…

    The Evaluation Differential: When Frontier AI Models Recognise They Are Being Tested

    Ivan Flechais +3