pith. sign in

arxiv: 2606.12430 · v1 · pith:VTVKQBW6new · submitted 2026-05-15 · 💻 cs.CY · cs.AI

Will AI Agents Free Us From Meaningless Work? A Human-Centered Analysis

Pith reviewed 2026-06-30 19:20 UTC · model grok-4.3

classification 💻 cs.CY cs.AI
keywords AI delegationbullshit jobstask-level analysisworker preferencesmeaningful workautomation feasibility
0
0 comments X

The pith

Workers want AI agents to take over tasks they rate as bullshit, which they also see as needing little oversight.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper asks which parts of work people would hand to AI agents by focusing on individual tasks rather than whole jobs. Drawing on Graeber's concept of bullshit jobs, it collects ratings from 202 workers across 171 tasks and validates a five-item scale that measures how meaningless each task feels. The scale turns out to be a strong predictor of which tasks workers want to delegate to AI, and those same tasks are judged to require less human supervision. This suggests that perceived bullshitness lines up with both worker preferences and practical feasibility for automation.

Core claim

Tasks perceived as bullshit are natural candidates for AI delegation, aligning worker preferences with perceived feasibility.

What carries the argument

Five-item scale of perceived bullshitness at the task level, which predicts desire for AI delegation and low oversight needs.

If this is right

  • Perceived bullshitness can serve as a practical signal for choosing which tasks to automate first.
  • Task-level variation in meaning within the same job becomes visible and actionable for AI design.
  • Delegation decisions can be guided by worker input rather than top-down occupation lists.
  • Tasks rated high on bullshitness are expected to need less ongoing human monitoring once automated.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • AI tools aimed at high-bullshit tasks may see faster acceptance because they match what workers already want removed.
  • Job redesign efforts could use the scale to protect high-meaning tasks while automating the rest.
  • Long-term studies could test whether removing these tasks raises overall job satisfaction or shifts what counts as meaningful work.

Load-bearing premise

The five-item scale validly measures Graeber's bullshit-jobs idea at the level of single tasks and that workers' self-reported desire to delegate reflects real feasibility without response bias.

What would settle it

A workplace study that tracks actual AI task delegation and finds no correlation between bullshitness ratings and delegation rates.

Figures

Figures reproduced from arXiv: 2606.12430 by Daniele Quercia, Davide Ghia, Jaspreet Ranjit, Tania Cerquitelli.

Figure 1
Figure 1. Figure 1: Perceived bullshitness in relation to (a) desire for automation and (b) required human agency. Tasks perceived as [PITH_FULL_IMAGE:figures/full_fig_p004_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Preferred AI traits for tasks perceived as bullshit. [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗
read the original abstract

Some claim that AI agents will free workers from the boring parts of their jobs, yet little is known about how workers themselves identify which tasks should be automated. Prior research focuses on occupations, overlooking that workers experience varying levels of meaning across tasks within the same role. We address this gap with a task-level analysis grounded in Graeber's theory of bullshit jobs. Using ratings from 202 workers on 171 workplace tasks, we (1) validate a five-item scale of perceived bullshitness, (2) show that perceived bullshitness strongly predicts desire for AI delegation, and (3) find that such tasks are also seen as requiring less human oversight. Together, these findings suggest that tasks perceived as bullshit are natural candidates for AI delegation, aligning worker preferences with perceived feasibility.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The manuscript reports results from 202 workers rating 171 workplace tasks. It validates a five-item scale of perceived bullshitness grounded in Graeber's theory, shows that bullshitness strongly predicts desire for AI delegation, and finds that such tasks are perceived to require less human oversight. The central claim is that tasks perceived as bullshit are natural candidates for AI delegation, aligning worker preferences with perceived feasibility.

Significance. If the scale validly operationalizes the construct at the task level and self-reported delegation preferences reflect genuine feasibility without bias, the work supplies task-level empirical evidence extending Graeber's framework to AI contexts and identifies a potential alignment between perceived meaninglessness and automation suitability. The direct collection of worker ratings on concrete tasks is a clear methodological asset.

major comments (2)
  1. [Methods] Methods (scale validation subsection): the five-item bullshitness scale is load-bearing for the prediction and oversight claims, yet the abstract and reported results give no indication of convergent validity against Graeber-derived items, discriminant validity from mere task unpleasantness, or controls for social-desirability bias in delegation preferences. Without these, the observed correlations do not establish that the scale specifically captures societal pointlessness rather than personal dislike.
  2. [Abstract/Results] Abstract and Results: the abstract states scale validation, a strong predictor relationship, and an oversight finding but supplies no error bars, per-task sample sizes, or data exclusion rules. This prevents assessment of whether post-hoc analytic choices affect the central correlations between bullshitness and delegation desire.
minor comments (1)
  1. [Abstract] Abstract: the opening sentence could explicitly note the sample (202 workers, 171 tasks) to give immediate context for the scale-validation claim.

Simulated Author's Rebuttal

2 responses · 0 unresolved

Thank you for the referee's constructive comments on our manuscript. We address each major point below with the strongest honest defense possible, indicating where revisions are warranted.

read point-by-point responses
  1. Referee: [Methods] Methods (scale validation subsection): the five-item bullshitness scale is load-bearing for the prediction and oversight claims, yet the abstract and reported results give no indication of convergent validity against Graeber-derived items, discriminant validity from mere task unpleasantness, or controls for social-desirability bias in delegation preferences. Without these, the observed correlations do not establish that the scale specifically captures societal pointlessness rather than personal dislike.

    Authors: The five-item scale was constructed with items directly derived from Graeber's definition of bullshit jobs to ensure content validity by design, and the manuscript reports reliability and factor-analytic validation of its internal structure. We acknowledge that separate convergent validity tests against additional Graeber-derived items, explicit discriminant validity checks against unpleasantness measures, and social-desirability controls were not included in the survey design or reported results. The survey was anonymous, which mitigates some response bias, but we agree this leaves open the possibility that ratings partly reflect personal dislike. We will revise the methods and limitations sections to clarify the theoretical grounding and explicitly discuss these gaps. revision: partial

  2. Referee: [Abstract/Results] Abstract and Results: the abstract states scale validation, a strong predictor relationship, and an oversight finding but supplies no error bars, per-task sample sizes, or data exclusion rules. This prevents assessment of whether post-hoc analytic choices affect the central correlations between bullshitness and delegation desire.

    Authors: The abstract is space-constrained, but the full results section reports the overall sample (202 workers, 171 tasks) and the key correlations. We will revise the abstract to include the primary correlation coefficients with 95% confidence intervals. In the results, we will add the mean and range of per-task ratings (each task was rated by a subset of participants), along with explicit data exclusion rules such as removal of incomplete responses. These changes will improve transparency regarding analytic choices and robustness. revision: yes

Circularity Check

0 steps flagged

Empirical survey study with no mathematical derivation or self-referential loops

full rationale

The paper is a human-subjects survey collecting ratings on 171 tasks from 202 workers. It validates a five-item bullshitness scale via standard psychometric methods on the collected data, then reports correlations between bullshitness ratings and delegation desire/oversight needs. No equations, fitted parameters, or predictions are defined in terms of themselves; the central claims rest directly on the external survey responses rather than any self-citation chain or definitional reduction. This matches the default case of a self-contained empirical study.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The study rests on the assumption that Graeber's bullshit-jobs framework can be operationalized into a five-item scale that workers can reliably apply to individual tasks; no free parameters or invented entities are described in the abstract.

axioms (1)
  • domain assumption Graeber's theory of bullshit jobs applies directly to granular workplace tasks and can be measured with a short self-report scale.
    The paper grounds its analysis in this theory and validates a five-item scale derived from it.

pith-pipeline@v0.9.1-grok · 5671 in / 1119 out tokens · 26640 ms · 2026-06-30T19:20:57.766240+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

17 extracted references · 10 canonical work pages

  1. [1]

    2022.New Frontiers: The Origins and Content of New Work, 1940–2018

    David Autor, Caroline Chin, Anna M Salomons, and Bryan Seegmiller. 2022.New Frontiers: The Origins and Content of New Work, 1940–2018. Working Paper 30389. National Bureau of Economic Research. doi:10.3386/w30389

  2. [2]

    Catherine Bailey, Ruth Yeoman, Adrian Madden, Marc Thompson, and Gary Kerridge. 2019. A Review of the Empirical Literature on Meaningful Work: Progress and Research Agenda.Human Resource Development Review18, 1 (2019), 83–113. doi:10.1177/1534484318804653

  3. [3]

    Butler, S

    J. Butler, S. Jaffe, R. Janßen, N. Baym, B. Hecht, J. Hofman, S. Rintel, B. Sarrafzadeh, A. Sellen, M. Vorvoreanu, and J. Teevan. 2025.Microsoft New Future of Work Report 2025. Technical Report MSR-TR-2025-58. Microsoft Research. https: //aka.ms/nfw2025

  4. [4]

    M. Dong, J. R. Conway, J.-F. Bonnefon, A. Shariff, and I. Rahwan. 2024. Fears about Artificial Intelligence across 20 Countries and Six Domains of Application. American Psychologist(2024)

  5. [5]

    Tyna Eloundou, Sam Manning, Pamela Mishkin, and Daniel Rock. 2023. GPTs are GPTs: An Early Look at the Labor Market Impact Potential of Large Language Models. arXiv:2303.10130 [econ.GN] https://arxiv.org/abs/2303.10130

  6. [6]

    Erik Engberg, Michael Koch, Magnus Lodefalk, and Sarah Schroeder. 2025. Arti- ficial intelligence, tasks, skills, and wages: Worker-level evidence from Germany. Research Policy54, 8 (2025), 105285. doi:10.1016/j.respol.2025.105285

  7. [7]

    David Graeber. 2013. On the Phenomenon of Bullshit Jobs: A Work Rant.Strike Magazine3, 1 (2013), 2

  8. [8]

    2018.Bullshit Jobs: A Theory

    David Graeber. 2018.Bullshit Jobs: A Theory. Simon & Schuster, New York, NY, USA

  9. [9]

    Abraham H. Maslow. 1943. A Theory of Human Motivation.Psychological Review 50, 4 (1943), 370–396. doi:10.1037/h0054346

  10. [10]

    Mariano Méndez-Suárez, Maja Ćukušić, and Ivana Ninčević-Pašalić. 2026. AI FoMO (fear of missing out) in the workplace.Technology in Society84 (2026), 103052. doi:10.1016/j.techsoc.2025.103052

  11. [11]

    National Center for O*NET Development. 2025. O*NET OnLine. https://www. onetonline.org/ Will AI Agents Free Us From Meaningless Work? A Human-Centered Analysis Conference’17, July 2017, Washington, DC, USA

  12. [12]

    Jaspreet Ranjit, Ke Zhou, Swabha Swayamdipta, and Daniele Quercia. 2026. Are We Automating the Joy Out of Work? Designing AI to Augment Work, Not Meaning. InProceedings of the 2026 CHI Conference on Human Factors in Computing Systems. 1–46

  13. [13]

    Ali Akbar Septiandri, Marios Constantinides, and Daniele Quercia. 2024. The potential impact of AI innovations on US occupations.PNAS Nexus3, 9 (09 2024), pgae320. doi:10.1093/pnasnexus/pgae320

  14. [14]

    Y. Shao, H. Zope, Y. Jiang, J. Pei, D. Nguyen, Erik Brynjolfsson, and D. Yang

  15. [15]

    Workforce.arXiv(2025)

    Future of Work with AI Agents: Auditing Automation and Augmentation Potential across the U.S. Workforce.arXiv(2025). arXiv:2506.06576 https: //arxiv.org/abs/2506.06576

  16. [16]

    The training process of many deep networks explores the same low-dimensional manifold

    Tyler J. VanderWeele. 2017. On the Promotion of Human Flourishing.Proceedings of the National Academy of Sciences114, 31 (2017), 8148–8156. doi:10.1073/pnas. 1702996114

  17. [17]

    Monika Westphal, Patrick Hemmer, Michael Vössing, Max Schemmer, Sebastian Vetter, and Gerhard Satzger. 2025. Towards Understanding AI Delegation: The Role of Self-Efficacy and Visual Processing Ability.ACM Trans. Interact. Intell. Syst.15, 1, Article 5 (Feb. 2025), 24 pages. doi:10.1145/3696423