pith. sign in

arxiv: 2501.10551 · v4 · submitted 2025-01-17 · 💻 cs.HC

An Empirical Study to Understand How Students Use ChatGPT for Writing Essays

Pith reviewed 2026-05-23 05:45 UTC · model grok-4.3

classification 💻 cs.HC
keywords ChatGPT usageessay writingAI usage patternsstudent self-efficacyperceived ownershipgender differencesAI in educationhuman-AI interaction
0
0 comments X

The pith

Gender, race and self-efficacy predict distinct ChatGPT query patterns in student essay writing, which relate to enjoyment and ownership.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper reports results from an online study of 70 students who wrote essays on a custom platform that logged every query sent to ChatGPT. By manually sorting those queries into usage types, the authors show that gender, race, and students' own ratings of self-efficacy forecast which patterns appear. Students who followed different patterns then reported different levels of enjoyment during the task and different degrees of ownership over the finished essay. The work matters because any claim about whether AI tools help or harm learning must first account for these varying styles of use rather than assume uniform behavior.

Core claim

Through an online study with 70 participants, the authors categorized students' queries to ChatGPT during an essay task. They determined that gender, race, and perceived self-efficacy predict different usage patterns. These patterns in turn relate to different levels of enjoyment and perceived ownership over the resulting essay. The findings are offered as input to discussions on how writing classrooms should incorporate generative AI tools.

What carries the argument

Manual categorization of each student query to ChatGPT into usage types, followed by statistical analysis linking those types to demographic variables, self-efficacy ratings, enjoyment, and perceived ownership.

Load-bearing premise

The manual categorization of student queries into usage types is reliable and reproducible, and the self-reported metrics of enjoyment, ownership, and self-efficacy accurately capture the underlying constructs without substantial social-desirability bias.

What would settle it

An independent re-coding of the same query logs that produces markedly different category assignments for a large share of queries, or a new study in which self-reported self-efficacy shows no statistical association with observed query behavior.

Figures

Figures reproduced from arXiv: 2501.10551 by Alice Jang, Andrew Jelson, Daniel Dunlap, Daniel Manesh, Sang Won Lee, Young-Ho Kim.

Figure 1
Figure 1. Figure 1: The overview of Research Questions We analyzed the interaction traces to understand how the resulting essay was composed (e.g., the percentage of words pasted from ChatGPT) and how these essay characteristics varied by student group. The result of this question will characterize how each group used ChatGPT queries to affect the essay quantitatively. Lastly, we investigated how students’ use of ChatGPT rela… view at source ↗
Figure 2
Figure 2. Figure 2: The editor view of the website 3.1 Instrument Development: Writing Platform + ChatGPT Development To understand how students use ChatGPT, we developed a platform that tracked their queries and the corresponding responses. Since ChatGPT is an independent app, we built a system that integrates an in-house ChatGPT — referred to simply as ChatGPT from this point — within the writing platform, using the default… view at source ↗
Figure 3
Figure 3. Figure 3: The view of our ChatGPT page To track how users interact with ChatGPT, we implemented a custom version of ChatGPT using the OpenAI API (model GPT-3.5-turbo), as shown in [PITH_FULL_IMAGE:figures/full_fig_p008_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Data Flow Diagram between Editor, Participant, and ChatGPT [PITH_FULL_IMAGE:figures/full_fig_p011_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: K-means clustering SSE and Cluster information [PITH_FULL_IMAGE:figures/full_fig_p012_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Query distribution per group 4.1.6 Interaction Patterns and Writing Stage Transitions. We examined how many times each participant asked questions to ChatGPT to show the distribution of query counts, as shown in Figure 6a. Overall, nearly half of the participants (39 out of 77) used ChatGPT less than or equal to twice; six participants, classified as Group N, never used it at all. Meanwhile, some participa… view at source ↗
Figure 7
Figure 7. Figure 7: Transition matrix and state diagram for ChatGPT inquiries [PITH_FULL_IMAGE:figures/full_fig_p020_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Box Plots for significant Kruskal-Wallis tests with Post Hoc analysis (* indicates [PITH_FULL_IMAGE:figures/full_fig_p022_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Box Plots for Perceived Ownership and CSI - Enjoyment [PITH_FULL_IMAGE:figures/full_fig_p023_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: Box Plots for CSI Expressiveness and Immersion [PITH_FULL_IMAGE:figures/full_fig_p023_10.png] view at source ↗
read the original abstract

As large language models (LLMs) advance and become widespread, students increasingly turn to systems like ChatGPT for assistance with writing tasks. Educators are concerned with students' usage of ChatGPT beyond cheating; using ChatGPT may reduce their critical engagement with writing, hindering students' learning processes. The negative or positive impact of using LLM-powered tools for writing will depend on how students use them; however, how students use ChatGPT remains largely unknown, resulting in a limited understanding of its impact on learning. To better understand how students use these tools, we conducted an online study $(n=70)$ where students were given an essay-writing task using a custom platform we developed to capture the queries they made to ChatGPT. To characterize their ChatGPT usage, we categorized each of the queries students made to ChatGPT. We then analyzed the relationship between ChatGPT usage and a variety of other metrics, including students' self-perception, attitudes towards AI, and the resulting essay itself. We found that factors such as gender, race, and perceived self-efficacy can help predict different AI usage patterns. Additionally, we found that different usage patterns were associated with varying levels of enjoyment and perceived ownership over the essay. The results of this study contribute to discussions about how writing education should incorporate generative AI-powered tools in the classroom.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper reports results from an online study (n=70) in which participants completed an essay-writing task on a custom platform that logged all queries sent to ChatGPT. Queries were manually categorized into usage patterns; the authors then examined statistical associations between these patterns and demographic variables (gender, race), perceived self-efficacy, attitudes toward AI, enjoyment, perceived ownership of the final essay, and essay quality. The central claims are that demographic and self-efficacy factors predict distinct usage patterns and that usage patterns are associated with differences in enjoyment and ownership.

Significance. If the query categorization proves reliable and the reported associations hold after appropriate controls, the work supplies one of the first behavioral datasets on how students actually interact with LLMs during writing, moving beyond self-report surveys. The combination of logged queries with self-report measures is a methodological strength that could support more nuanced classroom guidelines. The study also surfaces potential equity issues (gender/race differences) that warrant follow-up.

major comments (2)
  1. [Methods] Methods section (query categorization procedure): No information is supplied on the number of coders, inter-rater reliability statistic, codebook construction (inductive vs. deductive), or disagreement-resolution protocol. Because the usage-pattern variable is the independent variable for all subsequent predictive and correlational claims, the absence of these details renders the reported associations between patterns and demographics/enjoyment/ownership difficult to interpret or replicate.
  2. [Results] Results section (statistical tests): The manuscript reports associations involving multiple demographic and self-report predictors but provides no mention of multiple-comparison correction, effect-size reporting, or a priori exclusion criteria. With n=70 and several tested relationships, uncorrected significance tests risk inflated Type I error and weaken the claim that gender, race, and self-efficacy reliably predict usage patterns.
minor comments (2)
  1. [Abstract] Abstract: The abstract states that queries were 'categorized' but does not name the resulting categories or their derivation, leaving readers without an immediate sense of the granularity of the usage typology.
  2. [Discussion] Discussion: Self-report scales for enjoyment, ownership, and self-efficacy are presented without reference to prior validation studies or discussion of social-desirability bias, which is relevant given the educational context.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive feedback, which highlights important areas for improving the transparency of our methods and statistical reporting. We will revise the manuscript to address these concerns.

read point-by-point responses
  1. Referee: [Methods] Methods section (query categorization procedure): No information is supplied on the number of coders, inter-rater reliability statistic, codebook construction (inductive vs. deductive), or disagreement-resolution protocol. Because the usage-pattern variable is the independent variable for all subsequent predictive and correlational claims, the absence of these details renders the reported associations between patterns and demographics/enjoyment/ownership difficult to interpret or replicate.

    Authors: We agree that these details are critical for the interpretability and replicability of our findings. In the study, two authors independently coded the queries using an inductive approach, developing the codebook iteratively based on the data. Disagreements were resolved through discussion until consensus was reached. We computed Cohen's kappa as a measure of inter-rater reliability. We will update the Methods section to fully describe this process, including the number of coders, the inductive nature of the codebook, the disagreement resolution protocol, and the reliability statistic. revision: yes

  2. Referee: [Results] Results section (statistical tests): The manuscript reports associations involving multiple demographic and self-report predictors but provides no mention of multiple-comparison correction, effect-size reporting, or a priori exclusion criteria. With n=70 and several tested relationships, uncorrected significance tests risk inflated Type I error and weaken the claim that gender, race, and self-efficacy reliably predict usage patterns.

    Authors: We appreciate this point regarding statistical rigor. Our study was exploratory in nature, as it is one of the first to log and categorize actual ChatGPT queries in an essay-writing task. We did not apply multiple-comparison corrections because the analyses were not confirmatory, but we will explicitly state this in the revised Results section. We will also report effect sizes for all key associations to provide a fuller picture of the practical significance. For exclusion criteria, we had no a priori exclusions beyond incomplete data or technical logging failures; we will add a clear statement on participant exclusion in the Methods or Results. These changes will strengthen the reporting without altering the core findings. revision: yes

Circularity Check

0 steps flagged

Empirical observational study with no derivations or self-referential reductions

full rationale

The paper describes an online study (n=70) collecting ChatGPT queries during an essay task, followed by categorization of queries and analysis of correlations with demographics, self-efficacy, enjoyment, and ownership. No equations, fitted parameters, predictions derived from models, or derivations appear in the provided text. Central claims rest on data collection and statistical associations rather than any step that reduces by construction to its own inputs or prior self-citations. This is a standard empirical design with no load-bearing circular elements.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

No free parameters or invented entities; relies on standard statistical assumptions for observational data analysis and the validity of self-report instruments.

axioms (2)
  • domain assumption Student self-reports of enjoyment, ownership, and self-efficacy accurately reflect internal states without substantial response bias.
    Invoked implicitly when linking usage patterns to these metrics; common in HCI survey studies but untested here.
  • domain assumption Manual categorization of ChatGPT queries produces stable, reproducible categories.
    Central to characterizing usage patterns; no inter-rater reliability reported in abstract.

pith-pipeline@v0.9.0 · 5776 in / 1339 out tokens · 50109 ms · 2026-05-23T05:45:11.162699+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. NIRVANA: A Comprehensive Dataset for Reproducing How Students Use Generative AI for Essay Writing

    cs.HC 2026-04 unverdicted novelty 7.0

    NIRVANA supplies keystroke-level logs, complete ChatGPT dialogues, and copied content from 77 students to reconstruct AI-assisted essay writing and classify students into four behavioral profiles: Lead Authors, Collab...

  2. Your Brain on ChatGPT: Accumulation of Cognitive Debt when Using an AI Assistant for Essay Writing Task

    cs.AI 2025-06 unverdicted novelty 6.0

    LLM use for essay writing correlates with reduced brain network connectivity, lower self-reported ownership, and poorer recall of one's own content compared to unaided or search-based writing.

  3. Characterizing Students' LLM Usage Behaviors and Their Association with Learning in Critical Thinking Tasks

    cs.HC 2026-05 unverdicted novelty 4.0

    Refined bottom-up categories of LLM usage in critical thinking homework, labeled by student initiative, are examined for associations with midterm performance across two course offerings.

  4. The Crutch or the Ceiling? How Different Generations of LLMs Shape EFL Student Writings

    cs.HC 2026-04 unverdicted novelty 4.0

    Advanced LLMs improve EFL writing scores and diversity for lower-proficiency students but correlate with lower expert ratings on deep coherence, acting more as crutches than scaffolds.

Reference graph

Works this paper leans on

103 extracted references · 103 canonical work pages · cited by 4 Pith papers · 3 internal anchors

  1. [1]

    Catherine Adams, Patti Pente, Gillian Lemermeyer, Joni Turville, and Geoffrey Rockwell. 2022. Artificial Intelligence and Teachers’ New Ethical Obligations.The International Review of Information Ethics31, 1 (Nov. 2022). https://doi.org/10.29173/irie483 Number: 1

  2. [2]

    Tazin Afrin, Omid Kashefi, Christopher Olshefski, Diane Litman, Rebecca Hwa, and Amanda Godley. 2021. Effective Interfaces for Student-Driven Revision Sessions for Argumentative Writing. InProceedings of the 2021 CHI Conference on Human Factors in Computing Systems(Yokohama, Japan) (CHI ’21). Association for Computing Machinery, New York, NY, USA, Article...

  3. [3]

    AlAfnan, Samira Dishari, Marina Jovic, and Koba Lomidze

    M. AlAfnan, Samira Dishari, Marina Jovic, and Koba Lomidze. 2023. ChatGPT as an Educational Tool: Opportunities, Challenges, and Rec- ommendations for Communication, Business Writing, and Composition Courses.Journal of Artificial Intelligence and Technology(2023). https://doi.org/10.37965/jait.2023.0184

  4. [4]

    Jamal Kaid Mohammed Ali, Muayad Abdulhalim Ahmad Shamsan, Taha Ahmed Hezam, and Ahmed A. Q. Mohammed. 2023. Impact of ChatGPT on Learning Motivation: Teachers and Students’ Voices.Journal of English Studies in Arabia Felix2, 1 (March 2023), 41–49. https: //doi.org/10.56540/jesaf.v2i1.51 Number: 1

  5. [5]

    Tawfiq Ammari, Meilun Chen, SM Zaman, and Kiran Garimella. 2025. How Students (Really) Use ChatGPT: Uncovering Experiences Among Undergraduate Students.arXiv preprint arXiv:2505.24126(2025)

  6. [6]

    Applebee

    Arthur N. Applebee. 1984. Writing and Reasoning.Review of Educational Research54 (1984), 577 – 596. https://doi.org/10.3102/00346543054004577

  7. [7]

    Avey, Bruce J

    James B. Avey, Bruce J. Avolio, Craig D. Crossley, and Fred Luthans. 2009. Psychological ownership: theoretical extensions, measure- ment and relation to work outcomes.Journal of Organizational Behavior30, 2 (2009), 173–191. https://doi.org/10.1002/job.583 _eprint: https://onlinelibrary.wiley.com/doi/pdf/10.1002/job.583

  8. [8]

    Aruleba, and Solomon Sunday Oyelere

    Musa Adekunle Ayanwale, Ismaila Temitayo Sanusi, Owolabi Paul Adelana, Kehinde D. Aruleba, and Solomon Sunday Oyelere. 2022. Teachers’ readiness and intention to teach artificial intelligence in schools.Computers and Education: Artificial Intelligence3 (Jan. 2022), 100099. https: //doi.org/10.1016/j.caeai.2022.100099

  9. [9]

    David Baidoo-Anu and Leticia Owusu Ansah. 2023. Education in the Era of Generative Artificial Intelligence (AI): Understanding the Potential Benefits of ChatGPT in Promoting Teaching and Learning. https://doi.org/10.2139/ssrn.4337484

  10. [10]

    Alex Barrett and Austin Pack. 2023. Not quite eye to A.I.: student and teacher perspectives on the use of generative artificial intelligence in the writing process.International Journal of Educational Technology in Higher Education20, 1 (Nov. 2023), 59. https://doi.org/10.1186/s41239-023-00427-0

  11. [11]

    2013.The psychology of written composition

    Carl Bereiter and Marlene Scardamalia. 2013.The psychology of written composition. Routledge

  12. [12]

    Arne Bewersdorff, Marie Hornberger, Claudia Nerdel, and Daniel S. Schiff. 2025. AI advocates and cautious critics: How AI attitudes, AI interest, use of AI, and AI literacy build university students’ AI self-efficacy.Computers and Education: Artificial Intelligence8 (June 2025), 100340. https://doi.org/10.1016/j.caeai.2024.100340

  13. [13]

    Advait Bhat, Saaket Agashe, Niharika Mohile, Parth Oberoi, Ravi Jangir, and Anirudha Joshi. 2023. Interacting with next-phrase suggestions: How suggestion systems aid and influence the cognitive processes of writing. InProceedings of the 28th International Conference on Intelligent User Interfaces. 436–452. https://doi.org/10.1145/3581641.3584060 arXiv:22...

  14. [14]

    Som Biswas. 2023. Role of Chat GPT in Education. https://papers.ssrn.com/abstract=4369981

  15. [15]

    Rebecca W Black and Bill Tomlinson. 2025. University students describe how they adopt AI for writing and research in a general education course. Scientific reports15, 1 (2025), 8799

  16. [16]

    Matt Bower, Jodie Torrington, Jennifer W. M. Lai, Peter Petocz, and Mark Alfano. 2024. How should we change teaching and assessment in response to increasingly powerful generative Artificial Intelligence? Outcomes of the ChatGPT teacher survey.Education and Information Technologies29, 12 (Aug. 2024), 15403–15439. https://doi.org/10.1007/s10639-023-12405-0

  17. [17]

    Roger Bruning, Michael Dempsey, Douglas F Kauffman, Courtney McKim, and Sharon Zumbrunn. 2013. Examining dimensions of self-efficacy for writing.Journal of educational psychology105, 1 (2013), 25

  18. [18]

    Zana Buçinca, Maja Barbara Malaya, and Krzysztof Z. Gajos. 2021. To Trust or to Think: Cognitive Forcing Functions Can Reduce Overreliance on AI in AI-assisted Decision-making.Proceedings of the ACM on Human-Computer Interaction5, CSCW1 (April 2021), 188:1–188:21. https: //doi.org/10.1145/3449287

  19. [19]

    Cristina Cachero, David Tomás, and Francisco A. Pujol. 2025. Gender Bias in Self-Perception of AI Knowledge, Impact, and Support among Higher Education Students: An Observational Study.ACM Trans. Comput. Educ.25, 2, Article 15 (May 2025), 26 pages. https://doi.org/10.1145/3721295

  20. [20]

    Tuhin Chakrabarty, Vishakh Padmakumar, Faeze Brahman, and Smaranda Muresan. 2024. Creativity Support in the Age of Large Language Models: An Empirical Study Involving Professional Writers. InProceedings of the 16th Conference on Creativity & Cognition(Chicago, IL, USA)(C&C ’24). Association for Computing Machinery, New York, NY, USA, 132–155. https://doi....

  21. [21]

    Cecilia Ka Yuk Chan and Katherine K. W. Lee. 2023. The AI generation gap: Are Gen Z students more interested in adopting generative AI such as ChatGPT in teaching and learning than their Gen X and Millennial Generation teachers? https://doi.org/10.48550/arXiv.2305.02878 arXiv:2305.02878 [cs]

  22. [22]

    Olckers Chantal. 2012. Psychological ownership: Development of an instrument.SA Journal of Industrial Psychology39 (Dec. 2012), 1–13. https://doi.org/10.4102/sajip.v39i2.1105 Manuscript submitted to ACM An Empirical Study to Understand How Students Use ChatGPT for Writing Essays 29

  23. [23]

    Erin Cherry and Celine Latulipe. 2014. Quantifying the Creativity Support of Digital Tools through the Creativity Support Index.ACM Trans. Comput.-Hum. Interact.21, 4 (June 2014), 21:1–21:25. https://doi.org/10.1145/2617588

  24. [24]

    William Condon and Diane Kelly-Riley. 2004. Assessing and teaching what we value: The relationship between college-level writing and critical thinking abilities.Assessing Writing9, 1 (2004), 56–75

  25. [26]

    Helen Crompton and Diane Burke. 2023. Artificial intelligence in higher education: the state of the field.International Journal of Educational Technology in Higher Education20, 1 (April 2023), 22. https://doi.org/10.1186/s41239-023-00392-8

  26. [27]

    Fred D. Davis. 1989. Perceived Usefulness, Perceived Ease of Use, and User Acceptance of Information Technology.MIS Quarterly13, 3 (1989), 319–340. https://doi.org/10.2307/249008 Publisher: Management Information Systems Research Center, University of Minnesota

  27. [28]

    Cotton Debby R

    Peter A. Cotton Debby R. E. Cotton and J. Reuben Shipway. 2024. Chatting and cheating: Ensuring academic integrity in the era of ChatGPT.Innovations in Education and Teaching International61, 2 (2024), 228–239. https://doi.org/10.1080/14703297.2023.2190148 arXiv:https://doi.org/10.1080/14703297.2023.2190148

  28. [29]

    Yanning Dong and Ling Shi. 2021. Using Grammarly to support students’ source-based writing practices.Assessing Writing50 (Oct. 2021), 100564. https://doi.org/10.1016/j.asw.2021.100564

  29. [30]

    Damian Okaibedi Eke. 2023. ChatGPT and the rise of generative AI: Threat to academic integrity?Journal of Responsible Technology13 (2023), 100060. https://doi.org/10.1016/j.jrt.2023.100060

  30. [31]

    J. Emig. 1977. Writing as a Mode of Learning.College Composition and Communication28 (1977), 122. https://doi.org/10.2307/356095

  31. [32]

    Juan Escalante, Austin Pack, and Alex Barrett. 2023. AI-generated feedback on writing: Insights into efficacy and ENL student preference. International Journal of Educational Technology in Higher Education20, 1 (2023), 57

  32. [33]

    Aidan Z Fitzsimons, Elizabeth Gerber, and Duri Long. 2025. Pressure to use AI for college admissions: implications for adolescent self-concept and intelligent coaching design. InProceedings of the 2025 Conference on Creativity and Cognition. Association for Computing Machinery, New York, NY, USA, 975–987. https://doi.org/10.1145/3698061.3726909

  33. [34]

    Linda Flower and John R. Hayes. 1981. A Cognitive Process Theory of Writing.College Composition and Communication32, 4 (1981), 365–387. https://doi.org/10.2307/356600 Publisher: National Council of Teachers of English

  34. [35]

    Hanying Gan and Wei Ji. 2025. Research on Evaluating College Students’ Usage Behaviors and Patterns of Generative AI Tools Using Natural Language Processing. InProceedings of the 2nd Guangdong-Hong Kong-Macao Greater Bay Area International Conference on Digital Economy and Artificial Intelligence (DEAI ’25). Association for Computing Machinery, New York, ...

  35. [36]

    Michael Gerlich. 2025. AI Tools in Society: Impacts on Cognitive Offloading and the Future of Critical Thinking.Societies15, 1 (2025). https: //doi.org/10.3390/soc15010006

  36. [37]

    Katy Ilonka Gero, Vivian Liu, and Lydia Chilton. 2022. Sparks: Inspiration for Science Writing using Language Models. InProceedings of the 2022 ACM Designing Interactive Systems Conference (DIS ’22). Association for Computing Machinery, New York, NY, USA, 1002–1019. https: //doi.org/10.1145/3532106.3533533

  37. [38]

    Katy Ilonka Gero, Tao Long, and Lydia B Chilton. 2023. Social Dynamics of AI Support in Creative Writing. InProceedings of the 2023 CHI Conference on Human Factors in Computing Systems. ACM, Hamburg Germany, 1–15. https://doi.org/10.1145/3544548.3580782

  38. [39]

    Andreas Göldi, Thiemo Wambsganss, Seyed Parsa Neshaei, and Roman Rietsche. 2024. Intelligent Support Engages Writers Through Relevant Cognitive Processes. InProceedings of the 2024 CHI Conference on Human Factors in Computing Systems(Honolulu, HI, USA)(CHI ’24). Association for Computing Machinery, New York, NY, USA, Article 1047, 12 pages. https://doi.or...

  39. [40]

    grammarly. 2023. grammarly. https://www.grammarly.com/. Accessed: 2024-10-29

  40. [41]

    Simone Grassini and Alexander Sævild Ree. 2023. Hope or Doom AI-ttitude? Examining the Impact of Gender, Age, and Cultural Differences on the Envisioned Future Impact of Artificial Intelligence on Humankind. InProceedings of the European Conference on Cognitive Ergonomics 2023 (Swansea, United Kingdom)(ECCE ’23). Association for Computing Machinery, New Y...

  41. [42]

    Mohanad Halaweh. 2023. ChatGPT in education: Strategies for responsible implementation.TBD(2023). https://digitallibrary.aau.ac.ae/handle/ 123456789/980 Accepted: 2023-04-13T06:53:37Z Publisher: Bastas

  42. [43]

    Jieun Han, Haneul Yoo, Yoonsu Kim, Junho Myung, Minsun Kim, Hyunseung Lim, Juho Kim, Tak Yeon Lee, Hwajung Hong, So-Yeon Ahn, and Alice Oh. 2023. RECIPE: How to Integrate ChatGPT into EFL Writing Education. InProceedings of the Tenth ACM Conference on Learning @ Scale (L@S ’23). Association for Computing Machinery, New York, NY, USA, 416–420. https://doi....

  43. [44]

    Don’t Forget the Teachers

    Emma Harvey, Allison Koenecke, and Rene F. Kizilcec. 2025. "Don’t Forget the Teachers": Towards an Educator-Centered Understanding of Harms from Large Language Models in Education. https://doi.org/10.48550/arXiv.2502.14592 arXiv:2502.14592 [cs]

  44. [45]

    Daniel Herman. 2022. The End of High-School English.The Atlantic(9 December 2022). https://www.theatlantic.com/technology/archive/2022/ 12/openai-chatgpt-writing-high-school-english-essay/672412/

  45. [46]

    2023.Guidance for generative AI in education and research

    Wayne Holmes, Fengchun Miao, et al. 2023.Guidance for generative AI in education and research. UNESCO Publishing

  46. [47]

    Hui-Wen Huang, Zehui Li, and Linda Taylor. 2020. The Effectiveness of Using Grammarly to Improve Students’ Writing Skills. InProceedings of the 5th International Conference on Distance Education and Learning (ICDEL ’20). Association for Computing Machinery, New York, NY, USA, 122–127. Manuscript submitted to ACM 30 Jelson, Manesh, Lee https://doi.org/10.1...

  47. [48]

    Gwo-Jen Hwang and Ching-Yi Chang. 2021. A review of opportunities and challenges of chatbots in education.Interactive Learning Environments (July 2021). https://doi.org/10.1080/10494820.2021.1952615

  48. [49]

    Takumi Ito, Tatsuki Kuribayashi, Masatoshi Hidaka, Jun Suzuki, and Kentaro Inui. 2020. Langsmith: An Interactive Academic Text Revision System. InProceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, Qun Liu and David Schlangen (Eds.). Association for Computational Linguistics, Online, 216–226. http...

  49. [50]

    Zorana Ivcevic and Mike Grandinetti. 2024. Artificial intelligence as a tool for creativity.Journal of Creativity34, 2 (Aug. 2024), 100079. https://doi.org/10.1016/j.yjoc.2024.100079

  50. [51]

    Maurice Jakesch, Advait Bhat, Daniel Buschek, Lior Zalmanson, and Mor Naaman. 2023. Co-Writing with Opinionated Language Models Affects Users’ Views. InProceedings of the 2023 CHI Conference on Human Factors in Computing Systems. ACM, Hamburg Germany, 1–15. https://doi.org/10.1145/3544548.3581196

  51. [52]

    Kalpana Jayavalan and Abu Bakar Razali. 2018. Effectiveness of Online Grammar Checker to Improve Secondary Students’ English Narrative Essay Writing. 2, 1 (2018)

  52. [53]

    Jaeho Jeon and Seongyong Lee. 2023. Large language models in education: A focus on the complementary relationship between human teachers and ChatGPT.Education and Information Technologies(May 2023). https://doi.org/10.1007/s10639-023-11834-1

  53. [54]

    Jisuake. 2023. CodeMirror Record. https://github.com/Jisuanke/CodeMirror-Record. Accessed: 2023-10-29

  54. [55]

    Nikhita Joshi and Daniel Vogel. 2025. Writing with AI Lowers Psychological Ownership, but Longer Prompts Can Help. https://doi.org/10.48550/ arXiv.2404.03108 arXiv:2404.03108 [cs]

  55. [56]

    Laksnoria Karyuatry. 2018. Grammarly as a Tool to Improve Students’ Writing Quality: Free Online-Proofreader across the Boundaries.JSSH (Jurnal Sains Sosial dan Humaniora)2, 1 (May 2018), 83–89. https://doi.org/10.30595/jssh.v2i1.2297 Number: 1

  56. [57]

    Enkelejda Kasneci, Kathrin Sessler, Frank Fischer, Urs Gasser, and Georg Groh. [n. d.]. ChatGPT for Good? On Opportunities and Challenges of Large Language Models for Education. ([n. d.])

  57. [58]

    Enkelejda Kasneci, Kathrin Seßler, Stefan Küchemann, Maria Bannert, Daryna Dementieva, Frank Fischer, Urs Gasser, Georg Groh, Stephan Günnemann, Eyke Hüllermeier, et al. 2023. ChatGPT for good? On opportunities and challenges of large language models for education.Learning and individual differences103 (2023), 102274

  58. [59]

    Simon Knight, Antonette Shibani, Sophie Abel, Andrew Gibson, Philippa Ryan, Nicole Sutton, Raechel Wight, Cherie Lucas, Ágnes Sándor, Kirsty Kitto, Ming Liu, Radhika Vijay Mogarkar, and Simon Buckingham Shum. 2020. AcaWriter: A learning analytics tool for formative feedback on academic writing.Journal of Writing Research12, 1 (June 2020), 141–186. https:/...

  59. [60]

    Svetlana Koltovskaia. 2020. Student engagement with automated written corrective feedback (AWCF) provided byGrammarly: A multiple case study.Assessing Writing44 (April 2020), 100450. https://doi.org/10.1016/j.asw.2020.100450

  60. [61]

    Nataliya Kosmyna, Eugene Hauptmann, Ye Tong Yuan, Jessica Situ, Xian-Hao Liao, Ashly Vivian Beresnitzky, Iris Braunstein, and Pattie Maes

  61. [62]

    Your Brain on ChatGPT: Accumulation of Cognitive Debt when Using an AI Assistant for Essay Writing Task

    Your Brain on ChatGPT: Accumulation of Cognitive Debt when Using an AI Assistant for Essay Writing Task. arXiv:2506.08872 [cs.AI] https://arxiv.org/abs/2506.08872

  62. [63]

    Hao-Ping (Hank) Lee, Advait Sarkar, Lev Tankelevitch, Ian Drosos, Sean Rintel, Richard Banks, and Nicholas Wilson. 2025. The Impact of Generative AI on Critical Thinking: Self-Reported Reductions in Cognitive Effort and Confidence Effects From a Survey of Knowledge Workers. InProceedings of the 2025 CHI Conference on Human Factors in Computing Systems (CH...

  63. [64]

    Alghamdi, Tal August, Avinash Bhat, Madiha Zahrah Choksi, Senjuti Dutta, Jin L.C

    Mina Lee, Katy Ilonka Gero, John Joon Young Chung, Simon Buckingham Shum, Vipul Raheja, Hua Shen, Subhashini Venugopalan, Thiemo Wambsganss, David Zhou, Emad A. Alghamdi, Tal August, Avinash Bhat, Madiha Zahrah Choksi, Senjuti Dutta, Jin L.C. Guo, Md Naimul Hoque, Yewon Kim, Simon Knight, Seyed Parsa Neshaei, Antonette Shibani, Disha Shrivastava, Lila Shr...

  64. [65]

    Mina Lee, Percy Liang, and Qian Yang. 2022. CoAuthor: Designing a Human-AI Collaborative Writing Dataset for Exploring Language Model Capabilities. InProceedings of the 2022 CHI Conference on Human Factors in Computing Systems (CHI ’22). Association for Computing Machinery, New York, NY, USA, 1–19. https://doi.org/10.1145/3491102.3502030

  65. [66]

    Lee, Denise Pope, Sarah Miles, and Rosalía C

    Victor R. Lee, Denise Pope, Sarah Miles, and Rosalía C. Zárate. 2024. Cheating in the age of generative AI: A high school survey study of cheating behaviors before and after the release of ChatGPT.Computers and Education: Artificial Intelligence7 (2024), 100253. https://doi.org/10.1016/j.caeai. 2024.100253

  66. [67]

    Rongxin Liu, Carter Zenke, Charlie Liu, Andrew Holmes, Patrick Thornton, and David J. Malan. 2024. Teaching CS50 with AI: Leveraging Generative Artificial Intelligence in Computer Science Education. InProceedings of the 55th ACM Technical Symposium on Computer Science Education V. 1 (SIGCSE 2024). Association for Computing Machinery, New York, NY, USA, 75...

  67. [68]

    Victoria Livingstone. 2024. I Quit Teaching Because of ChatGPT.TIME(30 September 2024). https://time.com/7026050/chatgpt-quit-teaching-ai- essay/ Manuscript submitted to ACM An Empirical Study to Understand How Students Use ChatGPT for Writing Essays 31

  68. [69]

    Shuai Ma, Ying Lei, Xinru Wang, Chengbo Zheng, Chuhan Shi, Ming Yin, and Xiaojuan Ma. 2023. Who Should I Trust: AI or Myself? Leveraging Human and AI Correctness Likelihood to Promote Appropriate Trust in AI-Assisted Decision-Making. https://doi.org/10.48550/arXiv.2301.05809 arXiv:2301.05809 [cs]

  69. [70]

    2024.The Dawn of the AI Era: Teens, Parents, and the Adoption of Generative AI at Home and School

    Mary Madden, Amanda Calvin, Abigail Hasse, and Amanda Lenhart. 2024.The Dawn of the AI Era: Teens, Parents, and the Adoption of Generative AI at Home and School. Common Sense, San Francisco, CA

  70. [71]

    Stephen Marche. 2022. The College Essay Is Dead.The Atlantic(December 2022). https://www.theatlantic.com/technology/archive/2022/12/chatgpt- ai-writing-college-student-essays/672371/

  71. [72]

    Patricia McCarthy, Scott Meier, and Regina Rinderer. [n. d.]. Self-Efficacy and Writing: A Different View of Self-Evaluation.CollegeCompositionand Communication([n. d.])

  72. [73]

    Ruth ONeill and Alex Russell. 2019. Stop! Grammar time: University students’ perceptions of the automated feedback program Grammarly. Australasian Journal of Educational Technology35, 1 (March 2019). https://doi.org/10.14742/ajet.3795 Number: 1

  73. [74]

    Michael Sheinman Orenstrakh, Oscar Karnalim, Carlos Aníbal Suárez, and Michael Liut. 2024. Detecting LLM-Generated Text in Computing Education: Comparative Study for ChatGPT Cases. In2024 IEEE 48th Annual Computers, Software, and Applications Conference (COMPSAC). 121–126. https://doi.org/10.1109/COMPSAC61105.2024.00027 ISSN: 2836-3795

  74. [75]

    Frank Pajares, Gio Valiante, and Yuk Fai Cheong. 2006. Writing Self-Efficacy and Its Relation to Gender, Writing Motivation and Writing Competence: A Developmental Perspective. InWriting and Motivation, P. Boscolo and S. Hidi (Eds.). Studies in Writing, Vol. 19. Brill, Leiden, 141–159. https://doi.org/10.1163/9781849508216_009

  75. [76]

    Hyanghee Park and Daehwan Ahn. 2024. The Promise and Peril of ChatGPT in Higher Education: Opportunities, Challenges, and Design Implications. InProceedings of the 2024 CHI Conference on Human Factors in Computing Systems (CHI ’24). Association for Computing Machinery, New York, NY, USA, 1–21. https://doi.org/10.1145/3613904.3642785

  76. [77]

    Mike Perkins. 2023. Academic integrity considerations of AI Large Language Models in the post-pandemic era: ChatGPT and beyond.Journal of University Teaching and Learning Practice(2023). https://doi.org/10.53761/1.20.02.07

  77. [78]

    Ali Quidwai, Chunhui Li, and Parijat Dube. 2023. Beyond Black Box AI generated Plagiarism Detection: From Sentence to Document Level. InProceedings of the 18th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2023). Association for Computational Linguistics, Toronto, Canada, 727–735. https://doi.org/10.18653/v1/2023.bea-1.58

  78. [79]

    Alnfiai, and Mohammad Arif

    Naveed Saif, Sajid Ullah Khan, Imrab Shaheen, Faiz Abdullah ALotaibi, Mrim M. Alnfiai, and Mohammad Arif. 2024. Chat-GPT; validating Technology Acceptance Model (TAM) in education sector via ubiquitous learning mechanism.Computers in Human Behavior154 (May 2024), 108097. https://doi.org/10.1016/j.chb.2023.108097

  79. [80]

    Malik Sallam. 2023. ChatGPT Utility in Healthcare Education, Research, and Practice: Systematic Review on the Promising Perspectives and Valid Concerns.Healthcare11 (2023). https://doi.org/10.3390/healthcare11060887

  80. [81]

    Advait Sarkar and Ian Drosos. 2025. Vibe coding: programming through conversation with artificial intelligence.arXiv preprint arXiv:2506.23253 (2025)

Showing first 80 references.