Gemini 3.0 Pro with rubric prompts reached ICC 0.888 agreement with human graders on low-complexity Linux/bash responses but lower agreement at higher taxonomy levels across 1200 student answers from three expert raters.
IEEE Access 13, 113449–113460
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
Semester-long tracking of 96 students showed LLM usage fell 42.7 points and verification dropped 19.4 points in team work, with 18.9% of consistent individual users stopping entirely in teams.
citing papers explorer
-
Automated grading of Linux/bash examinations using large language models: a four-level cognitive taxonomy approach
Gemini 3.0 Pro with rubric prompts reached ICC 0.888 agreement with human graders on low-complexity Linux/bash responses but lower agreement at higher taxonomy levels across 1200 student answers from three expert raters.
-
Less Deliberate in Teams: Student LLM Use Across Individual and Collaborative Work
Semester-long tracking of 96 students showed LLM usage fell 42.7 points and verification dropped 19.4 points in team work, with 18.9% of consistent individual users stopping entirely in teams.