Instructions and Guide for Diagnostic Questions: The NeurIPS 2020 Education Challenge
read the original abstract
Digital technologies are becoming increasingly prevalent in education, enabling personalized, high quality education resources to be accessible by students across the world. Importantly, among these resources are diagnostic questions: the answers that the students give to these questions reveal key information about the specific nature of misconceptions that the students may hold. Analyzing the massive quantities of data stemming from students' interactions with these diagnostic questions can help us more accurately understand the students' learning status and thus allow us to automate learning curriculum recommendations. In this competition, participants will focus on the students' answer records to these multiple-choice diagnostic questions, with the aim of 1) accurately predicting which answers the students provide; 2) accurately predicting which questions have high quality; and 3) determining a personalized sequence of questions for each student that best predicts the student's answers. These tasks closely mimic the goals of a real-world educational platform and are highly representative of the educational challenges faced today. We provide over 20 million examples of students' answers to mathematics questions from Eedi, a leading educational platform which thousands of students interact with daily around the globe. Participants to this competition have a chance to make a lasting, real-world impact on the quality of personalized education for millions of students across the world.
This paper has not been read by Pith yet.
Forward citations
Cited by 7 Pith papers
-
KG-SoftMAP: Soft Knowledge-Graph Priors for Bayesian Network Structure Learning from Sparse Discrete Data
KG-SoftMAP incorporates soft, confidence-weighted priors from a knowledge graph into MAP estimation for Bayesian network structure learning, recovering substantial directed structure from sparse discrete data where da...
-
Skill Neologisms: Towards Skill-based Continual Learning
Skill neologisms are optimized soft tokens that improve LLM performance on targeted skills without weight updates and allow zero-shot composition for continual learning.
-
Skill Neologisms: Towards Skill-based Continual Learning
Skill neologisms are optimized soft tokens that enhance specific LLM skills and support zero-shot composition on synthetic and Skill-Mix tasks.
-
Embedding Enhancement via Fine-Tuned Language Models for Learner-Item Cognitive Modeling
EduEmbed fine-tunes language models in two stages to add semantic information to learner-item embeddings and improve performance on cognitive diagnosis and adaptive testing tasks.
-
Graph-Based Alternatives to LLMs for Human Simulation
GEMS formulates close-ended human-behavior simulation as link prediction on a heterogeneous graph and matches or exceeds LLM performance with three orders of magnitude fewer parameters across three datasets and three ...
-
MCQ Difficulty Prediction via Modeling Learner Heterogeneity Using Data-Driven Cognitive Profiling
A framework using latent class analysis on student data to define personas, LLM simulations of their responses, and ridge regression improves IRT difficulty prediction for MCQs over baselines.
-
A Case Study Reexamining the Cold-Start Problem in Knowledge Tracing Models and Implications for SafeInsights, an Education Research Infrastructure
Replication of cold-start analysis in KT models on FoundationalASSIST shows performance varies by practice opportunities and problem types, highlighting reproduction challenges and SafeInsights utility.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.