CommAI: Evaluating the first steps towards a useful general AI

Marco Baroni , Armand Joulin , Allan Jabri , Germ\`an Kruszewski , Angeliki Lazaridou , Klemen Simonic , Tomas Mikolov

Authors on Pith no claims yet

classification 💻 cs.LG cs.AIcs.CL

keywords generalmachinedesideratatowardsalmostapplicationsappliedattainable

read the original abstract

With machine learning successfully applied to new daunting problems almost every day, general AI starts looking like an attainable goal. However, most current research focuses instead on important but narrow applications, such as image classification or machine translation. We believe this to be largely due to the lack of objective ways to measure progress towards broad machine intelligence. In order to fill this gap, we propose here a set of concrete desiderata for general AI, together with a platform to test machines on how well they satisfy such desiderata, while keeping all further complexities to a minimum.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

CommonWhy: A Dataset for Evaluating Entity-Based Causal Commonsense Reasoning in Large Language Models
cs.CL 2026-05 unverdicted novelty 7.0

CommonWhy is a new dataset of 15,000 why-questions for evaluating LLMs on entity-based causal commonsense reasoning grounded in Wikidata.