Llm-hanabi: Evaluating multi- agent gameplays with theory-of-mind and rationale inference in imperfect information collaboration game

Fangzhou Liang, Tianshi Zheng, Chunkit Chan, Yauwai Yim, Yangqiu Song · 2025 · arXiv 2510.04980

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

SMAC-Talk: A Natural Language Extension of the StarCraft Multi-Agent Challenge for Large Language Models

cs.AI · 2026-06-02 · unverdicted · novelty 7.0

SMAC-Talk is a new benchmark that adds natural language messaging and deceptive-agent scenarios to SMAC for testing LLM coordination in multi-agent environments.

SciLens: Multi-modal Scientific Claim Verification with Agentic Entailment and Grounding

cs.CL · 2026-06-18 · unverdicted · novelty 5.0

SciLens introduces an evidence-conditioned atomic entailment framework that grounds claims to modality-specific witnesses in tables and figures, achieving 79.2% macro-F1 on SciClaimEval.

Towards Generalist Game Players: An Investigation of Foundation Models in the Game Multiverse

cs.CV · 2026-05-11 · unverdicted · novelty 5.0 · 2 refs

The paper organizes research on generalist game AI into Dataset, Model, Harness, and Benchmark pillars and charts a five-level progression from single-game mastery to agents that create and live inside game multiverses.

citing papers explorer

Showing 0 of 0 citing papers after filters.

No citing papers match the current filters.

Llm-hanabi: Evaluating multi- agent gameplays with theory-of-mind and rationale inference in imperfect information collaboration game

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer