NARRA-Gym is an executable benchmark that generates complete interactive narrative episodes from emotional seeds and logs full model trajectories to expose gaps in coherence, adaptation, and personalization that static story tests miss.
arXiv preprint arXiv:2411.02316 , year =
3 Pith papers cite this work. Polarity classification is still indexing.
years
2026 3verdicts
UNVERDICTED 3representative citing papers
Frontier LLMs generate creative ideas with excess population-level crowding below human-relative parity across tasks, but targeted generation protocols can reduce it.
Forking Garden generates branching dungeon graphs from user-provided storylines by creating a pool of independent nodes and assembling them via arc-guided constraint algorithms that enforce multimodal alignment of gameplay elements.
citing papers explorer
-
NARRA-Gym for Evaluating Interactive Narrative Agents
NARRA-Gym is an executable benchmark that generates complete interactive narrative episodes from emotional seeds and logs full model trajectories to expose gaps in coherence, adaptation, and personalization that static story tests miss.
-
Ex Ante Evaluation of AI-Induced Idea Diversity Collapse
Frontier LLMs generate creative ideas with excess population-level crowding below human-relative parity across tasks, but targeted generation protocols can reduce it.
-
The Garden of Forking Paths: Narrative Arc-Conditioned Gameplay Planning
Forking Garden generates branching dungeon graphs from user-provided storylines by creating a pool of independent nodes and assembling them via arc-guided constraint algorithms that enforce multimodal alignment of gameplay elements.