From Next Token Prediction to (STRIPS) World Models
Pith reviewed 2026-05-18 16:17 UTC · model grok-4.3
The pith
Next-token prediction on action traces yields STRIPS world models accurate enough for planning on unseen states and goals.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
In a controlled symbolic setting where propositional STRIPS action models are learned solely from action traces and correctness can be checked exactly, next-token prediction produces models that serve as drop-in replacements for hand-written STRIPS operators. These models enable classical planners to solve planning tasks over exponentially many unseen initial states and goals. The STRIPS Transformer incorporates theoretical links to the formal language of STRIPS domains, whereas a standard transformer augmented with stick-breaking attention achieves comparable or better results in accuracy and generalization without explicit symbolic structure.
What carries the argument
Next-token prediction on sequences of STRIPS actions and states, used to learn implicit action models that are then extracted and fed to off-the-shelf symbolic planners.
If this is right
- Both architectures support planning across five classical domains and varying problem sizes without retraining.
- Generalization holds for exponentially many new initial states and goals drawn from the same domain.
- Stick-breaking attention enables reliable performance on long traces where standard attention fails.
- A symbolic STRIPS model can be extracted from a transformer trained only on short traces to restore planning capability.
Where Pith is reading between the lines
- This suggests data-driven sequence models can replace manual STRIPS engineering in domains where action traces are cheap to collect.
- The approach may extend to domains with partial observability if trace data includes observations.
- It opens the possibility of using large language models directly as implicit planners without separate symbolic extraction.
- Testable extension: apply the same training regime to domains with numeric fluents or durative actions.
Load-bearing premise
The learned next-token predictors are accurate and complete enough to serve as correct STRIPS action models for arbitrary unseen states and goals.
What would settle it
A held-out planning problem whose true solution requires an action sequence that the learned model either forbids or fails to reach within the planner's search budget.
Figures
read the original abstract
We study whether next-token prediction can yield world models that truly support planning, in a controlled symbolic setting where propositional STRIPS action models are learned from action traces alone and correctness can be evaluated exactly. We introduce two architectures. The first is the STRIPS Transformer, a symbolically aligned model grounded in theoretical results linking transformers and the formal language structure of STRIPS domains. The second is a standard transformer architecture without explicit symbolic structure built in, for which we study different positional encoding schemes and attention aggregation mechanisms. We evaluate both architectures on five classical planning domains, measuring training accuracy, generalization, and planning performance across domains and problem sizes. Interestingly, both approaches can be used to produce models that support planning with off-the-shelf STRIPS planners over exponentially many unseen initial states and goals. Although the STRIPS Transformer incorporates a strong symbolic inductive bias, it is harder to optimize and requires larger datasets to generalize reliably. In contrast, a standard transformer with stick-breaking attention achieves near-perfect training accuracy and strong generalization. Finally, standard transformers without stick-breaking attention do not generalize to long traces, whereas a symbolic STRIPS model extracted from a transformer trained on shorter traces does.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper studies whether next-token prediction can produce world models supporting planning in a controlled symbolic setting. It introduces the STRIPS Transformer, grounded in theoretical links between transformers and STRIPS formal language, and compares it to standard transformers using different positional encodings and attention mechanisms (including stick-breaking). Both are trained on action traces from five classical planning domains and evaluated on training accuracy, generalization to unseen initial states and goals, and planning performance with off-the-shelf STRIPS planners.
Significance. If the results hold, the work shows that next-token predictors can yield extractable STRIPS operators that enable exact planning over exponentially many unseen states and goals, bridging neural sequence modeling and symbolic planning. The finding that a standard transformer with stick-breaking attention achieves strong generalization while the symbolically biased STRIPS Transformer is harder to optimize is a useful empirical contribution, as is the observation that standard transformers without stick-breaking fail to generalize to long traces.
major comments (1)
- [Evaluation / planning performance] Evaluation section (planning performance results): The central claim that the learned models support planning over unseen states and goals requires that the extracted STRIPS operators remain sound and complete for states generated during search. While planning success is reported, the manuscript does not describe an explicit check (e.g., enumeration or verification of predicted effects against ground-truth semantics) for states encountered by the planner itself, which may lie outside the training trace distribution. This leaves open the possibility of silent failures that could invalidate plans or dead-end detection.
minor comments (2)
- [Abstract] The abstract states that both architectures 'can be used to produce models that support planning,' but the text should clarify whether this holds uniformly across all five domains or if there are domain-specific exceptions in the reported results.
- [Method] Notation for the extracted STRIPS operators (preconditions and effects) should be introduced earlier and used consistently when describing how next-token outputs are converted to formal action models.
Simulated Author's Rebuttal
We thank the referee for their careful reading and constructive feedback. We address the single major comment below and will revise the manuscript to incorporate an explicit verification procedure as suggested.
read point-by-point responses
-
Referee: Evaluation section (planning performance results): The central claim that the learned models support planning over unseen states and goals requires that the extracted STRIPS operators remain sound and complete for states generated during search. While planning success is reported, the manuscript does not describe an explicit check (e.g., enumeration or verification of predicted effects against ground-truth semantics) for states encountered by the planner itself, which may lie outside the training trace distribution. This leaves open the possibility of silent failures that could invalidate plans or dead-end detection.
Authors: We agree that an explicit verification step strengthens the central claim. In the current experiments, planning success was defined as the off-the-shelf STRIPS planner returning a plan that, when executed in the ground-truth simulator, reaches the goal without violating any preconditions or producing incorrect effects. Because the planner only applies operators that are deemed applicable and because success is measured by actual execution in the true environment, any unsound operator would have produced either an invalid plan or a failure to reach the goal. Nevertheless, this verification was performed only at the level of final plan execution rather than for every intermediate state visited during search. To address the referee's concern directly, we will add a new paragraph in Section 5 (Evaluation) that describes an additional post-hoc check: for every solved planning instance we record all states generated by the planner and verify that the predicted add and delete effects of each applied operator exactly match the ground-truth transition function. We will report the fraction of states for which this holds and will include the verification code in the supplementary material. This change will be made in the revised version. revision: yes
Circularity Check
No significant circularity: derivation relies on external empirical validation against held-out states and planners
full rationale
The paper trains next-token models on action traces, extracts or uses them as STRIPS operators, and evaluates training accuracy, generalization to unseen initial states/goals, and planning success with off-the-shelf external STRIPS planners. No equation or claim reduces a reported prediction to a fitted quantity by construction, nor does any load-bearing step invoke a self-citation chain or uniqueness theorem that collapses to the paper's own inputs. Results are measured against independent benchmarks outside the training distribution, making the central claims empirically falsifiable rather than tautological.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Next-token prediction on action traces is sufficient to recover the full propositional STRIPS transition function.
Forward citations
Cited by 1 Pith paper
-
Differentiable Learning of Lifted Action Schemas for Classical Planning
A differentiable neural model recovers ground-truth lifted action schemas from state traces by jointly learning schemas and inferring unobserved action arguments.
Reference graph
Works this paper leans on
-
[1]
Aineto, D.; Celorrio, S. J.; and Onaindia, E. 2019. Learning action models with minimal observability. Artificial Intelligence, 275: 104--137
work page 2019
-
[2]
Asai, M.; Kajino, H.; Fukunaga, A.; and Muise, C. 2022. Classical planning in deep latent space. Journal of Artificial Intelligence Research, 74: 1599--1686
work page 2022
-
[3]
Bonet, B.; and Geffner, H. 2020. Learning First-Order Symbolic Representations for Planning from the Structure of the State Space. In ECAI 2020 - 24th European Conference on Artificial Intelligence , volume 325, 2322--2329. IOS Press
work page 2020
-
[4]
Cresswell, S. N.; McCluskey, T. L.; and West, M. M. 2013. Acquiring planning domain models using LOCM. The Knowledge Engineering Review, 28(2): 195--213
work page 2013
-
[5]
Dosovitskiy, A.; Beyer, L.; Kolesnikov, A.; Weissenborn, D.; Zhai, X.; Unterthiner, T.; Dehghani, M.; Minderer, M.; Heigold, G.; Gelly, S.; Uszkoreit, J.; and Houlsby, N. 2021. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. In International Conference on Learning Representations
work page 2021
- [6]
-
[7]
Lamanna, L.; Serafini, L.; Saetti, A.; Gerevini, A. E.; and Traverso, P. 2025. Lifted action models learning from partial traces. Artificial Intelligence, 339: 104256
work page 2025
-
[8]
K.; Bau, D.; Vi \'e gas, F.; Pfister, H.; and Wattenberg, M
Li, K.; Hopkins, A. K.; Bau, D.; Vi \'e gas, F.; Pfister, H.; and Wattenberg, M. 2023. Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task. In The Eleventh International Conference on Learning Representations
work page 2023
-
[9]
Lin, S.; and Bercher, P. 2022. On the Expressive Power of Planning Formalisms in Conjunction with LTL. Proceedings of the International Conference on Automated Planning and Scheduling, 32(1): 231--240
work page 2022
-
[10]
Lin, T.-Y.; Goyal, P.; Girshick, R.; He, K.; and Doll \'a r, P. 2017. Focal loss for dense object detection. In Proceedings of the IEEE international conference on computer vision, 2980--2988
work page 2017
- [11]
-
[12]
D.; Bonet, B.; Romero, J.; and Geffner, H
Rodriguez, I. D.; Bonet, B.; Romero, J.; and Geffner, H. 2021. Learning First-Order Representations for Planning from Black Box States: New Results . In Proceedings of the 18th International Conference on Principles of Knowledge Representation and Reasoning , 539--548
work page 2021
-
[13]
Tan, S.; Yang, S.; Courville, A.; Panda, R.; and Shen, Y. 2025. Scaling Stick-Breaking Attention: An Efficient Implementation and In-depth Study. In The Thirteenth International Conference on Learning Representations
work page 2025
-
[14]
Y.; Rambachan, A.; Kleinberg, J.; and Mullainathan, S
Vafa, K.; Chen, J. Y.; Rambachan, A.; Kleinberg, J.; and Mullainathan, S. 2024 a . Evaluating the world model implicit in a generative model. Advances in Neural Information Processing Systems, 37: 26941--26975
work page 2024
-
[15]
Y.; Rambachan, A.; Kleinberg, J.; and Mullainathan, S
Vafa, K.; Chen, J. Y.; Rambachan, A.; Kleinberg, J.; and Mullainathan, S. 2024 b . Evaluating the world model implicit in a generative model. Advances in Neural Information Processing Systems, 37: 26941--26975
work page 2024
-
[16]
N.; Kaiser, .; and Polosukhin, I
Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A. N.; Kaiser, .; and Polosukhin, I. 2017. Attention is all you need. Advances in neural information processing systems, 30
work page 2017
-
[17]
Weiss, G.; Goldberg, Y.; and Yahav, E. 2021. Thinking like transformers. In International Conference on Machine Learning, 11080--11090. PMLR
work page 2021
-
[18]
Xi, K.; Gould, S.; and Thi \'e baux, S. 2024. Neuro-Symbolic Learning of Lifted Action Models from Visual Traces. In Proceedings of the International Conference on Automated Planning and Scheduling, volume 34, 653--662
work page 2024
-
[19]
Yang, A.; Chiang, D.; and Angluin, D. 2024. Masked Hard-Attention Transformers Recognize Exactly the Star-Free Languages. In The Thirty-eighth Annual Conference on Neural Information Processing Systems
work page 2024
-
[20]
Yildirim, I.; and Paul, L. 2024. From task structures to world models: what do LLMs know? Trends in Cognitive Sciences, 28(5): 404--415
work page 2024
-
[21]
Zhuo, H. H.; and Kambhampati, S. 2013. Action-model acquisition from noisy plan traces. In Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence, 2444–2450. AAAI Press
work page 2013
-
[22]
, " * write output.state after.block = add.period write newline
ENTRY address archivePrefix author booktitle chapter edition editor eid eprint howpublished institution isbn journal key month note number organization pages publisher school series title type volume year label extra.label sort.label short.list INTEGERS output.state before.all mid.sentence after.sentence after.block FUNCTION init.state.consts #0 'before.a...
-
[23]
" write newline "" before.all 'output.state := FUNCTION n.dashify 't := "" t empty not t #1 #1 substring "-" = t #1 #2 substring "--" = not "--" * t #2 global.max substring 't := t #1 #1 substring "-" = "-" * t #2 global.max substring 't := while if t #1 #1 substring * t #2 global.max substring 't := if while FUNCTION word.in bbl.in capitalize " " * FUNCT...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.