Constructing Non-Markovian Decision Process via History Aggregator

Wenxin Li; Yongyi Wang

arxiv: 2506.24026 · v1 · pith:CU47OMCHnew · submitted 2025-06-30 · 💻 cs.AI

Constructing Non-Markovian Decision Process via History Aggregator

Yongyi Wang , Wenxin Li This is my paper

classification 💻 cs.AI

keywords non-markoviandecisiondynamicscategorydecision-makingaggregatoralgorithmseffectiveness

0 comments

read the original abstract

In the domain of algorithmic decision-making, non-Markovian dynamics manifest as a significant impediment, especially for paradigms such as Reinforcement Learning (RL), thereby exerting far-reaching consequences on the advancement and effectiveness of the associated systems. Nevertheless, the existing benchmarks are deficient in comprehensively assessing the capacity of decision algorithms to handle non-Markovian dynamics. To address this deficiency, we have devised a generalized methodology grounded in category theory. Notably, we established the category of Markov Decision Processes (MDP) and the category of non-Markovian Decision Processes (NMDP), and proved the equivalence relationship between them. This theoretical foundation provides a novel perspective for understanding and addressing non-Markovian dynamics. We further introduced non-Markovianity into decision-making problem settings via the History Aggregator for State (HAS). With HAS, we can precisely control the state dependency structure of decision-making problems in the time series. Our analysis demonstrates the effectiveness of our method in representing a broad range of non-Markovian dynamics. This approach facilitates a more rigorous and flexible evaluation of decision algorithms by testing them in problem settings where non-Markovian dynamics are explicitly constructed.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Synthetic POMDPs to Challenge Memory-Augmented RL: Memory Demand Structure Modeling
cs.AI 2025-08 unverdicted novelty 6.0

Presents MDS framework, linear-dynamics construction method, and tunable synthetic POMDP suite for controlled testing of memory-augmented reinforcement learning.