A new evaluation protocol shows agent memory reliability degrades variably with added irrelevant sessions depending on agent, memory interface, and scale.
Explaining context length scaling and bounds for language models,
6 Pith papers cite this work. Polarity classification is still indexing.
verdicts
UNVERDICTED 6representative citing papers
AgenticAI-DialogGen uses LLM agents to generate persona-grounded, topic-guided conversations and QA pairs encoding short- and long-term memory, producing the TGC dataset that improves LLM performance on memory tasks.
LZ78 sources are almost stationary ergodic processes satisfying a Shannon-McMillan-Breiman property and local i.i.d. convergence, yet their finite-state compressibility exceeds the entropy rate by a Jensen gap.
SCM-GRPO grounds multi-hop fact verification in structural causal models and applies GRPO reinforcement learning to optimize reasoning chain length, outperforming baselines on HoVer and EX-FEVER.
Video Parallel Scaling improves VideoLLM performance by aggregating outputs from parallel inferences on complementary disjoint frame subsets, effectively contracting the Chinchilla scaling law via uncorrelated visual evidence.
RAM outperforms prior methods on PoseTrack and 3DPW for zero-shot multi-person 3D motion tracking and reconstruction by fusing semantic tracking, memory-augmented pose estimation, and predictive fusion.
citing papers explorer
-
When Stored Evidence Stops Being Usable: Scale-Conditioned Evaluation of Agent Memory
A new evaluation protocol shows agent memory reliability degrades variably with added irrelevant sessions depending on agent, memory interface, and scale.
-
AgenticAI-DialogGen: Topic-Guided Conversation Generation for Fine-Tuning and Evaluating Short- and Long-Term Memories of LLMs
AgenticAI-DialogGen uses LLM agents to generate persona-grounded, topic-guided conversations and QA pairs encoding short- and long-term memory, producing the TGC dataset that improves LLM performance on memory tasks.
-
The LZ78 Source
LZ78 sources are almost stationary ergodic processes satisfying a Shannon-McMillan-Breiman property and local i.i.d. convergence, yet their finite-state compressibility exceeds the entropy rate by a Jensen gap.
-
Grounding Multi-Hop Reasoning in Structural Causal Models via Group Relative Policy Optimization
SCM-GRPO grounds multi-hop fact verification in structural causal models and applies GRPO reinforcement learning to optimize reasoning chain length, outperforming baselines on HoVer and EX-FEVER.
-
Video Parallel Scaling: Aggregating Diverse Frame Subsets for VideoLLMs
Video Parallel Scaling improves VideoLLM performance by aggregating outputs from parallel inferences on complementary disjoint frame subsets, effectively contracting the Chinchilla scaling law via uncorrelated visual evidence.
-
RAM: Recover Any 3D Human Motion in-the-Wild
RAM outperforms prior methods on PoseTrack and 3DPW for zero-shot multi-person 3D motion tracking and reconstruction by fusing semantic tracking, memory-augmented pose estimation, and predictive fusion.