pith. machine review for the scientific record. sign in

Your agent may misevolve: Emergent risks in self-evolving llm agents

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

years

2026 8

representative citing papers

Belief Memory: Agent Memory Under Partial Observability

cs.AI · 2026-05-07 · unverdicted · novelty 7.0 · 2 refs

BeliefMem is a probabilistic memory architecture for LLM agents that retains multiple candidate conclusions with probabilities updated by Noisy-OR, achieving superior average performance over deterministic baselines on LoCoMo and ALFWorld.

MemEvoBench: Benchmarking Memory MisEvolution in LLM Agents

cs.CL · 2026-04-17 · unverdicted · novelty 7.0

MemEvoBench is the first benchmark for long-horizon memory safety in LLM agents, using QA tasks across 7 domains and 36 risks plus workflow tasks with noisy tools to measure behavioral drift from biased memory updates.

citing papers explorer

Showing 8 of 8 citing papers.