Clonemem: Benchmarking long-term memory for ai clones.arXiv preprint arXiv:2601.07023, 2026

SenHu,ZhiyuZhang,YuxiangWei,XueranHan,ZhenhengTang,HuacanWang,andRonghaoChen · 2026 · arXiv 2601.07023

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

HEART-Bench: Do LLM Agents Exhibit Human-like Psychology?

cs.CL · 2026-05-28 · unverdicted · novelty 7.0

HEART-Bench evaluates LLM agents on psychological consistency using 11 Big-Five-grounded characters with 1,000 episodic memories each and 64 DIAMONDS-based decision scenarios, yielding 673 validated MCQs.

citing papers explorer

Showing 1 of 1 citing paper after filters.

HEART-Bench: Do LLM Agents Exhibit Human-like Psychology? cs.CL · 2026-05-28 · unverdicted · none · ref 19
HEART-Bench evaluates LLM agents on psychological consistency using 11 Big-Five-grounded characters with 1,000 episodic memories each and 64 DIAMONDS-based decision scenarios, yielding 673 validated MCQs.

Clonemem: Benchmarking long-term memory for ai clones.arXiv preprint arXiv:2601.07023, 2026

fields

years

verdicts

representative citing papers

citing papers explorer