HGMEM: Hypergraph-based Working Memory to Improve Multi-step RAG for Long-Context Complex Relational Modeling

Judging llm-as-a-judge with mt-bench, chatbot arena · 2025 · cs.CL · arXiv 2512.23959

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open full Pith review browse 3 citing papers arXiv PDF

abstract

Multi-step retrieval-augmented generation (RAG) has become a widely adopted strategy for enhancing large language models (LLMs) on tasks that demand global comprehension and intensive reasoning. Although many RAG systems incorporate a working memory to consolidate information, existing designs primarily function as a passive storage for isolated facts. This static nature overlooks crucial high-order correlations among primitive facts, thereby limiting models' capacity for multi-step reasoning and resulting in fragmented reasoning and weak global sense-making within extended contexts. We introduce HGMem, a hypergraph-based working memory system, extending the concept of memory beyond simple storage into a dynamic, expressive structure for complex reasoning and global understanding. In our approach, memory is represented as a hypergraph where hyperedges correspond to distinct memory units, enabling the progressive formation of high-order interactions within memory. This mechanism connects facts and thoughts around the focal problem, evolving the memory into an integrated and situated knowledge structure that provides strong propositions for deeper reasoning. We evaluate HGMem on several challenging global sense-making benchmarks. Extensive experiments and in-depth analyses demonstrate that our method consistently improves multi-step RAG and substantially outperforms strong baseline systems across diverse datasets.

representative citing papers

HKVM-RAG: Key-Value-Separated Hypergraph Evidence Organization for Multi-Hop RAG

cs.IR · 2026-06-05 · unverdicted · novelty 6.0

HKVM-RAG uses key-value-separated hypergraphs to organize LLM evidence tuples into answer-path hyperedges, yielding F1 gains over KG-PPR on two multi-hop QA benchmarks and further gains when combined with dense retrievers.

MemGraphRAG: Memory-based Multi-Agent System for Graph Retrieval-Augmented Generation

cs.IR · 2026-05-30 · unverdicted · novelty 6.0

MemGraphRAG uses a memory-based multi-agent system for globally consistent graph construction from fragmented corpora plus a memory-aware hierarchical retriever, claiming better benchmark performance than prior GraphRAG methods at similar cost.

Judge Like Human Examiners: A Weighted Importance Multi-Point Evaluation Framework for Generative Tasks with Long-form Answers

cs.CL · 2026-04-13 · unverdicted · novelty 6.0

WIMPE factorizes reference answers into weighted context-bound points and applies alignment (WPA) and conflict penalty (PCP) metrics, yielding higher human correlation than prior rubric or checklist methods across 10 generative tasks.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Judge Like Human Examiners: A Weighted Importance Multi-Point Evaluation Framework for Generative Tasks with Long-form Answers cs.CL · 2026-04-13 · unverdicted · none · ref 3 · internal anchor
WIMPE factorizes reference answers into weighted context-bound points and applies alignment (WPA) and conflict penalty (PCP) metrics, yielding higher human correlation than prior rubric or checklist methods across 10 generative tasks.

HGMEM: Hypergraph-based Working Memory to Improve Multi-step RAG for Long-Context Complex Relational Modeling

fields

years

verdicts

representative citing papers

citing papers explorer