Can long-context language models subsume retrieval, rag, sql, and more? arXiv preprint arXiv:2406.13121

· 2024 · arXiv 2406.13121

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

representative citing papers

RULER: What's the Real Context Size of Your Long-Context Language Models?

cs.CL · 2024-04-09 · accept · novelty 8.0

RULER shows most long-context LMs drop sharply in performance on complex tasks as length and difficulty increase, with only half maintaining results at 32K tokens.

Scalable Model-Based Clustering with Sequential Monte Carlo

stat.ML · 2026-04-16 · unverdicted · novelty 7.0

A memory-efficient SMC clustering method decomposes problems into approximately independent subproblems to handle large-scale online clustering with complex distributions.

LLM as Attention-Informed NTM and Topic Modeling as long-input Generation: Interpretability and long-Context Capability

cs.CL · 2025-10-03 · unverdicted · novelty 5.0

LLMs recover interpretable topic structures via attention and achieve competitive topic modeling performance as long-context generators.

World Model on Million-Length Video And Language With Blockwise RingAttention

cs.LG · 2024-02-13 · unverdicted · novelty 5.0

Presents open-source 7B models for million-token video and language understanding via Blockwise RingAttention, setting new benchmarks in retrieval and long video tasks.

Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

cs.CL · 2025-07-07 · unverdicted · novelty 4.0

Gemini 2.5 Pro and Flash models are presented as achieving frontier performance in reasoning, coding, and long-context multimodal tasks while spanning a cost-capability Pareto curve.

citing papers explorer

Showing 5 of 5 citing papers.

RULER: What's the Real Context Size of Your Long-Context Language Models? cs.CL · 2024-04-09 · accept · none · ref 23
RULER shows most long-context LMs drop sharply in performance on complex tasks as length and difficulty increase, with only half maintaining results at 32K tokens.
Scalable Model-Based Clustering with Sequential Monte Carlo stat.ML · 2026-04-16 · unverdicted · none · ref 2
A memory-efficient SMC clustering method decomposes problems into approximately independent subproblems to handle large-scale online clustering with complex distributions.
LLM as Attention-Informed NTM and Topic Modeling as long-input Generation: Interpretability and long-Context Capability cs.CL · 2025-10-03 · unverdicted · none · ref 22
LLMs recover interpretable topic structures via attention and achieve competitive topic modeling performance as long-context generators.
World Model on Million-Length Video And Language With Blockwise RingAttention cs.LG · 2024-02-13 · unverdicted · none · ref 16
Presents open-source 7B models for million-token video and language understanding via Blockwise RingAttention, setting new benchmarks in retrieval and long video tasks.
Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities cs.CL · 2025-07-07 · unverdicted · none · ref 47
Gemini 2.5 Pro and Flash models are presented as achieving frontier performance in reasoning, coding, and long-context multimodal tasks while spanning a cost-capability Pareto curve.

Can long-context language models subsume retrieval, rag, sql, and more? arXiv preprint arXiv:2406.13121

fields

years

verdicts

representative citing papers

citing papers explorer