arXiv preprint arXiv:2602.11683 , year=

ThinkRouter: Efficient Reasoning via Routing Thinking between Latent, Discrete Spaces , author= · arXiv 2602.11683

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

cs.CL · 2026-06-01 · unverdicted · novelty 6.0

ALAR trains LLM agents to perform most reasoning in a latent space supervised by actions and escalates to explicit CoT only when needed, cutting tokens by up to 84.6% while preserving accuracy on search and tool-use benchmarks.

TARPO: Token-Wise Latent-Explicit Reasoning via Action-Routing Policy Optimization

cs.CL · 2026-06-04 · unverdicted · novelty 5.0

TARPO is a pure RL framework using a token-wise action router to switch between discrete token generation and latent reasoning in LLMs, with joint optimization showing outperformance on benchmarks.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Adaptive Latent Agentic Reasoning cs.CL · 2026-06-01 · unverdicted · none · ref 29
ALAR trains LLM agents to perform most reasoning in a latent space supervised by actions and escalates to explicit CoT only when needed, cutting tokens by up to 84.6% while preserving accuracy on search and tool-use benchmarks.
TARPO: Token-Wise Latent-Explicit Reasoning via Action-Routing Policy Optimization cs.CL · 2026-06-04 · unverdicted · none · ref 31
TARPO is a pure RL framework using a token-wise action router to switch between discrete token generation and latent reasoning in LLMs, with joint optimization showing outperformance on benchmarks.

arXiv preprint arXiv:2602.11683 , year=

fields

years

verdicts

representative citing papers

citing papers explorer