Tree of thoughts: Deliberate problem solving with large language models

Shunyu Yao, Dian Yu, Jeffrey Zhao, Izhak Shafran, Tom Griffiths, Yuan Cao, Karthik Narasimhan · 2023

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

browse 3 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Same Signal, Opposite Meaning: Direction-Informed Adaptive Learning for LLM Agents

cs.LG · 2026-05-07 · unverdicted · novelty 6.0

Common gating signals for adaptive LLM compute have unstable directions across settings, and DIAL learns per-setting utility directions from signal-agnostic counterfactuals to outperform fixed-direction baselines.

Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems

cs.AI · 2024-12-12 · unverdicted · novelty 3.0

STILL-2 uses imitation of distilled long-form thoughts, multi-rollout exploration on difficult problems, and iterative self-improvement of the dataset to train reasoning models that reach competitive performance on three challenging benchmarks.

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

cs.CV · 2025-03-16 · unverdicted · novelty 2.0

The paper provides the first comprehensive survey of multimodal chain-of-thought reasoning, including foundational concepts, a taxonomy of methodologies, application analyses, challenges, and future directions.

citing papers explorer

Showing 3 of 3 citing papers.

Same Signal, Opposite Meaning: Direction-Informed Adaptive Learning for LLM Agents cs.LG · 2026-05-07 · unverdicted · none · ref 6
Common gating signals for adaptive LLM compute have unstable directions across settings, and DIAL learns per-setting utility directions from signal-agnostic counterfactuals to outperform fixed-direction baselines.
Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems cs.AI · 2024-12-12 · unverdicted · none · ref 23
STILL-2 uses imitation of distilled long-form thoughts, multi-rollout exploration on difficult problems, and iterative self-improvement of the dataset to train reasoning models that reach competitive performance on three challenging benchmarks.
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey cs.CV · 2025-03-16 · unverdicted · none · ref 24
The paper provides the first comprehensive survey of multimodal chain-of-thought reasoning, including foundational concepts, a taxonomy of methodologies, application analyses, challenges, and future directions.

Tree of thoughts: Deliberate problem solving with large language models

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer