← back to paper
arxiv: 2510.05921 · 2 revisions
Prompt reinforcing for long-term planning of large language models