pith. sign in

arxiv: 2606.09371 · v1 · pith:734O2Z5Anew · submitted 2026-06-08 · 💻 cs.AI

Capability-Aligned Hierarchical Learning for Tool-Augmented LLMs

classification 💻 cs.AI
keywords hierarchicalhigh-levellearninglow-leveltaskscahlcapability-alignedeffectiveness
0
0 comments X
read the original abstract

Tool learning enables LLMs to invoke external tools to accomplish tasks. Prior studies have demonstrated the effectiveness of a hierarchical structure: a high-level policy handles global planning and decomposes tasks into manageable sub-tasks, and a low-level policy focuses on invoking tools to solve these sub-tasks. However, these works typically optimize the high-level and low-level policies separately, leading to planner-executor misalignment and limiting LLM performance on tool-use tasks. In this paper, we propose a method called Capability-Aligned Hierarchical Learning (CAHL), which leverages RLVR to jointly optimize both policies, enabling better alignment between the high-level planner and the low-level executor. Experiments on constrained tool-use benchmarks (API-Bank and BFCL) and an open-ended environment (Bamboogle) demonstrate the effectiveness of CAHL.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.