Preprint, arXiv:2502.08820

Can a single model master both multi-turn conversations, tool use? coalm: A unified conversational agentic language model · 2025 · arXiv 2502.08820

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

representative citing papers

ToolRL: Reward is All Tool Learning Needs

cs.LG · 2025-04-16 · conditional · novelty 6.0

A principled reward design for tool selection and application in RL-trained LLMs delivers 17% gains over base models and 15% over SFT across benchmarks.

A Survey of Context Engineering for Large Language Models

cs.CL · 2025-07-17 · accept · novelty 4.0

The survey organizes Context Engineering into retrieval, processing, management, and integrated systems like RAG and multi-agent setups while identifying an asymmetry where LLMs handle complex inputs well but struggle with equally sophisticated long outputs.

citing papers explorer

Showing 2 of 2 citing papers.

ToolRL: Reward is All Tool Learning Needs cs.LG · 2025-04-16 · conditional · none · ref 1
A principled reward design for tool selection and application in RL-trained LLMs delivers 17% gains over base models and 15% over SFT across benchmarks.
A Survey of Context Engineering for Large Language Models cs.CL · 2025-07-17 · accept · none · ref 9
The survey organizes Context Engineering into retrieval, processing, management, and integrated systems like RAG and multi-agent setups while identifying an asymmetry where LLMs handle complex inputs well but struggle with equally sophisticated long outputs.

Preprint, arXiv:2502.08820

fields

years

verdicts

representative citing papers

citing papers explorer