Reflexion: Language agents with verbal reinforcement learning

· 2023

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Multi-Dimensional Behavioral Evaluation of Agentic Stock Prediction Systems Using Large Language Model Judges with Closed-Loop Reinforcement Learning Feedback

cs.LG · 2026-05-07 · unverdicted · novelty 7.0 · 2 refs

A multi-dimensional behavioral scoring system using LLM judges evaluates agentic stock predictors and feeds scores into closed-loop RL to improve one-day MAPE by 11.5% on held-out data.

CastFlow: Learning Role-Specialized Agentic Workflows for Time Series Forecasting

cs.LG · 2026-04-30 · unverdicted · novelty 7.0

CastFlow introduces a role-specialized agentic workflow with memory retrieval and multi-view toolkit for iterative ensemble time series forecasting, using two-stage SFT+RLVR training on a domain-specific LLM to outperform static baselines.

citing papers explorer

Showing 2 of 2 citing papers.

Multi-Dimensional Behavioral Evaluation of Agentic Stock Prediction Systems Using Large Language Model Judges with Closed-Loop Reinforcement Learning Feedback cs.LG · 2026-05-07 · unverdicted · none · ref 16 · 2 links
A multi-dimensional behavioral scoring system using LLM judges evaluates agentic stock predictors and feeds scores into closed-loop RL to improve one-day MAPE by 11.5% on held-out data.
CastFlow: Learning Role-Specialized Agentic Workflows for Time Series Forecasting cs.LG · 2026-04-30 · unverdicted · none · ref 59
CastFlow introduces a role-specialized agentic workflow with memory retrieval and multi-view toolkit for iterative ensemble time series forecasting, using two-stage SFT+RLVR training on a domain-specific LLM to outperform static baselines.

Reflexion: Language agents with verbal reinforcement learning

fields

years

verdicts

representative citing papers

citing papers explorer