Revisit mixture mod- els for multi-agent simulation: Experimental study within a unified framework, 2025

Longzhong Lin, Xuewu Lin, Kechun Xu, Haojian Lu, Lichao Huang, Rong Xiong, Yue Wang · 2025 · arXiv 2501.17015

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

PyFi: Toward Pyramid-like Financial Image Understanding for VLMs via Adversarial Agents

q-fin.CP · 2025-12-11 · unverdicted · novelty 7.0

PyFi generates a 600K pyramid QA dataset for financial images using adversarial MCTS agents, allowing fine-tuned VLMs to decompose complex questions and achieve 19.52% and 8.06% accuracy gains on Qwen2.5-VL models.

Goal-Oriented Reactive Simulation for Closed-Loop Trajectory Prediction

cs.RO · 2026-03-25 · conditional · novelty 6.0

Closed-loop on-policy training with a reactive goal-oriented scene decoder cuts collision rates by up to 79.5% in dense traffic compared to standard open-loop baselines.

RLFTSim: Realistic and Controllable Multi-Agent Traffic Simulation via Reinforcement Learning Fine-Tuning

cs.RO · 2026-05-18 · unverdicted · novelty 5.0

RLFTSim uses RL fine-tuning on a pre-trained model with a balanced reward to align traffic simulator rollouts to real data distributions and distill goal-conditioned controllability, reporting SOTA realism on the Waymo Open Motion Dataset.

citing papers explorer

Showing 3 of 3 citing papers.

PyFi: Toward Pyramid-like Financial Image Understanding for VLMs via Adversarial Agents q-fin.CP · 2025-12-11 · unverdicted · none · ref 3
PyFi generates a 600K pyramid QA dataset for financial images using adversarial MCTS agents, allowing fine-tuned VLMs to decompose complex questions and achieve 19.52% and 8.06% accuracy gains on Qwen2.5-VL models.
Goal-Oriented Reactive Simulation for Closed-Loop Trajectory Prediction cs.RO · 2026-03-25 · conditional · none · ref 30
Closed-loop on-policy training with a reactive goal-oriented scene decoder cuts collision rates by up to 79.5% in dense traffic compared to standard open-loop baselines.
RLFTSim: Realistic and Controllable Multi-Agent Traffic Simulation via Reinforcement Learning Fine-Tuning cs.RO · 2026-05-18 · unverdicted · none · ref 17
RLFTSim uses RL fine-tuning on a pre-trained model with a balanced reward to align traffic simulator rollouts to real data distributions and distill goal-conditioned controllability, reporting SOTA realism on the Waymo Open Motion Dataset.

Revisit mixture mod- els for multi-agent simulation: Experimental study within a unified framework, 2025

fields

years

verdicts

representative citing papers

citing papers explorer