Deep FinResearch Bench evaluates AI financial research reports on rigor, forecasting accuracy, and verifiability, finding them inferior to human-authored ones.
Tianyu Zhou, Pinqiao Wang, Yilin Wu, and Hongyang Yang
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
MimirRAG, a multi-agent RAG framework with metadata integration and table-aware chunking, reaches 89.3% accuracy on FinanceBench and outperforms prior baselines for financial document retrieval.
citing papers explorer
-
Deep FinResearch Bench: Evaluating AI's Ability to Conduct Professional Financial Investment Research
Deep FinResearch Bench evaluates AI financial research reports on rigor, forecasting accuracy, and verifiability, finding them inferior to human-authored ones.