hub

A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem

· 2017 · q-fin.CP · arXiv 1706.10059

13 Pith papers cite this work. Polarity classification is still indexing.

13 Pith papers citing it

open full Pith review browse 13 citing papers arXiv PDF

abstract

Financial portfolio management is the process of constant redistribution of a fund into different financial products. This paper presents a financial-model-free Reinforcement Learning framework to provide a deep machine learning solution to the portfolio management problem. The framework consists of the Ensemble of Identical Independent Evaluators (EIIE) topology, a Portfolio-Vector Memory (PVM), an Online Stochastic Batch Learning (OSBL) scheme, and a fully exploiting and explicit reward function. This framework is realized in three instants in this work with a Convolutional Neural Network (CNN), a basic Recurrent Neural Network (RNN), and a Long Short-Term Memory (LSTM). They are, along with a number of recently reviewed or published portfolio-selection strategies, examined in three back-test experiments with a trading period of 30 minutes in a cryptocurrency market. Cryptocurrencies are electronic and decentralized alternatives to government-issued money, with Bitcoin as the best-known example of a cryptocurrency. All three instances of the framework monopolize the top three positions in all experiments, outdistancing other compared trading algorithms. Although with a high commission rate of 0.25% in the backtests, the framework is able to achieve at least 4-fold returns in 50 days.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

A Three-Phase Foundation Model for Tax-Aware Personalized Portfolio Management

cs.AI · 2026-06-30 · unverdicted · novelty 7.0

A three-phase DRL framework for personalized portfolio management using a ticker-free encoder pretrained with a time series foundation model, an objective-conditioned MoE actor-critic, and inference-time LoRA adaptation from brokerage data.

Prediction Arena: Benchmarking AI Models on Real-World Prediction Markets

cs.LG · 2026-03-28 · unverdicted · novelty 7.0

Frontier AI models lose 16-31% trading on Kalshi over 57 days but show better results on Polymarket, with platform design strongly affecting outcomes and prediction accuracy mattering more than research volume.

Counterfactual Transport Flows for Offline Conservative Trajectory Refinement

cs.LG · 2026-06-08 · unverdicted · novelty 6.0

Counterfactual transport flows enable conservative, instance-specific trajectory refinement in offline RL by constructing local preference pairs in latent space from offline data and learning refinement directions controlled by a strength parameter.

A Meta Reinforcement Learning Approach to Goals-Based Wealth Management

cs.LG · 2026-05-04 · unverdicted · novelty 6.0

MetaRL pre-trained on GBWM problems delivers near-optimal dynamic strategies in 0.01s achieving 97.8% of DP optimal utility and handles larger problems where DP fails.

SBCA: Cross-Modal BERT-driven Actor-Critic for Multi-Asset Portfolio Optimization

q-fin.CP · 2026-05-02 · unverdicted · novelty 6.0

SBCA is a reinforcement learning framework using BERT cross-modal fusion and Actor-Critic to integrate price data with sentiment text for multi-asset portfolio optimization with practical trading constraints.

When Missing Becomes Structure: Intent-Preserving Policy Completion from Financial KOL Discourse

cs.LG · 2026-04-15 · unverdicted · novelty 6.0

KICL completes execution decisions in KOL financial discourse using offline RL, achieving top returns and Sharpe ratios with no unsupported trades or direction changes on YouTube and X data from 2022-2025.

Mitigating Bias in Low-SNR Financial Reinforcement Learning via Quantum Representations

cs.LG · 2026-06-09 · unverdicted · novelty 5.0

FPQC-SAC adds a bounded parameterized quantum circuit to SAC to constrain representations in low-SNR financial environments, reporting 66.89% higher cumulative returns than standard SAC on real portfolio tasks.

Addressing Market Regime Changes and Heavy-Tailed Returns in Portfolio Optimization via Bayesian VAR and Elliptical Black-Litterman

cs.LG · 2026-06-08 · unverdicted · novelty 4.0

BAVAR-BLED combines BAVAR for regime-aware priors and BLED with Student's t-distributions inside TD3, reporting Sharpe 1.72 and Sortino 2.70 on 29 DJIA stocks over 10 years.

Macro Economists in the Machine: A Multi-Agent LLM Framework for Commodity-Related ETF Portfolio Construction

q-fin.PM · 2026-06-06 · conditional · novelty 4.0

LLM agents (hawkish, dovish, debate) outperform a deterministic z-score rule agent in Sharpe ratio for commodity ETF portfolios by 0.04-0.044, with advantage concentrated in the soft-landing sub-period and preserved up to 30bp trading costs.

Dynamic Multi-Pair Trading Strategy in Cryptocurrency Markets with Deep Reinforcement Learning

cs.LG · 2026-06-03 · unverdicted · novelty 4.0

A hybrid DRL system for multi-pair crypto trading with deterministic risk shielding outperforms a heuristic baseline at 10% significance on Binance futures data.

Regime-Adaptive Continual Learning for Portfolio Management

q-fin.PM · 2026-05-29 · unverdicted · novelty 4.0

ReCAP segments markets into regimes, builds a policy library via continual learning, and uses a regime-gate to adapt trading policies, claiming superior returns and fast adaptation on five real datasets.

Portfolio Optimization Proxies under Label Scarcity and Regime Shifts via Bayesian and Deterministic Students under Semi-Supervised Sandwich Training

cs.LG · 2026-04-04 · unverdicted · novelty 4.0

A semi-supervised teacher-student framework enables neural networks to proxy CVaR portfolio optimization using synthetic data augmentation for scarce labels and regime shifts.

A Systematic Review of Recent Advancements in PINN Augmented Deep Learning and Mathematical Modeling for Efficient Portfolio Management

math.OC · 2026-04-30 · unverdicted · novelty 2.0

A systematic review of physics-informed neural networks and mathematical modeling approaches for portfolio optimization and management in finance.

citing papers explorer

Showing 1 of 1 citing paper after filters.

A Three-Phase Foundation Model for Tax-Aware Personalized Portfolio Management cs.AI · 2026-06-30 · unverdicted · none · ref 10 · internal anchor
A three-phase DRL framework for personalized portfolio management using a ticker-free encoder pretrained with a time series foundation model, an objective-conditioned MoE actor-critic, and inference-time LoRA adaptation from brokerage data.

A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer