Huizhen Yu
Identifiers
- name variant Huizhen Yu 0.60 · backfill
Papers (17)
- Asynchronous Stochastic Approximation with Applications to Average-Reward Reinforcement Learning cs.LG · 2024 · author #1
- On Strategic Measures and Optimality Properties in Discrete-Time Stochastic Control with Universally Measurable Policies math.OC · 2022 · author #1
- On Linear Programming for Constrained and Unconstrained Average-Cost Markov Decision Processes with Countable Action Spaces and Strictly Unbounded Costs math.OC · 2019 · author #1
- On Markov Decision Processes with Borel Spaces and an Average Cost Criterion math.OC · 2019 · author #1
- Two geometric input transformation methods for fast online reinforcement learning with neural nets cs.LG · 2018 · author #2
- On Convergence of some Gradient-based Temporal-Differences Algorithms for Off-Policy Learning cs.LG · 2017 · author #1
- On Generalized Bellman Equations and Temporal-Difference Learning cs.LG · 2017 · author #1
- Multi-step Off-policy Learning Without Importance Sampling Ratios cs.LG · 2017 · author #2
- Some Simulation Results for Emphatic Temporal-Difference Learning Algorithms cs.LG · 2016 · author #1
- Weak Convergence Properties of Constrained Emphatic Temporal-difference Learning with Constant and Slowly Diminishing Stepsize cs.LG · 2015 · author #1
- Emphatic Temporal-Difference Learning cs.LG · 2015 · author #2
- On Convergence of Emphatic Temporal-Difference Learning cs.LG · 2015 · author #1
- Stochastic Shortest Path Games and Q-Learning math.OC · 2014 · author #1
- On Convergence of Value Iteration for a Class of Total Cost Markov Decision Processes math.OC · 2014 · author #1
- A Mixed Value and Policy Iteration Method for Stochastic Control with Universally Measurable Policies math.OC · 2013 · author #1
- Discretized Approximations for POMDP with Average Cost cs.AI · 2012 · author #1
- A Function Approximation Approach to Estimation of Policy Gradient for POMDP with Structured Policies cs.LG · 2012 · author #1
Mentions
- 1511.07471 #1 · backfill · confidence 0.70 Huizhen Yu
- 1507.01569 #2 · backfill · confidence 0.70 Huizhen Yu
- 1506.02582 #1 · backfill · confidence 0.70 Huizhen Yu
- 1207.4154 #1 · arxiv_oai · confidence 0.70 Huizhen Yu
- 1412.8570 #1 · backfill · confidence 0.70 Huizhen Yu
- 2409.03915 #1 · arxiv_oai · confidence 0.70 Huizhen Yu
- 2206.06492 #1 · arxiv_oai · confidence 0.70 Huizhen Yu
- 1905.12095 #1 · arxiv_oai · confidence 0.70 Huizhen Yu
- 1411.1459 #1 · backfill · confidence 0.70 Huizhen Yu
- 1308.3814 #1 · backfill · confidence 0.70 Huizhen Yu
- 1207.4154 #1 · backfill · confidence 0.70 Huizhen Yu
- 1207.1421 #1 · backfill · confidence 0.70 Huizhen Yu
Frequent Coauthors
- Richard S. Sutton 5 shared papers
- A. Rupam Mahmood 2 shared papers
- Ashique Rupam Mahmood 1 shared papers
- Banafsheh Rafiee 1 shared papers
- Dimitri Bertsekas 1 shared papers
- Dimitri P. Bertsekas 1 shared papers
- Martha White 1 shared papers
- Sina Ghiassian 1 shared papers
- Yi Wan 1 shared papers