pith. sign in

Huizhen Yu

Identifiers

  • name variant Huizhen Yu 0.60 · backfill

Papers (17)

  1. Asynchronous Stochastic Approximation with Applications to Average-Reward Reinforcement Learning cs.LG · 2024 · author #1
  2. On Strategic Measures and Optimality Properties in Discrete-Time Stochastic Control with Universally Measurable Policies math.OC · 2022 · author #1
  3. On Linear Programming for Constrained and Unconstrained Average-Cost Markov Decision Processes with Countable Action Spaces and Strictly Unbounded Costs math.OC · 2019 · author #1
  4. On Markov Decision Processes with Borel Spaces and an Average Cost Criterion math.OC · 2019 · author #1
  5. Two geometric input transformation methods for fast online reinforcement learning with neural nets cs.LG · 2018 · author #2
  6. On Convergence of some Gradient-based Temporal-Differences Algorithms for Off-Policy Learning cs.LG · 2017 · author #1
  7. On Generalized Bellman Equations and Temporal-Difference Learning cs.LG · 2017 · author #1
  8. Multi-step Off-policy Learning Without Importance Sampling Ratios cs.LG · 2017 · author #2
  9. Some Simulation Results for Emphatic Temporal-Difference Learning Algorithms cs.LG · 2016 · author #1
  10. Weak Convergence Properties of Constrained Emphatic Temporal-difference Learning with Constant and Slowly Diminishing Stepsize cs.LG · 2015 · author #1
  11. Emphatic Temporal-Difference Learning cs.LG · 2015 · author #2
  12. On Convergence of Emphatic Temporal-Difference Learning cs.LG · 2015 · author #1
  13. Stochastic Shortest Path Games and Q-Learning math.OC · 2014 · author #1
  14. On Convergence of Value Iteration for a Class of Total Cost Markov Decision Processes math.OC · 2014 · author #1
  15. A Mixed Value and Policy Iteration Method for Stochastic Control with Universally Measurable Policies math.OC · 2013 · author #1
  16. Discretized Approximations for POMDP with Average Cost cs.AI · 2012 · author #1
  17. A Function Approximation Approach to Estimation of Policy Gradient for POMDP with Structured Policies cs.LG · 2012 · author #1

Mentions

  • 1511.07471 #1 · backfill · confidence 0.70 Huizhen Yu
  • 1507.01569 #2 · backfill · confidence 0.70 Huizhen Yu
  • 1506.02582 #1 · backfill · confidence 0.70 Huizhen Yu
  • 1207.4154 #1 · arxiv_oai · confidence 0.70 Huizhen Yu
  • 1412.8570 #1 · backfill · confidence 0.70 Huizhen Yu
  • 2409.03915 #1 · arxiv_oai · confidence 0.70 Huizhen Yu
  • 2206.06492 #1 · arxiv_oai · confidence 0.70 Huizhen Yu
  • 1905.12095 #1 · arxiv_oai · confidence 0.70 Huizhen Yu
  • 1411.1459 #1 · backfill · confidence 0.70 Huizhen Yu
  • 1308.3814 #1 · backfill · confidence 0.70 Huizhen Yu
  • 1207.4154 #1 · backfill · confidence 0.70 Huizhen Yu
  • 1207.1421 #1 · backfill · confidence 0.70 Huizhen Yu

Frequent Coauthors