Huizhen Yu — Pith Author Registry

Identifiers

name variant Huizhen Yu 0.60 · backfill

Papers (17)

Asynchronous Stochastic Approximation with Applications to Average-Reward Reinforcement Learning cs.LG · 2024 · author #1
On Strategic Measures and Optimality Properties in Discrete-Time Stochastic Control with Universally Measurable Policies math.OC · 2022 · author #1
On Linear Programming for Constrained and Unconstrained Average-Cost Markov Decision Processes with Countable Action Spaces and Strictly Unbounded Costs math.OC · 2019 · author #1
On Markov Decision Processes with Borel Spaces and an Average Cost Criterion math.OC · 2019 · author #1
Two geometric input transformation methods for fast online reinforcement learning with neural nets cs.LG · 2018 · author #2
On Convergence of some Gradient-based Temporal-Differences Algorithms for Off-Policy Learning cs.LG · 2017 · author #1
On Generalized Bellman Equations and Temporal-Difference Learning cs.LG · 2017 · author #1
Multi-step Off-policy Learning Without Importance Sampling Ratios cs.LG · 2017 · author #2
Some Simulation Results for Emphatic Temporal-Difference Learning Algorithms cs.LG · 2016 · author #1
Weak Convergence Properties of Constrained Emphatic Temporal-difference Learning with Constant and Slowly Diminishing Stepsize cs.LG · 2015 · author #1
Emphatic Temporal-Difference Learning cs.LG · 2015 · author #2
On Convergence of Emphatic Temporal-Difference Learning cs.LG · 2015 · author #1
Stochastic Shortest Path Games and Q-Learning math.OC · 2014 · author #1
On Convergence of Value Iteration for a Class of Total Cost Markov Decision Processes math.OC · 2014 · author #1
A Mixed Value and Policy Iteration Method for Stochastic Control with Universally Measurable Policies math.OC · 2013 · author #1
Discretized Approximations for POMDP with Average Cost cs.AI · 2012 · author #1
A Function Approximation Approach to Estimation of Policy Gradient for POMDP with Structured Policies cs.LG · 2012 · author #1

Mentions

1511.07471 #1 · backfill · confidence 0.70 Huizhen Yu
1507.01569 #2 · backfill · confidence 0.70 Huizhen Yu
1506.02582 #1 · backfill · confidence 0.70 Huizhen Yu
1207.4154 #1 · arxiv_oai · confidence 0.70 Huizhen Yu
1412.8570 #1 · backfill · confidence 0.70 Huizhen Yu
2409.03915 #1 · arxiv_oai · confidence 0.70 Huizhen Yu
2206.06492 #1 · arxiv_oai · confidence 0.70 Huizhen Yu
1905.12095 #1 · arxiv_oai · confidence 0.70 Huizhen Yu
1411.1459 #1 · backfill · confidence 0.70 Huizhen Yu
1308.3814 #1 · backfill · confidence 0.70 Huizhen Yu
1207.4154 #1 · backfill · confidence 0.70 Huizhen Yu
1207.1421 #1 · backfill · confidence 0.70 Huizhen Yu

Frequent Coauthors

Richard S. Sutton 5 shared papers
A. Rupam Mahmood 2 shared papers
Ashique Rupam Mahmood 1 shared papers
Banafsheh Rafiee 1 shared papers
Dimitri Bertsekas 1 shared papers
Dimitri P. Bertsekas 1 shared papers
Martha White 1 shared papers
Sina Ghiassian 1 shared papers
Yi Wan 1 shared papers