pith. sign in

Doina Precup

Identifiers

  • name variant Doina Precup 0.60 · backfill

Papers (58)

  1. Human Adults and LLMs as Scientists: Who Benefits from Active Exploration? cs.CL · 2026 · author #8
  2. Using Reward Uncertainty to Induce Diverse Behaviour in Reinforcement Learning cs.LG · 2026 · author #8
  3. Reinforcement Learning with Pairwise Preferences in Long-Term Decision Problems cs.LG · 2026 · author #3
  4. Balancing Plasticity and Stability with Fast and Slow Successor Features cs.LG · 2026 · author #2
  5. Rotation-Preserving Supervised Fine-Tuning cs.LG · 2026 · author #6
  6. RL Fine-Tuning Heals OOD Forgetting in SFT cs.LG · 2025 · author #7
  7. Training Language Models to Self-Correct via Reinforcement Learning cs.LG · 2024 · author #16
  8. Recurrent Value Functions cs.LG · 2019 · author #4
  9. Uncertainty Aware Learning from Demonstrations in Multiple Contexts using Bayesian Neural Networks cs.RO · 2019 · author #4
  10. Learning Modular Safe Policies in the Bandit Setting with Application to Adaptive Clinical Trials cs.LG · 2019 · author #2
  11. The Termination Critic cs.AI · 2019 · author #6
  12. Clustering-Oriented Representation Learning with Attractive-Repulsive Loss cs.LG · 2018 · author #6
  13. Environments for Lifelong Reinforcement Learning cs.AI · 2018 · author #4
  14. The Barbados 2018 List of Open Issues in Continual Learning cs.AI · 2018 · author #10
  15. Temporal Regularization in Markov Decision Process cs.LG · 2018 · author #4
  16. Combined Reinforcement Learning via Abstract Representations cs.LG · 2018 · author #3
  17. Undersampling and Bagging of Decision Trees in the Analysis of Cardiorespiratory Behavior for the Prediction of Extubation Readiness in Extremely Preterm Infants cs.LG · 2018 · author #7
  18. Predicting Extubation Readiness in Extreme Preterm Infants based on Patterns of Breathing cs.LG · 2018 · author #7
  19. A Semi-Markov Chain Approach to Modeling Respiratory Patterns Prior to Extubation in Preterm Infants eess.SP · 2018 · author #7
  20. Exploring Uncertainty Measures in Deep Networks for Multiple Sclerosis Lesion Detection and Segmentation cs.CV · 2018 · author #2
  21. Attend Before you Act: Leveraging human visual attention for continual learning cs.AI · 2018 · author #2
  22. Connecting Weighted Automata and Recurrent Neural Networks through Spectral Learning cs.LG · 2018 · author #3
  23. Resolving Event Coreference with Supervised Representation Learning and Clustering-Oriented Regularization cs.CL · 2018 · author #3
  24. Dyna Planning using a Feature Based Generative Model cs.LG · 2018 · author #2
  25. Learning Safe Policies with Expert Guidance cs.LG · 2018 · author #3
  26. Disentangling the independently controllable factors of variation by interacting with the world stat.ML · 2018 · author #8
  27. Learning Robust Options cs.AI · 2018 · author #4
  28. Learnings Options End-to-End for Continuous Action Tasks cs.LG · 2017 · author #4
  29. Ubenwa: Cry-based Diagnosis of Birth Asphyxia stat.ML · 2017 · author #5
  30. Learning with Options that Terminate Off-Policy cs.AI · 2017 · author #4
  31. OptionGAN: Learning Joint Reward-Policy Options using Generative Adversarial Inverse Reinforcement Learning cs.LG · 2017 · author #6
  32. Deep Reinforcement Learning that Matters cs.LG · 2017 · author #5
  33. When Waiting is not an Option : Learning Options with a Deliberation Cost cs.AI · 2017 · author #4
  34. Neural Network Based Nonlinear Weighted Finite Automata cs.FL · 2017 · author #3
  35. Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control cs.LG · 2017 · author #4
  36. Independently Controllable Factors cs.LG · 2017 · author #8
  37. Variational Generative Stochastic Networks with Collaborative Shaping cs.LG · 2017 · author #2
  38. Convergent Tree Backup and Retrace with Function Approximation cs.LG · 2017 · author #3
  39. Investigating Recurrence and Eligibility Traces in Deep Q-Networks cs.AI · 2017 · author #2
  40. Independently Controllable Features cs.LG · 2017 · author #4
  41. Multi-Timescale, Gradient Descent, Temporal Difference Learning with Linear Options cs.AI · 2017 · author #2
  42. A Matrix Splitting Perspective on Planning with Options cs.AI · 2016 · author #2
  43. The Option-Critic Architecture cs.AI · 2016 · author #3
  44. Leveraging Lexical Resources for Learning Entity Embeddings in Multi-Relational Data cs.CL · 2016 · author #4
  45. Differentially Private Policy Evaluation cs.LG · 2016 · author #3
  46. Policy Gradient Methods for Off-policy Control cs.AI · 2015 · author #2
  47. Conditional Computation in Neural Networks for faster models cs.LG · 2015 · author #4
  48. Testing Visual Attention in Dynamic Environments cs.LG · 2015 · author #3
  49. Data Generation as Sequential Decision Making cs.LG · 2015 · author #2
  50. A Canonical Form for Weighted Automata and Applications to Approximate Minimization cs.FL · 2015 · author #3
  51. Learning with Pseudo-Ensembles stat.ML · 2014 · author #3
  52. Practical Kernel-Based Reinforcement Learning cs.LG · 2014 · author #2
  53. Classification-based Approximate Policy Iteration: Experiments and Extended Discussions cs.LG · 2014 · author #2
  54. Algorithms for multi-armed bandit problems cs.AI · 2014 · author #2
  55. Bellman Error Based Feature Generation using Random Projections on Sparse Spaces cs.LG · 2012 · author #5
  56. Metrics for Finite Markov Decision Processes cs.AI · 2012 · author #3
  57. Metrics for Markov Decision Processes with Infinite State Spaces cs.AI · 2012 · author #3
  58. Methods for computing state similarity in Markov Decision Processes cs.AI · 2012 · author #3

Mentions

  • 1511.06297 #4 · backfill · confidence 0.70 Doina Precup
  • 1510.08949 #3 · backfill · confidence 0.70 Doina Precup
  • 2606.06464 #8 · arxiv_oai · confidence 0.70 Doina Precup
  • 1407.0449 #2 · arxiv_oai · confidence 0.70 Doina Precup
  • 1506.03504 #2 · backfill · confidence 0.70 Doina Precup
  • 2606.03962 #8 · arxiv_oai · confidence 0.70 Doina Precup
  • 1501.06841 #3 · backfill · confidence 0.70 Doina Precup
  • 1412.4864 #3 · backfill · confidence 0.70 Doina Precup
  • 2606.00367 #3 · arxiv_oai · confidence 0.70 Doina Precup
  • 1407.5358 #2 · backfill · confidence 0.70 Doina Precup
  • 1407.0449 #2 · backfill · confidence 0.70 Doina Precup
  • 1402.6028 #2 · backfill · confidence 0.70 Doina Precup
  • 2605.26357 #2 · arxiv_oai · confidence 0.70 Doina Precup
  • 1207.5554 #5 · backfill · confidence 0.70 Doina Precup
  • 1207.4114 #3 · backfill · confidence 0.70 Doina Precup
  • 1207.1386 #3 · backfill · confidence 0.70 Doina Precup
  • 1206.6836 #3 · backfill · confidence 0.70 Doina Precup
  • 2409.12917 #16 · arxiv_oai · confidence 0.70 Doina Precup

Frequent Coauthors