Beyond Pure Sampling: Hybrid Optimization Mechanisms for Non-Convex Model Predictive Control
Pith reviewed 2026-06-28 18:32 UTC · model grok-4.3
The pith
A hybrid DDP-then-Hessian-sampling mechanism in ME-DDP improves success rates over pure sampling in non-convex robotic MPC tasks.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The Maximum Entropy Differential Dynamic Programming framework employs an initial DDP phase to exploit local gradient information in the cost landscape followed by a sampling phase that draws from policies defined by the inverse Hessian of the action-value function; this dual mechanism allows the optimizer to escape suboptimal local minima while preserving closed-loop stability, yielding consistently higher or more stable success rates than either pure DDP or MPPI across low- and high-dimensional navigation tasks in cluttered environments.
What carries the argument
The dual-step optimization mechanism of gradient exploitation via DDP followed by disruption via sampling from policies characterized by the inverse Hessian of the action-value function, realized in Unimodal Gaussian, Multimodal Gaussian, and Stein Variational variants of ME-DDP.
If this is right
- In low-dimensional robotic systems the ME-DDP variants achieve higher task performance than MPPI with matching policy parameterizations.
- In high-dimensional robotic systems ME-DDP maintains higher success rates than MPPI even when MPPI occasionally produces faster trajectories.
- DDP-based methods including the ME-DDP variants exhibit greater overall stability than pure sampling schemes.
- The framework transfers to hardware, as shown by successful quadrotor flights through dense non-convex obstacle fields.
Where Pith is reading between the lines
- The inverse-Hessian sampling step could be grafted onto other gradient-based trajectory optimizers to improve escape from local minima.
- The observed low-versus-high dimensional performance split suggests that the value of the sampling phase scales with the complexity of the cost landscape.
- Hardware validation on one platform implies the method may extend to additional underactuated systems without major retuning.
Load-bearing premise
That sampling from policies defined by the inverse Hessian of the action-value function reliably disrupts convergence to suboptimal local minima without degrading closed-loop stability or requiring extensive tuning of entropy parameters.
What would settle it
A set of navigation experiments in which ME-DDP variants produce lower success rates or greater instability than the corresponding MPPI variants in both low-dimensional and high-dimensional robotic systems.
Figures
read the original abstract
This paper investigates the optimization mechanisms of non-convex Model Predictive Control (MPC) using the Maximum Entropy Differential Dynamic Programming (ME-DDP) framework. Navigating non-convex cost landscapes induced by nonlinear dynamics, multiple obstacles, etc. remains a fundamental challenge in robotics, where gradient-based methods frequently converge to suboptimal local minima. We demonstrate a dual-step optimization mechanism designed to overcome these traps. (1) an initial phase of using DDP to exploit the gradient of the cost landscape, followed by (2) disruption of the optimization via sampling from policies characterized by the inverse Hessian of the action-value function. We provide a rigorous analysis of this sampling mechanism of three ME-DDP variants: Unimodal Gaussian ME-DDP, Multimodal Gaussian ME-DDP, and Stein Variational DDP. Furthermore, with navigation tasks of four robotic systems under cluttered environments, we conduct extensive benchmarking of three variants of the ME-DDP, against deterministic DDP, and one of the most successful sampling-based schemes, Model Predictive Path Integral (MPPI) control with three policy parameterizations and update laws that correspond to those of ME-DDPs. The results show that in low-dimensional systems where the cost landscapes are relatively simple and local information is sufficiently representative, our framework consistently outperforms MPPIs. In high-dimensional systems, MPPI can occasionally discover aggressive maneuvers that enable it to steer the systems faster than DDP-based methods, whereas our method maintains a higher, more stable success rate. Finally, we validate the practical efficacy of the framework through hardware experiments with a quadrotor navigating a dense, non-convex obstacle field, confirming the robustness of the proposed framework for real-world deployment.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes the Maximum Entropy Differential Dynamic Programming (ME-DDP) framework as a hybrid optimization approach for non-convex MPC problems in robotics. It describes a dual-step process consisting of an initial DDP phase to follow cost gradients followed by sampling from policies defined via the inverse Hessian of the action-value function. Three variants (Unimodal Gaussian ME-DDP, Multimodal Gaussian ME-DDP, and Stein Variational DDP) are analyzed, and the framework is benchmarked against deterministic DDP and MPPI (with matched policy parameterizations and update laws) on navigation tasks involving four robotic systems in cluttered environments. The central empirical claim is that ME-DDP variants consistently outperform MPPI in low-dimensional cases and achieve higher, more stable success rates than MPPI in high-dimensional cases, with additional hardware validation on a quadrotor navigating a dense obstacle field.
Significance. If the benchmarking results and stability claims hold under scrutiny, the work would demonstrate a practical hybrid mechanism that combines gradient exploitation with targeted sampling to mitigate local minima in non-convex MPC without requiring extensive entropy tuning, supported by direct comparisons to established baselines and real-world deployment evidence.
major comments (2)
- [Abstract; Section describing ME-DDP variants and sampling analysis] The abstract and methods description assert a 'rigorous analysis' of the three ME-DDP sampling mechanisms, yet provide no explicit equations, invertibility conditions, or derivation steps for the inverse-Hessian policy sampling; this is load-bearing for the claim that the sampling reliably disrupts suboptimal minima.
- [Benchmarking section and results tables/figures] Benchmarking results claim consistent outperformance and higher success rates, but the text supplies no details on number of trials, error bars, statistical tests, or the precise criterion used to identify local minima versus successful trajectories; these omissions undermine evaluation of the low- versus high-dimensional performance distinction.
minor comments (2)
- [Benchmarking setup] Clarify whether the three policy parameterizations for MPPI are exactly matched to the ME-DDP variants in all reported experiments.
- [Discussion or hardware validation section] Add a short discussion of any observed cases where the Hessian-based sampling degraded closed-loop stability or required retuning.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback on the presentation of the ME-DDP analysis and benchmarking. We address each major comment below and will incorporate clarifications and additional details in the revised manuscript.
read point-by-point responses
-
Referee: [Abstract; Section describing ME-DDP variants and sampling analysis] The abstract and methods description assert a 'rigorous analysis' of the three ME-DDP sampling mechanisms, yet provide no explicit equations, invertibility conditions, or derivation steps for the inverse-Hessian policy sampling; this is load-bearing for the claim that the sampling reliably disrupts suboptimal minima.
Authors: We agree that the abstract and high-level methods overview do not contain the full set of equations. The manuscript does derive the inverse-Hessian sampling in the sections on each ME-DDP variant (Unimodal Gaussian, Multimodal Gaussian, and Stein Variational), including the policy update rules that follow from the second-order expansion of the action-value function. However, to strengthen accessibility and directly support the claim about disrupting local minima, we will add an explicit subsection with the core sampling equations, the invertibility condition (requiring the Hessian of the action-value function to be positive definite, which is ensured by the quadratic approximation in DDP), and the derivation steps from the maximum-entropy objective. This will be placed immediately after the variant descriptions. revision: yes
-
Referee: [Benchmarking section and results tables/figures] Benchmarking results claim consistent outperformance and higher success rates, but the text supplies no details on number of trials, error bars, statistical tests, or the precise criterion used to identify local minima versus successful trajectories; these omissions undermine evaluation of the low- versus high-dimensional performance distinction.
Authors: The referee is correct that these experimental details are missing from the current text. The underlying experiments were run with 50 independent trials per method per environment (with different random seeds for initial states and obstacle placements), success defined as reaching the goal without collision within the time horizon, and local-minima failures identified by trajectories that remain trapped in obstacle-induced cost basins (verified by cost-value stagnation). We will add a dedicated experimental-setup paragraph specifying the trial count, report mean success rates with standard-error bars in the tables and figures, and include a brief description of the success criterion. No formal statistical hypothesis tests were performed beyond the reported means and variances; we will note this limitation explicitly. revision: yes
Circularity Check
No significant circularity; empirical claims only
full rationale
The paper's central claims rest on empirical benchmarking of three ME-DDP variants against deterministic DDP and MPPI across four robotic navigation tasks plus quadrotor hardware validation. No derivation chain, prediction step, or uniqueness theorem is presented that reduces by construction to a fitted quantity, self-citation, or ansatz internal to the paper. The dual-step mechanism and sampling analysis are design choices whose performance is evaluated externally against independent baselines with matched parameterizations. This is the most common honest finding for an applied robotics methods paper whose load-bearing evidence is experimental rather than deductive.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[3]
A Barrier Function Method for the Optimization of Trajectory Functionals with Constraints , year=
Hauser, John and Saccon, Alessandro , booktitle=. A Barrier Function Method for the Optimization of Trajectory Functionals with Constraints , year=
-
[4]
Bryson Jr
Arthur E. Bryson Jr. and Yu-Chi Ho , title =. 1975 , address =
1975
-
[5]
Betts , title =
John T. Betts , title =. 2001 , address =
2001
-
[6]
Relaxed Logarithmic Barrier Function Based Model Predictive Control of Linear Systems , year=
Feller, Christian and Ebenbauer, Christian , journal=. Relaxed Logarithmic Barrier Function Based Model Predictive Control of Linear Systems , year=
-
[9]
Transactions on Machine Learning Research , issn=
Constrained Reinforcement Learning with Smoothed Log Barrier Function , author=. Transactions on Machine Learning Research , issn=. 2025 , url=
2025
-
[10]
Constraint Handling in Continuous-Time
Sleiman, Jean-Pierre and Farshidian, Farbod and Hutter, Marco , booktitle=. Constraint Handling in Continuous-Time. 2021 , volume=
2021
-
[11]
1968 , publisher=
Nonlinear Programming: Sequential Unconstrained Minimization Techniques , author=. 1968 , publisher=
1968
-
[16]
, title =
Dong, Jing and Tong, Xin T. , title =. J. Mach. Learn. Res. , month =. 2021 , issue_date =
2021
-
[18]
and Mayne, David Q
Jacobson, David H. and Mayne, David Q. Differential dynamic programming
-
[19]
Proceedings of the 35th International Conference on Machine Learning , pages =
Message Passing Stein Variational Gradient Descent , author =. Proceedings of the 35th International Conference on Machine Learning , pages =. 2018 , editor =
2018
-
[20]
Projected Stein Variational Gradient Descent , url =
Chen, Peng and Ghattas, Omar , booktitle =. Projected Stein Variational Gradient Descent , url =
-
[21]
Projected Stein Variational Newton: A Fast and Scalable Bayesian Inference Method in High Dimensions , url =
Chen, Peng and Wu, Keyi and Chen, Joshua and O Leary-Roseberry, Tom and Ghattas, Omar , booktitle =. Projected Stein Variational Newton: A Fast and Scalable Bayesian Inference Method in High Dimensions , url =
-
[22]
Robotics: Science and Systems (RSS) , year =
Variational Inference MPC using Tsallis Divergence , author =. Robotics: Science and Systems (RSS) , year =
-
[23]
Stein Variational Gradient Descent as Gradient Flow , url =
Liu, Qiang , booktitle =. Stein Variational Gradient Descent as Gradient Flow , url =
-
[24]
Advances in Neural Information Processing Systems (NeurIPS) , volume =
Liu, Qiang and Wang, Dilin , title =. Advances in Neural Information Processing Systems (NeurIPS) , volume =
-
[25]
Proceedings of the Thirtieth Conference on Uncertainty in Artificial Intelligence , pages =
Trivedi, Shubhendu and Wang, Jialei and Kpotufe, Samory and Shakhnarovich, Gregory , title =. Proceedings of the Thirtieth Conference on Uncertainty in Artificial Intelligence , pages =. 2014 , isbn =
2014
-
[26]
A Stein variational Newton method , url =
Detommaso, Gianluca and Cui, Tiangang and Marzouk, Youssef and Spantini, Alessio and Scheichl, Robert , booktitle =. A Stein variational Newton method , url =
-
[27]
Proceedings of the 2020 Conference on Robot Learning , pages =
Stein Variational Model Predictive Control , author =. Proceedings of the 2020 Conference on Robot Learning , pages =. 2021 , editor =
2020
-
[28]
Stein Variational Policy Gradient , booktitle =
Yang Liu and Prajit Ramachandran and Qiang Liu and Jian Peng , editor =. Stein Variational Policy Gradient , booktitle =. 2017 , url =
2017
-
[29]
2018 , journal =
Large sample analysis of the median heuristic , author=. 2018 , journal =
2018
-
[30]
and Murray, Walter and Saunders, Michael A
Gill, Philip E. and Murray, Walter and Saunders, Michael A. , title =. SIAM Journal on Optimization , volume =. 2002 , doi =
2002
-
[32]
2023 , eprint=
Barkour: Benchmarking Animal-level Agility with Quadruped Robots , author=. 2023 , eprint=
2023
-
[33]
Proceedings of Robotics: Science and Systems (RSS) , year=
Stein Variational Ergodic Search , author=. Proceedings of Robotics: Science and Systems (RSS) , year=
-
[34]
Daniel Freeman and Erik Frey and Anton Raichuk and Sertan Girgin and Igor Mordatch and Olivier Bachem , title =
C. Daniel Freeman and Erik Frey and Anton Raichuk and Sertan Girgin and Igor Mordatch and Olivier Bachem , title =
-
[35]
2011 , url =
Teppo Luukkonen , title =. 2011 , url =
2011
-
[36]
and Theodorou, Evangelos , journal=
Boutselis, George I. and Theodorou, Evangelos , journal=. Discrete-Time Differential Dynamic Programming on Lie Groups: Derivation, Convergence Analysis, and Numerical Results , year=
-
[37]
PX4: A node-based multithreaded open source robotics framework for deeply embedded platforms , year=
Meier, Lorenz and Honegger, Dominik and Pollefeys, Marc , booktitle=. PX4: A node-based multithreaded open source robotics framework for deeply embedded platforms , year=
-
[38]
and Shoemaker, C.A
Liao, L.-Z. and Shoemaker, C.A. , journal=. Convergence in unconstrained discrete-time differential dynamic programming , year=
-
[39]
, title =
Murray, Walter and Wright, Margaret H. , title =. SIAM Journal on Optimization , volume =. 1994 , doi =
1994
-
[40]
2006 , publisher=
Numerical optimization , author=. 2006 , publisher=
2006
-
[43]
Journal of Guidance, Control, and Dynamics , volume=
Model predictive path integral control: From theory to parallel computation , author=. Journal of Guidance, Control, and Dynamics , volume=. 2017 , publisher=
2017
-
[47]
Multimodal trajectory optimization for motion planning , volume =
Osa, Takayuki , year =. Multimodal trajectory optimization for motion planning , volume =. The International Journal of Robotics Research , doi =
-
[48]
Proceedings of the 30th International Conference on Machine Learning , pages =
Guided Policy Search , author =. Proceedings of the 30th International Conference on Machine Learning , pages =. 2013 , editor =
2013
-
[49]
The International Journal of Robotics Research , volume =
John Schulman and Yan Duan and Jonathan Ho and Alex Lee and Ibrahim Awwal and Henry Bradlow and Jia Pan and Sachin Patil and Ken Goldberg and Pieter Abbeel , title =. The International Journal of Robotics Research , volume =. 2014 , doi =
2014
-
[50]
CuRobo: Parallelized Collision-Free Robot Motion Generation , year=
Sundaralingam, Balakumar and Hari, Siva Kumar Sastry and Fishman, Adam and Garrett, Caelan and Van Wyk, Karl and Blukis, Valts and Millane, Alexander and Oleynikova, Helen and Handa, Ankur and Ramos, Fabio and Ratliff, Nathan and Fox, Dieter , booktitle=. CuRobo: Parallelized Collision-Free Robot Motion Generation , year=
-
[51]
Dragan and Mihail Pivtoraiko and Matthew Klingensmith and Christopher M
Matt Zucker and Nathan Ratliff and Anca D. Dragan and Mihail Pivtoraiko and Matthew Klingensmith and Christopher M. Dellin and J. Andrew Bagnell and Siddhartha S. Srinivasa , title =. The International Journal of Robotics Research , volume =. 2013 , doi =
2013
-
[52]
2011 IEEE international conference on robotics and automation , pages=
STOMP: Stochastic trajectory optimization for motion planning , author=. 2011 IEEE international conference on robotics and automation , pages=. 2011 , organization=
2011
-
[57]
Proceedings of the 8th Annual Conference on Robot Learning (CoRL) , address=
DiffusionSeeder: Seeding Motion Optimization with Diffusion for Rapid Motion Planning , author=. Proceedings of the 8th Annual Conference on Robot Learning (CoRL) , address=
-
[58]
Advances in Neural Information Processing Systems (NeurIPS) , year=
Accelerating Motion Planning via Optimal Transport , author=. Advances in Neural Information Processing Systems (NeurIPS) , year=
-
[59]
Advances in Neural Information Processing Systems , year =
Pan, Chaoyi and Yi, Zeji and Shi, Guanya and Qu, Guannan , title =. Advances in Neural Information Processing Systems , year =
-
[60]
2021 , eprint=
Entropy Regularized Motion Planning via Stein Variational Inference , author=. 2021 , eprint=
2021
-
[61]
Diverse Motion Planning with Stein Diffusion Trajectory Inference , year=
Yin, Zeya and Lai, Tin and Barcelos, Lucas and Jacob, Jayadeep and Li, Yonghui and Ramos, Fabio , booktitle=. Diverse Motion Planning with Stein Diffusion Trajectory Inference , year=
-
[62]
Constrained Stein Variational Trajectory Optimization , year=
Power, Thomas and Berenson, Dmitry , journal=. Constrained Stein Variational Trajectory Optimization , year=
-
[63]
and Huang, Philip and Heiden, Eric and Jatavallabhula, Krishna Murthy and Damken, Fabian and Smith, Kevin and Nowrouzezahrai, Derek and Ramos, Fabio and Shkurti, Florian , journal=
Lee, Yewon and Li, Andrew Z. and Huang, Philip and Heiden, Eric and Jatavallabhula, Krishna Murthy and Damken, Fabian and Smith, Kevin and Nowrouzezahrai, Derek and Ramos, Fabio and Shkurti, Florian , journal=. STAMP: Differentiable Task and Motion Planning via Stein Variational Gradient Descent , year=
-
[65]
The International Journal of Robotics Research , volume =
Lucas Barcelos and Tin Lai and Rafael Oliveira and Paulo Borges and Fabio Ramos , title =. The International Journal of Robotics Research , volume =. 2024 , doi =
2024
-
[66]
Full-Order Sampling-Based MPC for Torque-Level Locomotion Control via Diffusion-Style Annealing , year=
Xue, Haoru and Pan, Chaoyi and Yi, Zeji and Qu, Guannan and Shi, Guanya , booktitle=. Full-Order Sampling-Based MPC for Torque-Level Locomotion Control via Diffusion-Style Annealing , year=
-
[67]
, booktitle=
Aoyama, Yuichiro and Lehmann, Peter and Theodorou, Evangelos A. , booktitle=. Second-Order Stein Variational Dynamic Optimization , year=
-
[68]
2016 , publisher=
Messy: How to Be Creative and Resilient in a Tidy-Minded World , author=. 2016 , publisher=
2016
-
[69]
Proceedings of the 1st International Conference on Informatics in Control, Automation and Robotics (ICINCO) , year =
Li, Weiwei and Todorov, Emanuel , title =. Proceedings of the 1st International Conference on Informatics in Control, Automation and Robotics (ICINCO) , year =
-
[70]
2012 IEEE/RSJ International Conference on Intelligent Robots and Systems , pages=
Synthesis and Stabilization of Complex Behaviors through Online Trajectory Optimization , author=. 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems , pages=. 2012 , organization=
2012
-
[71]
Proceedings of the 34th International Conference on Machine Learning , pages =
Measuring Sample Quality with Kernels , author =. Proceedings of the 34th International Conference on Machine Learning , pages =. 2017 , volume =
2017
-
[72]
Proceedings of the 34th International Conference on Machine Learning , series =
Measuring Sample Quality with Kernels , author =. Proceedings of the 34th International Conference on Machine Learning , series =. 2017 , publisher =
2017
-
[73]
, booktitle=
Aoyama, Yuichiro and Theodorou, Evangelos A. , booktitle=. Generalized Maximum Entropy Differential Dynamic Programming , year=
-
[74]
Spline-Interpolated Model Predictive Path Integral Control with Stein Variational Inference for Reactive Navigation , year=
Miura, Takato and Akai, Naoki and Honda, Kohei and Hara, Susumu , booktitle=. Spline-Interpolated Model Predictive Path Integral Control with Stein Variational Inference for Reactive Navigation , year=
-
[75]
1996 , institution=
Global Convergence of Differential Dynamic Programming and Newton's Method for Discrete-Time Optimal Control , author=. 1996 , institution=
1996
-
[76]
1994 , publisher =
Markov Decision Processes: Discrete Stochastic Dynamic Programming , author =. 1994 , publisher =
1994
-
[77]
The International Journal of Robotics Research , volume =
Motion Planning with Sequential Convex Optimization and Convex Collision Checking , author =. The International Journal of Robotics Research , volume =. 2014 , doi =
2014
-
[78]
James Bradbury and Roy Frostig and Peter Hawkins and Matthew James Johnson and Yash Katariya and Chris Leary and Dougal Maclaurin and George Necula and Adam Paszke and Jake Vander
-
[79]
2010 , school =
Modeling Purposeful Adaptive Behavior with the Principle of Maximum Causal Entropy , author =. 2010 , school =
2010
-
[82]
Powell, Michael J. D. , title =. Optimization , editor =. 1969 , publisher =
1969
-
[83]
Tyrrell , title =
Rockafellar, R. Tyrrell , title =. Journal of Optimization Theory and Applications , volume =. 1973 , publisher =
1973
-
[84]
A simplicial algorithm for concave programming , author=
-
[86]
The International Journal of Robotics Research , volume=
Computing large convex regions of obstacle-free space for robotic navigation , author=. The International Journal of Robotics Research , volume=
-
[87]
Autonomous Robots , volume=
Optimization-based locomotion planning, estimation, and control design for the Atlas humanoid robot , author=. Autonomous Robots , volume=. 2016 , publisher=
2016
-
[88]
2014 IEEE International Conference on Robotics and Automation (ICRA) , pages=
Control-limited differential dynamic programming , author=. 2014 IEEE International Conference on Robotics and Automation (ICRA) , pages=. 2014 , organization=
2014
-
[89]
, journal=
Kim, Min-Gyeom and Jung, Minchan and Hong, JunGee and Kim, Kwang-Ki K. , journal=. MPPI-IPDDP: A Hybrid Method of Collision-Free Smooth Trajectory Generation for Autonomous Robots , year=
-
[90]
Cham: Springer International Publishing
Aguiar AP, Bayer FA, Hauser J, H \"a usler AJ, Notarstefano G, Pascoal AM, Rucco A and Saccon A (2017) Constrained Optimal Motion Planning for Autonomous Vehicles Using PRONTO. Cham: Springer International Publishing. ISBN 978-3-319-55372-6, pp. 207--226. doi:10.1007/978-3-319-55372-6_10
-
[91]
doi: 10.1109/ICRA55743.2025.11128816
Aoyama Y, Lehmann P and Theodorou EA (2025) Second-order stein variational dynamic optimization. In: 2025 IEEE International Conference on Robotics and Automation (ICRA). pp. 7209--7216. doi:10.1109/ICRA55743.2025.11128381
-
[92]
Data-Driven Output-Feed back Controller Synthesis for Dissipativity: a Dualization-Based Approac h,
Aoyama Y and Theodorou EA (2024) Generalized maximum entropy differential dynamic programming. In: 2024 IEEE 63rd Conference on Decision and Control (CDC). pp. 8825--8831. doi:10.1109/CDC56724.2024.10886519
-
[93]
The International Journal of Robotics Research 43(11): 1693--1710
Barcelos L, Lai T, Oliveira R, Borges P and Ramos F (2024) Path signatures for diversity in probabilistic trajectory optimisation. The International Journal of Robotics Research 43(11): 1693--1710. doi:10.1177/02783649241233300
-
[94]
Philadelphia, PA: SIAM
Betts JT (2001) Practical Methods for Optimal Control Using Nonlinear Programming. Philadelphia, PA: SIAM
2001
-
[95]
IEEE Transactions on Automatic Control 66(10): 4636--4651
Boutselis GI and Theodorou E (2021) Discrete-time differential dynamic programming on lie groups: Derivation, convergence analysis, and numerical results. IEEE Transactions on Automatic Control 66(10): 4636--4651. doi:10.1109/TAC.2020.3034206
-
[96]
Bradbury J, Frostig R, Hawkins P, Johnson MJ, Katariya Y, Leary C, Maclaurin D, Necula G, Paszke A, Vander P las J, Wanderman- M ilne S and Zhang Q (2018) JAX : composable transformations of P ython+ N um P y programs
2018
-
[97]
Caluwaerts K, Iscen A, Kew JC, Yu W, Zhang T, Freeman D, Lee KH, Lee L, Saliceti S, Zhuang V, Batchelor N, Bohez S, Casarini F, Chen JE, Cortes O, Coumans E, Dostmohamed A, Dulac-Arnold G, Escontrela A, Frey E, Hafner R, Jain D, Jyenis B, Kuang Y, Lee E, Luu L, Nachum O, Oslund K, Powell J, Reyes D, Romano F, Sadeghi F, Sloat R, Tabanpour B, Zheng D, Neun...
2023
-
[98]
Adam Wiktor, Dexter Scobee, Sean Messenger, and Christopher Clark
Carvalho J, Le AT, Baierl M, Koert D and Peters J (2023) Motion planning diffusion: Learning and planning of robot motions with diffusion models. In: 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). pp. 1916--1923. doi:10.1109/IROS55552.2023.10342382
-
[99]
In: Larochelle H, Ranzato M, Hadsell R, Balcan M and Lin H (eds.) Advances in Neural Information Processing Systems, volume 33
Chen P and Ghattas O (2020) Projected stein variational gradient descent. In: Larochelle H, Ranzato M, Hadsell R, Balcan M and Lin H (eds.) Advances in Neural Information Processing Systems, volume 33. Curran Associates, Inc., pp. 1947--1958
2020
-
[100]
In: Wallach H, Larochelle H, Beygelzimer A, d Alch\' e -Buc F, Fox E and Garnett R (eds.) Advances in Neural Information Processing Systems, volume 32
Chen P, Wu K, Chen J, O Leary-Roseberry T and Ghattas O (2019) Projected stein variational newton: A fast and scalable bayesian inference method in high dimensions. In: Wallach H, Larochelle H, Beygelzimer A, d Alch\' e -Buc F, Fox E and Garnett R (eds.) Advances in Neural Information Processing Systems, volume 32. Curran Associates, Inc
2019
-
[101]
Applied Mathematics and Computation 164(1): 45--69
Cho GM (2005) Log-barrier method for two-stage quadratic stochastic programming. Applied Mathematics and Computation 164(1): 45--69. doi:https://doi.org/10.1016/j.amc.2004.04.095
-
[102]
The International Journal of Robotics Research 34(7): 955--977
Deits R and Tedrake R (2015) Computing large convex regions of obstacle-free space for robotic navigation. The International Journal of Robotics Research 34(7): 955--977
2015
-
[103]
In: Bengio S, Wallach H, Larochelle H, Grauman K, Cesa-Bianchi N and Garnett R (eds.) Advances in Neural Information Processing Systems, volume 31
Detommaso G, Cui T, Marzouk Y, Spantini A and Scheichl R (2018) A stein variational newton method. In: Bengio S, Wallach H, Larochelle H, Grauman K, Cesa-Bianchi N and Garnett R (eds.) Advances in Neural Information Processing Systems, volume 31. Curran Associates, Inc
2018
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.