Recognition: unknown
Statistical Estimation of Monge Transport Maps via Brenier Potentials
Pith reviewed 2026-05-08 10:59 UTC · model grok-4.3
The pith
A closed-form estimator for Monge transport maps is obtained from the dual solution of the finite-sample quadratic optimal transport problem.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
For measures known only through finite samples, the Brenier potential is given by a simple closed-form expression based on the dual solution of the discrete sampled optimal transport problem. The resulting estimator of the Monge map is the gradient of this potential and requires no further computation beyond the primal-dual solutions of the finite-dimensional problem. This yields convergence rates under a new error bound for the quadratic optimal transport problem.
What carries the argument
The Brenier potential constructed from the dual variables of the discrete quadratic optimal transport problem on samples, whose gradient estimates the Monge map.
Load-bearing premise
The source measure is absolutely continuous, ensuring the Monge map exists and is unique as the gradient of a convex function.
What would settle it
Generate samples from two known distributions with explicit Monge map, such as standard normal to a shifted and scaled normal, apply the estimator for increasing sample sizes, and check if the empirical error matches the predicted convergence rate.
Figures
read the original abstract
We introduce and analyze a statistical estimator for Monge transport maps: solutions to the quadratic optimal transport problem in Euclidean space. For absolutely continuous source measures, this map is uniquely defined as the gradient of a convex function, a result known as Brenier's theorem. Without absolute continuity, the problem is relaxed, maps are replaced by coupling measures, and optimal couplings are supported on the subdifferential of a convex function, a Brenier potential. This characterization is the basis for our statistical estimator of Monge transport maps for measures known only through finite samples. The resulting Brenier potential has a simple closed-form expression based on the dual solution of the discrete sampled problem. In particular, our methodology does not rely on smoothness or continuity of the Monge transport map and requires no computation beyond primal-dual solutions of the discrete finite-dimensional problem. We exhibit convergence rates for this estimator based on a new error bound for the quadratic optimal transport problem. In the semi-discrete setting, where the target measure is finitely supported, our estimator enjoys sharper convergence rates. Finally, using similar proof techniques, we provide a novel convergence rate for empirical couplings.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces a statistical estimator for Monge transport maps (gradients of Brenier potentials) obtained in closed form from the dual solution of the discrete quadratic optimal transport problem on finite samples of the source and target measures. For absolutely continuous sources, the estimator is claimed to converge at explicit rates derived from a new error bound on the quadratic OT problem; sharper rates hold in the semi-discrete setting, and the same techniques yield a novel rate for empirical couplings. The method requires no smoothness or continuity assumptions on the map and uses only standard discrete OT solvers.
Significance. If the new quadratic OT error bound holds under the paper's stated conditions (absolute continuity of the source, no further regularity), the work would be significant: it supplies a computationally lightweight estimator that remains valid for discontinuous Monge maps, a regime where many existing statistical OT estimators fail. The explicit rates, the semi-discrete sharpening, and the empirical-coupling result are all grounded in the same bound, so confirmation of that bound would strengthen several related results in the literature.
major comments (2)
- [Abstract and new error bound section] Abstract and the section stating the new error bound: the convergence rates for the Brenier-potential estimator rest entirely on this novel bound controlling the gap between the discrete dual and the continuous potential. The abstract asserts that the bound (and thus the rates) hold without smoothness or continuity of the map, yet the precise assumptions (moment conditions, support restrictions, or implicit Lipschitz requirements) are not enumerated; if the bound fails in the discontinuous regime advertised, the central claim is unsupported.
- [Semi-discrete and empirical-coupling results] The semi-discrete sharper rates and the empirical-coupling result (both using the same proof technique) inherit the same gap; any hidden regularity assumption in the quadratic OT bound would propagate to these corollaries and undermine the claim that the methodology requires no continuity of the Monge map.
minor comments (1)
- [Estimator definition] The closed-form expression for the Brenier potential from the discrete dual is stated clearly in the abstract but would benefit from an explicit formula or pseudocode in the main text for immediate reproducibility.
Simulated Author's Rebuttal
We thank the referee for the careful reading and constructive comments. We address the major comments point by point below. The error bound and its corollaries are derived under absolute continuity of the source and finite second moments; no continuity of the map is used. We will revise to enumerate assumptions explicitly.
read point-by-point responses
-
Referee: [Abstract and new error bound section] Abstract and the section stating the new error bound: the convergence rates for the Brenier-potential estimator rest entirely on this novel bound controlling the gap between the discrete dual and the continuous potential. The abstract asserts that the bound (and thus the rates) hold without smoothness or continuity of the map, yet the precise assumptions (moment conditions, support restrictions, or implicit Lipschitz requirements) are not enumerated; if the bound fails in the discontinuous regime advertised, the central claim is unsupported.
Authors: The new error bound is proven under absolute continuity of the source measure with respect to Lebesgue measure together with finite second moments for both measures. The proof relies on convexity, Brenier’s theorem for uniqueness of the potential, and standard moment controls; it invokes no continuity, Lipschitz, or smoothness properties of the Monge map. The map is permitted to be discontinuous. We will revise the abstract and the error-bound section to list these assumptions explicitly. revision: yes
-
Referee: [Semi-discrete and empirical-coupling results] The semi-discrete sharper rates and the empirical-coupling result (both using the same proof technique) inherit the same gap; any hidden regularity assumption in the quadratic OT bound would propagate to these corollaries and undermine the claim that the methodology requires no continuity of the Monge map.
Authors: Both the semi-discrete rates and the empirical-coupling rate are obtained by applying the same quadratic OT error bound. Because that bound requires only absolute continuity of the source and finite second moments, the corollaries inherit exactly the same (non-)assumptions on the map. We will update the statements of these results to reference the assumptions of the main bound explicitly. revision: yes
Circularity Check
No significant circularity; derivation is self-contained
full rationale
The estimator is obtained directly from the dual solution of the standard discrete quadratic OT problem, which is an independent computational primitive. Convergence rates are supported by a new error bound for the quadratic OT problem that the paper presents as a separate contribution rather than a self-referential fit or renamed input. No load-bearing self-citations, self-definitional steps, or fitted quantities relabeled as predictions appear in the provided derivation outline. The approach remains non-circular even under the advertised lack of smoothness assumptions.
Axiom & Free-Parameter Ledger
axioms (2)
- standard math Brenier's theorem: for absolutely continuous source measures, the Monge map is the gradient of a convex Brenier potential.
- standard math Optimal couplings are supported on the subdifferential of a convex function for general measures.
Reference graph
Works this paper leans on
-
[1]
and Dur \'a n, R
Acosta, G. and Dur \'a n, R. (2004). An optimal poincar \'e inequality in l^1 for convex domains. Proceedings of the american mathematical society , 132(1):195--202
2004
-
[2]
Aurenhammer, F. (1987). Power diagrams: properties, algorithms and applications. SIAM Journal on Computing , 16(1):78--96
1987
-
[3]
Aurenhammer, F., Hoffmann, F., and Aronov, B. (1998). Minkowski-type theorems and least-squares clustering. Algorithmica , 20(1):61--76
1998
-
[4]
Bakry, D., Barthe, F., Cattiaux, P., and Guillin, A. (2008). A simple proof of the P oincaré inequality for a large class of probability measures. Electronic Communications in Probability , 13:60--66
2008
- [5]
-
[6]
Brenier, Y. (1991). Polar factorization and monotone rearrangement of vector-valued functions. Communications on pure and applied mathematics , 44(4):375--417
1991
-
[7]
Caffarelli, L. A. (1992a). Boundary regularity of maps with convex potentials. Communications on pure and applied mathematics , 45(9):1141--1151
-
[8]
Caffarelli, L. A. (1992b). The regularity of mappings with a convex potential. Journal of the American Mathematical Society , 5(1):99--104
-
[9]
Caffarelli, L. A. (1996). Boundary regularity of maps with convex potentials-- II . Annals of mathematics , 144(3):453--496
1996
-
[10]
Carlier, G. (2023). Fenchel--young inequality with a remainder and applications to convex duality and optimal transport. SIAM Journal on Optimization , 33(3):1463--1472
2023
-
[11]
Carlier, G., Delalande, A., and M \'e rigot, Q. (2025). Quantitative stability of the pushforward operation by an optimal transport map. Foundations of Computational Mathematics , 25(4):1259--1286
2025
-
[12]
Chewi, S., Niles-Weed, J., and Rigollet, P. (2025). Estimation of transport maps. In Statistical Optimal Transport: \'E cole d' \'E t \'e de Probabilit \'e s de Saint-Flour XLIX--2019 , pages 77--94. Springer
2025
- [13]
-
[14]
and Mustata, C
Cobzas, S. and Mustata, C. (1978). Norm preserving extension of convex L ipschitz functions. J. Approx. theory , 24(3):236--244
1978
-
[15]
and Mart \' n, J
Cominetti, R. and Mart \' n, J. S. (1994). Asymptotic analysis of the exponential penalty trajectory in linear programming. Mathematical Programming , 67(1):169--187
1994
-
[16]
Cuturi, M. (2013). S inkhorn distances: lightspeed computation of optimal transport. Advances in neural information processing systems , 26
2013
- [17]
-
[18]
and Merigot, Q
Delalande, A. and Merigot, Q. (2023). Quantitative stability of optimal transport maps under variations of the target measure. Duke Mathematical Journal , 172(17):3321--3357
2023
-
[19]
Divol, V., Niles-Weed, J., and Pooladian, A.-A. (2025). Optimal transport map estimation in general function spaces. The Annals of Statistics , 53(3):963--988
2025
-
[20]
Z., Boisbunon, A., Chambon, S., Chapel, L., Corenflos, A., Fatras, K., Fournier, N., et al
Flamary, R., Courty, N., Gramfort, A., Alaya, M. Z., Boisbunon, A., Chambon, S., Chapel, L., Corenflos, A., Fatras, K., Fournier, N., et al. (2021). POT : Python optimal transport. Journal of Machine Learning Research , 22(78):1--8
2021
-
[21]
Fournier, N. (2023). Convergence of the empirical measure in expected W asserstein distance: non-asymptotic explicit bounds in R ^d . ESAIM: Probability and Statistics , 27:749--775
2023
-
[22]
and Guillin, A
Fournier, N. and Guillin, A. (2015). On the rate of convergence in W asserstein distance of the empirical measure. Probability theory and related fields , 162(3):707--738
2015
-
[23]
Genevay, A., Cuturi, M., Peyr \'e , G., and Bach, F. (2016). Stochastic optimization for large-scale optimal transport. Advances in neural information processing systems , 29
2016
-
[24]
Gigli, N. (2011). On H \"o lder continuity-in-time of the optimal transport map towards measures along a curve. Proceedings of the Edinburgh Mathematical Society , 54(2):401--409
2011
-
[25]
and Nickl, R
Gin \'e , E. and Nickl, R. (2021). Mathematical foundations of infinite-dimensional statistical models . Cambridge university press
2021
- [26]
-
[27]
Gunsilius, F. (2022). On the convergence rate of potentials of B renier maps. Econometric Theory , 38(2):381--417
2022
-
[28]
Hallin, M., del Barrio, E., Cuesta-Albertos, J., and Matr \'a n, C. (2021). Distribution and quantile functions, ranks and signs in dimension d . The Annals of Statistics , 49(2):1139--1165
2021
-
[29]
and Rigollet, P
H \"u tter, J.-C. and Rigollet, P. (2021). Minimax estimation of smooth optimal transport maps. The Annals of Statistics , 49
2021
-
[30]
Lei, N., Su, K., Cui, L., Yau, S.-T., and Gu, X. D. (2019). A geometric view of optimal transportation and generative model. Computer Aided Geometric Design , 68:1--21
2019
-
[31]
The flow map of the Fokker–Planck equation does not provide optimal transport
Letrouit, C. and M \'e rigot, Q. (2024). Gluing methods for quantitative stability of optimal transport maps. arXiv preprint arXiv:2411.04908
- [32]
-
[33]
and Nochetto, R
Li, W. and Nochetto, R. H. (2021). Quantitative stability and error estimates for optimal transport plans. IMA Journal of Numerical Analysis , 41(3):1941--1965
2021
-
[34]
Makkuva, A., Taghvaei, A., Oh, S., and Lee, J. (2020). Optimal transport mapping via input convex neural networks. In International Conference on Machine Learning , pages 6672--6681. PMLR
2020
-
[35]
Manole, T., Balakrishnan, S., Niles-Weed, J., and Wasserman, L. (2024). Plugin estimation of smooth optimal transport maps. The Annals of Statistics , 52(3):966--998
2024
-
[36]
McCann, R. J. (2001). Polar factorization of maps on R iemannian manifolds. Geometric & Functional Analysis GAFA , 11(3):589--608
2001
-
[37]
M \'e rigot, Q., Delalande, A., and Chazal, F. (2020). Quantitative stability of optimal transport maps and linearization of the 2- W asserstein space. In International Conference on Artificial Intelligence and Statistics , pages 3186--3196. PMLR
2020
-
[38]
W., Lozier, D
Olver, F. W., Lozier, D. W., Boisvert, R. F., and Clark, C. W. (2010). NIST handbook of mathematical functions hardback and CD-ROM . Cambridge university press
2010
-
[39]
Paty, F.-P., d’Aspremont, A., and Cuturi, M. (2020). Regularity as regularization: smooth and strongly convex B renier potentials in optimal transport. In International Conference on Artificial Intelligence and Statistics , pages 1222--1232. PMLR
2020
-
[40]
Perrot, M., Courty, N., Flamary, R., and Habrard, A. (2016). Mapping estimation for discrete optimal transport. Advances in Neural Information Processing Systems , 29
2016
-
[41]
and Cuturi, M
Peyr \'e , G. and Cuturi, M. (2019). Computational optimal transport: with applications to data science. Foundations and Trends in Machine Learning , 11(5-6):355--607
2019
-
[42]
Pooladian, A.-A., Divol, V., and Niles-Weed, J. (2023). Minimax estimation of discontinuous optimal transport maps: The semi-discrete case. In International Conference on Machine Learning , pages 28128--28150. PMLR
2023
-
[43]
arXiv preprint arXiv:2109.12004 , year =
Pooladian, A.-A. and Niles-Weed, J. (2021). Entropic estimation of optimal transport maps. arXiv preprint:2109.12004
-
[44]
Rockafellar, R. (1970). Convex Analysis . Princeton Landmarks in Mathematics and Physics. Princeton University Press
1970
-
[45]
Sadhu, R., Goldfeld, Z., and Kato, K. (2024). Stability and statistical inference for semidiscrete optimal transport maps. The Annals of Applied Probability , 34(6):5694--5736
2024
-
[46]
Santambrogio, F. (2015). Optimal transport for applied mathematicians. Birk \"a user, NY , 55(58-63):94
2015
- [47]
-
[48]
B., Flamary, R., Courty, N., Rolet, A., and Blondel, M
Seguy, V., Damodaran, B. B., Flamary, R., Courty, N., Rolet, A., and Blondel, M. (2018). Large-scale optimal transport and mapping estimation. In ICLR 2018-International Conference on Learning Representations , pages 1--15
2018
-
[49]
and Vialard, F.-X
Vacher, A. and Vialard, F.-X. (2022). Parameter tuning and model selection in optimal transport with semi-dual B renier formulation. Advances in Neural Information Processing Systems , 35:23098--23108
2022
-
[50]
Villani, C. (2008). Optimal Transport: Old and New , volume 338 of Grundlehren der mathematischen Wissenschaften . Springer Berlin Heidelberg
2008
-
[51]
Villani, C. (2021). Topics in optimal transportation , volume 58. American Mathematical Soc
2021
-
[52]
Weed, J. (2018). An explicit analysis of the entropic penalty in linear programming. In Conference On Learning Theory , pages 1841--1855. PMLR
2018
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.