Non-asymptotic convergence bounds for Sinkhorn iterates and their gradients: a coupling approach.

被引:0
|
作者
Greco, Giacomo [1 ]
Noble, Maxence [2 ]
Conforti, Giovanni [2 ]
Oliviero-Durmus, Alain [2 ]
机构
[1] Eindhoven Univ Technol, Eindhoven, Netherlands
[2] Ecole Polytech, Palaiseau, France
基金
英国工程与自然科学研究理事会;
关键词
optimal transport; Sinkhorn algorithm; stochastic optimal control; Schrodinger bridge; OPTIMAL TRANSPORT; ENTROPY MINIMIZATION; MATRICES; TIME;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Computational optimal transport (OT) has recently emerged as a powerful framework with applications in various fields. In this paper we focus on a relaxation of the original OT problem, the entropic OT problem, which allows to implement efficient and practical algorithmic solutions, even in high dimensional settings. This formulation, also known as the Schrodinger Bridge problem, notably connects with Stochastic Optimal Control (SOC) and can be solved with the popular Sinkhorn algorithm. In the case of discrete-state spaces, this algorithm is known to have exponential convergence; however, achieving a similar rate of convergence in a more general setting is still an active area of research. In this work, we analyze the convergence of the Sinkhorn algorithm for probability measures defined on the d-dimensional torus T-L(d), that admit densities with respect to the Haar measure of T-L(d). In particular, we prove pointwise exponential convergence of Sinkhorn iterates and their gradient. Our proof relies on the connection between these iterates and the evolution along the Hamilton-Jacobi-Bellman equations of value functions obtained from SOC-problems. Our approach is novel in that it is purely probabilistic and relies on coupling by reflection techniques for controlled diffusions on the torus.
引用
收藏
页码:716 / 746
页数:31
相关论文
共 50 条
  • [21] Non-asymptotic capacity lower bounds for non-coherent SISO channels
    Zhang, Jianqiu
    2006 40th Annual Conference on Information Sciences and Systems, Vols 1-4, 2006, : 1697 - 1702
  • [22] Faster Non-asymptotic Convergence for Double Q-learning
    Zhao, Lin
    Xiong, Huaqing
    Liang, Yingbin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [23] Non-asymptotic bounds for the l∞ estimator in linear regression with uniform noise
    Yi, Yufei
    Neykov, Matey
    BERNOULLI, 2024, 30 (01) : 534 - 553
  • [24] Non-asymptotic sub-Gaussian error bounds for hypothesis testing
    Li, Yanpeng
    Tian, Boping
    STATISTICS & PROBABILITY LETTERS, 2022, 189
  • [25] Sharp non-asymptotic performance bounds for and Huber robust regression estimators
    Flores, Salvador
    TEST, 2015, 24 (04) : 796 - 812
  • [26] Non-asymptotic Delay Bounds for Networks with Heavy-Tailed Traffic
    Liebeherr, Joerg
    Burchard, Almut
    Ciucu, Florin
    2010 PROCEEDINGS IEEE INFOCOM, 2010,
  • [27] Non-asymptotic performance bounds for downlink MU-MIMO scheduling
    Zois, Georgios
    Michaloliakos, Antonios
    Psounis, Konstantinos
    Vassalos, Vasilis
    Mourtos, Ioannis
    2016 12TH ANNUAL CONFERENCE ON WIRELESS ON-DEMAND NETWORK SYSTEMS AND SERVICES (WONS), 2016, : 162 - 169
  • [28] Non-Asymptotic Gap-Dependent Regret Bounds for Tabular MDPs
    Simchowitz, Max
    Jamieson, Kevin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [29] Non-asymptotic upper bounds on the probability of the ε-atypical set for Markov chains
    Lastras-Montaño, LA
    2004 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, PROCEEDINGS, 2004, : 222 - 222
  • [30] Non-Asymptotic Bounds on Fixed Length Source Coding for Markov Chains
    Hayashi, Masahito
    Watanabe, Shun
    2013 51ST ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2013, : 875 - 882