Sinkhorn Natural Gradient for Generative Models

被引：0

作者：

Shen, Zebang ^{[1
]}

Wang, Zhenfu ^{[2
]}

Ribeiro, Alejandro ^{[1
]}

Hassani, Hamed ^{[1
]}

机构：

[1] Univ Penn, Dept Elect & Syst Engn, Philadelphia, PA 19104 USA

[2] Univ Penn, Dept Math, Philadelphia, PA 19104 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020 | 2020年 / 33卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We consider the problem of minimizing a functional over a parametric family of probability measures, where the parameterization is characterized via a push-forward structure. An important application of this problem is in training generative adversarial networks. In this regard, we propose a novel Sinkhorn Natural Gradient (SiNG) algorithm which acts as a steepest descent method on the probability space endowed with the Sinkhorn divergence. We show that the Sinkhorn information matrix (SIM), a key component of SiNG, has an explicit expression and can be evaluated accurately in complexity that scales logarithmically with respect to the desired accuracy. This is in sharp contrast to existing natural gradient methods that can only be carried out approximately. Moreover, in practical applications when only Monte-Carlo type integration is available, we design an empirical estimator for SIM and provide the stability analysis. In our experiments, we quantitatively compare SiNG with state-of-the-art SGD-type solvers on generative tasks to demonstrate its efficiency and efficacy of our method.

引用

页数：11

共 50 条

[21] Solving Hungarian natural language processing tasks with multilingual generative models
Yang, Zijian Gyozo
Laki, Laszlo Janos
[J]. ANNALES MATHEMATICAE ET INFORMATICAE, 2023, 57 : 92 - 106
[22] Generative Models
Sim-Hui Tee
[J]. Erkenntnis, 2023, 88 : 23 - 41
[23] Generative Models
Tee, Sim-Hui
[J]. ERKENNTNIS, 2023, 88 (01) : 23 - 41
[24] Adaptive natural gradient learning algorithms for various stochastic models
Park, H
Amari, SI
Fukumizu, K
[J]. NEURAL NETWORKS, 2000, 13 (07) : 755 - 764
[25] Adaptive Natural Gradient Learning Algorithms for Unnormalized Statistical Models
Karakida, Ryo
Okada, Masato
Amari, Shun-ichi
[J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2016, PT I, 2016, 9886 : 427 - 434
[26] BEYOND BAG OFWORDS: COMBINING GENERATIVE AND DISCRIMINATIVE MODELS FOR NATURAL SCENE CATEGORIZATION
Li, Zhen
Yap, Kim-Hui
Chen, Xiao-Ming
[J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 965 - 968
[27] Stellenwert von Natural Language Processing und chatbasierten Generative Language ModelsSignificance of natural language processing and chat-based generative language models
Markus Haar
Michael Sonntagbauer
Stefan Kluge
[J]. Medizinische Klinik - Intensivmedizin und Notfallmedizin, 2024, 119 : 181 - 188
[28] Stellenwert von Natural Language Processing und chatbasierten Generative Language Models
Haar, Markus
Sonntagbauer, Michael
Kluge, Stefan
[J]. MEDIZINISCHE KLINIK-INTENSIVMEDIZIN UND NOTFALLMEDIZIN, 2024, 119 (03) : 181 - 188
[29] Diversity in Deep Generative Models and Generative AI
Turinici, Gabriel
[J]. MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, LOD 2023, PT II, 2024, 14506 : 84 - 93
[30] Gradient Normalization for Generative Adversarial Networks
Wu, Yi-Lun
Shuai, Hong-Han
Tam, Zhi-Rui
Chiu, Hong-Yu
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 6353 - 6362

← 1 2 3 4 5 →