Learning Functors using Gradient Descent

被引：1

作者：

Gavranovic, Bruno ^{[1
]}

机构：

[1] Univ Strathclyde, Math Struct Programming Grp, Glasgow, Scotland

来源：

ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE | 2020年 / 323期

关键词：

D O I：

10.4204/EPTCS.323.15

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Neural networks are a general framework for differentiable optimization which includes many other machine learning approaches as special cases. In this paper we build a category-theoretic formalism around a neural network system called CycleGAN [15]. CycleGAN is a general approach to unpaired image-to-image translation that has been getting attention in the recent years. Inspired by categorical database systems, we show that CycleGAN is a "schema", i.e. a specific category presented by generators and relations, whose specific parameter instantiations are just set-valued functors on this schema. We show that enforcing cycle-consistencies amounts to enforcing composition invariants in this category. We generalize the learning procedure to arbitrary such categories and show a special class of functors, rather than functions, can be learned using gradient descent. Using this framework we design a novel neural network system capable of learning to insert and delete objects from images without paired data. We qualitatively evaluate the system on the CelebA dataset and obtain promising results.

引用

页码：230 / 245

页数：16

共 50 条

[1] Learning to learn using gradient descent
Hochreiter, S
Younger, AS
Conwell, PR
[J]. ARTIFICIAL NEURAL NETWORKS-ICANN 2001, PROCEEDINGS, 2001, 2130 : 87 - 94
[2] Learning to learn by gradient descent by gradient descent
Andrychowicz, Marcin
Denil, Misha
Colmenarejo, Sergio Gomez
Hoffman, Matthew W.
Pfau, David
Schaul, Tom
Shillingford, Brendan
de Freitas, Nando
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
[3] Learning to Learn without Gradient Descent by Gradient Descent
Chen, Yutian
Hoffman, Matthew W.
Colmenarejo, Sergio Gomez
Denil, Misha
Lillicrap, Timothy P.
Botvinick, Matt
de Freitas, Nando
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
[4] Learning a Single Neuron with Bias Using Gradient Descent
Vardi, Gal
Yehudai, Gilad
Shamir, Ohad
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[5] Gradient Descent Learning With Floats
Sun, Tao
Tang, Ke
Li, Dongsheng
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (03) : 1763 - 1771
[6] Learning Fractals by Gradient Descent
Tu, Cheng-Hao
Chen, Hong-You
Carlyn, David
Chao, Wei-Lun
[J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 2456 - 2464
[7] LEARNING BY ONLINE GRADIENT DESCENT
BIEHL, M
SCHWARZE, H
[J]. JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1995, 28 (03): : 643 - 656
[8] A Limitation of Gradient Descent Learning
Sum, John
Leung, Chi-Sing
Ho, Kevin
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (06) : 2227 - 2232
[9] Gradient learning in a classification setting by gradient descent
Cai, Jia
Wang, Hongyan
Zhou, Ding-Xuan
[J]. JOURNAL OF APPROXIMATION THEORY, 2009, 161 (02) : 674 - 692
[10] Learning to Learn Gradient Aggregation by Gradient Descent
Ji, Jinlong
Chen, Xuhui
Wang, Qianlong
Yu, Lixing
Li, Pan
[J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2614 - 2620

← 1 2 3 4 5 →