Input Convex Neural Networks

被引：0

作者：

Amos, Brandon ^{[2
]}

Xu, Lei ^{[1
,3
]}

Kolter, J. Zico ^{[2
]}

机构：

[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA

[2] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA

[3] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70 | 2017年 / 70卷

基金：

美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents the input convex neural network architecture. These are scalar-valued (potentially deep) neural networks with constraints on the network parameters such that the output of the network is a convex function of (some of) the inputs. The networks allow for efficient inference via optimization over some inputs to the network given others, and can be applied to settings including structured prediction, data imputation, reinforcement learning, and others. In this paper we lay the basic groundwork for these models, proposing methods for inference, optimization and learning, and analyze their representational power. We show that many existing neural network architectures can be made input-convex with a minor modification, and develop specialized optimization algorithms tailored to this setting. Finally, we highlight the performance of the methods on multi-label prediction, image completion, and reinforcement learning problems, where we show improvement over the existing state of the art in many cases.

引用

页数：10

共 50 条

[1] Principled Weight Initialisation for Input-Convex Neural Networks
Hoedt, Pieter-Jan
Klambauer, Guenter
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[2] A computational framework for nanotrusses: Input convex neural networks approach
Canadija, Marko
Kosmerl, Valentina
Zlatic, Martin
Vrtovsnik, Domagoj
Munjas, Neven
[J]. EUROPEAN JOURNAL OF MECHANICS A-SOLIDS, 2024, 103
[3] Optimization-based control using input convex neural networks
Yang, Shu
Bequette, B. Wayne
[J]. COMPUTERS & CHEMICAL ENGINEERING, 2021, 144
[4] Scalable Computations of Wasserstein Barycenter via Input Convex Neural Networks
Fan, Jiaojiao
Taghvaei, Amirhossein
Chen, Yongxin
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[5] Data-Driven Mirror Descent with Input-Convex Neural Networks
Tan, Hong Ye
Mukherjee, Subhadip
Tang, Junqi
Schonlieb, Carola-Bibiane
[J]. SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2023, 5 (02): : 558 - 587
[6] Input convex neural networks in nonlinear predictive control: A multi- model approach
Lawrynczuk, Maciej
[J]. NEUROCOMPUTING, 2022, 513 : 273 - 293
[7] Learning Optimal Power Flow value functions with input-convex neural networks
Rosemberg, Andrew
Tanneau, Mathieu
Fanzeres, Bruno
Garcia, Joaquim
Van Hentenryck, Pascal
[J]. ELECTRIC POWER SYSTEMS RESEARCH, 2024, 235
[8] Data-Driven Optimal Voltage Regulation Using Input Convex Neural Networks
Chen, Yize
Shi, Yuanyuan
Zhang, Baosen
[J]. ELECTRIC POWER SYSTEMS RESEARCH, 2020, 189
[9] Emission-Constrained Optimization of Gas Networks: Input-Convex Neural Network Approach
Dvorkin, Vladimir
Chevalier, Samuel
Chatzivasileiadis, Spyros
[J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 1575 - 1579
[10] Path-following control of autonomous ground vehicles based on input convex neural networks
Jiang, Kai
Hu, Chuan
Yan, Fengjun
[J]. PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2022, 236 (13) : 2806 - 2816

← 1 2 3 4 5 →