Input Convex Neural Networks

被引:0
|
作者
Amos, Brandon [2 ]
Xu, Lei [1 ,3 ]
Kolter, J. Zico [2 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
[3] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents the input convex neural network architecture. These are scalar-valued (potentially deep) neural networks with constraints on the network parameters such that the output of the network is a convex function of (some of) the inputs. The networks allow for efficient inference via optimization over some inputs to the network given others, and can be applied to settings including structured prediction, data imputation, reinforcement learning, and others. In this paper we lay the basic groundwork for these models, proposing methods for inference, optimization and learning, and analyze their representational power. We show that many existing neural network architectures can be made input-convex with a minor modification, and develop specialized optimization algorithms tailored to this setting. Finally, we highlight the performance of the methods on multi-label prediction, image completion, and reinforcement learning problems, where we show improvement over the existing state of the art in many cases.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Principled Weight Initialisation for Input-Convex Neural Networks
    Hoedt, Pieter-Jan
    Klambauer, Guenter
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [2] A computational framework for nanotrusses: Input convex neural networks approach
    Canadija, Marko
    Kosmerl, Valentina
    Zlatic, Martin
    Vrtovsnik, Domagoj
    Munjas, Neven
    [J]. EUROPEAN JOURNAL OF MECHANICS A-SOLIDS, 2024, 103
  • [3] Optimization-based control using input convex neural networks
    Yang, Shu
    Bequette, B. Wayne
    [J]. COMPUTERS & CHEMICAL ENGINEERING, 2021, 144
  • [4] Scalable Computations of Wasserstein Barycenter via Input Convex Neural Networks
    Fan, Jiaojiao
    Taghvaei, Amirhossein
    Chen, Yongxin
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [5] Data-Driven Mirror Descent with Input-Convex Neural Networks
    Tan, Hong Ye
    Mukherjee, Subhadip
    Tang, Junqi
    Schonlieb, Carola-Bibiane
    [J]. SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2023, 5 (02): : 558 - 587
  • [6] Input convex neural networks in nonlinear predictive control: A multi- model approach
    Lawrynczuk, Maciej
    [J]. NEUROCOMPUTING, 2022, 513 : 273 - 293
  • [7] Learning Optimal Power Flow value functions with input-convex neural networks
    Rosemberg, Andrew
    Tanneau, Mathieu
    Fanzeres, Bruno
    Garcia, Joaquim
    Van Hentenryck, Pascal
    [J]. ELECTRIC POWER SYSTEMS RESEARCH, 2024, 235
  • [8] Data-Driven Optimal Voltage Regulation Using Input Convex Neural Networks
    Chen, Yize
    Shi, Yuanyuan
    Zhang, Baosen
    [J]. ELECTRIC POWER SYSTEMS RESEARCH, 2020, 189
  • [9] Emission-Constrained Optimization of Gas Networks: Input-Convex Neural Network Approach
    Dvorkin, Vladimir
    Chevalier, Samuel
    Chatzivasileiadis, Spyros
    [J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 1575 - 1579
  • [10] Path-following control of autonomous ground vehicles based on input convex neural networks
    Jiang, Kai
    Hu, Chuan
    Yan, Fengjun
    [J]. PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2022, 236 (13) : 2806 - 2816