Deep learning theory of distribution regression with CNNs

被引:0
|
作者
Yu, Zhan [1 ]
Zhou, Ding-Xuan [2 ]
机构
[1] Hong Kong Baptist Univ, Dept Math, Kowloon Tong, 224 Waterloo Rd, Hong Kong, Peoples R China
[2] Univ Sydney, Sch Math & Stat, Sydney, NSW 2006, Australia
基金
美国国家科学基金会;
关键词
Learning theory; Deep learning; Distribution regression; Deep CNN; Oracle inequality; ReLU; NEURAL-NETWORKS; APPROXIMATION; BOUNDS;
D O I
10.1007/s10444-023-10054-y
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We establish a deep learning theory for distribution regression with deep convolutional neural networks (DCNNs). Deep learning based on structured deep neural networks has been powerful in practical applications. Generalization analysis for regression with DCNNs has been carried out very recently. However, for the distribution regression problem in which the input variables are probability measures, there is no mathematical model or theoretical analysis of DCNN-based learning theory. One of the difficulties is that the classical neural network structure requires the input variable to be a Euclidean vector. When the input samples are probability distributions, the traditional neural network structure cannot be directly used. A well-defined DCNN framework for distribution regression is desirable. In this paper, we overcome the difficulty and establish a novel DCNN-based learning theory for a two-stage distribution regression model. Firstly, we realize an approximation theory for functionals defined on the set of Borel probability measures with the proposed DCNN framework. Then, we show that the hypothesis space is well-defined by rigorously proving its compactness. Furthermore, in the hypothesis space induced by the general DCNN framework with distribution inputs, by using a two-stage error decomposition technique, we derive a novel DCNN-based two-stage oracle inequality and optimal learning rates (up to a logarithmic factor) for the proposed algorithm for distribution regression.
引用
收藏
页数:40
相关论文
共 50 条
  • [41] When Deep Learning Meets Metric Learning: Remote Sensing Image Scene Classification via Learning Discriminative CNNs
    Cheng, Gong
    Yang, Ceyuan
    Yao, Xiwen
    Guo, Lei
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (05): : 2811 - 2821
  • [42] Regression with Deep Learning for Sensor Performance Optimization
    Vaila, Ruthvik
    Lloyd, Denver
    Tetz, Kevin
    [J]. 2021 IEEE WORKSHOP ON MICROELECTRONICS AND ELECTRON DEVICES (WMED), 2021, : 19 - 22
  • [43] Deep Inverse Reinforcement Learning by Logistic Regression
    Uchibe, Eiji
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2016, PT I, 2016, 9947 : 23 - 31
  • [44] Parameter Distribution Balanced CNNs
    Liao, Lixin
    Zhao, Yao
    Wei, Shikui
    Wei, Yunchao
    Wang, Jingdong
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (11) : 4600 - 4609
  • [45] Enhancing Computer Vision Performance: A Hybrid Deep Learning Approach with CNNs and Vision Transformers
    Sardar, Abha Singh
    Ranjan, Vivek
    [J]. COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT II, 2024, 2010 : 591 - 602
  • [46] Self-paced active learning for deep CNNs via effective loss function
    Yin, Tianxiang
    Liu, Ningzhong
    Sun, Han
    [J]. NEUROCOMPUTING, 2021, 424 (424) : 1 - 8
  • [47] SOME DISTRIBUTION THEORY RESULTS FOR A REGRESSION MODEL
    JOSHI, PC
    [J]. ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 1975, 27 (02) : 309 - 317
  • [48] A novel approach for ear recognition: learning Mahalanobis distance features from deep CNNs
    Ibrahim Omara
    Ahmed Hagag
    Guangzhi Ma
    Fathi E. Abd El-Samie
    Enmin Song
    [J]. Machine Vision and Applications, 2021, 32
  • [49] A novel approach for ear recognition: learning Mahalanobis distance features from deep CNNs
    Omara, Ibrahim
    Hagag, Ahmed
    Ma, Guangzhi
    Abd El-Samie, Fathi E.
    Song, Enmin
    [J]. MACHINE VISION AND APPLICATIONS, 2021, 32 (01)
  • [50] Visual Semantic-Based Representation Learning Using Deep CNNs for Scene Recognition
    Gupta, Shikha
    Sharma, Krishan
    Dinesh, Dileep Aroor
    Thenkanidiyoor, Veena
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (02)