Multi-Level Ensemble Network for Scene Recognition

被引:4
|
作者
Zhang, Longhao [1 ]
Li, Lingqiao [1 ]
Pan, Xipeng [1 ]
Cao, Zhiwei [1 ]
Chen, Qianyu [2 ]
Yang, Huihua [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Automat Sch, Beijing Shi, Peoples R China
[2] Beijing Univ Posts & Telecommun, Beijing Shi, Peoples R China
关键词
Scene recognition; Neural network; Small object-supported scenes; Ensemble learning; Feature fusion;
D O I
10.1007/s11042-019-07933-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scene recognition is an important branch of computer vision and a common task for deep learning. As is known to all, different scenes are supported by different "key objects". Therefore, the neural network used for the scene recognition task needs to extract the features of these key objects in the scene, sometimes even has to integrate the positional relation between objects to determine the class to which the scene belongs. Under some circumstances, key objects in the scenes are very small and the features of them become extremely inconspicuous or even disappear in the deep layers of the network. Such kind of phenomenon is called "small object-supported scenes". In this paper, Multi-Level Ensemble Network (MLEN), a convolutional neural network, has been proposed, to improve the recognition accuracy of these "small object-supported scenes". Features from multiple levels of the net are used to make separate predictions. Then ensemble learning is performed within the net to make the final prediction. Apart from all this, "Feature Transfer Path" is added and feature fusion methods are adopted to make full use of low-level and high-level features. Moreover, a class-weight loss function for the problem of non-uniform class distribution has been designed. This function can help further improve accuracy in most scene recognition datasets. The experiments involve the Urban Management Case (UMC) dataset collated from two smart urban management system databases by ourselves, and the Places-mini dataset, which is a subset of the well-known Places dataset [36]. The results show that our Multi-Level Ensemble Network achieves much higher accuracy than the state-of-the-art scene recognition networks on both datasets.
引用
收藏
页码:28209 / 28230
页数:22
相关论文
共 50 条
  • [1] Multi-Level Ensemble Network for Scene Recognition
    Longhao Zhang
    Lingqiao Li
    Xipeng Pan
    Zhiwei Cao
    Qianyu Chen
    Huihua Yang
    [J]. Multimedia Tools and Applications, 2019, 78 : 28209 - 28230
  • [2] Ensemble relation network with multi-level measure
    Li Xiaoxu
    Qu Xue
    Cao Jie
    [J]. The Journal of China Universities of Posts and Telecommunications, 2022, (03) : 15 - 24
  • [3] Ensemble relation network with multi-level measure
    Xiaoxu, Li
    Jie, Cao
    Xue, Qu
    Jie, Cao
    [J]. Journal of China Universities of Posts and Telecommunications, 2022, 29 (03): : 15 - 24
  • [4] An Ensemble Model for Multi-Level Speech Emotion Recognition
    Zheng, Chunjun
    Wang, Chunli
    Jia, Ning
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (01):
  • [5] A Multi-level Progressive Rectification Mechanism for Irregular Scene Text Recognition
    Liao, Qianying
    Lin, Qingxiang
    Jin, Lianwen
    Luo, Canjie
    Zhang, Jiaxin
    Peng, Dezhi
    Wang, Tianwei
    [J]. DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT IV, 2021, 12824 : 140 - 155
  • [6] MFST: A Multi-Level Fusion Network for Remote Sensing Scene Classification
    Wang, Guoqing
    Zhang, Ning
    Liu, Wenchao
    Chen, He
    Xie, Yizhuang
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [7] Multi-Level Adaptive Network for Accented Mandarin Speech Recognition
    Wang, Huiyong
    Wang, Lan
    Liu, Xunying
    [J]. 2014 4TH IEEE INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2014, : 602 - 605
  • [8] Multi-level spatial and semantic enhancement network for expression recognition
    Ma, Yingdong
    Wang, Xia
    Wei, Lihua
    [J]. APPLIED INTELLIGENCE, 2021, 51 (12) : 8565 - 8578
  • [9] Speech Emotion Recognition via Multi-Level Attention Network
    Liu, Ke
    Wang, Dekui
    Wu, Dongya
    Liu, Yutao
    Feng, Jun
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2278 - 2282
  • [10] Multi-level Feature Fusion Facial Expression Recognition Network
    Hu, Qian
    Wu, Chengdong
    Chi, Jianning
    Yu, Xiaosheng
    Wang, Huan
    [J]. PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 5267 - 5272