Multi-Level Ensemble Network for Scene Recognition

被引:0
|
作者
Longhao Zhang
Lingqiao Li
Xipeng Pan
Zhiwei Cao
Qianyu Chen
Huihua Yang
机构
[1] Beijing University of Posts and Telecommunications,
来源
关键词
Scene recognition; Neural network; Small object-supported scenes; Ensemble learning; Feature fusion;
D O I
暂无
中图分类号
学科分类号
摘要
Scene recognition is an important branch of computer vision and a common task for deep learning. As is known to all, different scenes are supported by different “key objects”. Therefore, the neural network used for the scene recognition task needs to extract the features of these key objects in the scene, sometimes even has to integrate the positional relation between objects to determine the class to which the scene belongs. Under some circumstances, key objects in the scenes are very small and the features of them become extremely inconspicuous or even disappear in the deep layers of the network. Such kind of phenomenon is called “small object-supported scenes”. In this paper, Multi-Level Ensemble Network (MLEN), a convolutional neural network, has been proposed, to improve the recognition accuracy of these “small object-supported scenes”. Features from multiple levels of the net are used to make separate predictions. Then ensemble learning is performed within the net to make the final prediction. Apart from all this, “Feature Transfer Path” is added and feature fusion methods are adopted to make full use of low-level and high-level features. Moreover, a class-weight loss function for the problem of non-uniform class distribution has been designed. This function can help further improve accuracy in most scene recognition datasets. The experiments involve the Urban Management Case (UMC) dataset collated from two smart urban management system databases by ourselves, and the Places-mini dataset, which is a subset of the well-known Places dataset [36]. The results show that our Multi-Level Ensemble Network achieves much higher accuracy than the state-of-the-art scene recognition networks on both datasets.
引用
收藏
页码:28209 / 28230
页数:21
相关论文
共 50 条
  • [31] MAFN: multi-level attention fusion network for multimodal named entity recognition
    Zhou, Xiaoying
    Zhang, Yijia
    Wang, Zhuang
    Lu, Mingyu
    Liu, Xiaoxia
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (15) : 45047 - 45058
  • [32] Multi-level channel attention excitation network for human action recognition in videos
    Wu, Hanbo
    Ma, Xin
    Li, Yibin
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 114
  • [33] Multi-level semantic fusion network for Chinese medical named entity recognition
    Shi, Jintong
    Sun, Mengxuan
    Sun, Zhengya
    Li, Mingda
    Gu, Yifan
    Zhang, Wensheng
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 133
  • [34] Learning Multi-level Representations for Image Emotion Recognition in the Deep Convolutional Network
    Zhang, Hao
    Liu, Yanan
    Xu, Dan
    He, Kangjian
    Peng, Guoqin
    Yue, Yingying
    Liu, Ruhan
    [J]. THIRTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2021), 2022, 12083
  • [35] Concept-guided multi-level attention network for image emotion recognition
    Yang, Hansen
    Fan, Yangyu
    Lv, Guoyun
    Liu, Shiya
    Guo, Zhe
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (05) : 4313 - 4326
  • [36] Multi-Scale Multi-Level Generative Model in Scene Classification
    Xie, Wenjie
    Xu, De
    Tang, Yingjun
    Cui, Geng
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (01): : 167 - 170
  • [37] Multi-view Fusion for Multi-level Robotic Scene Understanding
    Lin, Yunzhi
    Tremblay, Jonathan
    Tyree, Stephen
    Vela, Patricio A.
    Birchfield, Stan
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 6817 - 6824
  • [38] MCNet: Multi-level Correction Network for thermal image semantic segmentation of nighttime driving scene
    Xiong, Haitao
    Cai, Wenjie
    Liu, Qiong
    [J]. INFRARED PHYSICS & TECHNOLOGY, 2021, 113
  • [39] Cascade ensemble learning for multi-level reliability evaluation
    Song, Lu-Kai
    Li, Xue-Qin
    Choy, Yat-Sze
    Zhu, Shun -Peng
    [J]. AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 148
  • [40] Multi-STMT: Multi-Level Network for Human Activity Recognition Based on Wearable Sensors
    Zhang, Haoran
    Xu, Linhai
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 12