Multi-Level Ensemble Network for Scene Recognition

被引：4

作者：

Zhang, Longhao ^{[1
]}

Li, Lingqiao ^{[1
]}

Pan, Xipeng ^{[1
]}

Cao, Zhiwei ^{[1
]}

Chen, Qianyu ^{[2
]}

Yang, Huihua ^{[1
]}

机构：

[1] Beijing Univ Posts & Telecommun, Automat Sch, Beijing Shi, Peoples R China

[2] Beijing Univ Posts & Telecommun, Beijing Shi, Peoples R China

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2019年 / 78卷 / 19期

关键词：

Scene recognition; Neural network; Small object-supported scenes; Ensemble learning; Feature fusion;

D O I：

10.1007/s11042-019-07933-2

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Scene recognition is an important branch of computer vision and a common task for deep learning. As is known to all, different scenes are supported by different "key objects". Therefore, the neural network used for the scene recognition task needs to extract the features of these key objects in the scene, sometimes even has to integrate the positional relation between objects to determine the class to which the scene belongs. Under some circumstances, key objects in the scenes are very small and the features of them become extremely inconspicuous or even disappear in the deep layers of the network. Such kind of phenomenon is called "small object-supported scenes". In this paper, Multi-Level Ensemble Network (MLEN), a convolutional neural network, has been proposed, to improve the recognition accuracy of these "small object-supported scenes". Features from multiple levels of the net are used to make separate predictions. Then ensemble learning is performed within the net to make the final prediction. Apart from all this, "Feature Transfer Path" is added and feature fusion methods are adopted to make full use of low-level and high-level features. Moreover, a class-weight loss function for the problem of non-uniform class distribution has been designed. This function can help further improve accuracy in most scene recognition datasets. The experiments involve the Urban Management Case (UMC) dataset collated from two smart urban management system databases by ourselves, and the Places-mini dataset, which is a subset of the well-known Places dataset [36]. The results show that our Multi-Level Ensemble Network achieves much higher accuracy than the state-of-the-art scene recognition networks on both datasets.

引用

页码：28209 / 28230

页数：22

共 50 条

[1] Multi-Level Ensemble Network for Scene Recognition
Longhao Zhang
Lingqiao Li
Xipeng Pan
Zhiwei Cao
Qianyu Chen
Huihua Yang
[J]. Multimedia Tools and Applications, 2019, 78 : 28209 - 28230
[2] Ensemble relation network with multi-level measure
Li Xiaoxu
Qu Xue
Cao Jie
[J]. The Journal of China Universities of Posts and Telecommunications, 2022, (03) : 15 - 24
[3] Ensemble relation network with multi-level measure
Xiaoxu, Li
Jie, Cao
Xue, Qu
Jie, Cao
[J]. Journal of China Universities of Posts and Telecommunications, 2022, 29 (03): : 15 - 24
[4] An Ensemble Model for Multi-Level Speech Emotion Recognition
Zheng, Chunjun
Wang, Chunli
Jia, Ning
[J]. APPLIED SCIENCES-BASEL, 2020, 10 (01):
[5] A Multi-level Progressive Rectification Mechanism for Irregular Scene Text Recognition
Liao, Qianying
Lin, Qingxiang
Jin, Lianwen
Luo, Canjie
Zhang, Jiaxin
Peng, Dezhi
Wang, Tianwei
[J]. DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT IV, 2021, 12824 : 140 - 155
[6] MFST: A Multi-Level Fusion Network for Remote Sensing Scene Classification
Wang, Guoqing
Zhang, Ning
Liu, Wenchao
Chen, He
Xie, Yizhuang
[J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[7] Multi-Level Adaptive Network for Accented Mandarin Speech Recognition
Wang, Huiyong
Wang, Lan
Liu, Xunying
[J]. 2014 4TH IEEE INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2014, : 602 - 605
[8] Multi-level spatial and semantic enhancement network for expression recognition
Ma, Yingdong
Wang, Xia
Wei, Lihua
[J]. APPLIED INTELLIGENCE, 2021, 51 (12) : 8565 - 8578
[9] Speech Emotion Recognition via Multi-Level Attention Network
Liu, Ke
Wang, Dekui
Wu, Dongya
Liu, Yutao
Feng, Jun
[J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2278 - 2282
[10] Multi-level Feature Fusion Facial Expression Recognition Network
Hu, Qian
Wu, Chengdong
Chi, Jianning
Yu, Xiaosheng
Wang, Huan
[J]. PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 5267 - 5272

← 1 2 3 4 5 →