A deep learning-based global and segmentation-based semantic feature fusion approach for indoor scene classification

被引:2
|
作者
Pereira, Ricardo [1 ]
Barros, Tiago [1 ]
Garrote, Luis [1 ]
Lopes, Ana [1 ,2 ]
Nunes, Urbano J. [1 ]
机构
[1] Univ Coimbra, Inst Syst & Robot, Dept Elect & Comp Engn, Rua Silvio Lima Polo II, P-3030290 Coimbra, Portugal
[2] Polytech Inst Tomar, P-2300313 Tomar, Portugal
关键词
Indoor scene classification; Scene representation; Visual recognition; Global and local features; Segmentation-based features; NETWORK;
D O I
10.1016/j.patrec.2024.01.022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work proposes a novel approach that uses a semantic segmentation mask to obtain a 2D spatial layout of the segmentation-categories across the scene, designated by segmentation-based semantic features (SSFs). These features represent, per segmentation-category, the pixel count, as well as the 2D average position and respective standard deviation values. Moreover, a two-branch network, GS2F2App, that exploits CNN-based global features extracted from RGB images and the segmentation-based features extracted from the proposed SSFs, is also proposed. GS2F2App was evaluated in two indoor scene benchmark datasets: the SUN RGB-D and the NYU Depth V2, achieving state-of-the-art results on both datasets.
引用
收藏
页码:24 / 30
页数:7
相关论文
共 50 条
  • [1] Deep-Learning based Global and Semantic Feature Fusion for Indoor Scene Classification
    Pereira, Ricardo
    Goncalves, Nuno
    Garrote, Luis
    Barros, Tiago
    Lopes, Ana
    Nunes, Urbano J.
    2020 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS (ICARSC 2020), 2020, : 67 - 73
  • [2] Exploration of Deep Learning-based Multimodal Fusion for Semantic Road Scene Segmentation
    Zhang, Yifei
    Morel, Olivier
    Blanchon, Marc
    Seulin, Ralph
    Rastgoo, Mojdeh
    Sidibe, Desire
    PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 336 - 343
  • [3] A Deep Learning-based Indoor Scene Classification Approach Enhanced with Inter-Object Distance Semantic Features
    Pereira, Ricardo
    Garrote, Luis
    Banos, Tiago
    Lopes, Ana
    Nunes, Urbano J.
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 32 - 38
  • [4] Semantic Segmentation of 3D Scene based on Global Feature Fusion
    Wang, Dan
    Liu, Shuaijun
    Xu, Nansheng
    Lin, Xiaobo
    Wang, Zijiang
    2022 IEEE 6TH ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2022, : 286 - 290
  • [5] Semantic Segmentation for Road Scene Based on Multiscale Feature Fusion
    Yi Qingming
    Zhang Wenting
    Shi Min
    Shen Jialin
    Luo Aiwen
    LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (12)
  • [6] Review of Deep Learning-Based Semantic Segmentation
    Zhang Xiangfu
    Jian, Liu
    Shi Zhangsong
    Wu Zhonghong
    Zhi, Wang
    LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (15)
  • [7] A Deep Learning-Based Watershed Feature Fusion Approach for Tunnel Crack Segmentation in Complex Backgrounds
    Wang, Haozheng
    Wang, Qiang
    Zhang, Weikang
    Zhai, Junli
    Yuan, Dongyang
    Tong, Junhao
    Xie, Xiongyao
    Zhou, Biao
    Tian, Hao
    MATERIALS, 2025, 18 (01)
  • [8] Deep learning-based hybrid feature selection for the semantic segmentation of crops and weeds
    Janneh, Lamin L.
    Youngjun, Zhang
    Hydara, Mbemba
    Cui, Zhongwei
    ICT EXPRESS, 2024, 10 (01): : 118 - 124
  • [9] Deep learning-based multidimensional feature fusion for classification of ECG arrhythmia
    Cui, Jianfeng
    Wang, Lixin
    He, Xiangmin
    De Albuquerque, Victor Hugo C.
    AlQahtani, Salman A.
    Hassan, Mohammad Mehedi
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (22): : 16073 - 16087
  • [10] Deep learning-based multidimensional feature fusion for classification of ECG arrhythmia
    Jianfeng Cui
    Lixin Wang
    Xiangmin He
    Victor Hugo C. De Albuquerque
    Salman A. AlQahtani
    Mohammad Mehedi Hassan
    Neural Computing and Applications, 2023, 35 : 16073 - 16087