Multi-Encoder Context Aggregation Network for Structured and Unstructured Urban Street Scene Analysis

被引:0
|
作者
Singha, Tanmay [1 ]
Pham, Duc-Son [1 ]
Krishna, Aneesh [1 ]
机构
[1] Curtin Univ, Sch Elect Engn Comp & Math Sci, Perth, WA, Australia
关键词
INDEX TERMS Semantic segmentation; feature scaling; feature aggregation; deep learning; scene under-standing; convolutional neural networks; SEGMENTATION;
D O I
10.1109/ACCESS.2023.3289968
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Developing computationally efficient semantic segmentation models that are suitable for resource-constrained mobile devices is an open challenge in computer vision research. To address this challenge, we propose a novel real-time semantic scene segmentation model called Multi-encoder Context Aggregation Network (MCANet), which offers the best combination of low model complexity and state-of-the-art (SOTA) performance on benchmark datasets. While we follow the multi-encoder approach, our novelty lies in the varying number of scales to capture both global context and local details effectively. We introduce suitable lateral connections between sub-encoders for improved feature refinement. We also optimize the backbone by exploiting the residual block of MobileNet for resource-constrained applications. On the decoder side, the proposed model includes a new Local and Global Context Aggregation (LGCA) module that significantly enhances semantic details in the segmentation output. Finally, we use several known efficient convolution techniques for the classification module to make the model more computationally efficient. We provide a comprehensive evaluation of MCANet on multiple datasets containing structured and unstructured urban street scenes. Among the existing real-time models with less than 3 million parameters, the proposed model is more competitive as it achieves the SOTA performance without ImageNet pre-trained weights on both structured and unstructured environments while being more compact for resource-constrained applications.
引用
收藏
页码:66227 / 66244
页数:18
相关论文
共 50 条
  • [41] Encoder deep interleaved network with multi-scale aggregation for RGB-D salient object detection
    Feng, Guang
    Meng, Jinyu
    Zhang, Lihe
    Lu, Huchuan
    PATTERN RECOGNITION, 2022, 128
  • [42] Disentangling the effects of urban form and socio-demographic context on street tree cover: A multi-level analysis from Montreal
    Thi-Thanh-Hien Pham
    Apparicio, Philippe
    Landry, Shawn
    Lewnard, Joseph
    LANDSCAPE AND URBAN PLANNING, 2017, 157 : 422 - 433
  • [43] Analysis of Urban Drivable and Walkable Street Networks of the ASEAN Smart Cities Network
    Zhao, Pengjun
    Yen, Yat
    Bailey, Earl
    Sohail, Muhammad Tayyab
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2019, 8 (10)
  • [44] Context-Aware Multi-Scale Aggregation Network for Congested Crowd Counting
    Huang, Liangjun
    Shen, Shihui
    Zhu, Luning
    Shi, Qingxuan
    Zhang, Jianwei
    SENSORS, 2022, 22 (09)
  • [45] HEFANet: hierarchical efficient fusion and aggregation segmentation network for enhanced rgb-thermal urban scene parsing
    Shen, Zhengwen
    Pan, Zaiyu
    Weng, Yuchen
    Li, Yulian
    Wang, Jiangyu
    Wang, Jun
    APPLIED INTELLIGENCE, 2024, 54 (22) : 11248 - 11266
  • [46] Multi-scale inputs and context-aware aggregation network for stereo matching
    Shi, Liqing
    Xiong, Taiping
    Cui, Gengshen
    Pan, Minghua
    Cheng, Nuo
    Wu, Xiangjie
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (30) : 75171 - 75194
  • [47] Multi-Scale Context Aggregation Network with Attention-Guided for Crowd Counting
    Wang, Xin
    Lv, Rongrong
    Zhao, Yang
    Yang, Tangwen
    Ruan, Qiuqi
    PROCEEDINGS OF 2020 IEEE 15TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2020), 2020, : 240 - 245
  • [48] Deep Multi-Branch Aggregation Network for Real-Time Semantic Segmentation in Street Scenes
    Weng, Xi
    Yan, Yan
    Dong, Genshun
    Shu, Chang
    Wang, Biao
    Wang, Hanzi
    Zhang, Ji
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) : 17224 - 17240
  • [49] Real-time urban street view semantic segmentation based on cross-layer aggregation network
    Hou Z.
    Cheng M.
    Ma S.
    Qu M.
    Yang X.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2024, 32 (08): : 1212 - 1226
  • [50] Multi-granular Aggregation of Network Flows for Security Analysis
    Ding, Tao
    AlEroud, Ahmed
    Karabatis, George
    2015 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS (ISI), 2015, : 173 - 175