Multi-Encoder Context Aggregation Network for Structured and Unstructured Urban Street Scene Analysis

被引:0
|
作者
Singha, Tanmay [1 ]
Pham, Duc-Son [1 ]
Krishna, Aneesh [1 ]
机构
[1] Curtin Univ, Sch Elect Engn Comp & Math Sci, Perth, WA, Australia
关键词
INDEX TERMS Semantic segmentation; feature scaling; feature aggregation; deep learning; scene under-standing; convolutional neural networks; SEGMENTATION;
D O I
10.1109/ACCESS.2023.3289968
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Developing computationally efficient semantic segmentation models that are suitable for resource-constrained mobile devices is an open challenge in computer vision research. To address this challenge, we propose a novel real-time semantic scene segmentation model called Multi-encoder Context Aggregation Network (MCANet), which offers the best combination of low model complexity and state-of-the-art (SOTA) performance on benchmark datasets. While we follow the multi-encoder approach, our novelty lies in the varying number of scales to capture both global context and local details effectively. We introduce suitable lateral connections between sub-encoders for improved feature refinement. We also optimize the backbone by exploiting the residual block of MobileNet for resource-constrained applications. On the decoder side, the proposed model includes a new Local and Global Context Aggregation (LGCA) module that significantly enhances semantic details in the segmentation output. Finally, we use several known efficient convolution techniques for the classification module to make the model more computationally efficient. We provide a comprehensive evaluation of MCANet on multiple datasets containing structured and unstructured urban street scenes. Among the existing real-time models with less than 3 million parameters, the proposed model is more competitive as it achieves the SOTA performance without ImageNet pre-trained weights on both structured and unstructured environments while being more compact for resource-constrained applications.
引用
收藏
页码:66227 / 66244
页数:18
相关论文
共 50 条
  • [31] MTSCANet: Multi temporal resolution temporal semantic context aggregation network
    Zhang, Haiping
    Ma, Conghao
    Yu, Dongjin
    Guan, Liming
    Wang, Dongjing
    Hu, Zepeng
    Liu, Xu
    IET COMPUTER VISION, 2023, 17 (03) : 366 - 378
  • [32] Multi-Scale Recursive Context Aggregation Network for Semantic Segmentation
    Yalcin, Abdullah
    Keskinoz, Mehmet
    32ND IEEE SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU 2024, 2024,
  • [33] A city is not a tree: a multi-city study on street network and urban life
    Huang, Jianxiang
    Cui, Yuming
    Chang, Haoliang
    Obracht-Prondzynska, Hanna
    Kamrowska-Zaluska, Dorota
    Li, Lishuai
    LANDSCAPE AND URBAN PLANNING, 2022, 226
  • [34] MCENET: Multi-Context Encoder Network for Homogeneous Agent Trajectory Prediction in Mixed Traffic
    Cheng, Hao
    Liao, Wentong
    Yang, Michael Ying
    Sester, Monika
    Rosenhahn, Bodo
    2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
  • [35] Robustness Analysis of Urban Street Networks Using Complex Network Method
    Tian J.
    Fang H.
    Liu J.
    Zhao F.
    Ren C.
    Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2019, 44 (05): : 771 - 777
  • [36] Multi-Resolution Context Aggregation Network for Single Image Rain Removal
    Xie Q.
    Zhang H.
    Gai S.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2022, 34 (02): : 232 - 244
  • [37] Modeling and Analysis of BS Deployment in Multi-scene Urban Areas
    Ma, Rui-qiang
    Wu, Mu-qing
    Zhang, Jian
    Wan, Xiu-sheng
    2017 2ND INTERNATIONAL CONFERENCE ON COMPUTATIONAL MODELING, SIMULATION AND APPLIED MATHEMATICS (CMSAM), 2017, : 45 - 49
  • [38] PMBANet: Progressive Multi-Branch Aggregation Network for Scene Depth Super-Resolution
    Ye, Xinchen
    Sun, Baoli
    Wang, Zhihui
    Yang, Jingyu
    Xu, Rui
    Li, Haojie
    Li, Baopu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 7427 - 7442
  • [39] Remote Sensing Scene Classification Method Based on Multi-Scale Graph Convolution Context Feature Aggregation
    Chen, Baolan
    Li, Huawang
    Wang, Yinxiao
    LASER & OPTOELECTRONICS PROGRESS, 2025, 62 (04)
  • [40] Iterative Convolutional Encoder-Decoder Network with Multi-Scale Context Learning for Liver Segmentation
    Zhang, Feiyan
    Yan, Shuhao
    Zhao, Yizhong
    Gao, Yuan
    Li, Zhi
    Lu, Xuesong
    APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)