Small Object Augmentation of Urban Scenes for Real-Time Semantic Segmentation

被引:43
|
作者
Yang, Zhengeng [1 ,2 ,3 ]
Yu, Hongshan [1 ,2 ]
Feng, Mingtao [1 ,2 ]
Sun, Wei [1 ,2 ]
Lin, Xuefei [4 ]
Sun, Mingui [3 ,5 ,6 ]
Mao, Zhi-Hong [5 ,6 ]
Mian, Ajmal [7 ]
机构
[1] Hunan Univ, Natl Engn Lab Robot Visual Percept & Control Tech, Coll Elect & Informat Engn, Changsha 410082, Hunan, Peoples R China
[2] Hunan Univ, Shenzhen Inst, Shenzhen 518057, Peoples R China
[3] Univ Pittsburgh, Dept Neurol Surg, Pittsburgh, PA 15260 USA
[4] Hunan Agr Univ, Dept Art, Changsha 410128, Peoples R China
[5] Univ Pittsburgh, Dept Elect & Comp Engn, Pittsburgh, PA 15260 USA
[6] Univ Pittsburgh, Dept Bioengn, Pittsburgh, PA 15260 USA
[7] Univ Western Australia, Dept Comp Sci, Perth, WA 6009, Australia
基金
美国国家卫生研究院; 湖南省自然科学基金; 中国国家自然科学基金;
关键词
Semantic segmentation; scene understanding; autonomous driving; synthetic dataset; FEATURES; NETWORK;
D O I
10.1109/TIP.2020.2976856
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic segmentation is a key step in scene understanding for autonomous driving. Although deep learning has significantly improved the segmentation accuracy, current high-quality models such as PSPNet and DeepLabV3 are inefficient given their complex architectures and reliance on multi-scale inputs. Thus, it is difficult to apply them to real-time or practical applications. On the other hand, existing real-time methods cannot yet produce satisfactory results on small objects such as traffic lights, which are imperative to safe autonomous driving. In this paper, we improve the performance of real-time semantic segmentation from two perspectives, methodology and data. Specifically, we propose a real-time segmentation model coined Narrow Deep Network (NDNet) and build a synthetic dataset by inserting additional small objects into the training images. The proposed method achieves 65.7% mean intersection over union (mIoU) on the Cityscapes test set with only 8.4G floating-point operations (FLOPs) on $1024\times 2048$ inputs. Furthermore, by re-training the existing PSPNet and DeepLabV3 models on our synthetic dataset, we obtained an average 2% mIoU improvement on small objects.
引用
收藏
页码:5175 / 5190
页数:16
相关论文
共 50 条
  • [1] Joint pyramid attention network for real-time semantic segmentation of urban scenes
    Xuegang Hu
    Liyuan Jing
    Uroosa Sehar
    Applied Intelligence, 2022, 52 : 580 - 594
  • [2] Joint pyramid attention network for real-time semantic segmentation of urban scenes
    Hu, Xuegang
    Jing, Liyuan
    Sehar, Uroosa
    APPLIED INTELLIGENCE, 2022, 52 (01) : 580 - 594
  • [3] DSANet: Dilated spatial attention for real-time semantic segmentation in urban street scenes
    Elhassan, Mohammed A. M.
    Huang, Chenxi
    Yang, Chenhui
    Munea, Tewodros Legesse
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 183
  • [4] Real-Time High-Performance Semantic Image Segmentation of Urban Street Scenes
    Dong, Genshun
    Yan, Yan
    Shen, Chunhua
    Wang, Hanzi
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (06) : 3258 - 3274
  • [5] RTSNet: Real-Time Semantic Segmentation Network For Outdoor Scenes
    Ma, Mingyu
    Zou, Fengshan
    Xu, Fang
    Song, Jilai
    2019 9TH IEEE ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER 2019), 2019, : 659 - 664
  • [6] A Real-Time Semantic Segmentation Approach for Autonomous Driving Scenes
    Qin, Feiwei
    Shen, Xiyue
    Peng, Yong
    Shao, Yanli
    Yuan, Wenqiang
    Ji, Zhongping
    Bai, Jing
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2021, 33 (07): : 1026 - 1037
  • [7] Small Target Augmentation for Urban Remote Sensing Image Real-Time Segmentation
    Ren, Shasha
    Liu, Qiong
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (02) : 2076 - 2088
  • [8] Research on Efficient Asymmetric Attention Module for Real-Time Semantic Segmentation Networks in Urban Scenes
    Su, Xu
    Li, Lihong
    Xiao, Jiejie
    Wang, Pengtao
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2024, 28 (03) : 562 - 572
  • [9] LCFNet: Loss Compensation Fusion Network for Real-time Semantic Segmentation of Urban Road Scenes
    Yang, Lu
    Bai, Yiwen
    Ren, Fenglei
    Zhang, Shiyu
    Bi, Chongke
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 347 - 354
  • [10] Real-time Hierarchical Fusion System for Semantic Segmentation in Offroad Scenes
    Dang, Kang
    Hoy, Michael
    Dauwels, Justin
    Yuan, Junsong
    2017 20TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2017, : 72 - 77