Real-time instance segmentation with assembly parallel task

被引:2
|
作者
Yang, Zhen [1 ,2 ]
Wang, Yang [1 ]
Yang, Fan [1 ]
Yin, Zhijian [1 ]
Zhang, Tao [3 ,4 ]
机构
[1] Jiangxi Sci & Technol Normal Univ, Sch Commun & Elect, Nanchang, Jiangxi, Peoples R China
[2] Guangdong Atv Acad Performing Arts, Dongguan, Peoples R China
[3] Shanghai Jiao Tong Univ, Dept Automat, 800 Dongchuan Rd, Shanghai, Peoples R China
[4] Shanghai Jiao Tong Univ, Shanghai Key Lab Intelligent Sensing & Recognit, 800 Dongchuan Rd, Shanghai 200240, Peoples R China
来源
VISUAL COMPUTER | 2023年 / 39卷 / 09期
基金
中国国家自然科学基金;
关键词
Real-time; Instance Segmentation; Assembly Parallel Task;
D O I
10.1007/s00371-022-02537-8
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Although instance segmentation has made significant progress in recent years, it is still a challenge to develop highly accurate algorithms with real-time performance. In this paper, we propose a real-time framework denoted by APTMask for instance segmentation, which builds on the real-time project YOLACT. In APTMask, we use Swin-Transformer Tiny with PA-FPN as the default feature backbone and a base image size of 544 x 544. We devise a new mask branch, which can more effectively exploit the semantic information of PA-FPN deeper features and the positional information of shallow features for mask representation, compared to the use of implicit parameterized forms. We replace fast NMS with Cluster NMS, which compensates for the performance penalty of fast NMS compiled to standard NMS. CIoU loss is also adopted to fully exploit the scale information of the aspect ratio of the bounding box. Experimental results show that APTMask can achieve 39.7/34.7 box/mask AP on COCO val2017 dataset at 31.8 fps evaluated with a single RTX 2080TI GPU card. Compared to YOLACT, APTMask improves the box AP by about 8.0% and the mask AP by 6.2%, which is encouraging and competitive. Given its simplicity and efficiency, we hope that our APTMask can serve as a simple but strong baseline for a variety of instance-wise prediction tasks.
引用
收藏
页码:3937 / 3947
页数:11
相关论文
共 50 条
  • [1] Real-time instance segmentation with assembly parallel task
    Zhen Yang
    Yang Wang
    Fan Yang
    Zhijian Yin
    Tao Zhang
    [J]. The Visual Computer, 2023, 39 : 3937 - 3947
  • [2] YOLACT Real-time Instance Segmentation
    Bolya, Daniel
    Zhou, Chong
    Xiao, Fanyi
    Lee, Yong Jae
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9156 - 9165
  • [3] Sparse Instance Activation for Real-Time Instance Segmentation
    Cheng, Tianheng
    Wang, Xinggang
    Chen, Shaoyu
    Zhang, Wenqiang
    Zhang, Qian
    Huang, Chang
    Zhang, Zhaoxiang
    Liu, Wenyu
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4423 - 4432
  • [4] YolactEdge: Real-time Instance Segmentation on the Edge
    Liu, Haotian
    Soto, Rafael A. Rivera
    Xiao, Fanyi
    Lee, Yong Jae
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 9579 - 9585
  • [5] RISAT: real-time instance segmentation with adversarial training
    Songwen Pei
    Bo Ni
    Tianma Shen
    Zhenling Zhou
    Yewang Chen
    Meikang Qiu
    [J]. Multimedia Tools and Applications, 2023, 82 : 4063 - 4080
  • [6] RISAT: real-time instance segmentation with adversarial training
    Pei, Songwen
    Ni, Bo
    Shen, Tianma
    Zhou, Zhenling
    Chen, Yewang
    Qiu, Meikang
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (03) : 4063 - 4080
  • [7] Real-time Instance Segmentation with Discriminative Orientation Maps
    Du, Wentao
    Xiang, Zhiyu
    Chen, Shuya
    Qiao, Chengyu
    Chen, Yiman
    Bai, Tingming
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7294 - 7303
  • [8] Explicit Shape Encoding for Real-Time Instance Segmentation
    Xu, Wenqiang
    Wang, Haiyang
    Qi, Fubo
    Lu, Cewu
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5167 - 5176
  • [9] Real-time and accurate model of instance segmentation of foods
    Fan, Yuhe
    Zhang, Lixun
    Zheng, Canxing
    Zu, Yunqin
    Wang, Keyi
    Wang, Xingyuan
    [J]. JOURNAL OF REAL-TIME IMAGE PROCESSING, 2024, 21 (03)
  • [10] QuadroNet: Multi-Task Learning for Real-Time Semantic Depth Aware Instance Segmentation
    Goel, Kratarth
    Srinivasan, Praveen
    Tariq, Sarah
    Philbin, James
    [J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 315 - 324