Hybrid Task Cascade for Instance Segmentation

被引:1019
|
作者
Chen, Kai [1 ]
Pang, Jiangmiao [2 ,3 ]
Wang, Jiaqi [1 ]
Xiong, Yu [1 ]
Li, Xiaoxiao [1 ]
Sun, Shuyang [4 ]
Feng, Wansen [2 ]
Liu, Ziwei [1 ]
Shi, Jianping [2 ]
Ouyang, Wanli [4 ]
Loy, Chen Change [5 ]
Lin, Dahua [1 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[2] SenseTime Res, Hong Kong, Peoples R China
[3] Zhejiang Univ, Hangzhou, Peoples R China
[4] Univ Sydney, Sydney, NSW, Australia
[5] Nanyang Technol Univ, Singapore, Singapore
关键词
D O I
10.1109/CVPR.2019.00511
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cascade is a classic yet powerful architecture that has boosted performance on various tasks. However, how to introduce cascade to instance segmentation remains an open question. A simple combination of Cascade R-CNN and Mask R-CNN only brings limited gain. In exploring a more effective approach, we find that the key to a successful instance segmentation cascade is to fully leverage the reciprocal relationship between detection and segmentation. In this work, we propose a new framework, Hybrid Task Cascade (HTC), which differs in two important aspects: (1) instead of performing cascaded refinement on these two tasks separately, it interweaves them for a joint multi-stage processing; (2) it adopts a fully convolutional branch to provide spatial context, which can help distinguishing hard foreground from cluttered background. Overall, this framework can learn more discriminative features progressively while integrating complementary features together in each stage. Without bells and whistles, a single HTC obtains 38.4% and 1.5% improvement over a strong Cascade Mask R-CNN baseline on MSCOCO dataset. Moreover, our overall system achieves 48.6 mask AP on the test-challenge split, ranking 1st in the COCO 2018 Challenge Object Detection Task.
引用
收藏
页码:4969 / 4978
页数:10
相关论文
共 50 条
  • [21] Semantic Instance Segmentation in a 3D Traffic Scene Reconstruction task
    Hadi, Shiqah
    Phon-Amnuaisuk, Somnuk
    Tan, Soon-Jiann
    [J]. 2020 59TH ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS OF JAPAN (SICE), 2020, : 186 - 191
  • [22] Incorporating Non-local and Task-specific Features for Instance Segmentation
    Yang, Longrong
    Meng, Fanman
    Wu, Qingbo
    Li, Hongliang
    [J]. 2019 IEEE 21ST INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP 2019), 2019,
  • [23] Query-Based Cascade Instance Segmentation Network for Remote Sensing Image Processing
    Chen, Enping
    Li, Maojun
    Zhang, Qian
    Chen, Man
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (17):
  • [24] Vectorized instance segmentation using periodic B-splines based on cascade architecture
    Wang, Fangjun
    Song, Yanzhi
    Huang, Zhangjin
    Yang, Zhouwang
    [J]. COMPUTERS & GRAPHICS-UK, 2022, 102 : 592 - 600
  • [25] Wheat-Net: An Automatic Dense Wheat Spike Segmentation Method Based on an Optimized Hybrid Task Cascade Model
    Zhang, Jiajing
    Min, An
    Steffenson, Brian J.
    Su, Wen-Hao
    Hirsch, Cory D.
    Anderson, James
    Wei, Jian
    Ma, Qin
    Yang, Ce
    [J]. FRONTIERS IN PLANT SCIENCE, 2022, 13
  • [26] Adapting Video Instance Segmentation for Instance Search
    Nguyen, An Thi
    [J]. 20TH INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING, CBMI 2023, 2023, : 256 - 260
  • [27] Instance Segmentation as Image Segmentation Annotation
    Watanabe, Thomio
    Wolf, Denis F.
    [J]. 2019 30TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV19), 2019, : 432 - 437
  • [28] Video Instance Segmentation by Instance Flow Assembly
    Li, Xiang
    Wang, Jinglu
    Li, Xiao
    Lu, Yan
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 7469 - 7479
  • [29] Iterative Instance Segmentation
    Li, Ke
    Hariharan, Bharath
    Malik, Jitendra
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3659 - 3667
  • [30] Amodal Instance Segmentation
    Li, Ke
    Malik, Jitendra
    [J]. COMPUTER VISION - ECCV 2016, PT II, 2016, 9906 : 677 - 693