A robust higher order potential for modeling the label consistency between object detection and semantic segmentation

被引:0
|
作者
机构
[1] [1,Yu, Miao
[2] Hu, Zhanyi
来源
Hu, Zhanyi (huzy@nlpr.ia.ac.cn) | 1600年 / Institute of Computing Technology卷 / 28期
关键词
Graphic methods - Inference engines - Object recognition - Semantic Segmentation - Semantics;
D O I
暂无
中图分类号
学科分类号
摘要
Jointly solving the object detection and semantic segmentation under a unified energy minimization framework is a promising way towards a holistic scene understanding, in which how to design powerful expressive higher order potentials and how to construct the corresponding efficient inference algorithms are two key issues. In this work, we at first introduce three design criteria for suitable higher order potential to appropriately model label consistency between object detection and semantic segmentation, then based on these three criteria, a robust higher order potential and its corresponding efficient inference algorithm are proposed. Our proposed higher order potential separately models the label consistency of the pixels within the bounding boxes for true, false and inaccurate detectors, and can be represented as the lower envelope of three linear functions. By introducing only two auxiliary binary variables, it is proved the higher order α-expansion move function can be transformed to submodular pairwise energy, which in turn can be efficiently minimized via graph cuts. The comparative experiments on PASCAL VOC 2010 dataset with the state-of-the-art algorithms showed that our proposed robust higher order potential could effectively model the label consistency of object detection and semantic segmentation for both accepted and rejected detectors, while keeping robust to the false detectors resulting from inaccurate localization. © 2016, Beijing China Science Journal Publishing Co. Ltd. All right reserved.
引用
收藏
相关论文
共 50 条
  • [31] Rich feature hierarchies for accurate object detection and semantic segmentation
    Girshick, Ross
    Donahue, Jeff
    Darrell, Trevor
    Malik, Jitendra
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587
  • [32] Leveraging Spatial-semantic Information in Object Detection and Segmentation
    Guo Q.-Z.
    Yuan C.
    Ruan Jian Xue Bao/Journal of Software, 2023, 34 (06): : 2776 - 2788
  • [33] Adaptive Generation of Weakly Supervised Semantic Segmentation for Object Detection
    Li, Shibao
    Liu, Yixuan
    Zhang, Yunwu
    Luo, Yi
    Liu, Jianhang
    NEURAL PROCESSING LETTERS, 2023, 55 (01) : 657 - 670
  • [34] Semantic segmentation guided pseudo label mining and instance re-detection for weakly supervised object detection in remote sensing images
    Qian, Xiaoliang
    Li, Chao
    Wang, Wei
    Yao, Xiwen
    Cheng, Gong
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2023, 119
  • [35] Robust object segmentation with constrained curve embedding potential field
    Ho, GHP
    Shi, PC
    MEDICAL IMAGING AND AUGMENTED REALITY, PROCEEDINGS, 2004, 3150 : 145 - 153
  • [36] HIGHER ORDER POTENTIALS WITH SUPERPIXEL NEIGHBOURHOOD (HSN) FOR SEMANTIC IMAGE SEGMENTATION
    Ibrahim, Mostafa S.
    El-Saban, Motaz
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [37] Salient object detection employing robust sparse representation and local consistency
    Liu Yi
    Zhang Qiang
    Han Jungong
    Wang Long
    IMAGE AND VISION COMPUTING, 2018, 69 : 155 - 167
  • [38] Semantic Consistency Reasoning for 3-D Object Detection in Point Clouds
    Wei, Wenwen
    Wei, Ping
    Liao, Zhimin
    Qin, Jialu
    Cheng, Xiang
    Liu, Meiqin
    Zheng, Nanning
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 14
  • [39] BG-Net: boundary-guidance network for object consistency maintaining in semantic segmentation
    Cheng, Xiji
    Huang, Shiliang
    Liao, Bingyan
    Wang, Yayun
    Luo, Xiao
    VISUAL COMPUTER, 2024, 40 (01): : 373 - 391
  • [40] Higher-order potentials for video object segmentation in bilateral space
    Hao, Chuanyan
    Chen, Yadang
    Yang, Zhi-Xin
    Wu, Enhua
    NEUROCOMPUTING, 2020, 401 : 28 - 35