A robust higher order potential for modeling the label consistency between object detection and semantic segmentation

被引：0

作者：

机构：

[1] [1,Yu, Miao

[2] Hu, Zhanyi

来源：

Hu, Zhanyi (huzy@nlpr.ia.ac.cn) | 1600年 / Institute of Computing Technology卷 / 28期

关键词：

Graphic methods - Inference engines - Object recognition - Semantic Segmentation - Semantics;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Jointly solving the object detection and semantic segmentation under a unified energy minimization framework is a promising way towards a holistic scene understanding, in which how to design powerful expressive higher order potentials and how to construct the corresponding efficient inference algorithms are two key issues. In this work, we at first introduce three design criteria for suitable higher order potential to appropriately model label consistency between object detection and semantic segmentation, then based on these three criteria, a robust higher order potential and its corresponding efficient inference algorithm are proposed. Our proposed higher order potential separately models the label consistency of the pixels within the bounding boxes for true, false and inaccurate detectors, and can be represented as the lower envelope of three linear functions. By introducing only two auxiliary binary variables, it is proved the higher order α-expansion move function can be transformed to submodular pairwise energy, which in turn can be efficiently minimized via graph cuts. The comparative experiments on PASCAL VOC 2010 dataset with the state-of-the-art algorithms showed that our proposed robust higher order potential could effectively model the label consistency of object detection and semantic segmentation for both accepted and rejected detectors, while keeping robust to the false detectors resulting from inaccurate localization. © 2016, Beijing China Science Journal Publishing Co. Ltd. All right reserved.

引用

共 50 条

[31] Rich feature hierarchies for accurate object detection and semantic segmentation
Girshick, Ross
Donahue, Jeff
Darrell, Trevor
Malik, Jitendra
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587
[32] Leveraging Spatial-semantic Information in Object Detection and Segmentation
Guo Q.-Z.
Yuan C.
Ruan Jian Xue Bao/Journal of Software, 2023, 34 (06): : 2776 - 2788
[33] Adaptive Generation of Weakly Supervised Semantic Segmentation for Object Detection
Li, Shibao
Liu, Yixuan
Zhang, Yunwu
Luo, Yi
Liu, Jianhang
NEURAL PROCESSING LETTERS, 2023, 55 (01) : 657 - 670
[34] Semantic segmentation guided pseudo label mining and instance re-detection for weakly supervised object detection in remote sensing images
Qian, Xiaoliang
Li, Chao
Wang, Wei
Yao, Xiwen
Cheng, Gong
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2023, 119
[35] Robust object segmentation with constrained curve embedding potential field
Ho, GHP
Shi, PC
MEDICAL IMAGING AND AUGMENTED REALITY, PROCEEDINGS, 2004, 3150 : 145 - 153
[36] HIGHER ORDER POTENTIALS WITH SUPERPIXEL NEIGHBOURHOOD (HSN) FOR SEMANTIC IMAGE SEGMENTATION
Ibrahim, Mostafa S.
El-Saban, Motaz
2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
[37] Salient object detection employing robust sparse representation and local consistency
Liu Yi
Zhang Qiang
Han Jungong
Wang Long
IMAGE AND VISION COMPUTING, 2018, 69 : 155 - 167
[38] Semantic Consistency Reasoning for 3-D Object Detection in Point Clouds
Wei, Wenwen
Wei, Ping
Liao, Zhimin
Qin, Jialu
Cheng, Xiang
Liu, Meiqin
Zheng, Nanning
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 14
[39] BG-Net: boundary-guidance network for object consistency maintaining in semantic segmentation
Cheng, Xiji
Huang, Shiliang
Liao, Bingyan
Wang, Yayun
Luo, Xiao
VISUAL COMPUTER, 2024, 40 (01): : 373 - 391
[40] Higher-order potentials for video object segmentation in bilateral space
Hao, Chuanyan
Chen, Yadang
Yang, Zhi-Xin
Wu, Enhua
NEUROCOMPUTING, 2020, 401 : 28 - 35

← 1 2 3 4 5 →