A robust higher order potential for modeling the label consistency between object detection and semantic segmentation

被引:0
|
作者
机构
[1] [1,Yu, Miao
[2] Hu, Zhanyi
来源
Hu, Zhanyi (huzy@nlpr.ia.ac.cn) | 1600年 / Institute of Computing Technology卷 / 28期
关键词
Graphic methods - Inference engines - Object recognition - Semantic Segmentation - Semantics;
D O I
暂无
中图分类号
学科分类号
摘要
Jointly solving the object detection and semantic segmentation under a unified energy minimization framework is a promising way towards a holistic scene understanding, in which how to design powerful expressive higher order potentials and how to construct the corresponding efficient inference algorithms are two key issues. In this work, we at first introduce three design criteria for suitable higher order potential to appropriately model label consistency between object detection and semantic segmentation, then based on these three criteria, a robust higher order potential and its corresponding efficient inference algorithm are proposed. Our proposed higher order potential separately models the label consistency of the pixels within the bounding boxes for true, false and inaccurate detectors, and can be represented as the lower envelope of three linear functions. By introducing only two auxiliary binary variables, it is proved the higher order α-expansion move function can be transformed to submodular pairwise energy, which in turn can be efficiently minimized via graph cuts. The comparative experiments on PASCAL VOC 2010 dataset with the state-of-the-art algorithms showed that our proposed robust higher order potential could effectively model the label consistency of object detection and semantic segmentation for both accepted and rejected detectors, while keeping robust to the false detectors resulting from inaccurate localization. © 2016, Beijing China Science Journal Publishing Co. Ltd. All right reserved.
引用
收藏
相关论文
共 50 条
  • [41] BG-Net: boundary-guidance network for object consistency maintaining in semantic segmentation
    Xiji Cheng
    Shiliang Huang
    Bingyan Liao
    Yayun Wang
    Xiao Luo
    The Visual Computer, 2024, 40 : 373 - 391
  • [42] Automatic moving object and background segmentation by means of higher order statistics
    Neri, A
    Colonnese, S
    Russo, G
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING '97, PTS 1-2, 1997, 3024 : 246 - 256
  • [43] One For All: A Mutual Enhancement Method for Object Detection and Semantic Segmentation
    Zhang, Shichao
    Zhang, Zhe
    Sun, Libo
    Qin, Wenhu
    APPLIED SCIENCES-BASEL, 2020, 10 (01):
  • [44] Using convolutional neural networks for image semantic segmentation and object detection
    Li, Shuangmei
    Huang, Chengning
    SYSTEMS AND SOFT COMPUTING, 2024, 6
  • [45] MaskLab: Instance Segmentation by Refining Object Detection with Semantic and Direction Features
    Chen, Liang-Chieh
    Hermans, Alexander
    Papandreou, George
    Schroff, Florian
    Wang, Peng
    Adam, Hartwig
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4013 - 4022
  • [46] Real-time Object Detection and Semantic Segmentation for Autonomous Driving
    Li, Baojun
    Liu, Shun
    Xu, Weichao
    Qiu, Wei
    MIPPR 2017: AUTOMATIC TARGET RECOGNITION AND NAVIGATION, 2018, 10608
  • [47] Segmentation-based multi-class semantic object detection
    Vieux, Remi
    Benois-Pineau, Jenny
    Domenger, Jean-Philippe
    Braquelaire, Achille
    MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 60 (02) : 305 - 326
  • [48] Research on Multitask Deep Learning Network for Semantic Segmentation and Object Detection
    Rui, Ting
    Xiao, Feng
    Tang, Jian
    Zhang, Fukai
    Yang, Chengsong
    Liu, Min
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 : 708 - 718
  • [49] AdvNet: Multi-Task Fusion of Object Detection and Semantic Segmentation
    Liu, Xiaohan
    Wang, Heng
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 3359 - 3362
  • [50] Fusing Semantic Segmentation and Object Detection for Visual SLAM in Dynamic Scenes
    Yu, Peilin
    Guo, Chi
    Liu, Yang
    Zhang, Huyin
    PROCEEDINGS OF 27TH ACM SYMPOSIUM ON VIRTUAL REALITY SOFTWARE AND TECHNOLOGY, VRST 2021, 2021,