Robust object detection with interleaved categorization and segmentation

被引:578
|
作者
Leibe, Bastian [1 ]
Leonardis, Ales [2 ]
Schiele, Bernt [3 ]
机构
[1] ETH, Comp Vis Lab, Zurich, Switzerland
[2] Univ Ljubljana, Fac Comp & Informat Sci, Ljubljana, Slovenia
[3] Tech Univ Darmstadt, Dept Comp Sci, Darmstadt, Germany
关键词
object categorization; object detection; segmentation; clustering; hough transform; hypothesis selection; MDL;
D O I
10.1007/s11263-007-0095-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel method for detecting and localizing objects of a visual category in cluttered real-world scenes. Our approach considers object categorization and figure-ground segmentation as two interleaved processes that closely collaborate towards a common goal. As shown in our work, the tight coupling between those two processes allows them to benefit from each other and improve the combined performance. The core part of our approach is a highly flexible learned representation for object shape that can combine the information observed on different training examples in a probabilistic extension of the Generalized Hough Transform. The resulting approach can detect categorical objects in novel images and automatically infer a probabilistic segmentation from the recognition result. This segmentation is then in turn used to again improve recognition by allowing the system to focus its efforts on object pixels and to discard misleading influences from the background. Moreover, the information from where in the image a hypothesis draws its support is employed in an MDL based hypothesis verification stage to resolve ambiguities between overlapping hypotheses and factor out the effects of partial occlusion. An extensive evaluation on several large data sets shows that the proposed system is applicable to a range of different object categories, including both rigid and articulated objects. In addition, its flexible representation allows it to achieve competitive object detection performance already from training sets that are between one and two orders of magnitude smaller than those used in comparable systems.
引用
收藏
页码:259 / 289
页数:31
相关论文
共 50 条
  • [41] Object detection combining recognition and segmentation
    Wang, Liming
    Shi, Jianbo
    Song, Gang
    Shen, I-fan
    COMPUTER VISION - ACCV 2007, PT I, PROCEEDINGS, 2007, 4843 : 189 - +
  • [42] Simultaneous Object Detection and Semantic Segmentation
    Salscheider, Niels Ole
    ICPRAM: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2020, : 555 - 561
  • [43] OBJECT DETECTION AND IDENTIFICATION BY HIERARCHICAL SEGMENTATION
    HANUSSE, P
    GUILLATAUD, P
    LECTURE NOTES IN COMPUTER SCIENCE, 1990, 427 : 583 - 585
  • [44] Fast and Robust Object Segmentation with the Integral Linear Classifier
    Aldavert, David
    Ramisa, Arnau
    Lopez de Mantaras, Ramon
    Toledo, Ricardo
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 1046 - 1053
  • [45] Robust and Efficient Memory Network for Video Object Segmentation
    Chen, Yadang
    Zhang, Dingwei
    Yang, Zhi-Xin
    Wu, Enhua
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1769 - 1774
  • [46] Robust object segmentation using low resolution stereo
    Morimoto, M
    Fujii, K
    IMAGE PROCESSING, BIOMEDICINE, MULTIMEDIA, FINANCIAL ENGINEERING AND MANUFACTURING, VOL 18, 2004, 18 : 75 - 80
  • [47] Multi-scale object segmentation for robust recognition
    Mokhtarian, F
    VISION INTERFACE '97, PROCEEDINGS, 1997, : 8 - 15
  • [48] A ROBUST FRAMEWORK FOR REGION BASED VIDEO OBJECT SEGMENTATION
    Escudero-Vinolo, Marcos
    Bescos, Jesus
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 3461 - 3464
  • [49] FAST AND ROBUST OBJECT SEGMENTATION APPROACH FOR MPEG VIDEOS
    Ahmad, Ashraf M. A.
    COMPUTER VISION AND GRAPHICS (ICCVG 2004), 2006, 32 : 746 - 751
  • [50] Robust Object Segmentation using Split-and-Merge
    Faruquzzaman, A. B. M.
    Paiker, Nafize Rabbani
    Arafat, Jahidul
    Ali, M. Ameer
    Sorwar, Golam
    INTERNATIONAL JOURNAL OF SIGNAL AND IMAGING SYSTEMS ENGINEERING, 2009, 2 (1-2) : 70 - 80