Toward coherent object detection and scene layout understanding

被引:16
|
作者
Bao, Sid Yingze [1 ]
Sun, Min [1 ]
Savarese, Silvio [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48105 USA
关键词
Object detection; Scene layout; Focal length estimation; Supporting surface estimation;
D O I
10.1016/j.imavis.2011.08.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting objects in complex scenes while recovering the scene layout is a critical functionality in many vision-based applications. In this work, we advocate the importance of geometric contextual reasoning for object recognition. We start from the intuition that objects' location and pose in the 3D space are not arbitrarily distributed but rather constrained by the fact that objects must lie on one or multiple supporting surfaces. We model such supporting surfaces by means of hidden parameters (i.e. not explicitly observed) and formulate the problem of joint scene reconstruction and object recognition as the one of finding the set of parameters that maximizes the joint probability of having a number of detected objects on K supporting planes given the observations. As a key ingredient for solving this optimization problem, we have demonstrated a novel relationship between object location and pose in the image, and the scene layout parameters (i.e. normal of one or more supporting planes in 3D and camera pose, location and focal length). Using a novel probabilistic formulation and the above relationship our method has the unique ability to jointly: i) reduce false alarm and false negative object detection rate; ii) recover object location and supporting planes within the 3D camera reference system; iii) infer camera parameters (view point and the focal length) from just one single uncalibrated image. Quantitative and qualitative experimental evaluation on two datasets (desk-top dataset [1] and LabelMe [2]) demonstrates our theoretical claims. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:569 / 579
页数:11
相关论文
共 50 条
  • [1] Toward Coherent Object Detection And Scene Layout Understanding
    Bao, Sid Ying-Ze
    Sun, Min
    Savarese, Silvio
    [J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 65 - 72
  • [2] Salient Object Detection Using Scene Layout Estimation
    Muratov, Oleg
    Boato, Giulia
    De Natale, Francesco G. B.
    [J]. 2013 IEEE 15TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2013, : 390 - 395
  • [3] Efficient Scene Layout Aware Object Detection for Traffic Surveillance
    Wang, Tao
    He, Xuming
    Su, Songzhi
    Guan, Yin
    [J]. 2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 926 - 933
  • [4] Toward Deeper Understanding of Camouflaged Object Detection
    Lv, Yunqiu
    Zhang, Jing
    Dai, Yuchao
    Li, Aixuan
    Barnes, Nick
    Fan, Deng-Ping
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (07) : 3462 - 3476
  • [5] Exploring the role of gaze behavior and object detection in scene understanding
    Yun, Kiwon
    Peng, Yifan
    Samaras, Dimitris
    Zelinsky, Gregory J.
    Berg, Tamara L.
    [J]. FRONTIERS IN PSYCHOLOGY, 2013, 4
  • [6] DRUformer: Enhancing Driving Scene Important Object Detection With Driving Scene Relationship Understanding
    Niu, Yingjie
    Ding, Ming
    Fujii, Keisuke
    Ohtani, Kento
    Carballo, Alexander
    Takeda, Kazuya
    [J]. IEEE ACCESS, 2024, 12 : 67589 - 67599
  • [7] Fast Joint Object Detection and Viewpoint Estimation for Traffic Scene Understanding
    Guindel, Carlos
    Martin, David
    Maria Armingol, Jose
    [J]. IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2018, 10 (04) : 74 - 86
  • [8] Incorporating Scene Context and Object Layout into Appearance Modeling
    Izadinia, Hamid
    Sadeghi, Fereshteh
    Farhadi, Ali
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 232 - 239
  • [9] Cooperative Holistic Scene Understanding: Unifying 3D Object, Layout, and Camera Pose Estimation
    Huang, Siyuan
    Qi, Siyuan
    Xiao, Yinxue
    Zhu, Yixin
    Wu, Ying Nian
    Zhu, Song-Chun
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [10] Holistic Scene Understanding for 3D Object Detection with RGBD cameras
    Lin, Dahua
    Fidler, Sanja
    Urtasun, Raquel
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 1417 - 1424