Toward coherent object detection and scene layout understanding

被引:16
|
作者
Bao, Sid Yingze [1 ]
Sun, Min [1 ]
Savarese, Silvio [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48105 USA
关键词
Object detection; Scene layout; Focal length estimation; Supporting surface estimation;
D O I
10.1016/j.imavis.2011.08.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting objects in complex scenes while recovering the scene layout is a critical functionality in many vision-based applications. In this work, we advocate the importance of geometric contextual reasoning for object recognition. We start from the intuition that objects' location and pose in the 3D space are not arbitrarily distributed but rather constrained by the fact that objects must lie on one or multiple supporting surfaces. We model such supporting surfaces by means of hidden parameters (i.e. not explicitly observed) and formulate the problem of joint scene reconstruction and object recognition as the one of finding the set of parameters that maximizes the joint probability of having a number of detected objects on K supporting planes given the observations. As a key ingredient for solving this optimization problem, we have demonstrated a novel relationship between object location and pose in the image, and the scene layout parameters (i.e. normal of one or more supporting planes in 3D and camera pose, location and focal length). Using a novel probabilistic formulation and the above relationship our method has the unique ability to jointly: i) reduce false alarm and false negative object detection rate; ii) recover object location and supporting planes within the 3D camera reference system; iii) infer camera parameters (view point and the focal length) from just one single uncalibrated image. Quantitative and qualitative experimental evaluation on two datasets (desk-top dataset [1] and LabelMe [2]) demonstrates our theoretical claims. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:569 / 579
页数:11
相关论文
共 50 条
  • [21] MSA R-CNN: A comprehensive approach to remote sensing object detection and scene understanding
    Sagar, A. S. M. Sharifuzzaman
    Chen, Yu
    Xie, YaKun
    Kim, Hyung Seok
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 241
  • [22] Cross-Domain Object Detection Algorithm for Complex End-to-End Scene Understanding
    Chen, Aoran
    Huang, Hai
    Zhu, Yueyan
    Xue, Junsheng
    [J]. Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2024, 47 (04): : 57 - 62
  • [23] Automatic scene understanding and object identification in point clouds
    Bae, Egil
    [J]. ELECTRO-OPTICAL REMOTE SENSING XIII, 2019, 11160
  • [24] An Enhanced Light Object Detection for Indiscernible Object in the Special Scene
    Zhang, Quanyou
    Feng, Yong
    Wang, Yong-heng
    Qiang, Bao-hua
    Wang, Lufeng
    Zhang, Zebin
    [J]. JOURNAL OF ENGINEERING RESEARCH, 2022, 10
  • [25] A novel object detection technique for dynamic scene and static object
    Li, Xiu
    Chen, Liansheng
    Yang, Zhixiong
    Wang, Huimin
    [J]. 2016 7TH INTERNATIONAL CONFERENCE ON MECHANICAL, INDUSTRIAL, AND MANUFACTURING TECHNOLOGIES (MIMT 2016), 2016, 54
  • [26] Places in the Brain: Bridging Layout and Object Geometry in Scene-Selective Cortex
    Dillon, Moira R.
    Persichetti, Andrew S.
    Spelke, Elizabeth S.
    Dilks, Daniel D.
    [J]. CEREBRAL CORTEX, 2018, 28 (07) : 2365 - 2374
  • [27] Indoor scene perception for object detection and manipulation
    Manso, L. J.
    Bustos, P.
    Franco, J.
    Bachiller, P.
    [J]. COGNITIVE PROCESSING, 2012, 13 : S4 - S5
  • [28] Object Detection Algorithm Based on Moving Scene
    Zhou, Rong
    Zhang, Qi
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL FORUM ON MANAGEMENT, EDUCATION AND INFORMATION TECHNOLOGY APPLICATION (IFMEITA 2017), 2017, 130 : 443 - 448
  • [29] A novel object detection technique for dynamic scene
    Li, Xiu
    Chen, Liansheng
    Yang, Zhixiong
    Zhang, Wanlu
    [J]. PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGIES IN EDUCATION AND LEARNING, 2016, 32
  • [30] Moving object detection based on scene knowledge
    [J]. Yan, L. (yanlileeyan@sina.com), 1600, Editorial Board of Jilin University (43):