Indoor Objects and Outdoor Urban Scenes Recognition by 3D Visual Primitives

被引:1
|
作者
Fu, Junsheng [1 ,3 ]
Kamarainen, Joni-Kristian [1 ]
Buch, Anders Glent [2 ]
Kruger, Norbert [2 ]
机构
[1] Tampere Univ Technol, Vis Grp, FIN-33101 Tampere, Finland
[2] Univ Southern Denmark, CARO Grp, Odense, Denmark
[3] Nokia Res Ctr, Tampere, Finland
来源
COMPUTER VISION - ACCV 2014 WORKSHOPS, PT I | 2015年 / 9008卷
关键词
FEATURES;
D O I
10.1007/978-3-319-16628-5_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection, recognition and pose estimation in 3D images have gained momentum due to availability of 3D sensors (RGB-D) and increase of large scale 3D data, such as city maps. The most popular approach is to extract and match 3D shape descriptors that encode local scene structure, but omits visual appearance. Visual appearance can be problematic due to imaging distortions, but the assumption that local shape structures are sufficient to recognise objects and scenes is largely invalid in practise since objects may have similar shape, but different texture (e.g., grocery packages). In this work, we propose an alternative appearance-driven approach which first extracts 2D primitives justified by Marr's primal sketch, which are "accumulated" over multiple views and the most stable ones are "promoted" to 3D visual primitives. The 3D promoted primitives represent both structure and appearance. For recognition, we propose a fast and effective correspondence matching using random sampling. For quantitative evaluation we construct a semisynthetic benchmark dataset using a public 3D model dataset of 119 kitchen objects and another benchmark of challenging street-view images from 4 different cities. In the experiments, our method utilises only a stereo view for training. As the result, with the kitchen objects dataset our method achieved almost perfect recognition rate for +/- 10 degrees camera view point change and nearly 80% for +/- 20 degrees, and for the street-view benchmarks it achieved 75% accuracy for 160 street-view images pairs, 80% for 96 street-view images pairs, and 92% for 48 street-view image pairs.
引用
收藏
页码:270 / 285
页数:16
相关论文
共 50 条
  • [41] 3D Reconstruction of Indoor Scenes via Image Registration
    Ce Li
    Bing Lu
    Yachao Zhang
    Hao Liu
    Yanyun Qu
    Neural Processing Letters, 2018, 48 : 1281 - 1304
  • [42] Rapid 3D Visualization of Indoor Scenes Using 3D Occupancy Grid Isosurfaces
    Zask, Ran
    Dailey, Matthew N.
    ECTI-CON: 2009 6TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2009, : 632 - 635
  • [43] Active 3D Classification of Multiple Objects in Cluttered Scenes
    Wang, Yiming
    Carletti, Marco
    Setti, Francesco
    Cristani, Marco
    Del Bue, Alessio
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2602 - 2610
  • [44] Detection and localization of objects in images of 3D ground scenes
    Gnilitskii, V. V.
    Insarov, V. V.
    JOURNAL OF COMPUTER AND SYSTEMS SCIENCES INTERNATIONAL, 2011, 50 (06) : 933 - 941
  • [45] A 3D Semantic Visual SLAM in Dynamic Scenes
    Hu, Shanshan
    Li, Dan
    Tang, Gujie
    Xu, Xiangrong
    2021 6TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2021), 2021, : 522 - 528
  • [46] Detection and localization of objects in images of 3D ground scenes
    V. V. Gnilitskii
    V. V. Insarov
    Journal of Computer and Systems Sciences International, 2011, 50 : 933 - 941
  • [47] Managing 3D objects for real world scenes reconstruction
    Amato, Flora
    Mazzeo, Antonino
    Moscato, Vincenzo
    Picariello, Antonio
    Sansone, Carlo
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2015, 11 (01) : 56 - 67
  • [48] Region Based on Object Recognition in 3D Scenes
    Xu, Lei
    Zhou, Yue
    Li, Qingshan
    INTELLIGENT SCIENCE AND INTELLIGENT DATA ENGINEERING, ISCIDE 2011, 2012, 7202 : 160 - 166
  • [49] A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes
    Yu, Ting
    Lin, Xiaojun
    Wang, Shuhui
    Sheng, Weiguo
    Huang, Qingming
    Yu, Jun
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1322 - 1338
  • [50] 3D MOBILE AUGMENTED REALITY IN URBAN SCENES
    Takacs, Gabriel
    El Choubassi, Maha
    Wu, Yi
    Kozintsev, Igor
    2011 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2011,