Modeling 2D Appearance Evolution for 3D Object Categorization

被引:0
|
作者
Zaki, Hasan F. M. [1 ,3 ]
Shafait, Faisal [2 ]
Mian, Ajmal [1 ]
机构
[1] Univ Western Australia, Comp Sci & Software Engn, Crawley, WA, Australia
[2] Natl Univ Sci & Technol, Islamabad, Pakistan
[3] Int Islamic Univ Malaysia, Mechatron Engn, Kuala Lumpur, Selangor, Malaysia
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
3D object categorization is a non-trivial task in computer vision encompassing many real-world applications. We pose the problem of categorizing 3D polygon meshes as learning appearance evolution from multi-view 2D images. Given a corpus of 3D polygon meshes, we first render the corresponding RGB and depth images from multiple viewpoints on a uniform sphere. Using rank pooling, we propose two methods to learn the appearance evolution of the 2D views. Firstly, we train view-invariant models based on a deep convolutional neural network (CNN) using the rendered RGB-D images and learn to rank the first fully connected layer activations and, therefore, capture the evolution of these extracted features. The parameters learned during this process are used as the 3D shape representations. In the second method, we learn the aggregation of the views from the outset by employing the ranking machine to the rendered RGB-D images directly, which produces aggregated 2D images which we term as "3D shape images". We then learn CNN models on this novel shape representation for both RGB and depth which encode salient geometrical structure of the polygon. Experiments on the ModelNet40 and ModelNet10 datasets show that the proposed method consistently outperforms existing state-of-the-art algorithms in 3D shape recognition.
引用
收藏
页码:185 / 192
页数:8
相关论文
共 50 条
  • [1] Application of uncertainty modeling in 2D and 3D object detection
    Wang M.
    Zhu B.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2023, 45 (08): : 2370 - 2376
  • [2] 3D Object Localization With 2D Object Detector and 2D Localization
    Staszak, Rafal
    Belter, Dominik
    2022 17TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2022, : 715 - 720
  • [3] Continuous medial representations for geometric object modeling in 2D and 3D
    Yushkevich, P
    Fletcher, PT
    Joshi, S
    Thall, A
    Pizer, SM
    IMAGE AND VISION COMPUTING, 2003, 21 (01) : 17 - 27
  • [4] A hybrid 2D/3D user interface for immersive object modeling
    Coninx, K
    VanReeth, F
    Flerackers, E
    COMPUTER GRAPHICS INTERNATIONAL, PROCEEDINGS, 1997, : 47 - 57
  • [5] A Framework for Fusion of 3D Appearance and 2D Shape Cues for Generic Object Recognition
    Kalra, Manisha
    Sengupta, Sunando
    Das, Sukhendu
    JOURNAL OF PATTERN RECOGNITION RESEARCH, 2008, 3 (01): : 54 - 69
  • [6] A framework for fusion of 3D appearance and 2D shape cues for generic object recognition
    Kalra, Manisha
    Das, Sukhendu
    PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION, 2007, : 332 - +
  • [7] 2D/3D SEMANTIC CATEGORIZATION OF VISUAL OBJECTS
    Petre, Raluca Diana
    Zaharia, Titus
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2387 - 2391
  • [8] 3D Object Recognition Using Fuzzy Mathematical Modeling of 2D Images
    Sheta, Alaa F.
    Baareh, Abdelkarim
    Ai-Batah, Mohammad
    2012 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2012, : 278 - 283
  • [9] Efficient categorization of 3D edges from 2D projections
    Mukerjee, A
    Sasmal, N
    Sastry, DS
    GRAPHICS RECOGNITION, RECENT ADVANCES, 2001, 1941 : 288 - 297
  • [10] 3D object understanding from 2D images
    Wang, PSP
    INTERNATIONAL SYMPOSIUM ON MULTISPECTRAL IMAGE PROCESSING, 1998, 3545 : 33 - 43