Modeling 2D Appearance Evolution for 3D Object Categorization

被引:0
|
作者
Zaki, Hasan F. M. [1 ,3 ]
Shafait, Faisal [2 ]
Mian, Ajmal [1 ]
机构
[1] Univ Western Australia, Comp Sci & Software Engn, Crawley, WA, Australia
[2] Natl Univ Sci & Technol, Islamabad, Pakistan
[3] Int Islamic Univ Malaysia, Mechatron Engn, Kuala Lumpur, Selangor, Malaysia
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
3D object categorization is a non-trivial task in computer vision encompassing many real-world applications. We pose the problem of categorizing 3D polygon meshes as learning appearance evolution from multi-view 2D images. Given a corpus of 3D polygon meshes, we first render the corresponding RGB and depth images from multiple viewpoints on a uniform sphere. Using rank pooling, we propose two methods to learn the appearance evolution of the 2D views. Firstly, we train view-invariant models based on a deep convolutional neural network (CNN) using the rendered RGB-D images and learn to rank the first fully connected layer activations and, therefore, capture the evolution of these extracted features. The parameters learned during this process are used as the 3D shape representations. In the second method, we learn the aggregation of the views from the outset by employing the ranking machine to the rendered RGB-D images directly, which produces aggregated 2D images which we term as "3D shape images". We then learn CNN models on this novel shape representation for both RGB and depth which encode salient geometrical structure of the polygon. Experiments on the ModelNet40 and ModelNet10 datasets show that the proposed method consistently outperforms existing state-of-the-art algorithms in 3D shape recognition.
引用
收藏
页码:185 / 192
页数:8
相关论文
共 50 条
  • [41] A unified approach to moving object detection in 2D and 3D scenes
    Irani, M
    Anandan, P
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (06) : 577 - 589
  • [42] From 2D Silhouettes to 3D Object Retrieval: Contributions and Benchmarking
    Napoleon, Thibault
    Sahbi, Hichem
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2010,
  • [43] SYMMETRICAL 3D OBJECTS ARE AN EASY CASE FOR 2D OBJECT RECOGNITION
    VETTER, T
    POGGIO, T
    SPATIAL VISION, 1994, 8 (04): : 443 - 453
  • [44] 2D/3D Sensor Exploitation and Fusion for Enhanced Object Detection
    Xu, Jiejun
    Kim, Kyungnam
    Zhang, Zhiqi
    Chen, Hai-wen
    Owechko, Yuri
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2014, : 778 - 784
  • [45] Watermark recovery from 2D views of a 3D video object
    Garcia, E
    Dugelay, JL
    SECURITY AND WATERMARKING OF MULTIMEDIA CONTENTS V, 2003, 5020 : 471 - 480
  • [46] 2D TO 3D LABEL PROPAGATION FOR OBJECT DETECTION IN POINT CLOUD
    Lertniphonphan, Kanokphan
    Komorita, Satoshi
    Tasaka, Kazuyuki
    Yanagihara, Hiromasa
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018), 2018,
  • [47] 2D and 3D object detection algorithms from images: A Survey
    Chen, Wei
    Li, Yan
    Tian, Zijian
    Zhang, Fan
    ARRAY, 2023, 19
  • [48] Effects of HMD on 2D Object Interaction in 3D Virtual Environments
    Tunali, Mustafa Fatih
    Gokturk, Mehmet
    2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 1825 - 1828
  • [49] From 2D Silhouettes to 3D Object Retrieval: Contributions and Benchmarking
    Thibault Napoléon
    Hichem Sahbi
    EURASIP Journal on Image and Video Processing, 2010
  • [50] Fast 2D/3D object representation with growing neural gas
    Angelopoulou, Anastassia
    Rodriguez, Jose Garcia
    Orts-Escolano, Sergio
    Gupta, Gaurav
    Psarrou, Alexandra
    NEURAL COMPUTING & APPLICATIONS, 2018, 29 (10): : 903 - 919