Modeling 2D Appearance Evolution for 3D Object Categorization

被引：0

作者：

Zaki, Hasan F. M. ^{[1
,3
]}

Shafait, Faisal ^{[2
]}

Mian, Ajmal ^{[1
]}

机构：

[1] Univ Western Australia, Comp Sci & Software Engn, Crawley, WA, Australia

[2] Natl Univ Sci & Technol, Islamabad, Pakistan

[3] Int Islamic Univ Malaysia, Mechatron Engn, Kuala Lumpur, Selangor, Malaysia

来源：

2016 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA) | 2016年

关键词：

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

3D object categorization is a non-trivial task in computer vision encompassing many real-world applications. We pose the problem of categorizing 3D polygon meshes as learning appearance evolution from multi-view 2D images. Given a corpus of 3D polygon meshes, we first render the corresponding RGB and depth images from multiple viewpoints on a uniform sphere. Using rank pooling, we propose two methods to learn the appearance evolution of the 2D views. Firstly, we train view-invariant models based on a deep convolutional neural network (CNN) using the rendered RGB-D images and learn to rank the first fully connected layer activations and, therefore, capture the evolution of these extracted features. The parameters learned during this process are used as the 3D shape representations. In the second method, we learn the aggregation of the views from the outset by employing the ranking machine to the rendered RGB-D images directly, which produces aggregated 2D images which we term as "3D shape images". We then learn CNN models on this novel shape representation for both RGB and depth which encode salient geometrical structure of the polygon. Experiments on the ModelNet40 and ModelNet10 datasets show that the proposed method consistently outperforms existing state-of-the-art algorithms in 3D shape recognition.

引用

页码：185 / 192

页数：8

共 50 条

[41] A unified approach to moving object detection in 2D and 3D scenes
Irani, M
Anandan, P
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (06) : 577 - 589
[42] From 2D Silhouettes to 3D Object Retrieval: Contributions and Benchmarking
Napoleon, Thibault
Sahbi, Hichem
EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2010,
[43] SYMMETRICAL 3D OBJECTS ARE AN EASY CASE FOR 2D OBJECT RECOGNITION
VETTER, T
POGGIO, T
SPATIAL VISION, 1994, 8 (04): : 443 - 453
[44] 2D/3D Sensor Exploitation and Fusion for Enhanced Object Detection
Xu, Jiejun
Kim, Kyungnam
Zhang, Zhiqi
Chen, Hai-wen
Owechko, Yuri
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2014, : 778 - 784
[45] Watermark recovery from 2D views of a 3D video object
Garcia, E
Dugelay, JL
SECURITY AND WATERMARKING OF MULTIMEDIA CONTENTS V, 2003, 5020 : 471 - 480
[46] 2D TO 3D LABEL PROPAGATION FOR OBJECT DETECTION IN POINT CLOUD
Lertniphonphan, Kanokphan
Komorita, Satoshi
Tasaka, Kazuyuki
Yanagihara, Hiromasa
2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018), 2018,
[47] 2D and 3D object detection algorithms from images: A Survey
Chen, Wei
Li, Yan
Tian, Zijian
Zhang, Fan
ARRAY, 2023, 19
[48] Effects of HMD on 2D Object Interaction in 3D Virtual Environments
Tunali, Mustafa Fatih
Gokturk, Mehmet
2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 1825 - 1828
[49] From 2D Silhouettes to 3D Object Retrieval: Contributions and Benchmarking
Thibault Napoléon
Hichem Sahbi
EURASIP Journal on Image and Video Processing, 2010
[50] Fast 2D/3D object representation with growing neural gas
Angelopoulou, Anastassia
Rodriguez, Jose Garcia
Orts-Escolano, Sergio
Gupta, Gaurav
Psarrou, Alexandra
NEURAL COMPUTING & APPLICATIONS, 2018, 29 (10): : 903 - 919

← 1 2 3 4 5 →