Deeply Exploiting Long-Term View Dependency for 3D Shape Recognition

被引：4

作者：

Xu, Yong ^{[1
,2
,3
]}

Zheng, Chaoda ^{[1
]}

Xu, Ruotao ^{[1
]}

Quan, Yuhui ^{[1
,4
]}

机构：

[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510000, Guangdong, Peoples R China

[2] Peng Cheng Lab, Shenzhen 518000, Peoples R China

[3] Commun & Comp Network Lab Guangdong, Guangzhou 510000, Guangdong, Peoples R China

[4] Guangdong Prov Key Lab Computat Intelligence & Cy, Guangzhou 510000, Guangdong, Peoples R China

来源：

IEEE ACCESS | 2019年 / 7卷

基金：

中国国家自然科学基金;

关键词：

3D shape recognition; long-term dependency; multi-view deep learning; view aggregation; CONVOLUTIONAL NEURAL-NETWORKS; OBJECT RECOGNITION; POSE ESTIMATION;

D O I：

10.1109/ACCESS.2019.2934650

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recognition of 3D shapes is a fundamental task in computer vision. In recent years, view-based deep learning has emerged as an effective approach for 3D shape recognition. Most existing view-based methods treat the views of an object as an unordered set, which ignores the dynamic relations among the views, e.g. sequential semantic dependencies. In this paper, modeling the views of an object by a sequence, we aim at exploiting the long-term dependencies among different views for shape recognition, which is done by constructing a sequence-aware view aggregation module based on the bi-directional Long Short-Term Memory network. It is shown that our view aggregation module not only captures the bi-directional dependencies in view sequences, but also enjoys the robustness to circular shifts of input sequences. Incorporating the aggregation module into a standard convolutional network architecture, we develop an effective method for 3D shape classification and retrieval. Our method was evaluated on the ModelNet40/10 and ShapeNetCore55 datasets. The results show the encouraging performance gain from exploiting long-term dependencies in view sequences, as well as the superior performance of our method compared to the existing ones.

引用

页码：111678 / 111691

页数：14

共 50 条

[1] Exploiting Long-Term Dependency for Topic Sentiment Analysis
Huang, Faliang
Yuan, Changan
Bi, Yingzhou
Lu, Jianbo
IEEE ACCESS, 2020, 8 : 221963 - 221974
[2] 3D action recognition and long-term prediction of human motion
Hahn, Markus
Krueger, Lars
Woehler, Christian
COMPUTER VISION SYSTEMS, PROCEEDINGS, 2008, 5008 : 23 - 32
[3] SVNET: A SINGLE VIEW NETWORK FOR 3D SHAPE RECOGNITION
Li, Shaoshuai
Liu, Fuyan
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1648 - 1653
[4] Long-Term Dependency for 3D Reconstruction of Freehand Ultrasound Without External Tracker
Li, Qi
Shen, Ziyi
Li, Qian
Barratt, Dean C.
Dowrick, Thomas
Clarkson, Matthew J.
Vercauteren, Tom
Hu, Yipeng
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2024, 71 (03) : 1033 - 1042
[5] Dynamic View Aggregation for Multi-View 3D Shape Recognition
Zhou, Yuan
Sun, Zhongqi
Huo, Shuwei
Kung, Sun-Yuan
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9163 - 9174
[6] An Improved 3D Shape Recognition Method Based on Panoramic View
Zheng, Qiang
Sun, Jian
Zhang, Le
Chen, Wei
Fan, Huanhuan
MATHEMATICAL PROBLEMS IN ENGINEERING, 2018, 2018
[7] Context Dependency in 3D Shape Perception
Todd, James
PERCEPTION, 2019, 48 : 194 - 194
[8] Multi-view Convolutional Neural Networks for 3D Shape Recognition
Su, Hang
Maji, Subhransu
Kalogerakis, Evangelos
Learned-Miller, Erik
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 945 - 953
[9] MVTN: Multi-View Transformation Network for 3D Shape Recognition
Hamdi, Abdullah
Giancola, Silvio
Ghanem, Bernard
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 1 - 11
[10] MVPN: Multi-View Prototype Network for 3D Shape Recognition
Wu, Zizhao
Yang, Ping
Wang, Yigang
IEEE ACCESS, 2019, 7 : 130363 - 130372

← 1 2 3 4 5 →