Deeply Exploiting Long-Term View Dependency for 3D Shape Recognition

被引:4
|
作者
Xu, Yong [1 ,2 ,3 ]
Zheng, Chaoda [1 ]
Xu, Ruotao [1 ]
Quan, Yuhui [1 ,4 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510000, Guangdong, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518000, Peoples R China
[3] Commun & Comp Network Lab Guangdong, Guangzhou 510000, Guangdong, Peoples R China
[4] Guangdong Prov Key Lab Computat Intelligence & Cy, Guangzhou 510000, Guangdong, Peoples R China
来源
IEEE ACCESS | 2019年 / 7卷
基金
中国国家自然科学基金;
关键词
3D shape recognition; long-term dependency; multi-view deep learning; view aggregation; CONVOLUTIONAL NEURAL-NETWORKS; OBJECT RECOGNITION; POSE ESTIMATION;
D O I
10.1109/ACCESS.2019.2934650
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recognition of 3D shapes is a fundamental task in computer vision. In recent years, view-based deep learning has emerged as an effective approach for 3D shape recognition. Most existing view-based methods treat the views of an object as an unordered set, which ignores the dynamic relations among the views, e.g. sequential semantic dependencies. In this paper, modeling the views of an object by a sequence, we aim at exploiting the long-term dependencies among different views for shape recognition, which is done by constructing a sequence-aware view aggregation module based on the bi-directional Long Short-Term Memory network. It is shown that our view aggregation module not only captures the bi-directional dependencies in view sequences, but also enjoys the robustness to circular shifts of input sequences. Incorporating the aggregation module into a standard convolutional network architecture, we develop an effective method for 3D shape classification and retrieval. Our method was evaluated on the ModelNet40/10 and ShapeNetCore55 datasets. The results show the encouraging performance gain from exploiting long-term dependencies in view sequences, as well as the superior performance of our method compared to the existing ones.
引用
收藏
页码:111678 / 111691
页数:14
相关论文
共 50 条
  • [1] Exploiting Long-Term Dependency for Topic Sentiment Analysis
    Huang, Faliang
    Yuan, Changan
    Bi, Yingzhou
    Lu, Jianbo
    IEEE ACCESS, 2020, 8 : 221963 - 221974
  • [2] 3D action recognition and long-term prediction of human motion
    Hahn, Markus
    Krueger, Lars
    Woehler, Christian
    COMPUTER VISION SYSTEMS, PROCEEDINGS, 2008, 5008 : 23 - 32
  • [3] SVNET: A SINGLE VIEW NETWORK FOR 3D SHAPE RECOGNITION
    Li, Shaoshuai
    Liu, Fuyan
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1648 - 1653
  • [4] Long-Term Dependency for 3D Reconstruction of Freehand Ultrasound Without External Tracker
    Li, Qi
    Shen, Ziyi
    Li, Qian
    Barratt, Dean C.
    Dowrick, Thomas
    Clarkson, Matthew J.
    Vercauteren, Tom
    Hu, Yipeng
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2024, 71 (03) : 1033 - 1042
  • [5] Dynamic View Aggregation for Multi-View 3D Shape Recognition
    Zhou, Yuan
    Sun, Zhongqi
    Huo, Shuwei
    Kung, Sun-Yuan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9163 - 9174
  • [6] An Improved 3D Shape Recognition Method Based on Panoramic View
    Zheng, Qiang
    Sun, Jian
    Zhang, Le
    Chen, Wei
    Fan, Huanhuan
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2018, 2018
  • [7] Context Dependency in 3D Shape Perception
    Todd, James
    PERCEPTION, 2019, 48 : 194 - 194
  • [8] Multi-view Convolutional Neural Networks for 3D Shape Recognition
    Su, Hang
    Maji, Subhransu
    Kalogerakis, Evangelos
    Learned-Miller, Erik
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 945 - 953
  • [9] MVTN: Multi-View Transformation Network for 3D Shape Recognition
    Hamdi, Abdullah
    Giancola, Silvio
    Ghanem, Bernard
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 1 - 11
  • [10] MVPN: Multi-View Prototype Network for 3D Shape Recognition
    Wu, Zizhao
    Yang, Ping
    Wang, Yigang
    IEEE ACCESS, 2019, 7 : 130363 - 130372