PVFAN: Point-view fusion attention network for 3D shape recognition

被引:0
|
作者
Cao, Jiangzhong [1 ]
Liao, Siyi [1 ]
机构
[1] Guangdong Univ Technol, Sch Informat Engn, Guangzhou, Peoples R China
关键词
3D Shape recognition; multimodal feature fusion; feature refinement; attention mechanism; CLASSIFICATION; RETRIEVAL; DEPTH;
D O I
10.3233/JIFS-232800
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D shape recognition is a critical research topic in the field of computer vision, attracting substantial attention. Existing approaches mainly focus on extracting distinctive 3D shape features; however, they often neglect the model's robustness and lack refinement in deep features. To address these limitations, we propose the point-view fusion attention network that aims to extract a concise, informative, and robust3Dshape descriptor. Initially, our approach combines multi-view features with point cloud features to obtain accurate and distinguishable fusion features. To effectively handle these fusion features, we design a dual-attention convolutional network which consists of a channel attention module and a spatial attention module. This dual-attention mechanism greatly enhances the generalization ability and robustness of 3D recognition models. Notably, we introduce a strip-pooling layer in the channel attention module to refine the features, resulting in improved fusion features that are more compact. Finally, a classification process is performed on the refined features to assign appropriate 3D shape labels. Our extensive experiments on the ModelNet10 and ModelNet40 datasets for 3D shape recognition and retrieval demonstrate the remarkable accuracy and robustness of the proposed method.
引用
收藏
页码:8119 / 8133
页数:15
相关论文
共 50 条
  • [1] PVFNet: Point-View Fusion Network for 3D Shape Recognition
    Yang, Jun
    Dang, Jisheng
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2020), PT I, 2020, 12274 : 291 - 303
  • [2] MANet: Multimodal Attention Network based Point-View fusion for 3D Shape Recognition
    Zhao, Yaxin
    Jiao, Jichao
    Li, Ning
    Deng, Zhongliang
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 134 - 141
  • [3] PVRNet: Point-View Relation Neural Network for 3D Shape Recognition
    You, Haoxuan
    Feng, Yifan
    Zhao, Xibin
    Zou, Changqing
    Ji, Rongrong
    Gao, Yue
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9119 - 9126
  • [4] PVCLN: Point-View Complementary Learning Network for 3D Shape Recognition
    Sun, Shanlin
    Li, Yun
    Ren, Minjie
    Li, Guo
    Yao, Xing
    [J]. IEEE ACCESS, 2021, 9 (09): : 3451 - 3460
  • [5] LATFormer: Locality-Aware Point-View Fusion Transformer for 3D shape recognition
    He, Xinwei
    Cheng, Silin
    Liang, Dingkang
    Bai, Song
    Wang, Xi
    Zhu, Yingying
    [J]. PATTERN RECOGNITION, 2024, 151
  • [6] PVRAR: POINT-VIEW RELATION NEURAL NETWORK EMBEDDED WITH BOTH ATTENTION MECHANISM AND RADON TRANSFORM FOR 3D SHAPE RECOGNITION
    Zhou, Jie
    Ma, Ziping
    Ma, Jinlin
    [J]. Computing and Informatics, 2021, 40 (06): : 1217 - 1243
  • [7] PVRAR: POINT-VIEW RELATION NEURAL NETWORK EMBEDDED WITH BOTH ATTENTION MECHANISM AND RADON TRANSFORM FOR 3D SHAPE RECOGNITION
    Zhou, Jie
    Ma, Ziping
    Ma, Jinlin
    [J]. COMPUTING AND INFORMATICS, 2021, 40 (06) : 1217 - 1243
  • [8] Attention-Guided Fusion Network of Point Cloud and Multiple Views for 3D Shape Recognition
    Peng, Bo
    Yu, Zengrui
    Lei, Jianjun
    Song, Jiahui
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, : 185 - 188
  • [9] SVHAN: Sequential View Based Hierarchical Attention Network for 3D Shape Recognition
    Zhao, Yue
    Nie, Weizhi
    Liu, An-An
    Gao, Zan
    Su, Yuting
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2130 - 2138
  • [10] SVNET: A SINGLE VIEW NETWORK FOR 3D SHAPE RECOGNITION
    Li, Shaoshuai
    Liu, Fuyan
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1648 - 1653