3D shape classification based on global and local features extraction with collaborative learning

被引:1
|
作者
Ding, Bo [1 ]
Zhang, Libao [1 ]
He, Yongjun [2 ]
Qin, Jian [1 ]
机构
[1] Harbin Univ Sci & Technol, Sch Comp Sci & Technol, 52 Xuefu Rd, Harbin 150080, Peoples R China
[2] Harbin Inst Technol, Sch Comp Sci & Technol, 92 XiDaZhi St, Harbin 150006, Peoples R China
来源
VISUAL COMPUTER | 2024年 / 40卷 / 06期
基金
中国国家自然科学基金;
关键词
3D shape classification; Transformer encoder; Collaborative learning; Local features; Global features; NETWORK;
D O I
10.1007/s00371-023-03098-0
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
It is important to extract both global and local features for view-based 3D shape classification. Therefore, we propose a 3D shape classification method based on global and local features extraction with collaborative learning. This method consists of a patch-level transformer sub-network (PTS) and a view-level transformer sub-network (VTS). In the PTS, a single view is divided into multiple patches. And a multi-layer transformer encoder is employed to accurately highlight discriminative patches and capture correlations among patches in a view, which can efficiently filter out the meaningless information and enhance meaningful information. The PTS can aggregate patch features into a 3D shape representation with rich local details. In the VTS, a multi-layer transformer encoder is employed to assign different attention to each view and obtain the contextual relationship among views, which can highlight the discriminative views among all the views of the same 3D shape and efficiently aggregate view features into a 3D shape representation. A collaborative loss is applied to encourage the two branches to learn collaboratively and teach each other in training. Experiments on two 3D benchmark datasets show that our proposed method outperforms current methods.
引用
收藏
页码:4539 / 4551
页数:13
相关论文
共 50 条
  • [31] The 3D Model Retrieval Based on Local Features
    Huo, Lei
    Lv, Xueqiang
    Zhang, Kai
    Li, Zhuo
    2014 SEVENTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2014), VOL 2, 2014,
  • [32] Local features of 3D point cloud registration based on Siamese network learning
    Sui, Yinling
    Qin, Zhiyuan
    Tong, Xiaochong
    Li, He
    Ding, Lu
    Lai, Guangling
    REMOTE SENSING LETTERS, 2021, 12 (08) : 730 - 738
  • [33] CNN Classification Based on Global and Local Features
    Zheng, Yufeng
    Huang, Jun
    Chen, Tianwen
    Ou, Yang
    Zhou, Wu
    REAL-TIME IMAGE PROCESSING AND DEEP LEARNING 2019, 2019, 10996
  • [34] Image classification based on enhanced learning of global and local features with structural priors
    Cao, Yuan
    Jiang, Di
    Yang, Qiang
    KNOWLEDGE-BASED SYSTEMS, 2025, 311
  • [35] Scene classification using local and global features with collaborative representation fusion
    Zou, Jinyi
    Li, Wei
    Chen, Chen
    Du, Qian
    INFORMATION SCIENCES, 2016, 348 : 209 - 226
  • [36] Online 3D Ear Recognition by Combining Global and Local Features
    Liu, Yahui
    Zhang, Bob
    Lu, Guangming
    Zhang, David
    PLOS ONE, 2016, 11 (12):
  • [37] PLReg3D: Learning 3D Local and Global Descriptors Jointly for Global Localization
    Qiao, Zhijian
    Wang, Hanwen
    Zhu, Yu
    Wang, Hesheng
    2021 27TH INTERNATIONAL CONFERENCE ON MECHATRONICS AND MACHINE VISION IN PRACTICE (M2VIP), 2021,
  • [38] Contrastive Learning for 3D Point Clouds Classification and Shape Completion
    Nazir, Danish
    Afzal, Muhammad Zeshan
    Pagani, Alain
    Liwicki, Marcus
    Stricker, Didier
    SENSORS, 2021, 21 (21)
  • [39] SHAPE COMPARISON OF 3D MODELS BASED ON FEATURES AND PARAMETERS
    Viswanathan, Karthik
    Chowdhury, Sagar
    Siddiclue, Zahed
    DETC 2008: PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATIONAL IN ENGINEERING CONFERENCE, VOL 3, PTS A AND B: 28TH COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2009, : 647 - 657
  • [40] ROBUST 3D MESH HASHING BASED ON SHAPE FEATURES
    Lee, Suk-Hwan
    Lee, Eung-Joo
    Kwon, Ki-Ryong
    2010 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2010), 2010, : 1040 - 1043