Balanced Class-Incremental 3D Object Classification and Retrieval

被引:4
|
作者
Liu, An-An [1 ,2 ]
Lu, Haochun [1 ]
Zhou, Heyu [1 ]
Li, Tianbao [1 ]
Kankanhalli, Mohan [3 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Hefei Comprehens Natl Sci Ctr, Inst Artificial Intelligence, Hefei 230088, Anhui, Peoples R China
[3] Natl Univ Singapore, Sch Comp, Singapore 117543, Singapore
基金
中国国家自然科学基金;
关键词
3D representation learning; 3D object classification; 3D object retrieval; class-incremental learning;
D O I
10.1109/TKDE.2023.3284032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most existing 3D object classification and retrieval algorithms rely on one-off supervised learning on closed 3D object sets and tend to provide rigid convolutional neural networks with little scalability. Such limitations substantially restrict their potential to learn newly emerged 3D object classes continually in the real world. Aiming to go beyond these limitations, we innovatively propose two new and challenging tasks: class-incremental 3D object classification (CI-3DOC) and class-incremental 3D object retrieval (CI-3DOR), the key to which is class-incremental 3D representation learning. It expects the network to update continually to learn new 3D class representations without forgetting the previously learned ones. To this end, we design a novel balanced distillation network (BDNet) that uses a dual supervision mechanism to balance between consolidating old knowledge (stability) and adapting to new 3D object classes (plasticity) carefully. On the one hand, we employ stability-based supervision to retain the stable and discriminative information of old classes that greatly benefit both classification and retrieval tasks. On the other hand, we use plasticity-based supervision to improve the network's generalization for learning new class 3D representations by transferring knowledge from a temporary teacher network to the current model. By properly handling the relationship between the two modules, we achieve a surprising performance improvement. Furthermore, considering there is no available dataset for evaluation, we build two 3D datasets, INOR-1 and INOR-2, to evaluate these two new tasks. Extensive experimental results demonstrate that our method can significantly outperform other state-of-the-art class-incremental learning methods. Even if we store 500-1000 fewer 3D objects than SOTA methods, BDNet still achieves comparable performance.
引用
收藏
页码:35 / 48
页数:14
相关论文
共 50 条
  • [21] 3D OBJECT RETRIEVAL BY 3D CURVE MATCHING
    Feinen, Christian
    Czajkowska, Joanna
    Grzegorzek, Marcin
    Latecki, Longin Jan
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 2749 - 2753
  • [22] BEFM: A balanced and efficient fine-tuning model in class-incremental learning
    Liu, Lize
    Ji, Jian
    Zhao, Lei
    KNOWLEDGE-BASED SYSTEMS, 2025, 315
  • [23] Dual Balanced Class-Incremental Learning With im-Softmax and Angular Rectification
    Zhi, Ruicong
    Meng, Yicheng
    Hou, Junyi
    Wan, Jun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 11
  • [24] Dual Balanced Class-Incremental Learning With im-Softmax and Angular Rectification
    Zhi, Ruicong
    Meng, Yicheng
    Hou, Junyi
    Wan, Jun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (03) : 4437 - 4447
  • [25] Take Goods from Shelves: A Dataset for Class-Incremental Object Detection
    Hao, Yu
    Fu, Yanwei
    Jiang, Yu-Gang
    ICMR'19: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2019, : 271 - 278
  • [26] 3D ResNets for 3D Object Classification
    Ioannidou, Anastasia
    Chatzilari, Elisavet
    Nikolopoulos, Spiros
    Kompatsiaris, Ioannis
    MULTIMEDIA MODELING (MMM 2019), PT I, 2019, 11295 : 495 - 506
  • [27] 3D Object Retrieval: Inter-Class vs. Intra-Class
    Theoharis, Theoharis
    ARTIFICIAL INTELLIGENCE TECHNIQUES FOR COMPUTER GRAPHICS, 2008, 159 : 55 - 66
  • [28] NormalNet: A voxel-based CNN for 3D object classification and retrieval
    Wang, Cheng
    Cheng, Ming
    Sohel, Ferdous
    Bennamoun, Mohammed
    Li, Jonathan
    NEUROCOMPUTING, 2019, 323 : 139 - 147
  • [29] SPNet: Deep 3D Object Classification and Retrieval Using Stereographic Projection
    Yavartanoo, Mohsen
    Kim, Eu Young
    Lee, Kyoung Mu
    COMPUTER VISION - ACCV 2018, PT V, 2019, 11365 : 691 - 706
  • [30] BALANCED RANKING AND SORTING FOR CLASS INCREMENTAL OBJECT DETECTION
    Cui, Bo
    Qu, Hui
    Huang, Xuhui
    Yu, Shan
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2010 - 2014