MultiCAD: Contrastive Representation Learning for Multi-modal 3D Computer-Aided Design Models

被引:5
|
作者
Ma, Weijian [1 ]
Xu, Minyang [1 ]
Li, Xueyang [1 ]
Zhou, Xiangdong [1 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China
关键词
Multimodal Machine Learning; Representation Learning; Contrastive Learning; Computer Aided Design;
D O I
10.1145/3583780.3614982
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
CAD models are multimodal data where information and knowledge contained in construction sequences and shapes are complementary to each other and representation learning methods should consider both of them. Such traits have been neglected in previous methods learning unimodal representations. To leverage the information from both modalities, we develop a multimodal contrastive learning strategy where features from different modalities interact via contrastive learning paradigm, driven by a novel multimodal contrastive loss. Two pretext tasks on both geometry and sequence domains are designed along with a two-stage training strategy to make the representation focus on encoding geometric details and decoding representations into construction sequences, thus being more applicable to downstream tasks such as multimodal retrieval and CAD sequence reconstruction. Experimental results show that the performance of our multimodal representation learning scheme has surpassed the baselines and unimodal methods significantly.
引用
收藏
页码:1766 / 1776
页数:11
相关论文
共 50 条
  • [1] ContrastCAD: Contrastive Learning-Based Representation Learning for Computer-Aided Design Models
    Jung, Minseop
    Kim, Minseong
    Kim, Jibum
    IEEE ACCESS, 2024, 12 : 84830 - 84842
  • [2] Multi-modal image registration using local frequency representation and computer-aided design (CAD) models
    Elbakary, M. I.
    Sundareshan, M. K.
    IMAGE AND VISION COMPUTING, 2007, 25 (05) : 663 - 670
  • [3] Multi-Modal 3D Shape Clustering with Dual Contrastive Learning
    Lin, Guoting
    Zheng, Zexun
    Chen, Lin
    Qin, Tianyi
    Song, Jiahui
    APPLIED SCIENCES-BASEL, 2022, 12 (15):
  • [4] Deep contrastive representation learning for multi-modal clustering
    Lu, Yang
    Li, Qin
    Zhang, Xiangdong
    Gao, Quanxue
    NEUROCOMPUTING, 2024, 581
  • [5] Contrastive Multi-Modal Knowledge Graph Representation Learning
    Fang, Quan
    Zhang, Xiaowei
    Hu, Jun
    Wu, Xian
    Xu, Changsheng
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (09) : 8983 - 8996
  • [6] Multi-modal Relation Distillation for Unified 3D Representation Learning
    Wang, Huiqun
    Bao, Yiping
    Pan, Panwang
    Li, Zeming
    Liu, Xiao
    Yang, Ruijie
    Huang, Di
    COMPUTER VISION - ECCV 2024, PT XXXIII, 2025, 15091 : 364 - 381
  • [7] Automated Assessment Tool for 3D Computer-Aided Design Models
    Eltaief, Ameni
    Ben Amor, Sabrine
    Louhichi, Borhen
    Alrasheedi, Nashmi H.
    Seibi, Abdennour
    APPLIED SCIENCES-BASEL, 2024, 14 (11):
  • [8] 3D COMPUTER-AIDED MOLD DESIGN
    CAREN, S
    SCHUDER, D
    IMPROVING COMPETITIVENESS THROUGH PLASTICS INNOVATION ( PREPRINT ), 1988, : R1 - R11
  • [9] Graph Embedding Contrastive Multi-Modal Representation Learning for Clustering
    Xia, Wei
    Wang, Tianxiu
    Gao, Quanxue
    Yang, Ming
    Gao, Xinbo
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1170 - 1183
  • [10] Computer-Aided Design of 3D Integrated Circuits
    Sapatnekar, Sachin S.
    GLSVLSI'07: PROCEEDINGS OF THE 2007 ACM GREAT LAKES SYMPOSIUM ON VLSI, 2007, : 317 - 317