MultiCAD: Contrastive Representation Learning for Multi-modal 3D Computer-Aided Design Models

被引:5
|
作者
Ma, Weijian [1 ]
Xu, Minyang [1 ]
Li, Xueyang [1 ]
Zhou, Xiangdong [1 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China
关键词
Multimodal Machine Learning; Representation Learning; Contrastive Learning; Computer Aided Design;
D O I
10.1145/3583780.3614982
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
CAD models are multimodal data where information and knowledge contained in construction sequences and shapes are complementary to each other and representation learning methods should consider both of them. Such traits have been neglected in previous methods learning unimodal representations. To leverage the information from both modalities, we develop a multimodal contrastive learning strategy where features from different modalities interact via contrastive learning paradigm, driven by a novel multimodal contrastive loss. Two pretext tasks on both geometry and sequence domains are designed along with a two-stage training strategy to make the representation focus on encoding geometric details and decoding representations into construction sequences, thus being more applicable to downstream tasks such as multimodal retrieval and CAD sequence reconstruction. Experimental results show that the performance of our multimodal representation learning scheme has surpassed the baselines and unimodal methods significantly.
引用
收藏
页码:1766 / 1776
页数:11
相关论文
共 50 条
  • [41] Computer-aided fixation detection using retinal birefringence in multi-modal ophthalmic systems: Computer, electronics, algorithms
    Gramatikov, Boris, I
    COMPUTERS IN BIOLOGY AND MEDICINE, 2020, 119
  • [42] THE MANY USES OF COMPUTER-AIDED 3D MODELING
    CROSLEY, ML
    ARCHITECTURE-THE AIA JOURNAL, 1988, 77 (07): : 119 - 122
  • [43] OmniViewer: Multi-modal Monoscopic 3D DASH
    Gao, Zhenhuan
    Chen, Shannon
    Nahrstedt, Klara
    2015 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2015, : 449 - 452
  • [44] Multi-modal brain tumor segmentation via disentangled representation learning and region-aware contrastive learning
    Zhou, Tongxue
    PATTERN RECOGNITION, 2024, 149 (149)
  • [45] Multi-Modal Streaming 3D Object Detection
    Abdelfattah, Mazen
    Yuan, Kaiwen
    Wang, Z. Jane
    Ward, Rabab
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (10) : 6163 - 6170
  • [46] ActiveAnno3D-An Active Learning Framework for Multi-Modal 3D Object Detection
    Ghita, Ahmed
    Antoniussen, Bjork
    Zimmer, Walter
    Greer, Ross
    Cress, Christian
    Mogelmose, Andreas
    Trivedi, Mohan M.
    Knoll, Alois C.
    2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 1699 - 1706
  • [47] A Java']Java-based computer-aided learning system for 3D geometry
    Chan, SCF
    Lui, KM
    Ng, VTY
    Chui, R
    ADVANCED RESEARCH IN COMPUTERS AND COMMUNICATIONS IN EDUCATION, VOL 1: NEW HUMAN ABILITIES FOR THE NETWORKED SOCIETY, 1999, 55 : 1010 - 1017
  • [48] 3D deep learning for computer-aided detection of serrated polyps in CT colonography
    Nappi, Janne J.
    Uemura, Tomoki
    Pickhardt, Perry
    Kim, David H.
    Yoshida, Hiroyuki
    MEDICAL IMAGING 2021: COMPUTER-AIDED DIAGNOSIS, 2021, 11597
  • [49] The multi-user computer-aided design collaborative learning framework
    Deng, Yuanzhe
    Mueller, Matthew
    Rogers, Chris
    Olechowski, Alison
    ADVANCED ENGINEERING INFORMATICS, 2022, 51
  • [50] Computer-aided design of resistance micro-fluidic circuits for 3D printing
    Tsur, Elishai Ezra
    Shamir, Ariel
    COMPUTER-AIDED DESIGN, 2018, 98 : 12 - 23