Attention-Enhanced Multimodal Learning for Conceptual Design Evaluations

被引:9
|
作者
Song, Binyang [1 ]
Miller, Scarlett [2 ]
Ahmed, Faez [1 ]
机构
[1] MIT, Dept Mech Engn, Cambridge, MA 02139 USA
[2] Penn State Univ, Sch Engn Design & Innovat, State Coll, PA 16802 USA
关键词
conceptual design; creativity and concept generation; design evaluation; machine learning; multimodal learning; CREATIVITY; NOVELTY;
D O I
10.1115/1.4056669
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
Conceptual design evaluation is an indispensable component of innovation in the early stage of engineering design. Properly assessing the effectiveness of conceptual design requires a rigorous evaluation of the outputs. Traditional methods to evaluate conceptual designs are slow, expensive, and difficult to scale because they rely on human expert input. An alternative approach is to use computational methods to evaluate design concepts. However, most existing methods have limited utility because they are constrained to unimodal design representations (e.g., texts or sketches). To overcome these limitations, we propose an attention-enhanced multimodal learning (AEMML)-based machine learning (ML) model to predict five design metrics: drawing quality, uniqueness, elegance, usefulness, and creativity. The proposed model utilizes knowledge from large external datasets through transfer learning (TL), simultaneously processes text and sketch data from early-phase concepts, and effectively fuses the multimodal information through a mutual cross-attention mechanism. To study the efficacy of multimodal learning (MML) and attention-based information fusion, we compare (1) a baseline MML model and the unimodal models and (2) the attention-enhanced models with baseline models in terms of their explanatory power for the variability of the design metrics. The results show that MML improves the model explanatory power by 0.05-0.12 and the mutual cross-attention mechanism further increases the explanatory power of the approach by 0.05-0.09, leading to the highest explanatory power of 0.44 for drawing quality, 0.60 for uniqueness, 0.45 for elegance, 0.43 for usefulness, and 0.32 for creativity. Our findings highlight the benefit of using multimodal representations for design metric assessment.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Attention-Enhanced Voice Portrait Model Using Generative Adversarial Network
    Mao, Jingyi
    Zhou, Yuchen
    Wang, Yifan
    Li, Junyu
    Liu, Ziqing
    Bu, Fanliang
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 79 (01): : 837 - 855
  • [42] Target Speaker Extraction Using Attention-Enhanced Temporal Convolutional Network
    Wang, Jian-Hong
    Lai, Yen-Ting
    Tai, Tzu-Chiang
    Le, Phuong Thi
    Pham, Tuan
    Wang, Ze-Yu
    Li, Yung-Hui
    Wang, Jia-Ching
    Chang, Pao-Chi
    Botzheim, Janos
    [J]. ELECTRONICS, 2024, 13 (02)
  • [43] Attention-Enhanced Graph Neural Networks for Session-Based Recommendation
    Wang, Baocheng
    Cai, Wentao
    [J]. MATHEMATICS, 2020, 8 (09)
  • [44] A Knowledge-Aware Recommender with Attention-Enhanced Dynamic Convolutional Network
    Liu, Yi
    Li, Bohan
    Zang, Yalei
    Li, Aoran
    Yin, Hongzhi
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 1079 - 1088
  • [45] ATTENTION-ENHANCED AND MORE BALANCED R-CNN FOR OBJECT DETECTION
    Mei, Ruohong
    Wang, Haiying
    Men, Aidong
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2136 - 2140
  • [46] An Attention-Enhanced End-to-End Discriminative Network With Multiscale Feature Learning for Remote Sensing Image Retrieval
    Hou, Dongyang
    Wang, Siyuan
    Tian, Xueqing
    Xing, Huaqiao
    [J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2022, 15 : 8245 - 8255
  • [47] Dual-Modal Attention-Enhanced Text-Video Retrieval with Triplet Partial Margin Contrastive Learning
    Jiang, Chen
    Liu, Hong
    Yu, Xuzheng
    Wang, Qing
    Cheng, Yuan
    Xu, Jia
    Liu, Zhongyi
    Guo, Qingpei
    Chu, Wei
    Yang, Ming
    Qi, Yuan
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 4626 - 4636
  • [48] An Attention-Enhanced End-to-End Discriminative Network With Multiscale Feature Learning for Remote Sensing Image Retrieval
    Hou, Dongyang
    Wang, Siyuan
    Tian, Xueqing
    Xing, Huaqiao
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 8245 - 8255
  • [49] Attention-Enhanced Guided Multimodal and Semi-Supervised Networks for Visual Acuity (VA) Prediction after Anti-VEGF Therapy
    Wang, Yizhen
    Wang, Yaqi
    Liu, Xianwen
    Cui, Weiwei
    Jin, Peng
    Cheng, Yuxia
    Jia, Gangyong
    [J]. ELECTRONICS, 2024, 13 (18)
  • [50] TransVCL: Attention-Enhanced Video Copy Localization Network with Flexible Supervision
    He, Sifeng
    He, Yue
    Lu, Minlong
    Jiang, Chen
    Yang, Xudong
    Qian, Feng
    Zhang, Xiaobo
    Yang, Lei
    Zhang, Jiandong
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 799 - 807