Multimodal feature fusion for concreteness estimation

被引:0
|
作者
Incitti, Francesca [1 ]
Snidaro, Lauro [1 ]
机构
[1] Univ Udine, Dept Math Comp Sci & Phys, Udine, Italy
关键词
Word Embeddings; Feature fusion; NLP; Multimodal feature learning; ELMo; BERT; CLIP; Concreteness estimation; Autoencoders; Dimensionality reduction; AUTOENCODER;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years the idea of fusing diverse type of information has often been employed to solve various Deep Learning tasks. Whether these regard an NLP problem or a Machine Vision one, the concept of using more inputs of the same type has been the basis of many studies. Considering NLP problems, attempts of different word embeddings have already been tried, managing to make improvements to the most common benchmarks. Here we want to explore the combination not only of different types of input together, but also different data modalities. This is done by fusing two popular word embeddings together, mainly ELMo and BERT, with other inputs that embed a visual description of the analysed text. Doing so, different modalities -textual and visual- are both employed to solve a textual problem, a concreteness task. Multimodal feature fusion is here explored through several techniques: input redundancy, concatenation, average, dimensionality reduction and augmentation. By combining these techniques it is possible to generate different vector representations: the goal is to understand which feature fusion techniques allow to obtain more accurate embeddings.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Lithium-Ion Batteries SOH Estimation With Multimodal Multilinear Feature Fusion
    Lin, Mingqiang
    You, Yuqiang
    Meng, Jinhao
    Wang, Wei
    Wu, Ji
    Stroe, Daniel-Ioan
    IEEE TRANSACTIONS ON ENERGY CONVERSION, 2023, 38 (04) : 2959 - 2968
  • [2] A Multimodal Framework for Unsupervised Feature Fusion
    Li, Xiaoyi
    Gao, Jing
    Li, Hui
    Yang, Le
    Srihari, Rohini K.
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 897 - 902
  • [3] Adaptive Multimodal-Feature Fusion for 6D Object Position Estimation
    Zang, Chuanfang
    Dang, Jianwu
    Yong, Jiu
    LASER & OPTOELECTRONICS PROGRESS, 2025, 62 (04)
  • [4] MFCNet: Multimodal Feature Fusion Network for RGB-T Vehicle Density Estimation
    Qin, Ling-Xiao
    Sun, Hong-Mei
    Duan, Xiao-Meng
    Che, Cheng-Yue
    Jia, Rui-Sheng
    IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (04): : 4207 - 4219
  • [5] Multimodal Biometric Person Recognition by Feature Fusion
    Huang, Lin
    Yu, Chenxi
    Cao, Xinzhe
    2018 5TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE 2018), 2018, : 1158 - 1162
  • [6] Multimodal Emotion Recognition Based on Feature Fusion
    Xu, Yurui
    Wu, Xiao
    Su, Hang
    Liu, Xiaorui
    2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022), 2022, : 7 - 11
  • [7] Feature Level Fusion in Multimodal Biometric Identification
    Belhia, S.
    Gafour, A.
    2012 SECOND INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING TECHNOLOGY (INTECH), 2012, : 418 - 423
  • [8] Multimodal Biometrics using Cancelable Feature Fusion
    Paul, Padma Polash
    Gavrilova, Marina
    2014 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW), 2014, : 279 - 284
  • [9] Adaptable Feature Importance Estimation Framework for Fusion-based Multimodal Deep Neural Networks
    Azmat, Muneeza
    Fessler, Henry
    Alessio, Adam
    JOURNAL OF NUCLEAR MEDICINE, 2023, 64
  • [10] A novel multimodal image feature fusion mechanism: Application to rabbit liveweight estimation in commercial farms
    Song, Daoyi
    Lai, Zhenhao
    Yang, Shuqi
    Liu, Dongyu
    Yao, Jinxia
    Wang, Hongying
    Wang, Liangju
    SMART AGRICULTURAL TECHNOLOGY, 2024, 9