Multimodal feature fusion for concreteness estimation

被引:0
|
作者
Incitti, Francesca [1 ]
Snidaro, Lauro [1 ]
机构
[1] Univ Udine, Dept Math Comp Sci & Phys, Udine, Italy
关键词
Word Embeddings; Feature fusion; NLP; Multimodal feature learning; ELMo; BERT; CLIP; Concreteness estimation; Autoencoders; Dimensionality reduction; AUTOENCODER;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years the idea of fusing diverse type of information has often been employed to solve various Deep Learning tasks. Whether these regard an NLP problem or a Machine Vision one, the concept of using more inputs of the same type has been the basis of many studies. Considering NLP problems, attempts of different word embeddings have already been tried, managing to make improvements to the most common benchmarks. Here we want to explore the combination not only of different types of input together, but also different data modalities. This is done by fusing two popular word embeddings together, mainly ELMo and BERT, with other inputs that embed a visual description of the analysed text. Doing so, different modalities -textual and visual- are both employed to solve a textual problem, a concreteness task. Multimodal feature fusion is here explored through several techniques: input redundancy, concatenation, average, dimensionality reduction and augmentation. By combining these techniques it is possible to generate different vector representations: the goal is to understand which feature fusion techniques allow to obtain more accurate embeddings.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Multimodal Feature Fusion Based Hypergraph Learning Model
    Yang, Zhe
    Xu, Liangkui
    Zhao, Lei
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [22] Multimodal image fusion via coupled feature learning
    Veshki, Farshad G.
    Ouzir, Nora
    Vorobyov, Sergiy A.
    Ollila, Esa
    SIGNAL PROCESSING, 2022, 200
  • [23] Gesture recognition based on multilevel multimodal feature fusion
    Tian, Jinrong
    Cheng, Wentao
    Sun, Ying
    Li, Gongfa
    Jiang, Du
    Jiang, Guozhang
    Tao, Bo
    Zhao, Haoyi
    Chen, Disi
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 38 (03) : 2539 - 2550
  • [24] A Pose Estimation Algorithm for Multimodal Data Fusion
    Chen, Ning
    Wu, Shaopeng
    Chen, Yupeng
    Wang, Zhanghua
    Zhang, Ziqian
    TRAITEMENT DU SIGNAL, 2022, 39 (06) : 1971 - 1979
  • [25] Estimation of Missing Values in Multimodal Biometric Fusion
    Fatukasi, Omolara
    Kittler, Josef
    Poh, Norman
    2008 IEEE SECOND INTERNATIONAL CONFERENCE ON BIOMETRICS: THEORY, APPLICATIONS AND SYSTEMS (BTAS), 2008, : 117 - 122
  • [26] Multimodal metaphor detection based on distinguishing concreteness
    Su, Chang
    Chen, Weijie
    Fu, Ze
    Chen, Yijiang
    NEUROCOMPUTING, 2021, 429 : 166 - 173
  • [27] Novel approach for multimodal feature fusion to generate cancelable biometric
    Keshav Gupta
    Gurjit Singh Walia
    Kapil Sharma
    The Visual Computer, 2021, 37 : 1401 - 1413
  • [28] Fuzzy feature fusion and multimodal degradation prognosis for mechanical components
    Li, Xuejiao
    Ren, Yongmei
    Tan, Xiaoyong
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 34 (06) : 3523 - 3533
  • [29] Alzheimer's disease diagnosis via multimodal feature fusion
    Tu, Yue
    Lin, Shukuan
    Qiao, Jianzhong
    Zhuang, Yilin
    Zhang, Peng
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 148
  • [30] Identification based on feature fusion of multimodal biometrics and deep learning
    Medjahed, Chahreddine
    Mezzoudj, Freha
    Rahmoun, Abdellatif
    Charrier, Christophe
    INTERNATIONAL JOURNAL OF BIOMETRICS, 2023, 15 (3-4) : 521 - 538