SELF-AUGMENTED MULTI-MODAL FEATURE EMBEDDING

被引:0
|
作者
Matsuo, Shinnosuke [1 ]
Uchida, Seiichi [1 ]
Iwana, Brian Kenji [1 ]
机构
[1] Kyushu Univ, Fukuoka, Japan
关键词
Self-augmented multi-modality; multi-modal embedding; gating neural networks;
D O I
10.1109/ICASSP39728.2021.9413974
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Oftentimes, patterns can be represented through different modalities. For example, leaf data can be in the form of images or contours. Handwritten characters can also be either online or offline. To exploit this fact, we propose the use of self-augmentation and combine it with multi-modal feature embedding. In order to take advantage of the complementary information from the different modalities, the self-augmented multi-modal feature embedding employs a shared feature space. Through experimental results on classification with online handwriting and leaf images, we demonstrate that the proposed method can create effective embeddings.
引用
收藏
页码:3995 / 3999
页数:5
相关论文
共 50 条
  • [41] Multi-modal recursive prompt learning with mixup embedding for generalization recognition
    Jia, Yunpeng
    Ye, Xiufen
    Liu, Yusong
    Guo, Shuxiang
    KNOWLEDGE-BASED SYSTEMS, 2024, 294
  • [42] Understanding Natural Language Sentences with Word Embedding and Multi-modal Interaction
    Zhong, Junpei
    Ogata, Tetsuya
    Cangelosi, Angelo
    Yang, Chenguang
    2017 THE SEVENTH JOINT IEEE INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING AND EPIGENETIC ROBOTICS (ICDL-EPIROB), 2017, : 184 - 189
  • [43] Generalised Zero-shot Learning with Multi-modal Embedding Spaces
    Felix, Rafael
    Sasdelli, Michele
    Harwood, Ben
    Carneiro, Gustavo
    2020 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2020,
  • [44] Multi-modal Emotion Recognition using Speech Features and Text Embedding
    Kim J.-H.
    Lee S.-P.
    Transactions of the Korean Institute of Electrical Engineers, 2021, 70 (01): : 108 - 113
  • [45] Efficient and Effective Multi-Modal Queries Through Heterogeneous Network Embedding
    Chi Thang Duong
    Thanh Tam Nguyen
    Yin, Hongzhi
    Weidlich, Matthias
    Mai, Thai Son
    Aberer, Karl
    Quoc Viet Hung Nguyen
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (11) : 5307 - 5320
  • [46] Prior Art Search Using Multi-Modal Embedding of Patent Documents
    Kang, Myungchul
    Lee, Suan
    Lee, Wookey
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2020), 2020, : 548 - 550
  • [47] Multi-Modal Entity Alignment Method Based on Feature Enhancement
    Wang, Huansha
    Liu, Qinrang
    Huang, Ruiyang
    Zhang, Jianpeng
    APPLIED SCIENCES-BASEL, 2023, 13 (11):
  • [48] Disease Classification Model Based on Multi-Modal Feature Fusion
    Wan, Zhengyu
    Shao, Xinhui
    IEEE ACCESS, 2023, 11 : 27536 - 27545
  • [49] Learning discriminative motion feature for enhancing multi-modal action
    Yang, Jianyu
    Huang, Yao
    Shao, Zhanpeng
    Liu, Chunping
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 79
  • [50] Multiscale structural feature transform for multi-modal image matching
    Hu, Maoqing
    Sun, Bin
    Kang, Xudong
    Li, Shutao
    INFORMATION FUSION, 2023, 95 : 341 - 354