SELF-AUGMENTED MULTI-MODAL FEATURE EMBEDDING

被引:0
|
作者
Matsuo, Shinnosuke [1 ]
Uchida, Seiichi [1 ]
Iwana, Brian Kenji [1 ]
机构
[1] Kyushu Univ, Fukuoka, Japan
关键词
Self-augmented multi-modality; multi-modal embedding; gating neural networks;
D O I
10.1109/ICASSP39728.2021.9413974
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Oftentimes, patterns can be represented through different modalities. For example, leaf data can be in the form of images or contours. Handwritten characters can also be either online or offline. To exploit this fact, we propose the use of self-augmentation and combine it with multi-modal feature embedding. In order to take advantage of the complementary information from the different modalities, the self-augmented multi-modal feature embedding employs a shared feature space. Through experimental results on classification with online handwriting and leaf images, we demonstrate that the proposed method can create effective embeddings.
引用
收藏
页码:3995 / 3999
页数:5
相关论文
共 50 条
  • [21] Gesture recognition based on multi-modal feature weight
    Duan, Haojie
    Sun, Ying
    Cheng, Wentao
    Jiang, Du
    Yun, Juntong
    Liu, Ying
    Liu, Yibo
    Zhou, Dalin
    Concurrency and Computation: Practice and Experience, 2021, 33 (05)
  • [22] Multi-modal feature fusion for geographic image annotation
    Li, Ke
    Zou, Changqing
    Bu, Shuhui
    Liang, Yun
    Zhang, Jian
    Gong, Minglun
    PATTERN RECOGNITION, 2018, 73 : 1 - 14
  • [23] A Discriminative Vectorial Framework for Multi-Modal Feature Representation
    Gao, Lei
    Guan, Ling
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1503 - 1514
  • [24] A fast multi-modal approach to facial feature detection
    Boehnen, C
    Russ, T
    WACV 2005: SEVENTH IEEE WORKSHOP ON APPLICATIONS OF COMPUTER VISION, PROCEEDINGS, 2005, : 135 - 142
  • [25] Fusional Recognition for Depressive Tendency With Multi-Modal Feature
    Wang, Hong
    Zhou, Ying
    Yu, Fengping
    Zhao, Lili
    Wang, Caiyu
    Ren, Yanju
    IEEE ACCESS, 2019, 7 : 38702 - 38713
  • [26] Adaptive Feature Fusion for Multi-modal Entity Alignment
    Guo H.
    Li X.-Y.
    Tang J.-Y.
    Guo Y.-M.
    Zhao X.
    Zidonghua Xuebao/Acta Automatica Sinica, 2024, 50 (04): : 758 - 770
  • [27] Landmark Classification With Hierarchical Multi-Modal Exemplar Feature
    Zhu, Lei
    Shen, Jialie
    Jin, Hai
    Xie, Liang
    Zheng, Ran
    IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (07) : 981 - 993
  • [28] Gesture recognition based on multi-modal feature weight
    Duan, Haojie
    Sun, Ying
    Cheng, Wentao
    Jiang, Du
    Yun, Juntong
    Liu, Ying
    Liu, Yibo
    Zhou, Dalin
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (05):
  • [29] Denoised Self-Augmented Learning for Social Recommendation
    Wang, Tianle
    Xia, Lianghao
    Huang, Chao
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 2324 - 2331
  • [30] Software infrastructure for interactive, multi-modal virtual and augmented realities
    Martin, GA
    Daly, J
    Washburn, DA
    Lazarus, T
    Goldiez, B
    ISAS/CITSA 2004: International Conference on Cybernetics and Information Technologies, Systems and Applications and 10th International Conference on Information Systems Analysis and Synthesis, Vol 4, Proceedings, 2004, : 13 - 18