SELF-AUGMENTED MULTI-MODAL FEATURE EMBEDDING

被引：0

作者：

Matsuo, Shinnosuke ^{[1
]}

Uchida, Seiichi ^{[1
]}

Iwana, Brian Kenji ^{[1
]}

机构：

[1] Kyushu Univ, Fukuoka, Japan

来源：

2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021) | 2021年

关键词：

Self-augmented multi-modality; multi-modal embedding; gating neural networks;

D O I：

10.1109/ICASSP39728.2021.9413974

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Oftentimes, patterns can be represented through different modalities. For example, leaf data can be in the form of images or contours. Handwritten characters can also be either online or offline. To exploit this fact, we propose the use of self-augmentation and combine it with multi-modal feature embedding. In order to take advantage of the complementary information from the different modalities, the self-augmented multi-modal feature embedding employs a shared feature space. Through experimental results on classification with online handwriting and leaf images, we demonstrate that the proposed method can create effective embeddings.

引用

页码：3995 / 3999

页数：5

共 50 条

[41] Multi-modal recursive prompt learning with mixup embedding for generalization recognition
Jia, Yunpeng
Ye, Xiufen
Liu, Yusong
Guo, Shuxiang
KNOWLEDGE-BASED SYSTEMS, 2024, 294
[42] Understanding Natural Language Sentences with Word Embedding and Multi-modal Interaction
Zhong, Junpei
Ogata, Tetsuya
Cangelosi, Angelo
Yang, Chenguang
2017 THE SEVENTH JOINT IEEE INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING AND EPIGENETIC ROBOTICS (ICDL-EPIROB), 2017, : 184 - 189
[43] Generalised Zero-shot Learning with Multi-modal Embedding Spaces
Felix, Rafael
Sasdelli, Michele
Harwood, Ben
Carneiro, Gustavo
2020 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2020,
[44] Multi-modal Emotion Recognition using Speech Features and Text Embedding
Kim J.-H.
Lee S.-P.
Transactions of the Korean Institute of Electrical Engineers, 2021, 70 (01): : 108 - 113
[45] Efficient and Effective Multi-Modal Queries Through Heterogeneous Network Embedding
Chi Thang Duong
Thanh Tam Nguyen
Yin, Hongzhi
Weidlich, Matthias
Mai, Thai Son
Aberer, Karl
Quoc Viet Hung Nguyen
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (11) : 5307 - 5320
[46] Prior Art Search Using Multi-Modal Embedding of Patent Documents
Kang, Myungchul
Lee, Suan
Lee, Wookey
2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2020), 2020, : 548 - 550
[47] Multi-Modal Entity Alignment Method Based on Feature Enhancement
Wang, Huansha
Liu, Qinrang
Huang, Ruiyang
Zhang, Jianpeng
APPLIED SCIENCES-BASEL, 2023, 13 (11):
[48] Disease Classification Model Based on Multi-Modal Feature Fusion
Wan, Zhengyu
Shao, Xinhui
IEEE ACCESS, 2023, 11 : 27536 - 27545
[49] Learning discriminative motion feature for enhancing multi-modal action
Yang, Jianyu
Huang, Yao
Shao, Zhanpeng
Liu, Chunping
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 79
[50] Multiscale structural feature transform for multi-modal image matching
Hu, Maoqing
Sun, Bin
Kang, Xudong
Li, Shutao
INFORMATION FUSION, 2023, 95 : 341 - 354

← 1 2 3 4 5 →