SELF-AUGMENTED MULTI-MODAL FEATURE EMBEDDING

被引：0

作者：

Matsuo, Shinnosuke ^{[1
]}

Uchida, Seiichi ^{[1
]}

Iwana, Brian Kenji ^{[1
]}

机构：

[1] Kyushu Univ, Fukuoka, Japan

来源：

2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021) | 2021年

关键词：

Self-augmented multi-modality; multi-modal embedding; gating neural networks;

D O I：

10.1109/ICASSP39728.2021.9413974

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Oftentimes, patterns can be represented through different modalities. For example, leaf data can be in the form of images or contours. Handwritten characters can also be either online or offline. To exploit this fact, we propose the use of self-augmentation and combine it with multi-modal feature embedding. In order to take advantage of the complementary information from the different modalities, the self-augmented multi-modal feature embedding employs a shared feature space. Through experimental results on classification with online handwriting and leaf images, we demonstrate that the proposed method can create effective embeddings.

引用

页码：3995 / 3999

页数：5

共 50 条

[21] Gesture recognition based on multi-modal feature weight
Duan, Haojie
Sun, Ying
Cheng, Wentao
Jiang, Du
Yun, Juntong
Liu, Ying
Liu, Yibo
Zhou, Dalin
Concurrency and Computation: Practice and Experience, 2021, 33 (05)
[22] Multi-modal feature fusion for geographic image annotation
Li, Ke
Zou, Changqing
Bu, Shuhui
Liang, Yun
Zhang, Jian
Gong, Minglun
PATTERN RECOGNITION, 2018, 73 : 1 - 14
[23] A Discriminative Vectorial Framework for Multi-Modal Feature Representation
Gao, Lei
Guan, Ling
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1503 - 1514
[24] A fast multi-modal approach to facial feature detection
Boehnen, C
Russ, T
WACV 2005: SEVENTH IEEE WORKSHOP ON APPLICATIONS OF COMPUTER VISION, PROCEEDINGS, 2005, : 135 - 142
[25] Fusional Recognition for Depressive Tendency With Multi-Modal Feature
Wang, Hong
Zhou, Ying
Yu, Fengping
Zhao, Lili
Wang, Caiyu
Ren, Yanju
IEEE ACCESS, 2019, 7 : 38702 - 38713
[26] Adaptive Feature Fusion for Multi-modal Entity Alignment
Guo H.
Li X.-Y.
Tang J.-Y.
Guo Y.-M.
Zhao X.
Zidonghua Xuebao/Acta Automatica Sinica, 2024, 50 (04): : 758 - 770
[27] Landmark Classification With Hierarchical Multi-Modal Exemplar Feature
Zhu, Lei
Shen, Jialie
Jin, Hai
Xie, Liang
Zheng, Ran
IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (07) : 981 - 993
[28] Gesture recognition based on multi-modal feature weight
Duan, Haojie
Sun, Ying
Cheng, Wentao
Jiang, Du
Yun, Juntong
Liu, Ying
Liu, Yibo
Zhou, Dalin
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (05):
[29] Denoised Self-Augmented Learning for Social Recommendation
Wang, Tianle
Xia, Lianghao
Huang, Chao
PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 2324 - 2331
[30] Software infrastructure for interactive, multi-modal virtual and augmented realities
Martin, GA
Daly, J
Washburn, DA
Lazarus, T
Goldiez, B
ISAS/CITSA 2004: International Conference on Cybernetics and Information Technologies, Systems and Applications and 10th International Conference on Information Systems Analysis and Synthesis, Vol 4, Proceedings, 2004, : 13 - 18

← 1 2 3 4 5 →