Multi-Modal Deep Learning-Based Violin Bowing Action Recognition

被引：1

作者：

Liu, Bao-Yun ^{[1
]}

Jen, Yi-Hsin ^{[2
,3
]}

Sun, Shih-Wei ^{[4
]}

Su, Li ^{[2
]}

Chang, Pao-Chi ^{[1
]}

机构：

[1] Natl Cent Univ, Dept Commun Engn, Taoyuan, Taiwan

[2] Acad Sinica, Inst Informat Sci, Taipei, Taiwan

[3] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu, Taiwan

[4] Taipei Natl Univ Arts, Dept New Media Art, Taipei, Taiwan

来源：

2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TAIWAN) | 2020年

关键词：

D O I：

10.1109/icce-taiwan49838.2020.9257995

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, a deep learning-based violin action recognition is proposed. By fusing the sensing signals from depth camera modality and inertial sensor modalities, violin bowing actions can be recognized by the proposed deep learning scheme. The actions performed by a violinist are captured by a depth camera, and recorded by wearable sensors on the forearm of a violinist. In the proposed system, 3D convolution neural network (3D-CNN) and long short-term memory (LSTM) deep learning algorithms are adopted to generate the action models from depth camera modality and inertial sensor modalities. The features and models obtained from multi-modalities are used to classify different violin bowing actions. A fusion process from different modalities can achieve satisfactory recognition accuracy. In this paper, we generate a violin bowing actions dataset for the preliminary study and the system performance evaluation.

引用

页数：2

共 50 条

[21] An efficient deep learning-based video captioning framework using multi-modal features
Varma, Soumya
James, Dinesh Peter
EXPERT SYSTEMS, 2021,
[22] Deep learning-based multi-modal computing with feature disentanglement for MRI image synthesis
Fei, Yuchen
Zhan, Bo
Hong, Mei
Wu, Xi
Zhou, Jiliu
Wang, Yan
MEDICAL PHYSICS, 2021, 48 (07) : 3778 - 3789
[23] Sports action recognition algorithm based on multi-modal data recognition
Zhang, Lin
Intelligent Decision Technologies, 2024, 18 (04) : 3243 - 3257
[24] Memory based fusion for multi-modal deep learning
Priyasad, Darshana
Fernando, Tharindu
Denman, Simon
Sridharan, Sridha
Fookes, Clinton
INFORMATION FUSION, 2021, 67 : 136 - 146
[25] Multi-Modal Fusion Emotion Recognition Method of Speech Expression Based on Deep Learning
Liu, Dong
Wang, Zhiyong
Wang, Lifeng
Chen, Longxi
FRONTIERS IN NEUROROBOTICS, 2021, 15
[26] RGB-D based multi-modal deep learning for spacecraft and debris recognition
AlDahoul, Nouar
Karim, Hezerul Abdul
Momo, Mhd Adel
SCIENTIFIC REPORTS, 2022, 12 (01)
[27] Multi-Modal Emotion Recognition From Speech and Facial Expression Based on Deep Learning
Cai, Linqin
Dong, Jiangong
Wei, Min
2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 5726 - 5729
[28] RGB-D based multi-modal deep learning for spacecraft and debris recognition
Nouar AlDahoul
Hezerul Abdul Karim
Mhd Adel Momo
Scientific Reports, 12
[29] MULTI-MODAL LEARNING FOR GESTURE RECOGNITION
Cao, Congqi
Zhang, Yifan
Lu, Hanqing
2015 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2015,
[30] Vision-Based Multi-Modal Framework for Action Recognition
Romaissa, Beddiar Djamila
Mourad, Oussalah
Brahim, Nini
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5859 - 5866

← 1 2 3 4 5 →