Multi-Modal Deep Learning-Based Violin Bowing Action Recognition

被引:1
|
作者
Liu, Bao-Yun [1 ]
Jen, Yi-Hsin [2 ,3 ]
Sun, Shih-Wei [4 ]
Su, Li [2 ]
Chang, Pao-Chi [1 ]
机构
[1] Natl Cent Univ, Dept Commun Engn, Taoyuan, Taiwan
[2] Acad Sinica, Inst Informat Sci, Taipei, Taiwan
[3] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu, Taiwan
[4] Taipei Natl Univ Arts, Dept New Media Art, Taipei, Taiwan
关键词
D O I
10.1109/icce-taiwan49838.2020.9257995
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a deep learning-based violin action recognition is proposed. By fusing the sensing signals from depth camera modality and inertial sensor modalities, violin bowing actions can be recognized by the proposed deep learning scheme. The actions performed by a violinist are captured by a depth camera, and recorded by wearable sensors on the forearm of a violinist. In the proposed system, 3D convolution neural network (3D-CNN) and long short-term memory (LSTM) deep learning algorithms are adopted to generate the action models from depth camera modality and inertial sensor modalities. The features and models obtained from multi-modalities are used to classify different violin bowing actions. A fusion process from different modalities can achieve satisfactory recognition accuracy. In this paper, we generate a violin bowing actions dataset for the preliminary study and the system performance evaluation.
引用
收藏
页数:2
相关论文
共 50 条
  • [1] Deep Learning-Based Violin Bowing Action Recognition
    Sun, Shih-Wei
    Liu, Bao-Yun
    Chang, Pao-Chi
    SENSORS, 2020, 20 (20) : 1 - 17
  • [2] Effective deep learning-based multi-modal retrieval
    Wang, Wei
    Yang, Xiaoyan
    Ooi, Beng Chin
    Zhang, Dongxiang
    Zhuang, Yueting
    VLDB JOURNAL, 2016, 25 (01): : 79 - 101
  • [3] Effective deep learning-based multi-modal retrieval
    Wei Wang
    Xiaoyan Yang
    Beng Chin Ooi
    Dongxiang Zhang
    Yueting Zhuang
    The VLDB Journal, 2016, 25 : 79 - 101
  • [4] Applying deep learning-based multi-modal for detection of coronavirus
    Rani, Geeta
    Oza, Meet Ganpatlal
    Dhaka, Vijaypal Singh
    Pradhan, Nitesh
    Verma, Sahil
    Rodrigues, Joel J. P. C.
    MULTIMEDIA SYSTEMS, 2022, 28 (04) : 1251 - 1262
  • [5] Applying deep learning-based multi-modal for detection of coronavirus
    Geeta Rani
    Meet Ganpatlal Oza
    Vijaypal Singh Dhaka
    Nitesh Pradhan
    Sahil Verma
    Joel J. P. C. Rodrigues
    Multimedia Systems, 2022, 28 : 1251 - 1262
  • [6] Multi-modal haptic image recognition based on deep learning
    Han, Dong
    Nie, Hong
    Chen, Jinbao
    Chen, Meng
    Deng, Zhen
    Zhang, Jianwei
    SENSOR REVIEW, 2018, 38 (04) : 486 - 493
  • [7] Multi-modal deep learning for landform recognition
    Du, Lin
    You, Xiong
    Li, Ke
    Meng, Liqiu
    Cheng, Gong
    Xiong, Liyang
    Wang, Guangxia
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2019, 158 : 63 - 75
  • [8] Deep learning-based multi-modal approach using RGB and skeleton sequences for human activity recognition
    Pratishtha Verma
    Animesh Sah
    Rajeev Srivastava
    Multimedia Systems, 2020, 26 : 671 - 685
  • [9] Deep learning-based multi-modal approach using RGB and skeleton sequences for human activity recognition
    Verma, Pratishtha
    Sah, Animesh
    Srivastava, Rajeev
    MULTIMEDIA SYSTEMS, 2020, 26 (06) : 671 - 685
  • [10] A Multi-Modal Deep Learning Approach for Emotion Recognition
    Shahzad, H. M.
    Bhatti, Sohail Masood
    Jaffar, Arfan
    Rashid, Muhammad
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 36 (02): : 1561 - 1570