An end-to-end lower limb activity recognition framework based on sEMG data augmentation and enhanced CapsNet

被引:8
|
作者
Zhang, Changhe [1 ]
Li, Yangan [2 ]
Yu, Zidong [1 ]
Huang, Xiaolin [2 ]
Xu, Jiang [2 ]
Deng, Chao [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Mech Sci & Engn, Wuhan 430074, Peoples R China
[2] Huazhong Univ Sci & Technol, Tongji Hosp, Tongji Med Coll, Dept Rehabil Med, Wuhan 430030, Peoples R China
基金
中国国家自然科学基金;
关键词
Lower limb activity recognition; Biomedical signal analysis; sEMG denoising; Class-imbalanced problem; Capsule network;
D O I
10.1016/j.eswa.2023.120257
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, lower limb activity recognition (LLAR) based on surface electromyography (sEMG) signal has attracted increasing attention, mainly due to its applications in the control of robots and prosthetics, medical rehabili-tation, etc. Traditional machine learning-based LLAR methods rely on expert experience for feature extraction. In addition, the noise interference and class-imbalanced problem can also affect the recognition effect. Aiming at these problems, a LLAR framework based on sEMG data augmentation (DA) and enhanced capsule network (ECN) is proposed in this paper. Firstly, a hybrid denoising technique combining variational mode decomposition and non-local means estimation is designed to effectively filter out noise components mixed in the sEMG. Then, K-Means synthetic minority oversampling technique is utilized to synthesize new samples for minority classes, thereby overcoming the influence of class imbalance on recognition reliability. Finally, an ECN model is con-structed to implement end-to-end LLAR, in which an efficient channel attention module is embedded to mine sensitive features, thus further improving the feature learning ability of the classifier. Experimental results indicate that the proposed framework is applicable to multiple types of individuals, including healthy subjects, patients with knee abnormalities, and patients with stroke, providing more satisfactory recognition performance and robustness than state-of-the-art methods..
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Data Augmentation for End-to-End Optical Music Recognition
    Lopez-Gutierrez, Juan C.
    Valero-Mas, Jose J.
    Castellanos, Francisco J.
    Calvo-Zaragoza, Jorge
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021 WORKSHOPS, PT I, 2021, 12916 : 59 - 73
  • [2] AUDITORY-BASED DATA AUGMENTATION FOR END-TO-END AUTOMATIC SPEECH RECOGNITION
    Tu, Zehai
    Deadman, Jack
    Ma, Ning
    Barker, Jon
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7447 - 7451
  • [3] Semantic Data Augmentation for End-to-End Mandarin Speech Recognition
    Sun, Jianwei
    Tang, Zhiyuan
    Yin, Hengxin
    Wang, Wei
    Zhao, Xi
    Zhao, Shuaijiang
    Lei, Xiaoning
    Zou, Wei
    Li, Xiangang
    INTERSPEECH 2021, 2021, : 1269 - 1273
  • [4] Data Augmentation for End-to-end Silent Speech Recognition for Laryngectomees
    Cao, Beiming
    Teplansky, Kristin
    Sebkhi, Nordine
    Bhaysar, Arpan
    Inan, Omer T.
    Samlan, Robin
    Mau, Ted
    Wang, Jun
    INTERSPEECH 2022, 2022, : 3653 - 3657
  • [5] DATA AUGMENTATION FOR END-TO-END CODE-SWITCHING SPEECH RECOGNITION
    Du, Chenpeng
    Li, Hao
    Lu, Yizhou
    Wang, Lan
    Qian, Yanmin
    2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 194 - 200
  • [6] SpecSwap: A Simple Data Augmentation Method for End-to-End Speech Recognition
    Song, Xingchen
    Wu, Zhiyong
    Huang, Yiheng
    Su, Dan
    Meng, Helen
    INTERSPEECH 2020, 2020, : 581 - 585
  • [7] Data Augmentation Methods for End-to-end Speech Recognition on Distant-Talk Scenarios
    Tsunoo, Emiru
    Shibata, Kentaro
    Narisetty, Chaitanya
    Kashiwagi, Yosuke
    Watanabe, Shinji
    INTERSPEECH 2021, 2021, : 301 - 305
  • [8] STARGAN FOR EMOTIONAL SPEECH CONVERSION: VALIDATED BY DATA AUGMENTATION OF END-TO-END EMOTION RECOGNITION
    Rizos, Georgios
    Baird, Alice
    Elliott, Max
    Schuller, Bjorn
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3502 - 3506
  • [9] CONVOLUTIONAL DROPOUT AND WORDPIECE AUGMENTATION FOR END-TO-END SPEECH RECOGNITION
    Xu, Hainan
    Huang, Yinghui
    Zhu, Yun
    Audhkhasi, Kartik
    Ramabhadran, Bhuvana
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5984 - 5988
  • [10] Sample Fusion Network: An End-to-End Data Augmentation Network for Skeleton-Based Human Action Recognition
    Meng, Fanyang
    Liu, Hong
    Liang, Yongsheng
    Tu, Juanhui
    Liu, Mengyuan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (11) : 5281 - 5295