Combining Data Augmentations for CNN-Based Voice Command Recognition

被引:7
|
作者
Azarang, Arian [1 ]
Hansen, John [1 ]
Kehtarnavaz, Nasser [1 ]
机构
[1] Univ Texas Dallas, Dept Elect & Comp Engn, Richardson, TX 75080 USA
关键词
Combining data augmentation methods for voice command recognition; CNN-based voice command recognition; voice command human interaction systems; CONVOLUTIONAL NEURAL-NETWORKS;
D O I
10.1109/hsi47298.2019.8942638
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents combining two data augmentation methods involving speed perturbation and room impulse response reverberation for the purpose of improving the generalization capability of convolutional neural networks when used for voice command recognition. Speed perturbation generates voice command variations caused by shorter or longer time durations of commands spoken by different speakers. Room impulse response reverberation generates voice command variations caused by reflected sound paths. The combination of these two augmentation methods is presented in this paper by examining a public domain dataset of voice commands. The experimental results based on the performance metric of word error rate indicate the improvement in voice command recognition rates when combining these data augmentation methods relative to using each augmentation method individually.
引用
收藏
页码:17 / 21
页数:5
相关论文
共 50 条
  • [41] Robust CNN-based Speech Recognition With Gabor Filter Kernels
    Chang, Shuo-Yiin
    Morgan, Nelson
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 905 - 909
  • [42] A CNN-based Vocal Recognition Algorithm for IARC Mission 8
    Lyu, Zibo
    Niu, Yinbao
    Lin, Zhaochen
    Liu, Xiyang
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 7611 - 7616
  • [43] A Method for Improving CNN-Based Image Recognition Using DCGAN
    Fang, Wei
    Zhang, Feihong
    Sheng, Victor S.
    Ding, Yewen
    CMC-COMPUTERS MATERIALS & CONTINUA, 2018, 57 (01): : 167 - 178
  • [44] CNN-based automatic modulation recognition for index modulation systems
    Leblebici, Merih
    Calhan, Ali
    Cicioglu, Murtaza
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 240
  • [45] CNN-based Transfer Learning in Intelligent Recognition of Scrap Bundles
    Zheng, Xiang
    Zhu, Zheng-hai
    Xiao, Zi-xuan
    Huang, Dong-jian
    Yang, Cheng-cheng
    He, Fei
    Zhou, Xiao-bin
    Zhao, Teng-fei
    ISIJ INTERNATIONAL, 2023, 63 (08) : 1383 - 1393
  • [46] A CNN-Based Animal Behavior Recognition Algorithm for Wearable Devices
    Pan, Zhixin
    Chen, Huihui
    Zhong, Weizhao
    Wang, Aiguo
    Zheng, Chundi
    IEEE SENSORS JOURNAL, 2023, 23 (05) : 5156 - 5164
  • [47] CNN-based off-angle iris segmentation and recognition
    Jalilian, Ehsaneddin
    Karakaya, Mahmut
    Uhl, Andreas
    IET BIOMETRICS, 2021, 10 (05) : 518 - 535
  • [48] Deep CNN-Based Recognition of JS']JSL Finger Spelling
    Nguen, Nam Tu
    Sako, Shinji
    Kwolek, Bogdan
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, HAIS 2019, 2019, 11734 : 602 - 613
  • [49] CNN-BASED INITIAL LOCALIZATION IMPROVED BY DATA AUGMENTATION
    Mueller, M. S.
    Metzger, A.
    Jutzi, B.
    ISPRS TC I MID-TERM SYMPOSIUM INNOVATIVE SENSING - FROM SENSORS TO METHODS AND APPLICATIONS, 2018, 4-1 : 117 - 124
  • [50] Communication behavior recognition using CNN-based signal analysis
    Meng, Hao
    Lei, Yingke
    Teng, Fei
    Wang, Jin
    Liu, Changming
    Lou, Caiyi
    PEERJ COMPUTER SCIENCE, 2024, 10