Two-stage algorithm of spectral analysis for the automatic speech recognition systems

被引:0
|
作者
V. V. Savchenko [1 ]
L. V. Savchenko [1 ]
机构
[1] National Research University,“Higher School of Economics,”
关键词
Speech signal; Spectral analysis; Vocal tract; Autoregressive model; All-pole model; Artificial neural network; Data augmentation;
D O I
10.1007/s11018-024-02376-0
中图分类号
学科分类号
摘要
The problem of the spectral analysis of speech signals in automatic speech recognition systems is considered within the framework of a dynamically developed direction of investigations in the field of acoustic measurements. We indicate that efficiency of the analyzed systems under unfavorable conditions of speech production (noise and insufficient intelligibility of speech sounds) is low as compared with human perception of oral speech. To improve the efficiency of automatic speech recognition systems, we propose to use a two-stage algorithm of spectral analysis of the speech signals. The first stage of processing of speech signals is their parametric spectral analysis performed by using an autoregressive model of the vocal tract of a conventional speaker. The second stage of processing is the transformation (modification) of the obtained spectral estimate according to the principle of frequency-selective amplification of the amplitude of main formants of the intraperiod power spectrum. The software implementation of the proposed algorithm is described on the basis of the computational procedure of fast Fourier transform. By using the software developed by the authors, we performed full-scale experiments and studied an additive mixture of vowel sounds in the speech of a control speaker with white Gaussian noise. The obtained experimental results enable us to conclude that the amplitudes of the main formants of speech signals are amplified by 10–20 dB and, hence, the intelligibility of speech sounds substantially improves. The developed algorithm can be used in the automatic speech recognition systems based on processing of the speech signals in the frequency domain, including the use of artificial neural networks.
引用
收藏
页码:553 / 563
页数:10
相关论文
共 50 条
  • [1] Two-Stage Enhancement of Noisy and Reverberant Microphone Array Speech for Automatic Speech Recognition Systems Trained with Only Clean Speech
    Wang, Quandong
    Wang, Sicheng
    Ge, Fengpei
    Han, Chang Woo
    Lee, Jaewon
    Guo, Lianghao
    Lee, Chin-Hui
    2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 21 - 25
  • [2] A two-stage algorithm for enhancement of reverberant speech
    Wu, MY
    Wang, D
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1085 - 1088
  • [3] A two-stage genetic algorithm for automatic clustering
    He, Hong
    Tan, Yonghong
    NEUROCOMPUTING, 2012, 81 : 49 - 59
  • [4] Optimising two-stage recognition systems
    Landgrebe, T
    Paclík, P
    Tax, DMJ
    Duin, RPW
    MULTIPLE CLASSIFIER SYSTEMS, 2005, 3541 : 206 - 215
  • [5] Spectral Analysis for Automatic Speech Recognition and Enhancement
    Oruh, Jane
    Viriri, Serestina
    MACHINE LEARNING FOR NETWORKING, MLN 2020, 2021, 12629 : 245 - 254
  • [6] Two-stage vocal effort detection based on spectral information entropy for robust speech recognition
    Chao, Hao (chaohao1981@163.com), 2018, Ubiquitous International (09):
  • [7] TWO-STAGE AUTOMATED DEFECT RECOGNITION ALGORITHM FOR THE ANALYSIS OF INFRARED IMAGES
    Vandone, Ambra
    Rizzo, Piervincenzo
    Vanali, Marcello
    RESEARCH IN NONDESTRUCTIVE EVALUATION, 2012, 23 (02) : 69 - 88
  • [8] A TWO-STAGE ALGORITHM FOR NOISY AND REVERBERANT SPEECH ENHANCEMENT
    Zhao, Yan
    Wang, Zhong-Qiu
    Wang, DeLiang
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5580 - 5584
  • [9] Perceptual Improvement of a Two-Stage Algorithm for Speech Dereverberation
    Prego, Thiago de M.
    de Lima, Amaro A.
    Netto, Sergio L.
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 216 - 219
  • [10] Two-stage algorithm for automatic repair of pavement cracks
    Yu, Jing
    Guo, Jiawei
    Zhang, Qi
    Xing, Lining
    Lv, Songtao
    ENGINEERING CONSTRUCTION AND ARCHITECTURAL MANAGEMENT, 2024,