Two-stage algorithm of spectral analysis for the automatic speech recognition systems

被引:0
|
作者
V. V. Savchenko [1 ]
L. V. Savchenko [1 ]
机构
[1] National Research University,“Higher School of Economics,”
关键词
Speech signal; Spectral analysis; Vocal tract; Autoregressive model; All-pole model; Artificial neural network; Data augmentation;
D O I
10.1007/s11018-024-02376-0
中图分类号
学科分类号
摘要
The problem of the spectral analysis of speech signals in automatic speech recognition systems is considered within the framework of a dynamically developed direction of investigations in the field of acoustic measurements. We indicate that efficiency of the analyzed systems under unfavorable conditions of speech production (noise and insufficient intelligibility of speech sounds) is low as compared with human perception of oral speech. To improve the efficiency of automatic speech recognition systems, we propose to use a two-stage algorithm of spectral analysis of the speech signals. The first stage of processing of speech signals is their parametric spectral analysis performed by using an autoregressive model of the vocal tract of a conventional speaker. The second stage of processing is the transformation (modification) of the obtained spectral estimate according to the principle of frequency-selective amplification of the amplitude of main formants of the intraperiod power spectrum. The software implementation of the proposed algorithm is described on the basis of the computational procedure of fast Fourier transform. By using the software developed by the authors, we performed full-scale experiments and studied an additive mixture of vowel sounds in the speech of a control speaker with white Gaussian noise. The obtained experimental results enable us to conclude that the amplitudes of the main formants of speech signals are amplified by 10–20 dB and, hence, the intelligibility of speech sounds substantially improves. The developed algorithm can be used in the automatic speech recognition systems based on processing of the speech signals in the frequency domain, including the use of artificial neural networks.
引用
收藏
页码:553 / 563
页数:10
相关论文
共 50 条
  • [31] Automatic Speech Recognition Post-Processing for Readability: Task, Dataset and a Two-Stage Pre-Trained Approach
    Liao, Junwei
    Shi, Yu
    Xu, Yong
    IEEE ACCESS, 2022, 10 : 117053 - 117066
  • [32] Gait recognition by two-stage principal component analysis
    Das, Sandhitsu R.
    Wilson, Robert C.
    Lazarewicz, Maciej T.
    Finkel, Leif H.
    PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION - PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE, 2006, : 579 - +
  • [33] Two-stage model-based feature compensation for robust speech recognition
    Shen, Haifeng
    Liu, Gang
    Guo, Jun
    COMPUTING, 2012, 94 (01) : 1 - 20
  • [34] Two-stage model-based feature compensation for robust speech recognition
    Haifeng Shen
    Gang Liu
    Jun Guo
    Computing, 2012, 94 : 1 - 20
  • [35] Chinese Traffic Sign Recognition Based on Two-stage Classification Algorithm
    Feng R.
    Jiang K.
    Yu W.
    Yang D.
    Qiche Gongcheng/Automotive Engineering, 2022, 44 (03): : 434 - 441
  • [36] Two-Stage Recognition Algorithm for Untrimmed Converter Steelmaking Flame Video
    Chen, Yi
    Liu, Jiyuan
    Xiong, Huilin
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, 2021, 13019 : 268 - 279
  • [37] A Two-Stage Fall Recognition Algorithm Based on Human Posture Features
    Han, Kun
    Yang, Qiongqian
    Huang, Zefan
    SENSORS, 2020, 20 (23) : 1 - 21
  • [38] A two-stage cooperation modulation recognition algorithm based on the correlation clustering
    Zhu, Q. (zhuqi@njupt.edu.cn), 1600, Binary Information Press, P.O. Box 162, Bethel, CT 06801-0162, United States (09):
  • [39] A Comparison of Single-Stage and Two-Stage Modelling Approaches for Automatic Forensic Speaker Recognition
    Thiruvaran, Tharmarajah
    Ambikairajah, Eliathamby
    Epps, Julien
    Enzinger, Ewald
    2013 8TH IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (ICIIS), 2013, : 433 - 438
  • [40] A Two-Stage Automatic Container Code Recognition Method Considering Environmental Interference
    Yu, Meng
    Zhu, Shanglei
    Lu, Bao
    Chen, Qiang
    Wang, Tengfei
    APPLIED SCIENCES-BASEL, 2024, 14 (11):