Speech-Signal-Based Frequency Warping

被引:13
|
作者
Paliwal, Kuldip [1 ]
Shannon, Benjamin [1 ]
Lyons, James [1 ]
Wojcicki, Kamil [1 ]
机构
[1] Griffith Univ, Signal Proc Lab, Nathan, Qld 4111, Australia
关键词
Bark scale; mel scale; robust automatic speech recognition (ASR); speech-signal-based frequency cepstral coefficient (SFCC); speech-signal-based frequency warping;
D O I
10.1109/LSP.2009.2014096
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The speech signal is used for transmission of linguistic information. High energy portions of the speech spectrum have higher signal-to-noise ratios than the low energy portions. As a result, these regions are more robust to noise. Since the speech signal is known to be very robust to noise, it is expected that the high energy regions of the speech spectrum carry the majority of the linguistic information. This letter tries to derive a frequency warping function directly from the speech signal by sampling the frequency axis nonuniformly with the high energy regions sampled more densely than the low energy regions. To achieve this, an ensemble average short-time power spectrum is computed from a large speech corpus. The speech-signal-based frequency warping is obtained by considering equal area portions of the log spectrum. The proposed frequency warping is shown to be similar to the frequency scales obtained through psycho-acoustic experiments, namely the mel and bark scales. The warping is then used in filterbank design for automatic speech recognition experiments. The results of these experiments show that cepstral features based on the proposed warping achieve performance under clean conditions comparable to that of mel-frequency cepstral coefficients, while outperforming them under noisy conditions.
引用
收藏
页码:319 / 322
页数:4
相关论文
共 50 条
  • [41] Segmented dynamic time warping based signal pattern classification
    Hong, Jae Yeol
    Park, Seung Hwan
    Baek, Jun-Geol
    2019 22ND IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (IEEE CSE 2019) AND 17TH IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (IEEE EUC 2019), 2019, : 260 - 262
  • [42] EMBEDDING TIME WARPING IN EXEMPLAR-BASED SPARSE REPRESENTATIONS OF SPEECH
    Yilmaz, Emre
    Gemmeke, Jort F.
    Van Hamme, Hugo
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8076 - 8080
  • [43] Pre-processing and segmentation of speech signal in frequency domain for speech recognition
    Kolokolov, A.S.
    Avtomatika i Telemekhanika, 2003, (06): : 152 - 162
  • [44] Speech Dynamic Time Warping Based on Ant Colony Optimization Algorithm
    Wei, Xing
    Yang, Xiaojin
    2013 3RD INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, COMMUNICATIONS AND NETWORKS (CECNET), 2013, : 602 - 604
  • [45] Dynamic Time Warping Based Speech Recognition for Isolated Sinhala Words
    Priyadarshani, P. G. N.
    Dias, N. G. J.
    Punchihewa, Amal
    2012 IEEE 55TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2012, : 892 - 895
  • [46] New speech/music discrimination approach based on warping transformation and ANFIS
    Munoz-Exposito, J. E.
    Ruiz-Reyes, N.
    Garcia-Galan, S.
    Vera-Candeas, P.
    JOURNAL OF NEW MUSIC RESEARCH, 2006, 35 (03) : 237 - 247
  • [47] The Effects of the Acute Hypoxia to the Fundamental Frequency of the Speech Signal
    Milivojevic, Zoran N.
    Milivojevic, Marina
    Brodic, Darko
    ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2012, 12 (02) : 57 - 60
  • [48] DYNAMIC FREQUENCY WARPING, THE DUAL OF DYNAMIC TIME WARPING
    NEUBURG, EP
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1987, 81 : S94 - S94
  • [49] Template-Warping Based Speech Driven Head Motion Synthesis
    Braude, David Adam
    Shimodaira, Hiroshi
    Ben Youssef, Atef
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2762 - 2766
  • [50] Voice Conversion based on Continuous Frequency Warping and Magnitude Scaling
    Ye, Yuhang
    Lawlor, Bob
    2017 28TH IRISH SIGNALS AND SYSTEMS CONFERENCE (ISSC), 2017,