Mel-Frequency Cepstral Coefficient-Based Bandwidth Extension of Narrowband Speech

被引:0
|
作者
Nour-Eldin, Amr H. [1 ]
Kabal, Peter [1 ]
机构
[1] McGill Univ, Dept Elect & Comp Engn, Montreal, PQ, Canada
关键词
Bandwidth extension; high-resolution IDCT; highband certainty; mutual information; source-filter model;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel MFCC-based scheme for the Bandwidth Extension (BWE) of narrowband speech. BWE is based on the assumption that narrowband speech (0.3-3.4 kHz) correlates closely with the highband signal (3.4-7 kHz), enabling estimation of the highband frequency content given the narrow band. While BWE schemes have traditionally used LP-based parametrizations, our recent work has shown that MFCC parametrization results in higher correlation between both bands reaching twice that using LSFs. By employing high-resolution IDCT of highband MFCCs obtained from narrowband MFCCs by statistical estimation, we achieve high-quality highband power spectra from which the time-domain speech signal can be reconstructed. Implementing this scheme for BWE translates the higher correlation advantage of MFCCs into BWE performance superior to that obtained using LSFs, as shown by improvements in log-spectral distortion as well as Itakura-based measures (the latter improving by up to 13%).
引用
收藏
页码:53 / 56
页数:4
相关论文
共 50 条
  • [31] Mel-Frequency Cepstral Coefficient Features Based on Standard Deviation and Principal Component Analysis for Language Identification Systems
    Musatafa Abbas Abbood Albadr
    Sabrina Tiun
    Masri Ayob
    Manal Mohammed
    Fahad Taha AL-Dhief
    Cognitive Computation, 2021, 13 : 1136 - 1153
  • [32] Methodology for identifying the damage state of sandstone using Mel-frequency cepstral coefficient of acoustic emission
    He X.
    Yang F.
    Li Z.
    Li N.
    Song D.
    Wang H.
    Sobolev A.
    Rasskazov I.
    Meitan Xuebao/Journal of the China Coal Society, 2024, 49 (02): : 753 - 766
  • [33] Mel-Frequency Cepstral Coefficient (MFCC) for Music Feature Extraction for the Dancing Robot Movement Decision
    Sulistijono, Indra Adji
    Urrosyda, Renita Chulafa
    Darojah, Zaqiatud
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2016, PT II, 2016, 9835 : 283 - 294
  • [34] Applying Mel-frequency cepstral coefficient of acoustic emission for analyzing fracture and failure of sandstone specimens
    Li Z.
    Li N.
    Yang F.
    Song D.
    He X.
    Xue Y.
    Wang H.
    Yin S.
    Meitan Xuebao/Journal of the China Coal Society, 2023, 48 (02): : 714 - 729
  • [35] Low Bit-Rate Speech Coding Through Quantization of Mel-Frequency Cepstral Coefficients
    Boucheron, Laura E.
    De Leon, Phillip L.
    Sandoval, Steven
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (02): : 610 - 619
  • [36] Variants of Mel-frequency Cepstral Coefficients for Improved Whispered Speech Speaker Verification in Mismatched Conditions
    Sarria-Paja, Milton
    Falk, Tiago H.
    2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 91 - 95
  • [37] How many Mel-frequency cepstral coefficients to be utilized in speech recognition? A study with the Bengali language
    Hasan, Md. Rakibul
    Hasan, Md. Mahbub
    Hossain, Md Zakir
    JOURNAL OF ENGINEERING-JOE, 2021, 2021 (12): : 817 - 827
  • [38] Hidden Markov Model Neurons Classification based on Mel-frequency Cepstral Coefficients
    Haggag, Sherif
    Mohamed, Shady
    Haggag, Hussein
    Nahavandi, Saeid
    PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON SYSTEM OF SYSTEMS ENGINEERING (SOSE 2014), 2014, : 166 - 170
  • [39] One Solution of Extension of Mel-Frequency Cepstral Coefficients Feature Vector for Automatic Speaker Recognition
    Jokic, Ivan D.
    Jokic, Stevan D.
    Delic, Vlado D.
    Peric, Zoran H.
    INFORMATION TECHNOLOGY AND CONTROL, 2020, 49 (02): : 224 - 236
  • [40] Clean speech reconstruction from noisy MEL-frequency cepstral coefficients using a sinusoidal model
    Shao, X
    Milner, B
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 704 - 707