Mel-Frequency Cepstral Coefficient-Based Bandwidth Extension of Narrowband Speech

被引:0
|
作者
Nour-Eldin, Amr H. [1 ]
Kabal, Peter [1 ]
机构
[1] McGill Univ, Dept Elect & Comp Engn, Montreal, PQ, Canada
关键词
Bandwidth extension; high-resolution IDCT; highband certainty; mutual information; source-filter model;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel MFCC-based scheme for the Bandwidth Extension (BWE) of narrowband speech. BWE is based on the assumption that narrowband speech (0.3-3.4 kHz) correlates closely with the highband signal (3.4-7 kHz), enabling estimation of the highband frequency content given the narrow band. While BWE schemes have traditionally used LP-based parametrizations, our recent work has shown that MFCC parametrization results in higher correlation between both bands reaching twice that using LSFs. By employing high-resolution IDCT of highband MFCCs obtained from narrowband MFCCs by statistical estimation, we achieve high-quality highband power spectra from which the time-domain speech signal can be reconstructed. Implementing this scheme for BWE translates the higher correlation advantage of MFCCs into BWE performance superior to that obtained using LSFs, as shown by improvements in log-spectral distortion as well as Itakura-based measures (the latter improving by up to 13%).
引用
收藏
页码:53 / 56
页数:4
相关论文
共 50 条
  • [1] Mel-Frequency Cepstral Coefficient Analysis in Speech Recognition
    On, Chin Kim
    Pandiyan, Paulraj M.
    Yaacob, Sazali
    Saudi, Azali
    2006 INTERNATIONAL CONFERENCE ON COMPUTING & INFORMATICS (ICOCI 2006), 2006, : 291 - +
  • [2] Modified Mel-Frequency cepstral coefficient
    Saha, G
    Yadhunandan, US
    Proceedings of the Sixth IASTED International Conference on Signal and Image Processing, 2004, : 215 - 219
  • [3] Research on Violin Audio Feature Recognition Based on Mel-Frequency Cepstral Coefficient-Based Feature Parameter Extraction
    Zeng, Ming
    Zeng, Huahong
    Informatica (Slovenia), 2024, 48 (19): : 1 - 6
  • [4] On the Inversion of Mel-Frequency Cepstral Coefficients for Speech Enhancement Applications
    Boucheron, Laura E.
    De Leon, Phillip L.
    ICSES 2008 INTERNATIONAL CONFERENCE ON SIGNALS AND ELECTRONIC SYSTEMS, CONFERENCE PROCEEDINGS, 2008, : 485 - 488
  • [5] Bandwidth extension of narrowband speech using cepstral analysis
    Soon, IY
    Yeo, CK
    PROCEEDINGS OF THE 2004 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2004, : 242 - 245
  • [6] UNDERSTANDING SARCASM IN SPEECH USING MEL-FREQUENCY CEPSTRAL COEFFICENT
    Mathur, Abhinav
    Saxena, Vikas
    Singh, Sandeep K.
    PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE AND ENGINEERING (CONFLUENCE 2017), 2017, : 728 - 732
  • [7] Encrypted Domain Mel-Frequency Cepstral Coefficient and Fragile Audio Watermarking
    Chen, Jian
    Chen, Ziyang
    Zheng, Peijia
    Guo, Jianting
    Zhang, Wei
    Huang, Jiwu
    2018 17TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (IEEE TRUSTCOM) / 12TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA SCIENCE AND ENGINEERING (IEEE BIGDATASE), 2018, : 68 - 73
  • [8] A mel-frequency cepstral coefficient-based approach for surface roughness diagnosis in hard turning using acoustic signals and gaussian mixture models
    Frigieri, Edielson P.
    Campos, Paulo H. S.
    Paiva, Anderson P.
    Balestrassi, Pedro P.
    Ferreira, Joao Roberto
    Ynoguti, Carlos A.
    APPLIED ACOUSTICS, 2016, 113 : 230 - 237
  • [9] Low-variance Multitaper Mel-frequency Cepstral Coefficient Features for Speech and Speaker Recognition Systems
    Md. Jahangir Alam
    Patrick Kenny
    Douglas O’Shaughnessy
    Cognitive Computation, 2013, 5 : 533 - 544
  • [10] Low-variance Multitaper Mel-frequency Cepstral Coefficient Features for Speech and Speaker Recognition Systems
    Alam, Md. Jahangir
    Kenny, Patrick
    O'Shaughnessy, Douglas
    COGNITIVE COMPUTATION, 2013, 5 (04) : 533 - 544