Artificial bandwidth extension using deep neural network-based spectral envelope estimation and enhanced excitation estimation

被引:20
|
作者
Li, Yaxing [1 ]
Kang, Sangwon [1 ]
机构
[1] Hanyang Univ, Dept Elect & Commun Engn, Ansan 426791, South Korea
关键词
speech synthesis; neural nets; filtering theory; speech coding; artificial bandwidth extension; deep neural network-based spectral envelope estimation; enhanced excitation estimation; narrowband speech signal quality; enhanced spectrum envelope; excitation estimation; whitening filter; adaptive spectral double shifting method; adaptive multirate codec; log spectral distortion; perceptual evaluation;
D O I
10.1049/iet-spr.2015.0375
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The authors propose a robust artificial bandwidth extension (ABE) technique to improve narrowband (NB) speech signal quality using an enhanced spectrum envelope and excitation estimation. For envelope estimation, they propose an enhanced envelope estimation method using a deep neural network with multiple layers. For excitation estimation, they use a whitened NB excitation signal that is generated by passing the excitation signal through a whitening filter. An adaptive spectral double shifting method is introduced to obtain an enhanced wideband (WB) excitation signal. The proposed ABE system is applied to the decoded output of an adaptive multi-rate (AMR) codec at 12.2 kbps. They evaluate its performance using log spectral distortion, a WB perceptual evaluation of speech quality, and a formal listening test. The objective and subjective evaluations confirm that the proposed ABE system provides better speech quality than AMR at the same bit rate.
引用
收藏
页码:422 / 427
页数:6
相关论文
共 50 条
  • [41] Artificial neural network-based cardiovascular disease prediction using spectral features
    Khan, Misha Urooj
    Samer, Sana
    Alshehri, Mohammad Dahman
    Baloch, Naveed Khan
    Khan, Hareem
    Hussain, Fawad
    Kim, Sung Won
    Bin Zikria, Yousaf
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 101
  • [42] Rainfall estimation using an artificial neural network
    Hsu, K
    Sorooshian, S
    Gao, XG
    Gupta, HV
    FIRST CONFERENCE ON ARTIFICIAL INTELLIGENCE, 1998, : 28 - 32
  • [43] Speech Envelope Estimation and Voiceless Consonant Restoration for Artificial Bandwidth Extension of Narrow Band Speech
    Asawa, Shun
    Sugiura, Yosuke
    Shimamura, Tetsuya
    2016 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS), 2016, : 237 - 242
  • [44] Artificial neural network-based method for overhead lines magnetic flux density estimation
    Alihodzic, Ajdin
    Mujezinovic, Adnan
    Turajlic, Emir
    JOURNAL OF ELECTRICAL ENGINEERING-ELEKTROTECHNICKY CASOPIS, 2024, 75 (03): : 181 - 191
  • [45] Improvement in artificial neural network-based estimation of grid connected photovoltaic power output
    Huang, Chao
    Bensoussan, Alain
    Edesess, Michael
    Tsui, Kwok L.
    RENEWABLE ENERGY, 2016, 97 : 838 - 848
  • [46] Robust Artificial Neural Network-Based Models for Accurate Surface Temperature Estimation of Batteries
    Hussein, Ala A.
    Chehade, Abdallah A.
    IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2020, 56 (05) : 5269 - 5278
  • [47] Spectral Estimation from Actual Color Images based on Deep Neural Network
    Xu, Peng
    Rajan, Sreeraman
    2023 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, ICCE, 2023,
  • [48] Neural network-based estimation of power electronic waveforms
    Kim, MH
    Simoes, MG
    Bose, BK
    IEEE TRANSACTIONS ON POWER ELECTRONICS, 1996, 11 (02) : 383 - 389
  • [49] Iterative Convolutional Neural Network-Based Illumination Estimation
    Koscevic, Karlo
    Subasic, Marko
    Loncaric, Sven
    IEEE ACCESS, 2021, 9 : 26755 - 26765
  • [50] Neural network-based pose estimation for fixtureless assembly
    Langley, CS
    D'Eleuterio, GMT
    2001 IEEE INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN ROBOTICS AND AUTOMATION: INTEGRATING INTELLIGENT MACHINES WITH HUMANS FOR A BETTER TOMORROW, 2001, : 248 - 253