Sequential Deep Neural Networks Ensemble for Speech Bandwidth Extension

被引:7
|
作者
Lee, Bong-Ki [1 ]
Noh, Kyounjin [2 ]
Chang, Joon-Hyuk [3 ]
Choo, Kihyun [4 ]
Oh, Eunmi [4 ]
机构
[1] LG Elect Co Ltd, CTO Div, Seoul 06763, South Korea
[2] Hanyang Univ, Elect Engn, Seoul 04763, South Korea
[3] Hanyang Univ, Sch Elect Engn, Seoul 04763, South Korea
[4] Samsung Elect Co Ltd, Digital Media & Commun Res & Dev Ctr, Seoul 06734, South Korea
来源
IEEE ACCESS | 2018年 / 6卷
关键词
Bandwidth extension; sequential deep neural network; ensemble; log-power spectra; regression; voiced/unvoiced classification; BAND EXTENSION; NARROW-BAND;
D O I
10.1109/ACCESS.2018.2833890
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a subband-based ensemble of sequential deep neural networks (DNNs) for bandwidth extension (BWE). First, the narrow-band spectra are folded into the high-band (HB) region to generate the high-band spectra, and then the energy levels of the HB spectra are adjusted using the DNN-based on the log-power spectra feature. For this, we basically build the multiple DNNs, which is responsible for each subband of the HB and the DNN ensemble is sequentially connected from lower to higher subbands. This sequential structure for the DNN ensemble carries out the denoising and HB regression to better estimate the HB energy levels. In addition, we use the voiced/unvoiced (V/UV) classification to differently apply the DNN ensemble depending on either V/UV sounds. To demonstrate the performance of the proposed BWE algorithm, we compare it with a speech production model-based BWE system and a DNN-based BWE system in which the log-power spectra in the HB are estimated directly. The experimental results show that the proposed approach provides better speech quality than conventional approaches.
引用
收藏
页码:27039 / 27047
页数:9
相关论文
共 50 条
  • [1] Speech Bandwidth Extension Using Bottleneck Features and Deep Recurrent Neural Networks
    Gu, Yu
    Ling, Zhen-Hua
    Dai, Li-Rong
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 297 - 301
  • [2] Mapping Neural Networks for Bandwidth Extension of Narrowband Speech
    Shahina, A.
    Yegnanarayana, B.
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1435 - 1438
  • [3] Audio bandwidth extension using ensemble of recurrent neural networks
    Xin Liu
    Chang-Chun Bao
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2016
  • [4] Audio bandwidth extension using ensemble of recurrent neural networks
    Liu, Xin
    Bao, Chang-Chun
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2016, : 1 - 12
  • [5] Artificial Speech Bandwidth Extension Using Deep Neural Networks for Wideband Spectral Envelope Estimation
    Abel, Johannes
    Fingscheidt, Tim
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (01) : 71 - 83
  • [6] Speech bandwidth expansion based on Deep Neural Networks
    Wang, Yingxue
    Zhao, Shenghui
    Liu, Wenbo
    Li, Ming
    Kuang, Jingming
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2593 - 2597
  • [7] Deep neural network ensemble for reducing artificial noise in bandwidth extension
    Noh, Kyoungjin
    Chang, Joon-Hyuk
    [J]. DIGITAL SIGNAL PROCESSING, 2020, 102
  • [8] BLIND BANDWIDTH EXTENSION BASED ON CONVOLUTIONAL AND RECURRENT DEEP NEURAL NETWORKS
    Schmidt, Konstantin
    Edler, Bernd
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5444 - 5448
  • [9] On Filter Generalization for Music Bandwidth Extension Using Deep Neural Networks
    Sulun, Serkan
    Davies, Matthew E. P.
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2021, 15 (01) : 132 - 142
  • [10] ARTIFICIAL BANDWIDTH EXTENSION USING DEEP NEURAL NETWORKS FOR SPECTRAL ENVELOPE ESTIMATION
    Abel, Johannes
    Strake, Maximilian
    Fingscheidt, Tim
    [J]. 2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,