RESEARCH ON ENGLISH SPEECH ENHANCEMENT ALGORITHM BASED ON IMPROVED SPECTRAL SUBTRACTION AND DEEP NEURAL NETWORK

被引:1
|
作者
Zhou, Qiaoling [1 ]
机构
[1] Fujian Agr & Forestry Univ, Int Coll, 15 Shangxiadian Rd, Fuzhou 350002, Peoples R China
关键词
Improved spectrum subtraction; Deep neural network; Speech enhancement; Amplitude spectrum; English communication; NOISE;
D O I
10.24507/ijicic.16.05.1711
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In order to solve the introduced unstructured voiceless problems of conventional spectrum subtraction in English speech signals enhancement, this paper proposes a novel English speech signals enhancement algorithm. This algorithm uses an improved minimal controlled recursive averaging (IMCRA) method to estimate noise spectrum, and tracks the estimated noise spectrum in real time. Then, the deep neural network (DNN) is used to construct the nonlinear mapping function of log amplitude spectrum between speech with noises and ideal pure speech for English speech enhancement. To validate the feasibility and effectiveness of the proposed algorithm, the standard IEEE speech signals and Noise-91 noise signals are used for experiments. Experimental results have shown that the proposed IMCRA method has stronger ability to avoid noises in speech signals, and the DNN method can well recover the speech components and spectrum structure polluted by noises. To enhance English speech in daily international speech communication, the proposed combination method has strong robustness to various real noise environments, and brings significant improvement to interpersonal communication and human computer communication.
引用
收藏
页码:1711 / 1723
页数:13
相关论文
共 50 条
  • [21] Improved multi-band spectral subtraction method for speech enhancement
    Ghanbari, Y
    Karami-Mollaei, MR
    Amelifard, B
    PROCEEDINGS OF THE SIXTH IASTED INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, 2004, : 225 - 230
  • [22] Speech dereverberation method based on spectral subtraction and spectral line enhancement
    Chen, Zhe
    Wang, Rui
    Yin, Fuliang
    Wang, Bingqian
    Peng, Wenwen
    APPLIED ACOUSTICS, 2016, 112 : 201 - 210
  • [23] CMAC spectral subtraction for speech enhancement
    Wahab, A
    Tan, EC
    Abut, H
    ISSPA 2001: SIXTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1 AND 2, PROCEEDINGS, 2001, : 707 - 710
  • [24] Heart Sound Enhancement based on Improved Spectral Subtraction
    Zhang, Lu
    Wang, Lei
    2015 11TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS (SITIS), 2015, : 546 - 553
  • [25] Speech Enhancement Using Modified Modulation Magnitude Estimation-Based Spectral Subtraction Algorithm
    Kalamani, M.
    Valarmathy, S.
    Krishnamoorthi, M.
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2014, 39 (12) : 8965 - 8978
  • [26] Speech Enhancement Using Modified Modulation Magnitude Estimation-Based Spectral Subtraction Algorithm
    M. Kalamani
    S. Valarmathy
    M. Krishnamoorthi
    Arabian Journal for Science and Engineering, 2014, 39 : 8965 - 8978
  • [27] A GSC algorithm based on null spectral subtraction for dual small microphone array speech enhancement
    Yang, Li-Chun
    Qian, Yun-Tao
    Wang, Wen-Hong
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2013, 47 (08): : 1493 - 1499
  • [28] A hybrid speech enhancement system based on HMM and spectral subtraction
    Ghoreishi, MH
    Sheikhzadeh, H
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1855 - 1858
  • [29] A real time spectral subtraction based speech enhancement scheme
    Flogeras, D
    Doraiswami, R
    Kaye, ME
    CCECE 2003: CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-3, PROCEEDINGS: TOWARD A CARING AND HUMANE TECHNOLOGY, 2003, : 1071 - 1074
  • [30] EXEMPLAR-BASED SPEECH ENHANCEMENT FOR DEEP NEURAL NETWORK BASED AUTOMATIC SPEECH RECOGNITION
    Baby, Deepak
    Gemmeke, Jort F.
    Virtanen, Tuomas
    Van hamme, Hugo
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4485 - 4489