Design of activation function in speech enhanced deep neural network

被引:0
|
作者
Lu Xun [1 ]
Li Wei-Yong [2 ]
Yuan Min-Min [3 ]
Zuo Yi [2 ,4 ]
Hu Wenlin [5 ]
Wang Jie [6 ,7 ]
Yan Zhi-Hao [2 ]
机构
[1] Guangdong Power Grid Co, Power Grid Planning Ctr, Guangzhou, Guangdong, Peoples R China
[2] Guangzhou Univ, Sch Elect & Commun Engn, Guangzhou, Guangdong, Peoples R China
[3] Res Inst Highway Minist Transport, Natl Environm Protect Engn & Technol Ctr Rd Traff, Beijing, Peoples R China
[4] Natl Engn Lab Digital Construct & Evaluat Urban R, Tianjin, Peoples R China
[5] China Railway Design Corp, Natl Engn Lab Digital Construct & Evaluat Urban R, Tianjin, Peoples R China
[6] Guangzhou Univ, Res Ctr Urban Sustainable Dev, Sch Elect & Commun Engn, Guangzhou, Guangdong, Peoples R China
[7] Guangzhou Univ, Res Ctr Urban Sustainable Dev, Linkoping Univ, Guangzhou, Guangdong, Peoples R China
关键词
speech enhancement; deep neural networks; activation function; local linear controllability; NOISE;
D O I
10.1109/CMMNO53328.2021.9467614
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
The activation functions have an impact on the performance of the neural networks. In the deep neural networks training procedure, the derivative of the standard Relu activation function is zero at the negative semi-axis so that it leads to the inactivation of some neurons, and it is slow for the Tanh activation function in networks training. In order to improve the performance of speech enhancement, we propose a locally linearly controllable PTanh activation function, which is a combination of Tanh, Relu and Taylor's series. Compared to Relu, the PTanh makes the derivative no longer constant to zero when the output value of the neuron is in the negative semi-axis, and the speed of learning is improved greatly. The experimental results show that the PTanh is more adaptable. And the convergence effect of the network and the performance of speech enhancement are improved better.
引用
收藏
页码:213 / 218
页数:6
相关论文
共 50 条
  • [1] A Novel Activation Function of Deep Neural Network
    Xiangyang, Lin
    Xing, Qinghua
    Han, Zhang
    Feng, Chen
    [J]. Scientific Programming, 2023, 2023
  • [2] ErfReLU: adaptive activation function for deep neural network
    Rajanand, Ashish
    Singh, Pradeep
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2024, 27 (02)
  • [3] Review of Adaptive Activation Function in Deep Neural Network
    Lau, Mian Mian
    Lim, King Hann
    [J]. 2018 IEEE-EMBS CONFERENCE ON BIOMEDICAL ENGINEERING AND SCIENCES (IECBES), 2018, : 686 - 690
  • [4] Design Space Exploration of Neural Network Activation Function Circuits
    Yang, Tao
    Wei, Yadong
    Tu, Zhijun
    Zeng, Haolun
    Kinsy, Michel A.
    Zheng, Nanning
    Ren, Pengju
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2019, 38 (10) : 1974 - 1978
  • [5] Weighted sigmoid gate unit for an activation function of deep neural network
    Tanaka, Masayuki
    [J]. PATTERN RECOGNITION LETTERS, 2020, 135 : 354 - 359
  • [6] Robust Speech Recognition with Speech Enhanced Deep Neural Networks
    Du, Jun
    Wang, Qing
    Gao, Tian
    Xu, Yong
    Dai, Lirong
    Lee, Chin-Hui
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 616 - 620
  • [7] Stimulated Deep Neural Network for Speech Recognition
    Wu, Chunyang
    Karanasou, Penny
    Gales, Mark J. F.
    Sim, Khe Chai
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 400 - 404
  • [8] Development of a Psychoacoustic Loss Function for the Deep Neural Network (DNN)-Based Speech Coder
    Byun, Joon
    Shin, Seungmin
    Park, Youngcheol
    Sung, Jongmo
    Bead, Seungkwon
    [J]. INTERSPEECH 2021, 2021, : 1694 - 1698
  • [9] Deep learning neural network for lung cancer classification: enhanced optimization function
    Bhoj Raj Pandit
    Abeer Alsadoon
    P. W. C. Prasad
    Sarmad Al Aloussi
    Tarik A. Rashid
    Omar Hisham Alsadoon
    Oday D. Jerew
    [J]. Multimedia Tools and Applications, 2023, 82 : 6605 - 6624
  • [10] Deep learning neural network for lung cancer classification: enhanced optimization function
    Pandit, Bhoj Raj
    Alsadoon, Abeer
    Prasad, P. W. C.
    Al Aloussi, Sarmad
    Rashid, Tarik A.
    Alsadoon, Omar Hisham
    Jerew, Oday D.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (05) : 6605 - 6624