PHASE RECONSTRUCTION FROM AMPLITUDE SPECTROGRAMS BASED ON VON-MISES-DISTRIBUTION DEEP NEURAL NETWORK

被引:0
|
作者
Takamichi, Shinnosuke [1 ]
Saito, Yuki [1 ]
Takamune, Norihiro [1 ]
Kitamura, Daichi [2 ]
Saruwatari, Hiroshi [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo, Japan
[2] Kagawa Coll, Natl Inst Technol, Dept Elect & Comp Engn, Takamatsu, Kagawa, Japan
关键词
speech analysis; phase reconstruction; deep neural network; von Mises distribution; group delay;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a deep neural network (DNN)-based phase reconstruction from amplitude spectrograms. In audio signal and speech processing, the amplitude spectrogram is often used for processing, and the corresponding phase spectrogram is reconstructed from the amplitude spectrogram on the basis of the Griffin-Lim method. However, the Griffin-Lim method causes unnatural artifacts in synthetic speech. Addressing this problem, we introduce the von-Mises-distribution DNN for phase reconstruction. The DNN is a generative model having the von Mises distribution that can model distributions of a periodic variable such as a phase, and the model parameters of the DNN are estimated on the basis of the maximum likelihood criterion. Furthermore, we propose a group-delay loss for DNN training to make the predicted group delay close to a natural group delay. The experimental results demonstrate that 1) the trained DNN can predict group delay accurately more than phases themselves, and 2) our phase reconstruction methods achieve better speech quality than the conventional Griffin-Lim method.
引用
收藏
页码:286 / 290
页数:5
相关论文
共 50 条
  • [11] PHASE RECONSTRUCTION BASED ON RECURRENT PHASE UNWRAPPING WITH DEEP NEURAL NETWORKS
    Masuyama, Yoshiki
    Yatabe, Kohei
    Koizumi, Yuma
    Oikawa, Yasuhiro
    Harada, Noboru
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 826 - 830
  • [12] Deep Neural Network Based Shape Reconstruction for Application in Robotics
    Li, Mikhail
    Mutahira, Husna
    Ahmad, Bilal
    Muhammad, Mannan Saeed
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION IN INDUSTRY (ICRAI), 2019,
  • [13] Discrete Spatial Data Reconstruction based on Deep Neural Network
    Du, Yi
    Zhang, Ting
    Wang, Jiacun
    PROCEEDINGS OF THE 2019 IEEE 16TH INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL (ICNSC 2019), 2019, : 403 - 408
  • [14] Neural-network-enabled holographic image reconstruction via amplitude and phase extraction
    Rymov, D. A.
    Starikov, R. S.
    Cheremkhin, P. A.
    JOURNAL OF OPTICAL TECHNOLOGY, 2022, 89 (09) : 511 - 516
  • [15] Distribution Network Connectivity Recognition Based on Ensemble Deep Neural Network
    Jiang W.
    Tang H.
    Qi H.
    Chen H.
    Chen J.
    Jiao H.
    Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2020, 44 (01): : 101 - 108
  • [16] Deep Neural Network-Based Robust Spectrum Sensing: Exploiting Phase Difference Distribution
    Wang, Yang
    Xu, Wenjun
    Qin, Zhijin
    Zhang, Yimeng
    Gao, Hui
    Pan, Miao
    Lin, Jiaru
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [17] Amplitude based keyless optical encryption system using deep neural network
    Inoue, Kotaro
    Cho, Myungjin
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 79
  • [18] BP Neural Network Model Based on Phase Space Reconstruction
    Hu, Jie
    Zeng, Xiangjin
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOLS 1-4, 2009, : 2183 - 2186
  • [19] Asphalt pavement macrotexture reconstruction from monocular image based on deep convolutional neural network
    Dong, Shihao
    Han, Sen
    Wu, Chi
    Xu, Ouming
    Kong, Haiyu
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2022, 37 (13) : 1754 - 1768
  • [20] SPECT Imaging Reconstruction Method Based on Deep Convolutional Neural Network
    Chrysostomou, Charalambos
    Koutsantonis, Loizos
    Lemesios, Christos
    Papanicolas, Costas N.
    2019 IEEE NUCLEAR SCIENCE SYMPOSIUM AND MEDICAL IMAGING CONFERENCE (NSS/MIC), 2019,