PHASE RECONSTRUCTION FROM AMPLITUDE SPECTROGRAMS BASED ON VON-MISES-DISTRIBUTION DEEP NEURAL NETWORK

被引:0
|
作者
Takamichi, Shinnosuke [1 ]
Saito, Yuki [1 ]
Takamune, Norihiro [1 ]
Kitamura, Daichi [2 ]
Saruwatari, Hiroshi [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo, Japan
[2] Kagawa Coll, Natl Inst Technol, Dept Elect & Comp Engn, Takamatsu, Kagawa, Japan
关键词
speech analysis; phase reconstruction; deep neural network; von Mises distribution; group delay;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a deep neural network (DNN)-based phase reconstruction from amplitude spectrograms. In audio signal and speech processing, the amplitude spectrogram is often used for processing, and the corresponding phase spectrogram is reconstructed from the amplitude spectrogram on the basis of the Griffin-Lim method. However, the Griffin-Lim method causes unnatural artifacts in synthetic speech. Addressing this problem, we introduce the von-Mises-distribution DNN for phase reconstruction. The DNN is a generative model having the von Mises distribution that can model distributions of a periodic variable such as a phase, and the model parameters of the DNN are estimated on the basis of the maximum likelihood criterion. Furthermore, we propose a group-delay loss for DNN training to make the predicted group delay close to a natural group delay. The experimental results demonstrate that 1) the trained DNN can predict group delay accurately more than phases themselves, and 2) our phase reconstruction methods achieve better speech quality than the conventional Griffin-Lim method.
引用
收藏
页码:286 / 290
页数:5
相关论文
共 50 条
  • [21] Deep Neural Network Based Frame Reconstruction for Optimized Video Coding
    Ding, Dandan
    Liu, Peng
    Chen, Yu
    Zhu, Zheng
    Liu, Zoe
    Bankoski, James
    ARTIFICIAL INTELLIGENCE AND MOBILE SERVICES - AIMS 2018, 2018, 10970 : 235 - 242
  • [22] DEEP NEURAL NETWORK-BASED DATA RECONSTRUCTION FOR LANDSLIDE DETECTION
    Utomo, Darmawan
    Hu, Liang-Cheng
    Hsiung, Pao-Ann
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 3119 - 3122
  • [23] A Reconstruction Method Based on Deep Convolutional Neural Network for SPECT Imaging
    Chrysostomou, Charalambos
    Koutsantonis, Loizos
    Lemesios, Christos
    Papanicolas, Costas N.
    2018 IEEE NUCLEAR SCIENCE SYMPOSIUM AND MEDICAL IMAGING CONFERENCE PROCEEDINGS (NSS/MIC), 2018,
  • [24] Converting amplitude holograms into complex and phase-only holograms using deep neural network-based converters
    Hirahara, Takuya
    Wang, Fan
    Ito, Tomoyoshi
    Shimobaba, Tomoyoshi
    OPTICS COMMUNICATIONS, 2025, 578
  • [25] An Ultrashort-Term Net Load Forecasting Model Based on Phase Space Reconstruction and Deep Neural Network
    Mei, Fei
    Wu, Qingliang
    Shi, Tian
    Lu, Jixiang
    Pan, Yi
    Zheng, Jianyong
    APPLIED SCIENCES-BASEL, 2019, 9 (07):
  • [27] RECURRENT PHASE RECONSTRUCTION USING ESTIMATED PHASE DERIVATIVES FROM DEEP NEURAL NETWORKS
    Thieling, Lars
    Wilhelm, Daniel
    Jax, Peter
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7088 - 7092
  • [28] Data Prediction of ECG Based on Phase Space Reconstruction and Neural Network
    Sun, ZhongGao
    Wang, QiaoLing
    Xue, QuanDe
    Liu, Qun
    Tan, QingQuan
    2018 8TH INTERNATIONAL CONFERENCE ON ELECTRONICS INFORMATION AND EMERGENCY COMMUNICATION (ICEIEC), 2018, : 162 - 165
  • [29] A Method to Determine the Parameters of Phase Space Reconstruction Based on the Neural Network
    Liu, Runjie
    Hou, Zhan
    Shen, Jinyuan
    2009 INTERNATIONAL WORKSHOP ON CHAOS-FRACTALS THEORIES AND APPLICATIONS (IWCFTA 2009), 2009, : 281 - 285
  • [30] AN IMAGE RECONSTRUCTION FRAMEWORK BASED ON DEEP NEURAL NETWORK FOR ELECTRICAL IMPEDANCE TOMOGRAPHY
    Li, Xiuyan
    Lu, Yang
    Wang, Jianming
    Dang, Xin
    Wang, Qi
    Duan, Xiaojie
    Sun, Yukuan
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3585 - 3589