PHASE RECONSTRUCTION FROM AMPLITUDE SPECTROGRAMS BASED ON VON-MISES-DISTRIBUTION DEEP NEURAL NETWORK

被引:0
|
作者
Takamichi, Shinnosuke [1 ]
Saito, Yuki [1 ]
Takamune, Norihiro [1 ]
Kitamura, Daichi [2 ]
Saruwatari, Hiroshi [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo, Japan
[2] Kagawa Coll, Natl Inst Technol, Dept Elect & Comp Engn, Takamatsu, Kagawa, Japan
关键词
speech analysis; phase reconstruction; deep neural network; von Mises distribution; group delay;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a deep neural network (DNN)-based phase reconstruction from amplitude spectrograms. In audio signal and speech processing, the amplitude spectrogram is often used for processing, and the corresponding phase spectrogram is reconstructed from the amplitude spectrogram on the basis of the Griffin-Lim method. However, the Griffin-Lim method causes unnatural artifacts in synthetic speech. Addressing this problem, we introduce the von-Mises-distribution DNN for phase reconstruction. The DNN is a generative model having the von Mises distribution that can model distributions of a periodic variable such as a phase, and the model parameters of the DNN are estimated on the basis of the maximum likelihood criterion. Furthermore, we propose a group-delay loss for DNN training to make the predicted group delay close to a natural group delay. The experimental results demonstrate that 1) the trained DNN can predict group delay accurately more than phases themselves, and 2) our phase reconstruction methods achieve better speech quality than the conventional Griffin-Lim method.
引用
收藏
页码:286 / 290
页数:5
相关论文
共 50 条
  • [41] Solar Radiation Prediction Based on Phase Space Reconstruction of Wavelet Neural Network
    Wang, Jianping
    Xie, Yunlin
    Zhu, Chenghui
    Xu, Xiaobing
    CEIS 2011, 2011, 15
  • [42] A Fault Location Method for Distribution network with DG System Based on Amplitude and Phase of the Current
    Zhao, Dongmei
    Zheng, Lixin
    Xu, Ruiqing
    Zhang, Xu
    2016 5TH INTERNATIONAL SYMPOSIUM ON NEXT-GENERATION ELECTRONICS (ISNE), 2016,
  • [43] Identification of Voltage Disturbances Based on Phase Space Reconstruction and BP Neural Network
    Hu, Ziteng
    Jia, Limin
    Yao, Dechen
    SENSORS, MEASUREMENT AND INTELLIGENT MATERIALS, PTS 1-4, 2013, 303-306 : 966 - 969
  • [44] THE APPLICATION OF THE MODEL BASED ON PHASE SPACE RECONSTRUCTION AND NEURAL NETWORK IN THE GROUNDWATER LEVEL
    Cao, LianHai
    Hao, ShiLong
    Chen, NanXiang
    PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON MODELLING AND SIMULATION (ICMS2009), VOL 7, 2009, : 268 - 273
  • [45] The Determination of Neural Network Inputs Based on Multivariate Phase-space Reconstruction
    Xi, Jianhui
    Han, Wenlan
    2011 INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND NEURAL COMPUTING (FSNC 2011), VOL I, 2011, : 286 - 290
  • [46] Amplitude-Phase-Characteristics-Based Fault Phase Detection Method for Grounding Fault in Distribution Network
    Lin, Jiahao
    You, Jianzhang
    Guo, Moufa
    Hong, Qiteng
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 1
  • [47] Deep neural network-based spatiotemporal heterogeneous data reconstruction for landslide detection
    Darmawan Utomo
    Liang-Cheng Hu
    Pao-Ann Hsiung
    International Journal of Data Science and Analytics, 2024, 17 : 93 - 109
  • [48] A Neural Network based Deep Reinforcement Learning Controller for Voltage Regulation of Active Distribution Network
    Jain, Jatin
    Mohamed, Ahmed
    Rahman, Tanvir
    Ali, Mohamed
    2024 IEEE 5TH ANNUAL WORLD AI IOT CONGRESS, AIIOT 2024, 2024, : 0280 - 0285
  • [49] Fast and Robust Reconstruction Method for Fluorescence Molecular Tomography based on Deep Neural Network
    Huang, Chao
    Meng, Hui
    Gao, Yuan
    Jiang, Shixin
    Wang, Kun
    Tian, Jie
    IMAGING, MANIPULATION, AND ANALYSIS OF BIOMOLECULES, CELLS, AND TISSUES XVII, 2019, 10881
  • [50] Deep neural network-based spatiotemporal heterogeneous data reconstruction for landslide detection
    Utomo, Darmawan
    Hu, Liang-Cheng
    Hsiung, Pao-Ann
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024, 17 (01) : 93 - 109