Advances in phase-aware signal processing in speech communication

被引:100
|
作者
Mowlaee, Pejman [1 ]
Saeidi, Rahim [2 ]
Stylianou, Yannis [3 ]
机构
[1] Graz Univ Technol, Signal Proc & Speech Commun Lab, Graz, Austria
[2] Aalto Univ, Dept Signal Proc & Acoust, Espoo, Finland
[3] Univ Crete, Dept Comp Sci, Iraklion, Greece
基金
奥地利科学基金会; 芬兰科学院;
关键词
Phase-aware speech processing; Phase-based features; Signal enhancement; Automatic speech recognition; Speaker recognition; Speech synthesis; Speech coding; Speech analysis; GROUP DELAY FUNCTIONS; SPECTRAL MAGNITUDE ESTIMATION; INTELLIGIBILITY PREDICTION; INSTANTANEOUS FREQUENCY; SOURCE SEPARATION; FOURIER SPECTRUM; MFCC FEATURES; ENHANCEMENT; RECONSTRUCTION; INFORMATION;
D O I
10.1016/j.specom.2016.04.002
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
During the past three decades, the issue of processing spectral phase has been largely neglected in speech applications. There is no doubt that the interest of speech processing community towards the use of phase information in a big spectrum of speech technologies, from automatic speech and speaker recognition to speech synthesis, from speech enhancement and source separation to speech coding, is constantly increasing. In this paper, we elaborate on why phase was believed to be unimportant in each application. We provide an overview of advancements in phase-aware signal processing with applications to speech, showing that considering phase-aware speech processing can be beneficial in many cases, while it can complement the possible solutions that magnitude-only methods suggest. Our goal is to show that phase-aware signal processing is an important emerging field with high potential in the current speech communication applications. The paper provides an extended and up-to-date bibliography on the topic of phase aware speech processing aiming at providing the necessary background to the interested readers for following the recent advancements in the area. Our review expands the step initiated by our organized special session and exemplifies the usefulness of spectral phase information in a wide range of speech processing applications. Finally, the overview will provide some future work directions. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 29
页数:29
相关论文
共 50 条
  • [21] Speech Enhancement Based on Fusion of Both Magnitude/Phase-Aware Features and Targets
    Lang, Haitao
    Yang, Jie
    ELECTRONICS, 2020, 9 (07) : 1 - 19
  • [22] Phase-Aware CPU Workload Forecasting
    Alcorta, Erika S.
    Rama, Pranav
    Ramachandran, Aswin
    Gerstlauer, Andreas
    EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING, AND SIMULATION, SAMOS 2021, 2022, 13227 : 195 - 209
  • [23] An evaluation of the perceptual quality of phase-aware single-channel speech enhancement
    Krawczyk-Becker, Martin
    Gerkmann, Timo
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 140 (04): : EL364 - EL369
  • [24] FPGA Implementation of a Phase-Aware Single-Channel Speech Enhancement System
    Suman Samui
    Pragya Sahu
    Indrajit Chakrabarti
    Soumya K. Ghosh
    Circuits, Systems, and Signal Processing, 2017, 36 : 4688 - 4715
  • [25] Phase-aware deep speech enhancement: It's all about the frame length
    Peer, Tal
    Gerkmann, Timo
    JASA EXPRESS LETTERS, 2022, 2 (10):
  • [26] DCCRN: Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement
    Hu, Yanxin
    Liu, Yun
    Lv, Shubo
    Xing, Mengtao
    Zhang, Shimin
    Fu, Yihui
    Wu, Jian
    Zhang, Bihong
    Xie, Lei
    INTERSPEECH 2020, 2020, : 2472 - 2476
  • [27] Speech Communication and Signal Processing FOREWORD
    Yegnanarayana, B.
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2011, 36 (05): : 551 - 553
  • [28] Phase-Aware Optimization in Approximate Computing
    Mitra, Subrata
    Gupta, Manish K.
    Misailovic, Sasa
    Bagchi, Saurabh
    CGO'17: PROCEEDINGS OF THE 2017 INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, 2017, : 185 - 196
  • [29] A Speech Enhancement Method Based on Dual-path Phase-Aware GAN Networks
    Cheng, Yunling
    Zhou, Lin
    Cao, Yanxiang
    Zhuang, Chenghao
    Wang, Qirui
    2024 13TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS, ICCCAS 2024, 2024, : 315 - 320
  • [30] Iterative Closed-Loop Phase-Aware Single-Channel Speech Enhancement
    Mowlaee, Pejman
    Saeidi, Rahim
    IEEE SIGNAL PROCESSING LETTERS, 2013, 20 (12) : 1235 - 1239