A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech

被引:31
|
作者
Valin, Jean-Marc [1 ]
Isik, Umut [2 ]
Phansalkar, Neerad [2 ]
Giri, Ritwik [2 ]
Helwani, Karim [2 ]
Krishnaswamy, Arvindh [2 ]
机构
[1] Amazon Web Serv, Toronto, ON, Canada
[2] Amazon Web Serv, Seattle, WA USA
来源
关键词
speech enhancement; pitch filtering; postfilter; MASKING; NOISE;
D O I
10.21437/Interspeech.2020-2730
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Over the past few years, speech enhancement methods based on deep learning have greatly surpassed traditional methods based on spectral subtraction and spectral estimation. Many of these new techniques operate directly in the the short-time Fourier transform (STFT) domain, resulting in a high computational complexity. In this work, we propose PercepNet, an efficient approach that relies on human perception of speech by focusing on the spectral envelope and on the periodicity of the speech. We demonstrate high-quality, real-time enhancement of fullband (48 kHz) speech with less than 5% of a CPU core.
引用
收藏
页码:2482 / 2486
页数:5
相关论文
共 50 条
  • [41] Low-complexity implementation of a real-time decorrelation algorithm for stereophonic acoustic echo cancellation
    Cecchi, Stefania
    Romoli, Laura
    Peretti, Paolo
    Piazza, Francesco
    [J]. SIGNAL PROCESSING, 2012, 92 (11) : 2668 - 2675
  • [42] Low-complexity algorithms for static cache locking in multitasking hard real-time systems
    Puaut, I
    Decotigny, D
    [J]. 23RD IEEE REAL-TIME SYSTEMS SYMPOSIUM, PROCEEDINGS, 2002, : 114 - 123
  • [43] Novel Real-Time Low-Complexity QRS Complex Detector Based on Adaptive Thresholding
    Gutierrez-Rivas, Raquel
    Jesus Garcia, J.
    Marnane, William P.
    Hernandez, Alvaro
    [J]. IEEE SENSORS JOURNAL, 2015, 15 (10) : 6036 - 6043
  • [44] A Robust, Low-Complexity Real-Time Vehicle Counting System For Automated Traffic Surveillance
    Varghese, Arun
    Sreelekha, G.
    [J]. 2020 TWENTY SIXTH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC 2020), 2020,
  • [45] Two-microphone kepstrum approach to real-time speech enhancement methods
    Jeong, J.
    Moir, T. J.
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ENGINEERING OF INTELLIGENT SYSTEMS, 2006, : 393 - +
  • [46] A Novel Low-Complexity Attention-Driven Composite Model for Speech Enhancement
    Hasannezhad, Mojtaba
    Zhu, Wei-Ping
    Champagne, Benoit
    [J]. 2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
  • [47] A 30 Gbps Low-Complexity and Real-Time Digital Modem for Wireless Communications at 0.325 THz
    Zhang, Hao
    Huang, Xiaojing
    Zhang, Ting
    Zhang, Jian A.
    Guo, Y. Jay
    [J]. ISCIT 2019: PROCEEDINGS OF 2019 19TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES (ISCIT), 2019, : 260 - 264
  • [48] Low-complexity unequal packet loss protection for real-time video over ubiquitous networks
    Ha, Hojin
    Yim, Changhoon
    Kim, Young Yong
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2007, PT 1, PROCEEDINGS, 2007, 4705 : 622 - +
  • [49] Poster Abstract: Low-Complexity Multicarrier Physical Layer for Wireless Real-Time Control Networks
    Horvath, Peter
    Yampolskiy, Mark
    Xue, Yuan
    Koutsoukos, Xenofon
    [J]. 2013 ACM/IEEE INTERNATIONAL CONFERENCE ON CYBER-PHYSICAL SYSTEMS (ICCPS), 2013, : 257 - 257
  • [50] ReBeatICG: Real-time Low-Complexity Beat-to-beat Impedance Cardiogram Delineation Algorithm
    Pale, Una
    Muller, Nathan
    Arza, Adriana
    Atienza, David
    [J]. 2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 5618 - 5624