Speech enhancement based on auditory spectral change

被引:0
|
作者
Quatieri, TF [1 ]
Dunn, RB [1 ]
机构
[1] MIT, Lincoln Lab, Lexington, MA 02173 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, an adaptive approach to the enhancement of speech signals is developed based on auditory spectral change, The algorithm is motivated by sensitivity of aural biologic systems to signal dynamics, by evidence that noise is aurally masked by rapid changes in a signal, and by analogies to these two aural phenomena in biologic visual processing. Emphasis is on preserving nonstationarity, i.e., speech transient and time-varying components, such as plosive bursts, formant transitions, and vowel onsets, while suppressing additive noise. The essence of the enhancement technique is a Wiener filter that uses a desired signal spectrum whose estimation adapts to stationarity of the measured signal. The degree of stationarity is derived from a signal change measurement, based on an auditory spectrum that accentuates change in spectral bands. The adaptive filter is applied in an unconventional overlap-add analysis/synthesis framework, using a very short 4-ms analysis window and a 1-ms frame interval. In informal listening, the reconstructions are judged to be "crisp" corresponding to good temporal resolution of transient and rapidly-moving speech events.
引用
收藏
页码:257 / 260
页数:4
相关论文
共 50 条
  • [1] Auditory-Based Spectral Amplitude Estimators for Speech Enhancement
    Plourde, Eric
    Champagne, Benoit
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (08): : 1614 - 1623
  • [2] Auditory enhancement and spectral contrast effects in speech perception
    Stilp, Christian E.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 146 (02): : 1503 - 1517
  • [3] Speech Enhancement Based on Auditory Masking Properties and Log-Spectral Distance
    Wang, Peihe
    Wang, Yong
    Liu, Hao
    Sheng, Yanxiu
    Wang, Xi
    Wei, Zhiqiang
    [J]. 2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, : 1060 - 1064
  • [4] AUDITORY CODING BASED SPEECH ENHANCEMENT
    Ren, Yao
    Johnson, Michael T.
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4685 - 4688
  • [5] SPEECH ENHANCEMENT BASED CONCEPTUALLY ON AUDITORY EVIDENCE
    CHENG, YM
    OSHAUGHNESSY, D
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (09) : 1943 - 1954
  • [6] RESEARCH OF SPEECH ENHANCEMENT BASED ON AUDITORY MODEL
    Tao, Zhi
    Zhang, Xiao-Jun
    Sun, Wen-Ye
    Zhu, Jun-Jie
    Zhao, He-Ming
    [J]. 2011 3RD INTERNATIONAL CONFERENCE ON COMPUTER TECHNOLOGY AND DEVELOPMENT (ICCTD 2011), VOL 3, 2012, : 689 - 694
  • [7] Enhancement of electrolarynx speech based on auditory masking
    Liu, HJ
    Zhao, Q
    Wan, MX
    Wang, SP
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2006, 53 (05) : 865 - 874
  • [8] A Modified Spectral Subtraction Method for Speech Enhancement Based on Masking Property of Human Auditory System
    Xia, Bing-yin
    Liang, Yan
    Bao, Chang-chun
    [J]. 2009 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP 2009), 2009, : 942 - 946
  • [9] Speech Enhancement Based on Spectral Subtraction for Speech Recognition System
    Han, Jung-woo
    Kim, Se-young
    Kim, Ki-man
    Jung, Ji-won
    Yun, Young
    [J]. IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE 2011), 2011, : 417 - 418
  • [10] Speech Enhancement Using Auditory-Based Transform
    Tank, Vanita Raj
    Mahajan, S. P.
    Khaparde, Arti
    Deshpande, Rahul
    [J]. 2015 10TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING (ICICS), 2015,