Cepstral compensation by polynomial approximation for environment-independent speech recognition

被引:0
|
作者
Raj, B
Gouvea, EB
Moreno, PJ
Stern, RM
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech recognition systems perform poorly on speech degraded by even simple effects such as linear filtering and additive noise. One possible solution to this problem is to modify the probability density function (PDF) of clean speech to account for the effects of the degradation. However, even for the case of linear filtering and additive noise, it is extremely difficult to do this analytically. Previously attempted analytical solutions to the problem of noisy speech recognition have either used an overly-simplified mathematical description of the effects of noise on the statistics of speech, or they have relied on the availability of large environment-specific adaptation sets. Some of the previous methods required the use of adaptation data that consists of simultaneously-recorded or ''stereo'' recordings of clean and degraded speech. In this paper we introduce an approximation-based method to compute the effects of the environment on the parameters of the PDF of clean speech. In this work, we perform compensation by Vector Polynomial approximationS (VPS) for the effects of linear filtering and additive noise on the clean speech. We also estimate the parameters of the environment, namely the noise and the channel, by using piecewise-linear approximations of these effects. We evaluate the performance of this method (VPS) using the CMU SPHINX-II system and the 100-word alphanumeric CENSUS database. Performance is evaluated at several SNRs, with artificial white Gaussian noise added to the database. VPS provides improvements of up to 15 percent in relative recognition accuracy.
引用
收藏
页码:2340 / 2343
页数:4
相关论文
共 50 条
  • [21] mmASL: Environment-Independent ASL Gesture Recognition Using 60 GHz Millimeter-wave Signals
    Santhalingam, Panneer Selvam
    Hosain, Al Amin
    Zhang, Ding
    Pathak, Parth
    Rangwala, Huzefa
    Kushalnagar, Raja
    PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2020, 4 (01):
  • [22] Speech compression by polynomial approximation
    Dusan, Sorin
    Flanagan, James L.
    Karve, Amod
    Balaraman, Mridul
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (02): : 387 - 395
  • [23] A unified compensation approach for speech recognition in severely adverse environment
    Tian, B
    Sun, MG
    Sclabassi, RJ
    Yi, KC
    ISUMA 2003: FOURTH INTERNATIONAL SYMPOSIUM ON UNCERTAINTY MODELING AND ANALYSIS, 2003, : 256 - 261
  • [24] Environment Mismatch Compensation using Average Eigenspace for Speech Recognition
    Kumar, Abhishek
    Hansen, John H. L.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1277 - 1280
  • [25] DeFall: Environment-Independent Passive Fall Detection Using WiFi
    Hu, Yuqian
    Zhang, Feng
    Wu, Chenshu
    Wang, Beibei
    Liu, K. J. Ray
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (11) : 8515 - 8530
  • [26] Environment-Independent Online Real-time Traffic Identification
    Tai, Masaki
    Ata, Shingo
    Oka, Ikuo
    FOURTH INTERNATIONAL CONFERENCE ON NETWORKING AND SERVICES (ICNS 2008), PROCEEDINGS, 2008, : 230 - 235
  • [27] A vector statistical piecewise polynomial approximation algorithm for environment compensation in telephone LVCSR
    Han, ZB
    Zhang, SW
    Zhang, HY
    Xu, B
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 117 - 120
  • [28] Environment-independent Formation Flight for Micro Aerial Vehicles
    Nageli, Tobias
    Conte, Christian
    Domahidi, Alexander
    Morari, Manfred
    Hilliges, Otmar
    2014 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2014), 2014, : 1141 - 1146
  • [29] MEAN NORMALIZATION OF POWER FUNCTION BASED CEPSTRAL COEFFICIENTS FOR ROBUST SPEECH RECOGNITION IN NOISY ENVIRONMENT
    Baek, Soonho
    Kang, Hong-Goo
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [30] Feature compensation based on independent noise estimation for robust speech recognition
    Yong Lü
    Han Lin
    Pingping Wu
    Yitao Chen
    EURASIP Journal on Audio, Speech, and Music Processing, 2021