Auditory-based speech processing based on the average localized synchrony detection

被引:0
|
作者
Ali, AMA [1 ]
Van der Spiegel, J [1 ]
Mueller, P [1 ]
机构
[1] Univ Penn, Dept Elect Engn, Philadelphia, PA 19104 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, a new auditory-based speech processing system based on the biologically rooted property of average localized synchrony detection (ALSD) is proposed. The system detects periodicity in the speech signal at Bark-scaled frequencies while reducing the response's spurious peaks and sensitivity to implementation mismatches, and hence presents a consistent and robust representation of the formants. The system is evaluated for its formant extraction ability while reducing spurious peaks. It is compared with other auditory-based front-end processing systems in the task of vowel recognition on clean speech from the TIMIT database and in the presence of noise. The results illustrate the advantage of the ALSD system in extracting the formants and reducing the spurious peaks. They also indicate the superiority of the synchrony measures over the mean-rate in the presence of noise.
引用
收藏
页码:1623 / 1626
页数:4
相关论文
共 50 条
  • [21] Robust Speech Recognition Based on Binaural Auditory Processing
    Menon, Anjali
    Kim, Chanwoo
    Stern, Richard M.
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3872 - 3876
  • [22] Speech Recognition Based on the Processing Solutions of Auditory Cortex
    May, Patrick J. C.
    Tiitinen, Hannu
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2011, PT II, 2011, 6792 : 421 - 428
  • [23] An auditory-based measure for improved phone segment concatenation
    Chappell, DT
    Hansen, JHL
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1639 - 1642
  • [24] A HIGHLY ROBUST AUDIO HASHING SYSTEM USING AUDITORY-BASED FRONT-END PROCESSING
    Ben Salem, Abderraouf
    Selouani, Sid-Ahmed
    Hamam, Habib
    Caelen, Jean
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1413 - +
  • [25] ROBUST SPEAKER IDENTIFICATION USING AN AUDITORY-BASED FEATURE
    Li, Qi
    Huang, Yan
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4514 - 4517
  • [26] Music-based and auditory-based interventions for reading difficulties: A literature review
    Cancer, Alice
    Antonietti, Alessandro
    [J]. HELIYON, 2022, 8 (04)
  • [27] Native and non-native class discrimination using speech rhythm- and auditory-based cues
    Selouani, S. -A.
    Alotaibi, Y.
    Cichocki, W.
    Gharsellaoui, S.
    Kadi, K.
    [J]. COMPUTER SPEECH AND LANGUAGE, 2015, 31 (01): : 28 - 48
  • [28] A cochlear implant speech processing strategy based on an auditory model
    Grayden, DB
    Burkitt, AN
    Kenny, OP
    Clarey, JC
    Paolini, AG
    Clark, GM
    [J]. PROCEEDINGS OF THE 2004 INTELLIGENT SENSORS, SENSOR NETWORKS & INFORMATION PROCESSING CONFERENCE, 2004, : 491 - 496
  • [29] Towards an Immersive Auditory-based Journey Planner for the Visually Impaired
    McCarthy, Chris
    Lai, Tuan Dung
    Favilla, Stuart
    Sly, David
    [J]. PROCEEDINGS OF THE 31ST AUSTRALIAN CONFERENCE ON HUMAN-COMPUTER-INTERACTION (OZCHI'19), 2020, : 387 - 391
  • [30] Auditory-based Formant Estimation in Noise using a Probabilistic Framework
    Glaeser, Claudius
    Heckmann, Martin
    Joublin, Frank
    Goerick, Christian
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2606 - 2609