Time-frequency representations in speech perception

被引:12
|
作者
Gomez-Vilda, Pedro [1 ]
Ferrandez-Vicente, Jose M. [2 ]
Rodellar-Biarge, Victoria [1 ]
Fernandez-Baillo, Roberto [1 ]
机构
[1] Univ Politecn Madrid, Fac Informat, E-28660 Madrid, Spain
[2] Univ Politecn Cartagena, Cartagena 30202, Spain
关键词
Bio-inspired speech processing; Speech perception; Acoustic-phonetics; Phonetic boundaries and classes; Minimal semantic units; ORGANIZATION; INTEGRATION; DOMAIN;
D O I
10.1016/j.neucom.2008.04.056
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays applications demand a comprehensive view of voice and speech perception to build more complex and competitive procedures amenable of extracting as much knowledge from sound-based human communication as possible. Many knowledge-extraction tasks from speech and voice may share signal treatment procedures which can be devised under the point of view of bio-inspiration. The present paper examines a hierarchy of sound processing functionalities at the auditory and perceptual levels on the Auditory Neural pathways which can be translated into bio-inspired speech-processing techniques, their fundamental characteristics being analyzed in relation with current tendencies in cognitive audio processing. The pathways linking the peripheral auditory system (cochlear complex) with the brain cortex are briefly examined, with special attention to the study of neuronal structures showing specific capabilities under the point of view of formant analysis and the build-up of a semantic hierarchy from the time-frequency structure of speech to explore their capability of conveying semantics to speech processing and understanding from the minimal acoustic clues with elementary meaning or "sematoms". The replication of known biological functionality by algorithmic methods through bio-inspiration is a secondary aim of the research. Examples extracted from speech processing tasks in the domain of acoustic-phonetics are presented. These may find applicability in speech recognition, speaker's characterization and biometry, emotion detection, and others related. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:820 / 830
页数:11
相关论文
共 50 条
  • [41] Are quadratic time-frequency representations really necessary?
    Marple, SL
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 2575 - 2578
  • [42] A new approach for time-frequency analysis of heart rate variability and assessment of time-frequency representations
    Chan, HL
    Huang, HH
    Wu, CP
    Lin, JL
    JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 1997, 20 (03) : 343 - 353
  • [43] On time-frequency representations for underwater acoustic signal
    Courmontagne, Philippe
    Ouelha, Samir
    Chaillan, Fabien
    2012 OCEANS, 2012,
  • [44] A NEW APPROACH FOR THE REASSIGNMENT OF TIME-FREQUENCY REPRESENTATIONS
    Sejdic, Ervin
    Ozertem, Umut
    Djurovic, Igor
    Erdogmus, Deniz
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 2997 - +
  • [45] TIME-FREQUENCY REPRESENTATIONS BASED ON COMPRESSIVE SAMPLES
    Sejdic, Ervin
    Chaparro, Luis E.
    2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
  • [46] Time-Frequency MUSIC: An array signal processing method based on time-frequency signal representations
    Amin, MG
    Belouchrani, A
    RADAR PROCESSING, TECHNOLOGY, AND APPLICATIONS III, 1998, 3462 : 186 - 194
  • [47] Time-frequency and time-scale vector fields for deforming time-frequency and time-scale representations
    Daudet, L
    Morvidone, M
    Torrésani, B
    WAVELET APPLICATIONS IN SIGNAL AND IMAGE PROCESSING VII, 1999, 3813 : 2 - 15
  • [48] Methodology for the automated selection of time-frequency representations
    DeVol, Nathaniel
    Saldaña, Christopher
    Fu, Katherine
    Journal of Sound and Vibration, 2025, 596
  • [49] Adaptive time-frequency representations for multiple structures
    Papandreou-Suppappola, A
    Suppappola, SB
    PROCEEDINGS OF THE TENTH IEEE WORKSHOP ON STATISTICAL SIGNAL AND ARRAY PROCESSING, 2000, : 579 - 583
  • [50] Linear and quadratic time-frequency signal representations
    Hlawatsch, F.
    Boudreaux-Bartels, G. F.
    IEEE SIGNAL PROCESSING MAGAZINE, 1992, 9 (02) : 21 - 67