Towards a neurocomputational model of speech production and perception

被引:92
|
作者
Kroeger, Bernd J. [1 ]
Kannampuzha, Jim
Neuschaefer-Rube, Christiane
机构
[1] Univ Hosp Aachen, Dept Phoniatr Pedaudiol & Commun Disorders, Aachen, Germany
关键词
Speech; Speech production; Speech perception; Neurocomputational model; Artificial neural networks; Self-organizing networks; NEURAL-NETWORK MODEL; LANGUAGE PRODUCTION; LEXICAL ACCESS; TEMPORAL-LOBE; ACTIVATION; DISCRIMINATION; RECOGNITION; DYNAMICS; IDENTIFICATION; REPRESENTATION;
D O I
10.1016/j.specom.2008.08.002
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The limitation in performance of current speech synthesis and speech recognition systems may result from the fact that these systems are not designed with respect to the human neural processes of speech production and perception. A neurocomputational model of speech production and perception is introduced which is organized with respect to human neural processes of speech production and perception. The production-perception model comprises all artificial computer-implemented vocal tract as a front-end module, which is capable of generating articulatory speech movements and acoustic speech signals. The structure of the production-perception model comprises motor and sensory processing pathways. Speech knowledge is collected during training stages which imitate early stages of speech acquisition. This knowledge is stored in artificial self-organizing maps. The current neurocomputational model is capable of producing and perceiving vowels, VC-, and CV-syllables (V = vowels and C = voiced plosives). Basic features of natural speech production and perception are predicted from this model in a straight forward way: Production of speech items is feedforward and feedback controlled and phoneme realizations vary within perceptually defined regions. Perception is less categorical in the case of vowels in comparison to consonants. Due to its human-like production-perception processing the model should be discussed as a basic module for more technical relevant approaches for high-quality speech synthesis and for high performance speech recognition. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:793 / 809
页数:17
相关论文
共 50 条
  • [21] Towards a somatosensory theory of speech perception
    Franken, Matthias K.
    Liu, Brian C.
    Ostry, David J.
    JOURNAL OF NEUROPHYSIOLOGY, 2022, 128 (06) : 1683 - 1695
  • [22] Towards a functional neuroanatomy of speech perception
    Hickok, G
    Poeppel, ID
    JOURNAL OF COGNITIVE NEUROSCIENCE, 2000, : 45 - 45
  • [23] Towards a functional neuroanatomy of speech perception
    Hickok, G
    Poeppel, D
    TRENDS IN COGNITIVE SCIENCES, 2000, 4 (04) : 131 - 138
  • [24] Speech production and speech perception in children with speech sound disorder
    Berti, Larissa Cristina
    de Assis, Mayara Ferreira
    Cremasco, Elissa
    Vieira Cardoso, Ana Claudia
    CLINICAL LINGUISTICS & PHONETICS, 2022, 36 (2-3) : 183 - 202
  • [25] Neurocomputational model of speech recognition for pathological speech detection: a case study on Parkinson's disease speech detection
    Hovsepyan, Sevada
    Magimai-Doss, Mathew
    INTERSPEECH 2024, 2024, : 3590 - 3594
  • [26] Neurocomputational Mechanisms Contributing to Auditory Perception
    Cohen, Yale E.
    Banno, Taku
    Lee, Jaejin
    Rodriguez-Campos, Francisco
    Schaff, Matthew
    Suriya-Arunroj, Lalitta
    Tsunada, Joji
    ACTA ACUSTICA UNITED WITH ACUSTICA, 2018, 104 (05) : 870 - 873
  • [27] Depression recognition using a proposed speech chain model fusing speech production and perception features
    Du, Minghao
    Liu, Shuang
    Wang, Tao
    Zhang, Wenquan
    Ke, Yufeng
    Chen, Long
    Ming, Dong
    JOURNAL OF AFFECTIVE DISORDERS, 2023, 323 : 299 - 308
  • [28] Speech Planning and Dynamics (Speech Production and Perception, 1)
    Calamai, Silvia
    STUDI E SAGGI LINGUISTICI, 2016, 54 (02): : 135 - 141
  • [29] Speech face perception is locked to anticipation in speech production
    Troille, Emilie
    Cathiard, Marie-Agnes
    Abry, Christian
    SPEECH COMMUNICATION, 2010, 52 (06) : 513 - 524
  • [30] Speech perception and speech production as indicators of reading difficulty
    Post, YV
    Foorman, BR
    Hiscock, M
    ANNALS OF DYSLEXIA, 1997, 47 : 3 - 27