Towards a neurocomputational model of speech production and perception

被引:92
|
作者
Kroeger, Bernd J. [1 ]
Kannampuzha, Jim
Neuschaefer-Rube, Christiane
机构
[1] Univ Hosp Aachen, Dept Phoniatr Pedaudiol & Commun Disorders, Aachen, Germany
关键词
Speech; Speech production; Speech perception; Neurocomputational model; Artificial neural networks; Self-organizing networks; NEURAL-NETWORK MODEL; LANGUAGE PRODUCTION; LEXICAL ACCESS; TEMPORAL-LOBE; ACTIVATION; DISCRIMINATION; RECOGNITION; DYNAMICS; IDENTIFICATION; REPRESENTATION;
D O I
10.1016/j.specom.2008.08.002
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The limitation in performance of current speech synthesis and speech recognition systems may result from the fact that these systems are not designed with respect to the human neural processes of speech production and perception. A neurocomputational model of speech production and perception is introduced which is organized with respect to human neural processes of speech production and perception. The production-perception model comprises all artificial computer-implemented vocal tract as a front-end module, which is capable of generating articulatory speech movements and acoustic speech signals. The structure of the production-perception model comprises motor and sensory processing pathways. Speech knowledge is collected during training stages which imitate early stages of speech acquisition. This knowledge is stored in artificial self-organizing maps. The current neurocomputational model is capable of producing and perceiving vowels, VC-, and CV-syllables (V = vowels and C = voiced plosives). Basic features of natural speech production and perception are predicted from this model in a straight forward way: Production of speech items is feedforward and feedback controlled and phoneme realizations vary within perceptually defined regions. Perception is less categorical in the case of vowels in comparison to consonants. Due to its human-like production-perception processing the model should be discussed as a basic module for more technical relevant approaches for high-quality speech synthesis and for high performance speech recognition. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:793 / 809
页数:17
相关论文
共 50 条
  • [1] A New Framework of Neurocomputational Model for Speech Production
    Yan, Han
    Dang, Jianwu
    Cao, Mengxue
    Kroeger, Bernd J.
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 294 - +
  • [2] Towards neurocomputational speech and sound processing
    Rouat, Jean
    Loiselle, Stephane
    Pichevar, Ramin
    PROGRESS IN NONLINEAR SPEECH PROCESSING, 2007, 4391 : 58 - +
  • [3] TOWARDS A PROBLEM OF PERCEPTION AND PRODUCTION OF ORAL SPEECH
    BELTYUKOV, VI
    PSIKHOLOGICHESKII ZHURNAL, 1988, 9 (03) : 53 - 59
  • [4] LaDIVA: A neurocomputational model providing laryngeal motor control for speech acquisition and production
    Weerathunge, Hasini
    Alzamendi, Gabriel
    Cler, Gabriel
    Guenther, Frank
    Stepp, Cara
    Zanartu, Matias
    PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (06)
  • [5] A Neurocomputational Model of Automatic Sequence Production
    Helie, Sebastien
    Roeder, Jessica L.
    Vucovich, Lauren
    Ruenger, Dennis
    Ashby, F. Gregory
    JOURNAL OF COGNITIVE NEUROSCIENCE, 2015, 27 (07) : 1456 - 1469
  • [6] A neurocomputational view of the effects of Parkinson's disease on speech production
    Manes, Jordan L.
    Bullock, Latane
    Meier, Andrew M.
    Turner, Robert S.
    Richardson, R. Mark
    Guenther, Frank H.
    FRONTIERS IN HUMAN NEUROSCIENCE, 2024, 18
  • [7] The Organization of a Neurocomputational Control Model for Articulatory Speech Synthesis
    Kroeger, Bernd J.
    Lowit, Anja
    Schnitker, Ralph
    VERBAL AND NONVERBAL FEATURES OF HUMAN-HUMAN AND HUMAN-MACHINE INTERACTIONS, 2008, 5042 : 121 - +
  • [8] Individual Differences in Speech Production and Perception (Speech Production and Perception, 3)
    Calamai, Silvia
    STUDI E SAGGI LINGUISTICI, 2016, 54 (02): : 135 - 141
  • [9] Speech perception and production
    Casserly, Elizabeth D.
    Pisoni, David B.
    WILEY INTERDISCIPLINARY REVIEWS-COGNITIVE SCIENCE, 2010, 1 (05) : 629 - 647
  • [10] Speech production and perception
    Frisch, Stefan A.
    Wodzinski, Sylvie W.
    JOURNAL OF THE INTERNATIONAL PHONETIC ASSOCIATION, 2009, 39 (01) : 98 - 101