Towards a neurocomputational model of speech production and perception

被引：92

作者：

Kroeger, Bernd J. ^{[1
]}

Kannampuzha, Jim

Neuschaefer-Rube, Christiane

机构：

[1] Univ Hosp Aachen, Dept Phoniatr Pedaudiol & Commun Disorders, Aachen, Germany

来源：

SPEECH COMMUNICATION | 2009年 / 51卷 / 09期

关键词：

Speech; Speech production; Speech perception; Neurocomputational model; Artificial neural networks; Self-organizing networks; NEURAL-NETWORK MODEL; LANGUAGE PRODUCTION; LEXICAL ACCESS; TEMPORAL-LOBE; ACTIVATION; DISCRIMINATION; RECOGNITION; DYNAMICS; IDENTIFICATION; REPRESENTATION;

D O I：

10.1016/j.specom.2008.08.002

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The limitation in performance of current speech synthesis and speech recognition systems may result from the fact that these systems are not designed with respect to the human neural processes of speech production and perception. A neurocomputational model of speech production and perception is introduced which is organized with respect to human neural processes of speech production and perception. The production-perception model comprises all artificial computer-implemented vocal tract as a front-end module, which is capable of generating articulatory speech movements and acoustic speech signals. The structure of the production-perception model comprises motor and sensory processing pathways. Speech knowledge is collected during training stages which imitate early stages of speech acquisition. This knowledge is stored in artificial self-organizing maps. The current neurocomputational model is capable of producing and perceiving vowels, VC-, and CV-syllables (V = vowels and C = voiced plosives). Basic features of natural speech production and perception are predicted from this model in a straight forward way: Production of speech items is feedforward and feedback controlled and phoneme realizations vary within perceptually defined regions. Perception is less categorical in the case of vowels in comparison to consonants. Due to its human-like production-perception processing the model should be discussed as a basic module for more technical relevant approaches for high-quality speech synthesis and for high performance speech recognition. (C) 2008 Elsevier B.V. All rights reserved.

引用

页码：793 / 809

页数：17

共 50 条

[31] The relationship of speech perception and speech production: It's complicated
Baese-Berk, Melissa M.
Kapnoula, Efthymia C.
Samuel, Arthur G.
PSYCHONOMIC BULLETIN & REVIEW, 2025, 32 (01) : 226 - 242
[32] IMPAIRMENTS OF SPEECH PRODUCTION AND SPEECH-PERCEPTION IN APHASIA
BLUMSTEIN, SE
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 1994, 346 (1315) : 29 - 36
[33] Speech perception and speech production as indicators of reading difficulty
Post Y.V.
Foorman B.R.
Hiscock M.
Annals of Dyslexia, 1997, 47 (1) : 3 - 27
[34] Does training in speech perception modify speech production?
AkahaneYamada, R
Tohkura, Y
Bradlow, AR
Pisoni, DB
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 606 - 609
[35] Expressive speech: Production, perception and application to speech synthesis
Erickson, Donna
ACOUSTICAL SCIENCE AND TECHNOLOGY, 2005, 26 (04) : 317 - 325
[36] Neurocomputational modeling of speech motor development
Meier, Andrew M.
Guenther, Frank H.
JOURNAL OF CHILD LANGUAGE, 2023, 50 (06) : 1318 - 1335
[37] PERCEPTUAL CENTERS IN SPEECH PRODUCTION AND PERCEPTION
FOWLER, CA
PERCEPTION & PSYCHOPHYSICS, 1979, 25 (05): : 375 - 388
[38] Discussion: Early speech perception and production
Paul, R
JOURNAL OF COMMUNICATION DISORDERS, 1999, 32 (04) : 247 - 250
[39] PRODUCTION AND PERCEPTION OF SPEECH - RELATION AND DIFFERENCES
LANE, H
PHONETICA, 1971, 23 (02) : 94 - &
[40] The synergy between speech production and perception
Ru, PW
Chi, TS
Shamma, S
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2003, 113 (01): : 498 - 515

← 1 2 3 4 5 →