Continuous Speech Recognition from ECoG

被引:0
|
作者
Heger, Dominic [1 ]
Herff, Christian [1 ]
de Pesters, Adriana [2 ,4 ]
Telaar, Dominic [1 ]
Brunner, Peter [2 ,3 ]
Schalk, Gerwin [2 ,3 ,4 ]
Schultz, Tanja [1 ]
机构
[1] Karlsruhe Inst Technol, Cognit Syst Lab, Inst Anthropomat & Robot, Karlsruhe, Germany
[2] New York State Dept Hlth, Natl Ctr Adapt Neurotechnol, Wadsworth Ctr, Albany, NY USA
[3] Albany Med Coll, Dept Neurol, Albany, NY 12208 USA
[4] SUNY Albany, Dept Biomed Sci, Albany, NY USA
关键词
electrocorticography; ECoG; speech recognition; brain-computer interface; ELECTROCORTICOGRAPHIC GAMMA ACTIVITY; INTERFACE; CORTEX;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Continuous speech production is a highly complex process involving many parts of the human brain. To date, no fundamental representation that allows for decoding of continuous speech from neural signals has been presented. Here we show that techniques from automatic speech recognition can be applied to decode a textual representation of spoken words from neural signals. We model phones as the fundamental unit of the speech process in invasively measured brain activity (intracranial electrocorticographic (ECoG)) recordings. These phone models give insights into timings and locations of neural processes associated with the continuous production of speech and can be used in a speech recognizer to decode the neural data into their textual representations. When restricting the dictionary to small subsets, Word Error Rates as low as 25% can be achieved. As the brain activity data sets are fairly small, alternative approaches to Gaussian models are investigated by relying on robust, regularized discriminative models.
引用
收藏
页码:1131 / 1135
页数:5
相关论文
共 50 条
  • [1] CONTINUOUS SPEECH RECOGNITION FROM PHONETIC TRANSCRIPTION
    LEVINSON, SE
    LJOLJE, A
    [J]. SPEECH AND NATURAL LANGUAGE, 1989, : 292 - 292
  • [2] CONTINUOUS SPEECH RECOGNITION
    MORGAN, N
    BOURLARD, H
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 1995, 12 (03) : 25 - 42
  • [3] Influence of Emotional Speech on Continuous Speech Recognition
    Zgank, Andrej
    Maucec, Mirjam Sepesy
    [J]. 13TH INTERNATIONAL CONFERENCE ON ELEKTRO (ELEKTRO 2020), 2020,
  • [4] Continuous speech recognition for clinicians
    Zafar, A
    Overhage, JM
    McDonald, CJ
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 1999, 6 (03) : 195 - 204
  • [5] COMPUTER RECOGNITION OF CONTINUOUS SPEECH
    PURVES, RB
    STRONG, WJ
    [J]. ACUSTICA, 1976, 35 (02): : 111 - 121
  • [6] PRACTICAL AND CONTINUOUS SPEECH RECOGNITION
    ROSS, S
    MACALLISTER, J
    [J]. COMPUTER DESIGN, 1984, 23 (07): : 69 - &
  • [7] WORD RECOGNITION IN CONTINUOUS SPEECH
    TABOSSI, P
    SCOTT, D
    BURANI, C
    [J]. BULLETIN OF THE PSYCHONOMIC SOCIETY, 1991, 29 (06) : 529 - 529
  • [8] Hindi phoneme-viseme recognition from continuous speech
    Mishra, A. N.
    Chandra, Mahesh
    Biswas, Astik
    Sharan, S. N.
    [J]. INTERNATIONAL JOURNAL OF SIGNAL AND IMAGING SYSTEMS ENGINEERING, 2013, 6 (03) : 164 - 171
  • [9] CONTINUOUS VISUAL SPEECH RECOGNITION FOR AUDIO SPEECH ENHANCEMENT
    Benhaim, Eric
    Sahbi, Hichem
    Vitte, Guillaume
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 2244 - 2248
  • [10] A time continuous model for speech recognition
    Euler, S
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 889 - 892