Brain-Computer Interface: Applications to Speech Decoding and Synthesis to Augment Communication

被引:0
|
作者
Shiyu Luo
Qinwan Rabbani
Nathan E. Crone
机构
[1] The Johns Hopkins University School of Medicine,Department of Biomedical Engineering
[2] The Johns Hopkins University,Department of Electrical and Computer Engineering
[3] The Johns Hopkins University School of Medicine,Department of Neurology
来源
Neurotherapeutics | 2022年 / 19卷
关键词
Speech synthesis; Brain-computer interface; Locked-in syndrome; Electrocorticography; ECoG;
D O I
暂无
中图分类号
学科分类号
摘要
Damage or degeneration of motor pathways necessary for speech and other movements, as in brainstem strokes or amyotrophic lateral sclerosis (ALS), can interfere with efficient communication without affecting brain structures responsible for language or cognition. In the worst-case scenario, this can result in the locked in syndrome (LIS), a condition in which individuals cannot initiate communication and can only express themselves by answering yes/no questions with eye blinks or other rudimentary movements. Existing augmentative and alternative communication (AAC) devices that rely on eye tracking can improve the quality of life for people with this condition, but brain-computer interfaces (BCIs) are also increasingly being investigated as AAC devices, particularly when eye tracking is too slow or unreliable. Moreover, with recent and ongoing advances in machine learning and neural recording technologies, BCIs may offer the only means to go beyond cursor control and text generation on a computer, to allow real-time synthesis of speech, which would arguably offer the most efficient and expressive channel for communication. The potential for BCI speech synthesis has only recently been realized because of seminal studies of the neuroanatomical and neurophysiological underpinnings of speech production using intracranial electrocorticographic (ECoG) recordings in patients undergoing epilepsy surgery. These studies have shown that cortical areas responsible for vocalization and articulation are distributed over a large area of ventral sensorimotor cortex, and that it is possible to decode speech and reconstruct its acoustics from ECoG if these areas are recorded with sufficiently dense and comprehensive electrode arrays. In this article, we review these advances, including the latest neural decoding strategies that range from deep learning models to the direct concatenation of speech units. We also discuss state-of-the-art vocoders that are integral in constructing natural-sounding audio waveforms for speech BCIs. Finally, this review outlines some of the challenges ahead in directly synthesizing speech for patients with LIS.
引用
收藏
页码:263 / 273
页数:10
相关论文
共 50 条
  • [1] Brain-Computer Interface: Applications to Speech Decoding and Synthesis to Augment Communication
    Luo, Shiyu
    Rabbani, Qinwan
    Crone, Nathan E.
    [J]. NEUROTHERAPEUTICS, 2022, 19 (01) : 263 - 273
  • [2] fMRI Brain Decoding and Its Applications in Brain-Computer Interface: A Survey
    Du, Bing
    Cheng, Xiaomu
    Duan, Yiping
    Ning, Huansheng
    [J]. BRAIN SCIENCES, 2022, 12 (02)
  • [3] Brain-computer interfaces for speech communication
    Brumberg, Jonathan S.
    Nieto-Castanon, Alfonso
    Kennedy, Philip R.
    Guenther, Frank H.
    [J]. SPEECH COMMUNICATION, 2010, 52 (04) : 367 - 379
  • [4] Preliminary study for intonation classification of imagined speech for brain-computer interface applications
    Casso, Isabel
    Rouillard, Jose
    Si-Mohammed, Hakim
    Betrouni, Nacim
    Cabestaing, Francois
    Basirat, Anahita
    [J]. 2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 1238 - 1242
  • [5] Advances in brain-computer interface for decoding speech imagery from EEG signals: a systematic review
    Rahman, Nimra
    Khan, Danish Mahmood
    Masroor, Komal
    Arshad, Mehak
    Rafiq, Amna
    Fahim, Syeda Maham
    [J]. COGNITIVE NEURODYNAMICS, 2024,
  • [6] P300 brain-computer interface design for communication and control applications
    Wang, Chaunchu
    Guan, Cuntai
    Zhang, Haihong
    [J]. 2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 5400 - 5403
  • [7] Decoding the Debate: A Comparative Study of Brain-Computer Interface and Neurofeedback
    Mohammad H. Mahrooz
    Farrokh Fattahzadeh
    Shahriar Gharibzadeh
    [J]. Applied Psychophysiology and Biofeedback, 2024, 49 : 47 - 53
  • [8] Editorial: Brain-computer interface and its applications
    Chen, Duo
    Liu, Ke
    Guo, Jiayang
    Bi, Luzheng
    Xiang, Jing
    [J]. FRONTIERS IN NEUROROBOTICS, 2023, 17
  • [9] An auditory brain-computer interface evoked by natural speech
    Lopez-Gordo, M. A.
    Fernandez, E.
    Romero, S.
    Pelayo, F.
    Prieto, Alberto
    [J]. JOURNAL OF NEURAL ENGINEERING, 2012, 9 (03)
  • [10] Artificial speech synthesizer control by brain-computer interface
    Brumberg, Jonathan S.
    Kennedy, Philip R.
    Guenther, Frank H.
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 652 - +