Real-time decoding of question-and-answer speech dialogue using human cortical activity

被引:0
|
作者
David A. Moses
Matthew K. Leonard
Joseph G. Makin
Edward F. Chang
机构
[1] Department of Neurological Surgery and the Center for Integrative Neuroscience at UC San Francisco,
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Natural communication often occurs in dialogue, differentially engaging auditory and sensorimotor brain regions during listening and speaking. However, previous attempts to decode speech directly from the human brain typically consider listening or speaking tasks in isolation. Here, human participants listened to questions and responded aloud with answers while we used high-density electrocorticography (ECoG) recordings to detect when they heard or said an utterance and to then decode the utterance’s identity. Because certain answers were only plausible responses to certain questions, we could dynamically update the prior probabilities of each answer using the decoded question likelihoods as context. We decode produced and perceived utterances with accuracy rates as high as 61% and 76%, respectively (chance is 7% and 20%). Contextual integration of decoded question likelihoods significantly improves answer decoding. These results demonstrate real-time decoding of speech in an interactive, conversational setting, which has important implications for patients who are unable to communicate.
引用
收藏
相关论文
共 50 条
  • [1] Real-time decoding of question-and-answer speech dialogue using human cortical activity
    Moses, David A.
    Leonard, Matthew K.
    Makin, Joseph G.
    Chang, Edward F.
    [J]. NATURE COMMUNICATIONS, 2019, 10 (1)
  • [2] mimir: A Market-Based Real-Time Question and Answer Service
    Hsieh, Gary
    Counts, Scott
    [J]. CHI2009: PROCEEDINGS OF THE 27TH ANNUAL CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, VOLS 1-4, 2009, : 769 - 778
  • [3] MultiQT: Multimodal Learning for Real-Time Question Tracking in Speech
    Havtorn, Jakob D.
    Latko, Jan
    Edin, Joakim
    Borgholt, Lasse
    Maaloe, Lars
    Belgrano, Lorenzo
    Jacobsen, Nicolai F.
    Sdun, Regitze
    Agic, Zeljko
    [J]. 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 2370 - 2380
  • [4] Neural speech recognition: continuous phoneme decoding using spatiotemporal representations of human cortical activity
    Moses, David A.
    Mesgarani, Nima
    Leonard, Matthew K.
    Chang, Edward F.
    [J]. JOURNAL OF NEURAL ENGINEERING, 2016, 13 (05)
  • [5] Integration of Speech and Text Processing Modules into a Real-Time Dialogue System
    Ptacek, Jan
    Ircing, Pavel
    Spousta, Miroslav
    Romportl, Jan
    Loose, Zdenek
    Cinkova, Silvie
    Relano Gil, Jose
    Santos, Raul
    [J]. TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 552 - +
  • [6] Real-time imaging of human cortical activity evoked by painful esophageal stimulation
    Hobson, AR
    Furlong, PL
    Worthen, SF
    Hillebrand, A
    Barnes, GR
    Singh, KD
    Aziz, Q
    [J]. GASTROENTEROLOGY, 2005, 128 (03) : 610 - 619
  • [7] Real-time classification of auditory sentences using evoked cortical activity in humans
    Moses, David A.
    Leonard, Matthew K.
    Chang, Edward F.
    [J]. JOURNAL OF NEURAL ENGINEERING, 2018, 15 (03)
  • [8] Real-time decoding of nonstationary neural activity in motor cortex
    Wu, Wei
    Hatsopoulos, Nicholas G.
    [J]. IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2008, 16 (03) : 213 - 222
  • [9] REAL-TIME SPEECH CODING AND DECODING FOR GSM SYSTEM AND ITS IMPLEMENT IN VC
    Wan, Guojin
    Xu, Qingyi
    Xiao, Jing
    Lu, Sheng
    [J]. PROCEEDINGS OF 2011 INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY AND APPLICATION, ICCTA2011, 2011, : 848 - 852
  • [10] Real-time Human Activity Recognition
    Albukhary, N.
    Mustafah, Y. M.
    [J]. 6TH INTERNATIONAL CONFERENCE ON MECHATRONICS (ICOM'17), 2017, 260