Improving Named Entity Recognition in Spoken Dialog Systems by Context and Speech Pattern Modeling

被引:0
|
作者
Minh Nguyen [1 ]
Yu, Zhou [2 ]
机构
[1] Univ Calif Davis, Davis, CA 95616 USA
[2] Columbia Univ, New York, NY 10027 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While named entity recognition (NER) from speech has been around as long as NER from written text has, the accuracy of NER from speech has generally been much lower than that of NER from text. The rise in popularity of spoken dialog systems such as Siri or Alexa highlights the need for more accurate NER from speech because NER is a core component for understanding what users said in dialogs. Deployed spoken dialog systems receive user input in the form of automatic speech recognition (ASR) transcripts, and simply applying NER model trained on written text to ASR transcripts often leads to low accuracy because compared to written text, ASR transcripts lack important cues such as punctuation and capitalization. Besides, errors in ASR transcripts also make NER from speech challenging. We propose two models that exploit dialog context and speech pattern clues to extract named entities more accurately from open-domain dialogs in spoken dialog systems. Our results show the benefit of modeling dialog context and speech patterns in two settings: a standard setting with random partition of data and a more realistic but also more difficult setting where many named entities encountered during deployment are unseen during training.
引用
收藏
页码:45 / 55
页数:11
相关论文
共 50 条
  • [1] Modeling Spoken Dialog Systems under the Interactive Pattern Recognition Framework
    Ines Torres, M.
    Miguel Benedi, Jose
    Justo, Raquel
    Ghigi, Fabrizio
    [J]. STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2012, 7626 : 519 - 528
  • [2] Speech recognition of a named entity
    Tomita, T
    Okimoto, Y
    Yamamoto, H
    Sagisaka, Y
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1057 - 1060
  • [3] Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning
    Wang, Xinyu
    Jiang, Yong
    Bach, Nguyen
    Wang, Tao
    Huang, Zhongqiang
    Huang, Fei
    Tu, Kewei
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 1800 - 1812
  • [4] Jointly predicting dialog act and named entity for spoken language understanding
    Jeong, Minwoo
    Lee, Gary Geunbae
    [J]. 2006 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, 2006, : 66 - +
  • [5] On the Use of External Data for Spoken Named Entity Recognition
    Pasad, Ankita
    Wu, Felix
    Shon, Suwon
    Livescu, Karen
    Han, Kyu
    [J]. NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 724 - 737
  • [6] Pattern Mining for Named Entity Recognition
    Nouvel, Damien
    Antoine, Jean-Yves
    Friburger, Nathalie
    [J]. HUMAN LANGUAGE TECHNOLOGY CHALLENGES FOR COMPUTER SCIENCE AND LINGUISTICS, 2014, 8387 : 226 - 237
  • [7] A Proposal for Improving Spoken Dialog Systems using Context Information Fusion
    Chairi, I.
    Griol, D.
    Garcia, J.
    Molina, J. M.
    [J]. 2015 18TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2015, : 1121 - 1127
  • [8] Joint Speech Translation and Named Entity Recognition
    Gaido, Marco
    Papi, Sara
    Negri, Matteo
    Turchi, Marco
    [J]. INTERSPEECH 2023, 2023, : 47 - 51
  • [9] Combining data-driven systems for improving named entity recognition
    Kozareva, Z
    Ferrández, O
    Montoyo, A
    Muñoz, R
    Suárez, A
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PROCEEDINGS, 2005, 3513 : 80 - 90
  • [10] Combining data-driven systems for improving named entity recognition
    Kozareva, Z.
    Ferrandez, O.
    Montoyo, A.
    Munoz, R.
    Suarez, A.
    Gomez, J.
    [J]. DATA & KNOWLEDGE ENGINEERING, 2007, 61 (03) : 449 - 466