Incorporating Class-based Language Model for Named Entity Recognition in Factorized Neural Transducer

被引:0
|
作者
Wang, Peng [2 ,3 ]
Yang, Yifan [1 ]
Bang, Zheng [1 ]
Tan, Tian [1 ]
Zhang, Shiliang [4 ]
Chen, Xie [1 ]
机构
[1] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai, Peoples R China
[2] Chinese Acad Sci, Key Lab Speech Acoust & Content Understanding, Inst Acoust, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
[4] Alibaba Grp, Hangzhou, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
named entity recognition; factorized neural Transducer; class-based language model; beam search; SPEECH RECOGNITION; ASR;
D O I
10.21437/Interspeech.2024-653
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite advancements of end-to-end (E2E) models in speech recognition, named entity recognition (NER) is still challenging but critical for semantic understanding. Previous studies mainly focus on various rule-based or attention-based contextual biasing algorithms. However, their performance might be sensitive to the biasing weight or degraded by excessive attention to the named entity list, along with a risk of false triggering. Inspired by the success of the class-based language model (LM) in NER in conventional hybrid systems and the effective decoupling of acoustic and linguistic information in the factorized neural Transducer (FNT), we propose C-FNT, a novel E2E model that incorporates class-based LMs into FNT. In C-FNT, the LM score of named entities can be associated with the name class instead of its surface form. The experimental results show that our proposed C-FNT significantly reduces error in named entities without hurting performance in general word recognition.
引用
收藏
页码:742 / 746
页数:5
相关论文
共 50 条
  • [1] Bacterial Named Entity Recognition Based on Language Model
    Li, Xusheng
    Fu, Chengcheng
    Zhong, Ran
    Zhong, Duo
    He, Tingling
    Jiang, Xingpeng
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 2715 - 2721
  • [2] Named entity recognition using neural language model and CRF for Hindi language
    Sharma, Richa
    Morwal, Sudha
    Agarwal, Basant
    COMPUTER SPEECH AND LANGUAGE, 2022, 74
  • [3] Thai Named-Entity Recognition Using Class-based Language Modeling on Multiple-sized Subword Units
    Saykhum, Kwanchiva
    Boonpiam, Vataya
    Thatphithakkul, Nattanun
    Wutiwiwatchai, Chai
    Natthee, Cholwich
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1586 - +
  • [4] A deep neural network-based model for named entity recognition for Hindi language
    Richa Sharma
    Sudha Morwal
    Basant Agarwal
    Ramesh Chandra
    Mohammad S. Khan
    Neural Computing and Applications, 2020, 32 : 16191 - 16203
  • [5] A deep neural network-based model for named entity recognition for Hindi language
    Sharma, Richa
    Morwal, Sudha
    Agarwal, Basant
    Chandra, Ramesh
    Khan, Mohammad S.
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (20): : 16191 - 16203
  • [6] FACTORIZED NEURAL TRANSDUCER FOR EFFICIENT LANGUAGE MODEL ADAPTATION
    Chen, Xie
    Meng, Zhong
    Parthasarathy, Sarangarajan
    Li, Jinyu
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8132 - 8136
  • [7] Incorporating token-level dictionary feature into neural model for named entity recognition
    Mu Xiaofeng
    Wang Wei
    Xu Aiping
    NEUROCOMPUTING, 2020, 375 : 43 - 50
  • [8] CLASS-BASED NAMED ENTITY TRANSLATION IN A SPEECH TO SPEECH TRANSLATION SYSTEM
    Maskey, Sameer R.
    Cmejrek, Martin
    Zhou, Bowen
    Gao, Yuqing
    2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 253 - 256
  • [9] A Neural Span-Based Continual Named Entity Recognition Model
    Zhang, Yunan
    Chen, Qingcai
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 13993 - 14001
  • [10] Named entity Recognition Model for Punjabi Language: A Survey
    Kaur, Pawandeep
    Kaur, Amandeep
    PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2016, : 887 - 891