Incorporating Class-based Language Model for Named Entity Recognition in Factorized Neural Transducer

被引:0
|
作者
Wang, Peng [2 ,3 ]
Yang, Yifan [1 ]
Bang, Zheng [1 ]
Tan, Tian [1 ]
Zhang, Shiliang [4 ]
Chen, Xie [1 ]
机构
[1] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai, Peoples R China
[2] Chinese Acad Sci, Key Lab Speech Acoust & Content Understanding, Inst Acoust, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
[4] Alibaba Grp, Hangzhou, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
named entity recognition; factorized neural Transducer; class-based language model; beam search; SPEECH RECOGNITION; ASR;
D O I
10.21437/Interspeech.2024-653
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite advancements of end-to-end (E2E) models in speech recognition, named entity recognition (NER) is still challenging but critical for semantic understanding. Previous studies mainly focus on various rule-based or attention-based contextual biasing algorithms. However, their performance might be sensitive to the biasing weight or degraded by excessive attention to the named entity list, along with a risk of false triggering. Inspired by the success of the class-based language model (LM) in NER in conventional hybrid systems and the effective decoupling of acoustic and linguistic information in the factorized neural Transducer (FNT), we propose C-FNT, a novel E2E model that incorporates class-based LMs into FNT. In C-FNT, the LM score of named entities can be associated with the name class instead of its surface form. The experimental results show that our proposed C-FNT significantly reduces error in named entities without hurting performance in general word recognition.
引用
收藏
页码:742 / 746
页数:5
相关论文
共 50 条
  • [31] A named entity recognition model based on ensemble learning
    Zhu, Xinghui
    Zou, Zhuoyang
    Qiao, Bo
    Fang, Kui
    Chen, Yiming
    JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2021, 21 (02) : 475 - 486
  • [32] A Neural Transition-based Joint Model for Disease Named Entity Recognition and Normalization
    Ji, Zongcheng
    Xia, Tian
    Han, Mei
    Xiao, Jing
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 2819 - 2827
  • [33] Named Entity Recognition Model Based on Feature Fusion
    Sun, Zhen
    Li, Xinfu
    INFORMATION, 2023, 14 (02)
  • [34] Named entity recognition based on a machine learning model
    Wang, Jing
    Liu, Zhijing
    Zhao, Hui
    Research Journal of Applied Sciences, Engineering and Technology, 2012, 4 (20) : 3973 - 3980
  • [35] Chinese named entity recognition model based on BERT
    Liu, Hongshuai
    Jun, Ge
    Zheng, Yuanyuan
    2020 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE COMMUNICATION AND NETWORK SECURITY (CSCNS2020), 2021, 336
  • [36] A comprehensive dataset and neural network approach for named entity recognition in the Uzbek language
    Mengliev, Davlatyor
    Barakhnin, Vladimir
    Eshkulov, Mukhriddin
    Ibragimov, Bahodir
    Madirimov, Shohrux
    DATA IN BRIEF, 2025, 58
  • [37] Incorporating Named Entity Recognition into the Speech Transcription Process
    Hatmi, Mohamed
    Jacquin, Christine
    Morin, Emmanuel
    Meignier, Sylvain
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3699 - 3703
  • [38] CALM: Context Augmentation with Large Language Model for Named Entity Recognition
    Luiggi, Tristan
    Herserant, Tanguy
    Trani, Thong
    Soulier, Laure
    Guigue, Vincent
    LINKING THEORY AND PRACTICE OF DIGITAL LIBRARIES, PT I, TPDL 2024, 2024, 15177 : 273 - 291
  • [39] Supervised Named Entity Recognition in Assamese language
    Talukdar, Gitimoni
    Borah, Pranjal Protim
    Baruah, Arup
    2014 INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2014, : 187 - 191
  • [40] Named Entity Recognition System for Sindhi Language
    Jumani, Awais Khan
    Memon, Mashooque Ahmed
    Khoso, Fida Hussain
    Sanjrani, Anwar Ali
    Soomro, Safeeullah
    EMERGING TECHNOLOGIES IN COMPUTING, ICETIC 2018, 2018, 200 : 237 - 246