Chinese Named Entity Recognition via Joint Identification and Categorization

被引:0
|
作者
Zhou Junsheng [1 ,2 ]
Qu Weiguang [1 ,2 ]
Zhang Fen [2 ]
机构
[1] Jiangsu Res Ctr Informat Secur & Privacy Technol, Nanjing 210046, Jiangsu, Peoples R China
[2] Nanjing Normal Univ, Sch Comp Sci & Technol, Nanjing 210046, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Named entity recognition; Entity-level features; Sequence labeling approach; Joint identification and categorization;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Chinese Named entity recognition (NER) is an important task for Chinese information processing. Traditional sequence labeling approaches to Chinese NER cannot treat globally a string of continuous characters as a named entity candidate so that the entity-level features cannot be exploited in a natural way. To deal with this problem, we formulate Chinese NER as a joint identification and categorization task that performs the two subtasks simultaneously: boundary identification and entity categorization, together with segmentation. The proposed approach provides a natural formulation to treats pieces of continuous characters as named entity candidates, which allows for more accurate prediction by examining both the internal evidence and contextual information of the candidates. Within this framework, we explored a variety of effective feature representations for Chinese NER. Closed tests on two quite different corpora from the third SIGHAN bakeoff show that our approach significantly outperforms the best in the literature, achieving state-of-the-art performance.
引用
收藏
页码:225 / 230
页数:6
相关论文
共 50 条
  • [21] Survey of Chinese Named Entity Recognition Research
    Zhao, Jigui
    Qian, Yurong
    Wang, Kui
    Hou, Shuxiang
    Chen, Jiaying
    [J]. Computer Engineering and Applications, 2024, 60 (01) : 15 - 27
  • [22] Joint segmentation and named entity recognition using dual decomposition in Chinese discharge summaries
    Xu, Yan
    Wang, Yining
    Liu, Tianren
    Liu, Jiahua
    Fan, Yubo
    Qian, Yi
    Tsujii, Junichi
    Chang, Eric I.
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2014, 21 (E1) : E84 - E92
  • [23] Joint Self-Attention and Multi-Embeddings for Chinese Named Entity Recognition
    Song, Cijian
    Xiong, Yan
    Huang, Wenchao
    Ma, Lu
    [J]. 2020 6TH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING AND COMMUNICATIONS (BIGCOM 2020), 2020, : 76 - 80
  • [24] Joint Learning of Named Entity Recognition and Relation Extraction
    Xu, Qiuyan
    Li, Fang
    [J]. 2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 1978 - 1982
  • [25] Chinese Named Entity Recognition Methods Combined with Entity Boundary Cues
    Huang, Rong
    Chen, Yanping
    Hu, Ying
    Huang, Ruizhang
    Qin, Yongbin
    [J]. Computer Engineering and Applications, 2024, 60 (06) : 199 - 206
  • [26] Enhancing Entity Boundary Detection for Better Chinese Named Entity Recognition
    Chen, Chun
    Kong, Fang
    [J]. ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 20 - 25
  • [27] A Chinese Named Entity Recognition System with Neural Networks
    Yi, Hui-Kang
    Huang, Jiu-Ming
    Yang, Shu-Qiang
    [J]. 4TH ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS (ITA 2017), 2017, 12
  • [28] Chinese Named Entity Recognition and Disambiguation Based on Wikipedia
    Yu Miao
    Lv Yajuan
    Liu Qun
    Su Jinsong
    Xiong Hao
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, 2012, 333 : 272 - 283
  • [29] Application of Data Encryption in Chinese Named Entity Recognition
    Dong, Jikun
    Long, Kaifang
    Yu, Hui
    Xu, Weizhi
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VIII, 2023, 14261 : 99 - 111
  • [30] Chinese Named Entity Recognition Augmented with Lexicon Memory
    Zhou, Yi
    Zheng, Xiao-Qing
    Huang, Xuan-Jing
    [J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 38 (05) : 1021 - 1035