A novel feature integration method for named entity recognition model in product titles

被引:3
|
作者
Sun, Shiqi [1 ]
Li, Jingyuan [1 ]
Zhang, Kun [2 ]
Sun, Xinghang [4 ]
Cen, Jianhe [3 ]
Wang, Yuanzhuo [2 ]
机构
[1] Beijing Technol & Business Univ, Sch Comp & Artificial Intelligence, Beijing, Peoples R China
[2] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
[3] Zhengzhou Univ, Henan Inst Adv Technol, Zhengzhou, Peoples R China
[4] Hebei Univ Engn, Coll Landscape & Ecol Engn, Handan, Peoples R China
基金
中国国家自然科学基金;
关键词
multitask learning; named entity recognition; natural language processing;
D O I
10.1111/coin.12654
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Entity recognition of product titles is essential for retrieving and recommending product information. Due to the irregularity of product title text, such as informal sentence structure, a large number of professional attribute words, a large number of unrelated independent entities of various combinations, the existing general named entity recognition model is limited in the e-commerce field of product title entity recognition. Most of the current studies focus on only one of the two challenges instead of considering the two challenges together. Our approach proposes NEZHA-CNN-GlobalPointer architecture with the addition of label semantic network, and uses multigranularity contextual and label semantic information to fully capture the internal structure and category information of words and texts to improve the entity recognition accuracy. Through a series of experiments, we proved the efficiency of our approach over a dataset of Chinese product titles from JD.com, improving the F1-value by 5.98%, when compared to the BERT-LSTM-CRF model on the product title corpus.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Deep purified feature mining model for joint named entity recognition and relation extraction
    Wang, Youwei
    Wang, Ying
    Sun, Zhongchuan
    Li, Yinghao
    Hu, Shizhe
    Ye, Yangdong
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (06)
  • [32] Incorporating token-level dictionary feature into neural model for named entity recognition
    Mu Xiaofeng
    Wang Wei
    Xu Aiping
    NEUROCOMPUTING, 2020, 375 : 43 - 50
  • [33] Named Entity Recognition Model of Power Equipment Based on Multi-feature Fusion
    Wu, Yun
    Ma, Xiangwen
    Yang, Jieming
    Wang, Anping
    PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2022, 13630 : 255 - 267
  • [34] Arabic Named Entity Recognition: A Feature-Driven Study
    Benajiba, Yassine
    Diab, Mona
    Rosso, Paolo
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (05): : 926 - 934
  • [35] HiNER: Hierarchical feature fusion for Chinese named entity recognition
    Hou, Shuxiang
    Qian, Yurong
    Chen, Jiaying
    Zhao, Jigui
    Lv, Huiyong
    Zhang, Jiyuan
    Leng, Hongyong
    Ma, Mengnan
    NEUROCOMPUTING, 2025, 611
  • [36] A hybrid model for Chinese named entity recognition
    Sun, Xiao
    Huang, Degen
    RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 232 - 237
  • [37] Incorporating Boundary and Category Feature for Nested Named Entity Recognition
    Cao, Jin
    Wang, Guohua
    Li, Canguang
    Ren, Haopeng
    Cai, Yi
    Wong, Raymond Chi-Wing
    Li, Qing
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT II, 2020, 12113 : 209 - 226
  • [38] Named Entity Recognition Model for Polish Books
    Sopyla, Krzysztof
    Drozda, Pawel
    Ropiak, Krzysztof
    Witkowska, Urszula
    Sieniewicz, Malgorzata
    Jankowski, Sebastian
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, PT I, ACIIDS 2024, 2024, 14795 : 147 - 158
  • [39] Hybrid Feature Selection Approach for Arabic Named Entity Recognition
    Shahine, Miran
    Sakre, Mohamed
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, (CICLING 2016), PT I, 2018, 9623 : 452 - 464
  • [40] A Novel Ensemble Method for Named Entity Recognition and Disambiguation Based on Neural Network
    Canale, Lorenzo
    Lisena, Pasquale
    Troncy, Raphael
    SEMANTIC WEB - ISWC 2018, PT I, 2018, 11136 : 91 - 107