Fine-Grained Entity Typing via Label Noise Reduction and Data Augmentation

被引:0
|
作者
Li, Haoyang [1 ]
Lin, Xueling [1 ]
Chen, Lei [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
关键词
Entity typing; Noise reduction; Data augmentation;
D O I
10.1007/978-3-030-73194-6_24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained entity typing aims to assign one or more types for entity mentions in the corpus. Recently, distant supervision has been utilized to generate training data. However, it has two drawbacks. First, the same labels are assigned to every entity mention in a context-agnostic manner, which introduces label noise. Some approaches alleviate this issue by hand-crafted features. However, they require efforts from experts. Second, the entity mentions out of Knowledge Base (KB) are ignored and hence cannot be added to the training data, which decreases the size of the training data. Furthermore, the existing entity typing systems neglect the types of other entity mentions in the same context which provide evidence to infer the types of the target entity mentions. In this paper, we first propose graph-based and sampling-based approaches, to reduce the label noise generated by the distant supervision, and then augment the training data by finding potential entity mentions in the corpus and inferring their types. Moreover, we propose a hierarchical neural network, which involves the types of other mentions in the context and satisfies the type consistency, to predict the types. Experiments on two datasets show that our system outperforms state-of-the-art entity typing systems.
引用
收藏
页码:356 / 374
页数:19
相关论文
共 50 条
  • [1] Hierarchical Modeling of Label Dependency and Label Noise in Fine-grained Entity Typing
    Wu, Junshuang
    Zhang, Richong
    Mao, Yongyi
    Shahrbabak, Masoumeh Soflaei
    Huai, Jinpeng
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 3950 - 3956
  • [2] Dealing With Hierarchical Types and Label Noise in Fine-Grained Entity Typing
    Wu, Junshuang
    Zhang, Richong
    Mao, Yongyi
    Huai, Jinpeng
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1305 - 1318
  • [3] Multilingual Fine-Grained Entity Typing
    van Erp, Marieke
    Vossen, Piek
    [J]. LANGUAGE, DATA, AND KNOWLEDGE, LDK 2017, 2017, 10318 : 262 - 275
  • [4] Improving Fine-grained Entity Typing with Entity Linking
    Dai, Hongliang
    Du, Donghong
    Li, Xin
    Song, Yangqiu
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 6210 - 6215
  • [5] Fine-Grained Entity Typing in Hyperbolic Space
    Lopez, Federico
    Heinzerling, Benjamin
    Strube, Michael
    [J]. 4TH WORKSHOP ON REPRESENTATION LEARNING FOR NLP (REPL4NLP-2019), 2019, : 169 - 180
  • [6] Transfer learning for fine-grained entity typing
    Hou, Feng
    Wang, Ruili
    Zhou, Yi
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2021, 63 (04) : 845 - 866
  • [7] A Chinese Corpus for Fine-grained Entity Typing
    Lee, Chin
    Dai, Hongliang
    Song, Yangqiu
    Li, Xin
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 4451 - 4457
  • [8] Transfer learning for fine-grained entity typing
    Feng Hou
    Ruili Wang
    Yi Zhou
    [J]. Knowledge and Information Systems, 2021, 63 : 845 - 866
  • [9] Fine-Grained Entity Typing with Hierarchical Inference
    Ren, Quan
    [J]. PROCEEDINGS OF 2020 IEEE 4TH INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2020), 2020, : 2552 - 2558
  • [10] Learning with Noise: Improving Distantly-Supervised Fine-grained Entity Typing via Automatic Relabeling
    Zhang, Haoyu
    Long, Dingkun
    Xu, Guangwei
    Zhu, Muhua
    Xie, Pengjun
    Huang, Fei
    Wang, Ji
    [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3808 - 3815