Using Crowdsourcing for Fine-Grained Entity Type Completion in Knowledge Bases

被引:1
|
作者
Dong, Zhaoan [1 ,2 ]
Fan, Ju [1 ,2 ]
Lu, Jiaheng [1 ,2 ,3 ]
Du, Xiaoyong [1 ,2 ]
Ling, Tok Wang [4 ]
机构
[1] Renmin Univ China, DEKE, MOE, Beijing, Peoples R China
[2] Renmin Univ China, Sch Informat, Beijing, Peoples R China
[3] Univ Helsinki, Dept Comp Sci, Helsinki, Finland
[4] Natl Univ Singapore, Sch Comp, Singapore, Singapore
来源
基金
芬兰科学院; 中国国家自然科学基金;
关键词
Crowdsourcing; Entity type completion; Knowledge base;
D O I
10.1007/978-3-319-96893-3_19
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent years have witnessed the proliferation of large-scale Knowledge Bases (KBs). However, many entities in KBs have incomplete type information, and some are totally untyped. Even worse, fine-grained types (e.g., BasketballPlayer) containing rich semantic meanings are more likely to be incomplete, as they are more difficult to be obtained. Existing machine-based algorithms use predicates (e.g., birthPlace) of entities to infer their missing types, and they have limitations that the predicates may be insufficient to infer fine-grained types. In this paper, we utilize crowdsourcing to solve the problem, and address the challenge of controlling crowdsourcing cost. To this end, we propose a hybrid machine-crowdsourcing approach for fine-grained entity type completion. It firstly determines the types of some "representative" entities via crowdsourcing and then infers the types for remaining entities based on the crowdsourcing results. To support this approach, we first propose an embedding-based influence for type inference which considers not only the distance between entity embeddings but also the distances between entity and type embeddings. Second, we propose a new difficulty model for entity selection which can better capture the uncertainty of the machine algorithm when identifying the entity types. We demonstrate the effectiveness of our approach through experiments on real crowdsourcing platforms. The results show that our method outperforms the state-of-the-art algorithms by improving the effectiveness of fine-grained type completion at affordable crowdsourcing cost.
引用
收藏
页码:248 / 263
页数:16
相关论文
共 50 条
  • [31] Fine-Grained Tasks for Crowdsourced Entity Resolution
    Nie, Tiezheng
    Mao, Hanyu
    Liu, Xin
    Yu, Sining
    APPLIED SCIENCES-BASEL, 2025, 15 (01):
  • [32] Fine-grained Named Entity Recognition for Turkish
    Khudoyberdieva, Lola
    Diri, Banu
    32ND IEEE SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU 2024, 2024,
  • [33] A Chinese Corpus for Fine-grained Entity Typing
    Lee, Chin
    Dai, Hongliang
    Song, Yangqiu
    Li, Xin
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 4451 - 4457
  • [34] Transfer learning for fine-grained entity typing
    Feng Hou
    Ruili Wang
    Yi Zhou
    Knowledge and Information Systems, 2021, 63 : 845 - 866
  • [35] Fine-grained Multimodal Entity Linking for Videos
    Zhao H.-Q.
    Wang X.-W.
    Li J.-L.
    Li Z.-X.
    Xiao Y.-H.
    Ruan Jian Xue Bao/Journal of Software, 2024, 35 (03): : 1140 - 1153
  • [36] Fine-Grained Entity Typing for Domain Independent Entity Linking
    Onoe, Yasumasa
    Durrett, Greg
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 8576 - 8583
  • [37] Multilingual Fine-Grained Named Entity Recognition
    Lupancu, Viorica-Camelia
    Iftene, Adrian
    COMPUTER SCIENCE JOURNAL OF MOLDOVA, 2023, 31 (03) : 321 - 339
  • [38] Fine-grained Dutch named entity recognition
    Desmet, Bart
    Hoste, Veronique
    LANGUAGE RESOURCES AND EVALUATION, 2014, 48 (02) : 307 - 343
  • [39] Fine-grained Dutch named entity recognition
    Bart Desmet
    Véronique Hoste
    Language Resources and Evaluation, 2014, 48 : 307 - 343
  • [40] Fine-Grained Entity Typing with Hierarchical Inference
    Ren, Quan
    PROCEEDINGS OF 2020 IEEE 4TH INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2020), 2020, : 2552 - 2558