From language models to large-scale food and biomedical knowledge graphs

被引:0
|
作者
Gjorgjina Cenikj
Lidija Strojnik
Risto Angelski
Nives Ogrinc
Barbara Koroušić Seljak
Tome Eftimov
机构
[1] Jožef Stefan Institute,
[2] Jožef Stefan International Postgraduate School,undefined
[3] Clinic Doctor 24-hours,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Knowledge about the interactions between dietary and biomedical factors is scattered throughout uncountable research articles in an unstructured form (e.g., text, images, etc.) and requires automatic structuring so that it can be provided to medical professionals in a suitable format. Various biomedical knowledge graphs exist, however, they require further extension with relations between food and biomedical entities. In this study, we evaluate the performance of three state-of-the-art relation-mining pipelines (FooDis, FoodChem and ChemDis) which extract relations between food, chemical and disease entities from textual data. We perform two case studies, where relations were automatically extracted by the pipelines and validated by domain experts. The results show that the pipelines can extract relations with an average precision around 70%, making new discoveries available to domain experts with reduced human effort, since the domain experts should only evaluate the results, instead of finding, and reading all new scientific papers.
引用
收藏
相关论文
共 50 条
  • [1] From language models to large-scale food and biomedical knowledge graphs
    Cenikj, Gjorgjina
    Strojnik, Lidija
    Angelski, Risto
    Ogrinc, Nives
    Seljak, Barbara Korousic
    Eftimov, Tome
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [2] Large Language Models as Commonsense Knowledge for Large-Scale Task Planning
    Zhao, Zirui
    Lee, Wee Sun
    Hsu, David
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [3] Hallucination Mitigation in Natural Language Generation from Large-Scale Open-Domain Knowledge Graphs
    Shi, Xiao
    Zhu, Zhengyuan
    Zhang, Zeyu
    Li, Chengkai
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 12506 - 12521
  • [4] Enriching Biomedical Knowledge for Low-resource Language Through Large-Scale Translation
    Phan, Long
    Dang, Tai
    Tran, Hieu
    Trinh, Trieu H.
    Phan, Vy
    Chau, Lam D.
    Luong, Minh-Thang
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 3131 - 3142
  • [5] Benchmarking Biomedical Relation Knowledge in Large Language Models
    Zhang, Fenghui
    Yang, Kuo
    Zhao, Chenqian
    Li, Haixu
    Dong, Xin
    Tian, Haoyu
    Zhou, Xuezhong
    BIOINFORMATICS RESEARCH AND APPLICATIONS, PT II, ISBRA 2024, 2024, 14955 : 482 - 495
  • [6] Unifying Large Language Models and Knowledge Graphs: A Roadmap
    Pan, Shirui
    Luo, Linhao
    Wang, Yufei
    Chen, Chen
    Wang, Jiapu
    Wu, Xindong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (07) : 3580 - 3599
  • [7] LargeEA: Aligning Entities for Large-scale Knowledge Graphs
    Ge, Congcong
    Liu, Xiaoze
    Chen, Lu
    Gao, Yunjun
    Zheng, Baihua
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 15 (02): : 237 - 245
  • [8] Linking Surface Facts to Large-Scale Knowledge Graphs
    Radevski, Gorjan
    Gashteovski, Kiril
    Hung, Chia-Chien
    Lawrence, Carolin
    Glavas, Goran
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 7189 - 7207
  • [9] KNOWLEDGE TRANSFER FROM LARGE-SCALE PRETRAINED LANGUAGE MODELS TO END-TO-END SPEECH RECOGNIZERS
    Kubo, Yotaro
    Karita, Shigeki
    Bacchiani, Michiel
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8512 - 8516
  • [10] Cross-Lingual Entity Query from Large-Scale Knowledge Graphs
    Su, Yonghao
    Zhang, Chi
    Li, Jinyang
    Wang, Chengyu
    Qian, Weining
    Zhou, Aoying
    WEB TECHNOLOGIES AND APPLICATIONS, APWEB 2015 WORKSHOPS, 2015, 9461 : 139 - 150