A multistage retrieval system for health-related misinformation detection

被引:0
|
作者
Fernandez-Pichel, Marcos [1 ]
Losada, David E. [1 ]
Pichel, Juan C. [1 ]
机构
[1] Univ Santiago de Compostela, Ctr Singular Invest Tecnol Intelixentes CiTIUS, Rua Jenaro de la Fuente, Santiago De Compostela 15782, Spain
关键词
Engineering applications; Web search; Health misinformation; Information retrieval; Natural language processing; Artificial intelligence; Deep learning for natural language processing;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Web search is widely used to find online medical advice. As such, health-related information access requires retrieval algorithms capable of promoting reliable documents and filtering out unreliable ones. To this end, different types of components, such as query-document matching features, passage relevance estimation and AI-based reliability estimators, need to be combined. In this paper, we propose an entire pipeline for misinformation detection, based on the fusion of multiple content-based features. We present experiments which study the influence of each pipeline stage for the target task.Our technological solution incorporates signals from technologies derived from diverse research fields, including search, deep learning for natural language processing, as well as advanced supervised and unsuper-vised learning. To combine evidence, different score fusion strategies are compared, including unsupervised rank fusion techniques and learning-to-rank methods. The reference framework for empirically validating our solution is the TREC Health Misinformation Track, which provides several challenging subtasks that foster research on the identification of reliable and correct information for health-related decision making tasks. More specifically, we address a total recall task, the goal of which is to identify all the documents conveying incorrect information for a specific set of topics, and an ad-hoc retrieval task, aiming to rank credible and correct information over incorrect information. All variants are evaluated with an assorted set of effectiveness metrics, which includes standard search measures, such as R-Precision, Average Precision or Normalised Discounted Cumulative Gain, and innovative metrics based on the compatibility between the ranked output and two reference rankings composed of helpful and harmful documents, respectively.Our experiments demonstrate the effectiveness of the proposed pipeline stages and indicate that sophisti-cated supervised fusion methods do not fare better than simpler fusion alternatives. Additionally, for reliability estimation, unsupervised textual similarity performs better than textual classification based on supervised learning. The results also show that the presented approach is highly competitive when compared with state-of-the-art solutions for the same problem.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] A multistage retrieval system for health-related misinformation detection
    Fernandez-Pichel, Marcos
    Losada, David E.
    Pichel, Juan C.
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 115
  • [2] Analysis and Detection of Health-Related Misinformation on Chinese Social Media
    Liu, Yue
    Yu, Ke
    Wu, Xiaofei
    Qing, Linbo
    Peng, Yonghong
    [J]. IEEE ACCESS, 2019, 7 : 154480 - 154489
  • [3] Addressing Health-Related Misinformation on Social Media
    Chou, Wen-Ying Sylvia
    Oh, April
    Klein, William M. P.
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2018, 320 (23): : 2417 - 2418
  • [4] Defining Misinformation and Related Terms in Health-Related Literature: Scoping Review
    El Mikati, Ibrahim K.
    Hoteit, Reem
    Harb, Tarek
    El Zein, Ola
    Piggott, Thomas
    Melki, Jad
    Mustafa, Reem A.
    Akl, Elie A.
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2023, 25
  • [5] The social media Infodemic of health-related misinformation and technical solutions
    Rodrigues, Flinta
    Newell, Richard
    Babu, Giridhara Rathnaiah
    Chatterjee, Tulika
    Sandhu, Nimrat Kaur
    Gupta, Latika
    [J]. HEALTH POLICY AND TECHNOLOGY, 2024, 13 (02)
  • [6] Systematic Literature Review on the Spread of Health-related Misinformation on Social Media
    Wang, Yuxi
    McKee, Martin
    Torbica, Aleksandra
    Stuckler, David
    [J]. SOCIAL SCIENCE & MEDICINE, 2019, 240
  • [7] Addressing the spread of health-related misinformation on social networks: an opinion article
    Polyzou, Maria
    Kiefer, David
    Baraliakos, Xenofon
    Sewerin, Philipp
    [J]. FRONTIERS IN MEDICINE, 2023, 10
  • [8] Medical and Health-Related Misinformation on Social Media: Bibliometric Study of the Scientific Literature
    Yeung, Andy Wai Kan
    Tosevska, Anela
    Klager, Elisabeth
    Eibensteiner, Fabian
    Tsagkaris, Christos
    Parvanov, Emil D.
    Nawaz, Faisal A.
    Voelkl-Kernstock, Sabine
    Schaden, Eva
    Kletecka-Pulker, Maria
    Willschke, Harald
    Atanasov, Atanas G.
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2022, 24 (01)
  • [9] Health-Related Rumour Detection On Twitter
    Sicilia, Rosa
    Lo Giudice, Stella
    Pei, Yulong
    Pechenizkiy, Mykola
    Soda, Paolo
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 1599 - 1606
  • [10] Do you trust the rumors? Examining the determinants of health-related misinformation in India
    Kapoor, Hansika
    Gurjar, Swanaya
    Mahadeshwar, Hreem
    Mehta, Nikita
    Puthillam, Arathy
    [J]. ASIAN JOURNAL OF SOCIAL PSYCHOLOGY, 2024, 27 (02) : 144 - 160