Automated Credibility Assessment of Web-Based Health Information Considering Health on the Net Foundation Code of Conduct (HONcode): Model Development and Validation Study

被引:0
|
作者
Bayani, Azadeh [1 ,2 ,3 ]
Ayotte, Alexandre [1 ,2 ,3 ]
Nikiema, Jean Noel [1 ,2 ,3 ,4 ]
机构
[1] Univ Montreal, Ctr Rech Sante Publ, Montreal, PQ H3C 3J7, Canada
[2] Ctr Integre Univ Sante & Serv Sociaux Ctr Sud Ile, Montreal, PQ H3C 3J7, Canada
[3] Lab Transformat Numer Sante, Montreal, PQ, Canada
[4] Univ Montreal, Sch Publ Hlth, Dept Management Evaluat & Hlth Policy, Montreal, PQ, Canada
关键词
HONcode; infodemic; natural language processing; web-based health information; machine learning; CONFORMITY; INTERNET;
D O I
10.2196/52995
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: An increasing number of users are turning to web-based sources as an important source of health care guidance information. Thus, trustworthy sources of information should be automatically identifiable using objective criteria.Objective: The purpose of this study was to automate the assessment of the Health on the Net Foundation Code of Conduct (HONcode) criteria, enhancing our ability to pinpoint trustworthy health information sources.Methods: A data set of 538 web pages displaying health content was collected from 43 health-related websites. HONcode criteria have been considered as web page and website levels. For the website-level criteria (confidentiality, transparency, financial disclosure, and advertising policy), a bag of keywords has been identified to assess the criteria using a rule-based model. For the web page-level criteria (authority, complementarity, justifiability, and attribution) several machine learning (ML) approaches were used. In total, 200 web pages were manually annotated until achieving a balanced representation in terms of frequency. In total, 3 ML models-random forest, support vector machines (SVM), and Bidirectional Encoder Representations from Transformers (BERT)-were trained on the initial annotated data. A second step of training was implemented for the complementarity criterion using the BERT model for multiclass classification of the complementarity sentences obtained by annotation and data augmentation (positive, negative, and noncommittal sentences). Finally, the remaining web pages were classified using the selected model and 100 sentences were randomly selected for manual review.Results: For web page-level criteria, the random forest model showed a good performance for the attribution criterion while displaying subpar performance in the others. BERT and SVM had a stable performance across all the criteria. BERT had a better area under the curve (AUC) of 0.96, 0.98, and 1.00 for neutral sentences, justifiability, and attribution, respectively. SVM had the overall better performance for the classification of complementarity with the AUC equal to 0.98. Finally, SVM and BERT had an equal AUC of 0.98 for the authority criterion. For the website level criteria, the rule-based model was able to retrieve web pages with an accuracy of 0.97 for confidentiality, 0.82 for transparency, and 0.51 for both financial disclosure and advertising policy. The final evaluation of the sentences determined 0.88 of precision and the agreement level of reviewers was computed at 0.82.Conclusions: Our results showed the potential power of automating the HONcode criteria assessment using ML approaches. This approach could be used with different types of pretrained models to accelerate the text annotation, and classification and to improve the performance in low-resource cases. Further work needs to be conducted to determine how to assign different weights to the criteria, as well as to identify additional characteristics that should be considered for consolidating these criteria into a comprehensive reliability score.(JMIR Form Res 2023;7:e52995) doi: 10.2196/52995
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Development, Validation, and Evaluation of Web-Based Iranian Diabetic Personal Health Record: Rationale for and Protocol of a Randomized Controlled Trial
    Azizi, Amirabbas
    Aboutorabi, Robab
    Mazloum-Khorasani, Zahra
    Afzal-Aghaea, Monavar
    Tara, Mahmood
    JMIR RESEARCH PROTOCOLS, 2016, 5 (01):
  • [42] Development and validation of a web-based questionnaire for surveying the health and working conditions of high-performance marine craft populations
    de Alwis, Manudul Pahansen
    Lo Martire, Riccardo
    Ang, Bjorn O.
    Garme, Karl
    BMJ OPEN, 2016, 6 (06):
  • [43] Users' Experiences With Web-Based Health Care Information: Qualitative Study About Diabetes and Dementia Information Presented on a Governmental Website
    Wiegers, Therese Agnes
    Hendriks, Michelle
    Malanda, Uriell
    de Boer, Dolf
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2019, 21 (07)
  • [44] Health Information Seeking From an Intelligent Web-Based Symptom Checker: Cross-sectional Questionnaire Study
    Carmona, Kimberly Arellano
    Chittamuru, Deepti
    Kravitz, Richard L.
    Ramondt, Steven
    Ramirez, A. Susana
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2022, 24 (08)
  • [45] Older Cancer Patients' User Experiences With Web-Based Health Information Tools: A Think-Aloud Study
    Bolle, Sifra
    Romijn, Geke
    Smets, Ellen M. A.
    Loos, Eugene F.
    Kunneman, Marleen
    van Weert, Julia C. M.
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2016, 18 (07)
  • [46] A Comparison of Women's and Men's Web-Based Information-Seeking Behaviors About Gender-Related Health Information: Web-Based Survey Study of a Stratified German Sample
    Link, Elena
    Baumann, Eva
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2023, 25
  • [47] Enabling Health Information Recommendation Using Crowdsourced Refinement in Web-Based Health Information Applications: User-Centered Design Approach and EndoZone Informatics Case Study
    Li, Wenhao
    O'Hara, Rebecca
    Hull, Louise
    Slater, Helen
    Sirohi, Diksha
    Parker, Melissa A.
    Bidargaddi, Niranjan
    JMIR HUMAN FACTORS, 2024, 11
  • [48] Effectiveness of a web-based health risk assessment with individually-tailored feedback on lifestyle behaviour: study protocol
    Eva K Laan
    Roderik A Kraaijenhagen
    Niels Peek
    Wim B Busschers
    Marije Deutekom
    Patrick M Bossuyt
    Karien Stronks
    Marie-Louise Essink-Bot
    BMC Public Health, 12
  • [49] Effectiveness of a web-based health risk assessment with individually-tailored feedback on lifestyle behaviour: study protocol
    Laan, Eva K.
    Kraaijenhagen, Roderik A.
    Peek, Niels
    Busschers, Wim B.
    Deutekom, Marije
    Bossuyt, Patrick M.
    Stronks, Karien
    Essink-Bot, Marie-Louise
    BMC PUBLIC HEALTH, 2012, 12
  • [50] Using Web-Based Questionnaires and Obstetric Records to Assess General Health Characteristics Among Pregnant Women: A Validation Study
    van Gelder, Marleen M. H. J.
    Schouten, Naomi P. E.
    Merkus, Peter J. F. M.
    Verhaak, Chris M.
    Roeleveld, Nel
    Roukema, Jolt
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2015, 17 (06) : e149