Biomedical text mining for research rigor and integrity: tasks, challenges, directions

被引:34
|
作者
Kilicoglu, Halil [1 ]
机构
[1] US Natl Lib Med, Lister Hill Natl Ctr Biomed Commun, Bethesda, MD 20894 USA
基金
美国国家卫生研究院;
关键词
biomedical research waste; biomedical text mining; natural language processing; research rigor; research integrity; reproducibility; AUTOMATIC RECOGNITION; PLAGIARISM; ARTICLES; CITATION; KNOWLEDGE; REPRODUCIBILITY; CLASSIFICATION; EXTRACTION; SENTENCES; MEDICINE;
D O I
10.1093/bib/bbx057
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
An estimated quarter of a trillion US dollars is invested in the biomedical research enterprise annually. There is growing alarm that a significant portion of this investment is wasted because of problems in reproducibility of research findings and in the rigor and integrity of research conduct and reporting. Recent years have seen a flurry of activities focusing on standardization and guideline development to enhance the reproducibility and rigor of biomedical research. Research activity is primarily communicated via textual artifacts, ranging from grant applications to journal publications. These artifacts can be both the source and the manifestation of practices leading to research waste. For example, an article may describe a poorly designed experiment, or the authors may reach conclusions not supported by the evidence presented. In this article, we pose the question of whether biomedical text mining techniques can assist the stakeholders in the biomedical research enterprise in doing their part toward enhancing research integrity and rigor. In particular, we identify four key areas in which text mining techniques can make a significant contribution: plagiarism/fraud detection, ensuring adherence to reporting guidelines, managing information overload and accurate citation/enhanced bibliometrics. We review the existing methods and tools for specific tasks, if they exist, or discuss relevant research that can provide guidance for future work. With the exponential increase in biomedical research output and the ability of text mining approaches to perform automatic tasks at large scale, we propose that such approaches can support tools that promote responsible research practices, providing significant benefits for the biomedical research enterprise.
引用
收藏
页码:1400 / 1414
页数:15
相关论文
共 50 条
  • [21] Biomedical Text Mining and Its Applications
    Rodriguez-Esteban, Raul
    PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (12)
  • [22] Text mining patents for biomedical knowledge
    Rodriguez-Esteban, Raul
    Bundschus, Markus
    DRUG DISCOVERY TODAY, 2016, 21 (06) : 997 - 1002
  • [23] New frontiers in biomedical text mining
    Zweigenbaum, Pierre
    Demner-Fushman, Dina
    Yu, Hong
    Cohen, K. Bretonnel
    Pacific Symposium on Biocomputing 2007, 2007, : 205 - 208
  • [24] TEXT AND DATA MINING FOR BIOMEDICAL DISCOVERY
    Gonzalez, Graciela
    Cohen, Kevin Bretonnel
    Leaman, Robert
    Greene, Casey S.
    Shah, Nigam
    Kann, Maricel G.
    Ye, Jieping
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2014, 2014, : 312 - 315
  • [25] Genescene: Biomedical text and data mining
    Leroy, G
    Chen, H
    Martinez, JD
    Eggers, S
    Falsey, RR
    Kislin, KL
    Huang, Z
    Li, JX
    Xu, J
    McDonald, DM
    Ng, G
    2003 JOINT CONFERENCE ON DIGITAL LIBRARIES, PROCEEDINGS, 2003, : 116 - 118
  • [26] Emerging directions in predictive text mining
    Indurkhya, Nitin
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2015, 5 (04) : 155 - 164
  • [27] Application of text mining in the biomedical domain
    Fleuren, Wilco W. M.
    Alkema, Wynand
    METHODS, 2015, 74 : 97 - 106
  • [28] Community challenges in biomedical text mining over 10 years: success, failure and the future
    Huang, Chung-Chi
    Lu, Zhiyong
    BRIEFINGS IN BIOINFORMATICS, 2016, 17 (01) : 132 - 144
  • [29] Rigor and ethics: challenges in qualitative research
    Angelo, Margareth
    CIENCIA & SAUDE COLETIVA, 2008, 13 (02): : 318 - 320
  • [30] Integrity and misconduct in biomedical research
    Schonhaut B, Luisa
    REVISTA CHILENA DE PEDIATRIA-CHILE, 2019, 90 (02): : 217 - 221