Biomedical text mining for research rigor and integrity: tasks, challenges, directions

被引:34
|
作者
Kilicoglu, Halil [1 ]
机构
[1] US Natl Lib Med, Lister Hill Natl Ctr Biomed Commun, Bethesda, MD 20894 USA
基金
美国国家卫生研究院;
关键词
biomedical research waste; biomedical text mining; natural language processing; research rigor; research integrity; reproducibility; AUTOMATIC RECOGNITION; PLAGIARISM; ARTICLES; CITATION; KNOWLEDGE; REPRODUCIBILITY; CLASSIFICATION; EXTRACTION; SENTENCES; MEDICINE;
D O I
10.1093/bib/bbx057
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
An estimated quarter of a trillion US dollars is invested in the biomedical research enterprise annually. There is growing alarm that a significant portion of this investment is wasted because of problems in reproducibility of research findings and in the rigor and integrity of research conduct and reporting. Recent years have seen a flurry of activities focusing on standardization and guideline development to enhance the reproducibility and rigor of biomedical research. Research activity is primarily communicated via textual artifacts, ranging from grant applications to journal publications. These artifacts can be both the source and the manifestation of practices leading to research waste. For example, an article may describe a poorly designed experiment, or the authors may reach conclusions not supported by the evidence presented. In this article, we pose the question of whether biomedical text mining techniques can assist the stakeholders in the biomedical research enterprise in doing their part toward enhancing research integrity and rigor. In particular, we identify four key areas in which text mining techniques can make a significant contribution: plagiarism/fraud detection, ensuring adherence to reporting guidelines, managing information overload and accurate citation/enhanced bibliometrics. We review the existing methods and tools for specific tasks, if they exist, or discuss relevant research that can provide guidance for future work. With the exponential increase in biomedical research output and the ability of text mining approaches to perform automatic tasks at large scale, we propose that such approaches can support tools that promote responsible research practices, providing significant benefits for the biomedical research enterprise.
引用
收藏
页码:1400 / 1414
页数:15
相关论文
共 50 条
  • [31] Status of text-mining techniques applied to biomedical text
    Erhardt, RAA
    Schneider, R
    Blaschke, C
    DRUG DISCOVERY TODAY, 2006, 11 (7-8) : 315 - 325
  • [32] Experimental Evaluations of MapReduce in Biomedical Text Mining
    Ji, Yanqing
    Tian, Yun
    Shen, Fangyang
    Tran, John
    INFORMATION TECHNOLOGY: NEW GENERATIONS, 2016, 448 : 665 - 675
  • [33] OntoGene web services for biomedical text mining
    Rinaldi, Fabio
    Clematide, Simon
    Marques, Hernani
    Ellendorff, Tilia
    Romacker, Martin
    Rodriguez-Esteban, Raul
    BMC BIOINFORMATICS, 2014, 15
  • [34] An effective extension to Okapi for biomedical text mining
    Zhong, Ming
    Huang, Xiangji
    2006 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, 2006, : 615 - +
  • [35] OntoGene web services for biomedical text mining
    Fabio Rinaldi
    Simon Clematide
    Hernani Marques
    Tilia Ellendorff
    Martin Romacker
    Raul Rodriguez-Esteban
    BMC Bioinformatics, 15
  • [36] A survey of current work in biomedical text mining
    Cohen, AM
    Hersh, WR
    BRIEFINGS IN BIOINFORMATICS, 2005, 6 (01) : 57 - 71
  • [37] Frontiers of biomedical text mining: current progress
    Zweigenbaum, Pierre
    Demner-Fushman, Dina
    Yu, Hong
    Cohen, Kevin B.
    BRIEFINGS IN BIOINFORMATICS, 2007, 8 (05) : 358 - 375
  • [38] Biomedical text data mining: Recent patents
    Crangle, Colleen E.
    Recent Patents on Computer Science, 2009, 2 (01): : 59 - 67
  • [39] Adversarial Constraint Evaluation on Biomedical Text Mining
    Wang, Yashen
    Zhang, Huanhuan
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT III, 2021, 12817 : 249 - 261
  • [40] Biomedical Text Mining: Experience and Practical Approach
    Ryu, Keun Ho
    3RD INTERNATIONAL CONFERENCE ON APPLIED COMPUTING AND INFORMATION TECHNOLOGY (ACIT 2015) 2ND INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND INTELLIGENCE (CSI 2015), 2015, : 1 - 1