Unveiling scientific articles from paper mills with provenance analysis

被引:0
|
作者
Cardenuto, Joao Phillipe [1 ]
Moreira, Daniel [2 ]
Rocha, Anderson [1 ]
机构
[1] Univ Estadual Campinas, Inst Comp, Artificial Intelligence Lab Recod Ai, Campinas, SP, Brazil
[2] Loyola Univ Chicago, Dept Comp Sci, Chicago, IL USA
来源
PLOS ONE | 2024年 / 19卷 / 10期
基金
巴西圣保罗研究基金会;
关键词
D O I
10.1371/journal.pone.0312666
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The increasing prevalence of fake publications created by paper mills poses a significant challenge to maintaining scientific integrity. While integrity analysts typically rely on textual and visual clues to identify fake articles, determining which papers merit further investigation can be akin to searching for a needle in a haystack, as these fake publications have non-related authors and are published on non-related venues. To address this challenge, we developed a new methodology for provenance analysis, which automatically tracks and groups suspicious figures and documents. Our approach groups manuscripts from the same paper mill by analyzing their figures and identifying duplicated and manipulated regions. These regions are linked and organized in a provenance graph, providing evidence of systematic production. We tested our solution on a paper mill dataset of hundreds of documents and also on a larger version of the dataset that deliberately included thousands of documents intentionally selected to distract our method. Our approach successfully identified and linked systematically produced articles on both datasets by pinpointing the figures they reused and manipulated from one another. The technique herein proposed offers a promising solution to identify fraudulent manuscripts, and it could be a valuable tool for supporting scientific integrity.
引用
收藏
页数:28
相关论文
共 50 条
  • [41] Argument Structure Mining in Scientific Articles: A Comparative Analysis
    Song, Ningyuan
    Cheng, Hanghang
    Zhou, Huimin
    Wang, Xiaoguang
    2019 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL 2019), 2019, : 339 - 340
  • [42] Scientific discourse in Education: Rhetorical and structural analysis of research articles from Colombia and Venezuela
    Blanco, Carlos E.
    REVISTA COMUNICACION, 2019, 28 (02): : 90 - 109
  • [43] Social attention and scientific articles on stroke: Altmetric analysis of top-50 articles
    Kim, Yerim
    Kim, Jee-Eun
    Kim, Yoo Hwan
    Yoon, Dae Young
    Kim, Yeo Jin
    Bae, Jong Seok
    CLINICAL NEUROLOGY AND NEUROSURGERY, 2019, 183
  • [44] DfAnalyzer: Runtime Dataflow Analysis of Scientific Applications using Provenance
    Silva, Vitor
    de Oliveira, Daniel
    Valduriez, Patrick
    Mattoso, Marta
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2018, 11 (12): : 2082 - 2085
  • [45] Quality Analysis for Scientific Workflow Provenance Access Control Policies
    Bhuyan, Fahima Amin
    Lu, Shiyong
    Reynolds, Robert
    Ahmed, Ishtiaq
    Zhang, Jia
    2018 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING (IEEE SCC 2018), 2018, : 261 - 264
  • [46] Provenance Explorer-a graphical interface for constructing scientific publication packages from provenance trails
    Hunter, Jane
    Cheung, Kwok
    INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2007, 7 (1-2) : 99 - 107
  • [47] Retracted papers originating from paper mills: a cross-sectional analysis of references and citations
    Candal-Pedreira, Cristina
    Guerra-Tort, Carla
    Ruano-Ravina, Alberto
    Freijedo-Farinasa, Fabian
    Rey-Brandariz, Julia
    Rossd, Joseph S.
    Perez-Rios, Monica
    JOURNAL OF CLINICAL EPIDEMIOLOGY, 2024, 172
  • [48] From the Impact Factor to DORA and the Scientific Content of Articles
    Ylae-Herttuala, Seppo
    MOLECULAR THERAPY, 2015, 23 (04) : 609 - 609
  • [49] Recommendation system of scientific articles from discharge summaries
    Barriuso, Adrian Alonso
    Fernandez-Isabel, Alberto
    de Diego, Isaac Martin
    Ardoiz, Alfonso
    Pinheiro, J. F. J. Viseu
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 136
  • [50] Quality Analysis of Papermaking Water from Hanji Paper Mills and a Proposal for Water Quality Standards
    Kim M.N.
    Jeong S.H.
    Palpu Chongi Gisul/Journal of Korea Technical Association of the Pulp and Paper Industry, 2023, 55 (06): : 21 - 32