Unveiling scientific articles from paper mills with provenance analysis

被引:0
|
作者
Cardenuto, Joao Phillipe [1 ]
Moreira, Daniel [2 ]
Rocha, Anderson [1 ]
机构
[1] Univ Estadual Campinas, Inst Comp, Artificial Intelligence Lab Recod Ai, Campinas, SP, Brazil
[2] Loyola Univ Chicago, Dept Comp Sci, Chicago, IL USA
来源
PLOS ONE | 2024年 / 19卷 / 10期
基金
巴西圣保罗研究基金会;
关键词
D O I
10.1371/journal.pone.0312666
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The increasing prevalence of fake publications created by paper mills poses a significant challenge to maintaining scientific integrity. While integrity analysts typically rely on textual and visual clues to identify fake articles, determining which papers merit further investigation can be akin to searching for a needle in a haystack, as these fake publications have non-related authors and are published on non-related venues. To address this challenge, we developed a new methodology for provenance analysis, which automatically tracks and groups suspicious figures and documents. Our approach groups manuscripts from the same paper mill by analyzing their figures and identifying duplicated and manipulated regions. These regions are linked and organized in a provenance graph, providing evidence of systematic production. We tested our solution on a paper mill dataset of hundreds of documents and also on a larger version of the dataset that deliberately included thousands of documents intentionally selected to distract our method. Our approach successfully identified and linked systematically produced articles on both datasets by pinpointing the figures they reused and manipulated from one another. The technique herein proposed offers a promising solution to identify fraudulent manuscripts, and it could be a valuable tool for supporting scientific integrity.
引用
收藏
页数:28
相关论文
共 50 条
  • [1] From plagiarism to scientific paper mills: a profile of retracted articles within the SciELO Brazil collection
    Santos-d'Amorim, Karen
    Wang, Ting
    Lund, Brady
    Macedo Dos Santos, Raimundo Nonato
    ETHICS & BEHAVIOR, 2024, 34 (01) : 40 - 57
  • [2] SCIENTIFIC MISCONDUCTS: PAPER MILLS IN PERU
    Mayta-Tristan, Percy
    Borja-Garcia, Ruben
    REVISTA PERUANA DE MEDICINA EXPERIMENTAL Y SALUD PUBLICA, 2022, 39 (04): : 388 - 391
  • [3] Commercialization of scientific misconduct: the challenge of paper mills
    Abalkina, A.
    EUROPEAN JOURNAL OF PUBLIC HEALTH, 2024, 34
  • [4] PAPER-MILLS, POLLUTION AND SCIENTIFIC BIAS
    ROY, D
    ECONOMIC AND POLITICAL WEEKLY, 1987, 22 (51) : 2195 - 2196
  • [5] How to Write a Good Scientific Paper: Review Articles
    Mack, Chris
    JOURNAL OF MICRO-NANOLITHOGRAPHY MEMS AND MOEMS, 2016, 15 (02):
  • [6] THE PROVENANCE OF DONKEY MILLS FROM ROMAN BRITAIN
    WILLIAMSTHORPE, O
    THORPE, RS
    ARCHAEOMETRY, 1988, 30 : 275 - 289
  • [7] EFFLUENTS FROM PAPER MILLS
    ROBERTS, CA
    EFFLUENT & WATER TREATMENT JOURNAL, 1972, 12 (12): : 659 - 662
  • [8] Nanoparticles from paper mills: A seasonal, numerical and morphological analysis
    Alderighi, Michele
    Carrai, Patrizio
    Nobili, Carla
    Lopez, Francesco
    Cuomo, Francesca
    Ambrosone, Luigi
    COLLOIDS AND SURFACES A-PHYSICOCHEMICAL AND ENGINEERING ASPECTS, 2017, 532 : 102 - 107
  • [9] Unmasking the fraud: How paper mills are undermining scientific publishing
    Chambers, Hank
    DEVELOPMENTAL MEDICINE AND CHILD NEUROLOGY, 2024, 66 (10): : 1262 - 1263
  • [10] PAPER MILLS AS A REFLECTION OF THE DECLINE OF SCIENTIFIC PRACTICES IN OUR COUNTRY
    Solari, Lely
    Cabezas, Cesar
    REVISTA PERUANA DE MEDICINA EXPERIMENTAL Y SALUD PUBLICA, 2023, 40 (04): : 390 - 391