Can the quality of published academic journal articles be assessed with machine learning?

被引:8
|
作者
Thelwell, Mike [1 ]
机构
[1] Univ Wolverhampton, Wolverhampton, England
来源
QUANTITATIVE SCIENCE STUDIES | 2022年 / 3卷 / 01期
关键词
citation analysis; machine learning; research evaluation; text mining; CITATION COUNTS; IMPACT; SCIENCE; READABILITY; JUDGMENT; FEATURES; MODELS; FIELD;
D O I
10.1162/qss_a_00185
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Formal assessments of the quality of the research produced by departments and universities are now conducted by many countries to monitor achievements and allocate performance-related funding. These evaluations are hugely time consuming if conducted by postpublication peer review and are simplistic if based on citations or journal impact factors. I investigate whether machine learning could help reduce the burden of peer review by using citations and metadata to learn how to score articles from a sample assessed by peer review. An experiment is used to underpin the discussion, attempting to predict journal citation thirds, as a proxy for article quality scores, for all Scopus narrow fields from 2014 to 2020. The results show that these proxy quality thirds can be predicted with above baseline accuracy in all 326 narrow fields, with Gradient Boosting Classifier, Random Forest Classifier, or Multinomial Naive Bayes being the most accurate in nearly all cases. Nevertheless, the results partly leverage journal writing styles and topics, which are unwanted for some practical applications and cause substantial shifts in average scores between countries and between institutions within a country. There may be scope for predicting articles' scores when the predictions have the highest probability.
引用
收藏
页码:208 / 226
页数:19
相关论文
共 50 条
  • [31] Current abstracts of the articles published in the Japanese journal of nuclear medicine
    Junichi mTajiri
    Hidekazu Irie
    Kazuki Ito
    Masahiro Koide
    Takuya Taniguchi
    Hirokazu Yokoi
    Reo NAKAMURa
    Noriyuki Kinoshita
    Tetsuo Hashimoto
    Shunichi Tamaki
    Takahisa Sawada
    Akihiro AzuMa
    Hiroaki Matsubara
    [J]. Annals of Nuclear Medicine, 2006, 20 (5) : 381 - 382
  • [32] CRediT for authors of articles published in the Journal of the Medical Library Association
    Alpi, Kristine M.
    Akers, Katherine G.
    [J]. JOURNAL OF THE MEDICAL LIBRARY ASSOCIATION, 2021, 109 (03) : 362 - 364
  • [33] FOLLOW-UP NOTES ON ARTICLES PREVIOUSLY PUBLISHED IN THE JOURNAL
    HALL, RH
    ELLIS, FW
    [J]. JOURNAL OF BONE AND JOINT SURGERY-AMERICAN VOLUME, 1963, 45 (02): : 430 - 430
  • [34] Articles published in the Family and Consumer Sciences Research Journal in 2023
    Borr, Mari L.
    [J]. FAMILY & CONSUMER SCIENCES RESEARCH JOURNAL, 2024, 52 (03): : 229 - 230
  • [35] ANALYSIS ON THE STATISTICS OF THE ARTICLES PUBLISHED IN THE KOREAN JOURNAL OF OBSTETRICS AND GYNECOLOGY
    Lee, C.
    Kim, N.
    [J]. INTERNATIONAL JOURNAL OF GYNECOLOGICAL CANCER, 2013, 23 (08)
  • [38] Current abstracts of the articles published in The Japanese Journal of Nuclear Medicine
    Tokuzou Yokokawa
    Tatsuo Shirai
    Hitoshi Ogata
    Shigeru Furui
    [J]. Annals of Nuclear Medicine, 2005, 19 (8) : 743 - 743
  • [40] Current abstracts of the articles published in the Japanese journal of nuclear medicine
    Masahiro Koide
    Kazuki Ito
    Takuya Taniguchi
    Hirokazu Yokoi
    Reo Nakamura
    Hidekazu Irie
    Noriyuki Kinoshita
    Tetsuo Hashimoto
    Shunichi Tamaki
    Takahisa Sawada
    Akihiro Azuma
    Hiroaki Matsubara
    Yuko Kawai
    Koh Kishino
    Kiyoko Kusakabe
    Terue Okamura
    Kanji Kasagi
    Akio Komatani
    Yukimitsu Sato
    Hiroshi Matsuda
    Hirotaka Maruno
    [J]. Annals of Nuclear Medicine, 2006, 20 (3) : 253 - 254