Enhancing Academic Literature Review through Relevance Recommendation Using Bibliometric and Text-based Features for Classification

被引:0
|
作者
Rubio, Thiago R. P. M. [1 ]
Gulo, Carlos A. S. J. [2 ]
机构
[1] Univ Porto, Fac Engn, DEI, LIACC Artificial Intelligence & Comp Sci Lab, Oporto, Portugal
[2] Univ Porto, Fac Engn, UNEMAT Brazil, PIXEL Res Grp, Oporto, Portugal
关键词
Systematic Literature Review (SLR); Machine Learning; Classification; Text Mining; Bibliometric;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The growing number of scientific publications and the availability of information in online repositories enable researchers to discover, analyze and maintain an updated state of the art bibliography. Indeed, few works explore this scenario in order to support researchers on the literature review step. Literature reviewing comprises a fundamental part of the scientific writing, in which publications are evaluated and selected by relevance. Different approaches for relevance are possible, whether a more qualitative (semantic) approach with text-based techniques either more quantitative (numerical) approaches that use article's metadata, such as bibliometric measures. Bibliometrics provide direct evidences of relevance and could represent good attributes for automatic classification. Our insight is that if a bibliometric-based cannot outperform text-based approaches, a hybrid model using both could benefit from it enhancing the classification performance (in terms of accuracy, precision and recall). In this paper we presented a novel approach, using Machine Learning (ML), namely the ID3 algorithm for a classification model that learn from specialist annotated data and recommend relevant papers for a specific research. Experiments showed good results on learning performance when using a hybrid approach, increasing testing performance in 12%, achieving 89.05% in accuracy when classifying a paper as relevant.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] A Review of Text-Based Recommendation Systems
    Kanwal, Safia
    Nawaz, Sidra
    Malik, Muhammad Kamran
    Nawaz, Zubair
    [J]. IEEE ACCESS, 2021, 9 : 31638 - 31661
  • [2] Enhancing Text-Based Analysis Using Neurophysiological Measures
    Behneman, Adrienne
    Kintz, Natalie
    Johnson, Robin
    Berka, Chris
    Hale, Kelly
    Fuchs, Sven
    Axelsson, Par
    Baskin, Angela
    [J]. FOUNDATIONS OF AUGMENTED COGNITION, PROCEEDINGS: NEUROERGONOMICS AND OPERATIONAL NEUROSCIENCE, 2009, 5638 : 449 - +
  • [3] Enhancing the Generalization for Text Classification through Fusion of Backward Features
    Seng, Dewen
    Wu, Xin
    [J]. SENSORS, 2023, 23 (03)
  • [4] Text-Based Chatbot in Financial Sector: A Systematic Literature Review
    Wube, Hana Demma
    Esubalew, Sintayehu Zekarias
    Weldesellasie, Firesew Fayiso
    Debelee, Taye Girma
    [J]. DATA SCIENCE IN FINANCE AND ECONOMICS, 2022, 2 (03): : 209 - 236
  • [5] ENHANCING COLLABORATIVE FILTERING RECOMMENDATION USING REVIEW TEXT CLUSTERING
    Ghabayen, Ayman S.
    Ahmed, Basem H.
    [J]. JORDANIAN JOURNAL OF COMPUTERS AND INFORMATION TECHNOLOGY, 2021, 7 (02): : 152 - 165
  • [6] Knowledge discovery through text-based similarity searches for astronomy literature
    Kerzendorf, Wolfgang E.
    [J]. JOURNAL OF ASTROPHYSICS AND ASTRONOMY, 2019, 40 (03)
  • [7] Text-based emotion classification using emotion cause extraction
    Li, Weiyuan
    Xu, Hua
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (04) : 1742 - 1749
  • [8] Knowledge discovery through text-based similarity searches for astronomy literature
    Wolfgang E. Kerzendorf
    [J]. Journal of Astrophysics and Astronomy, 2019, 40
  • [9] Understanding text-based studies of linguistic development through goals for academic writing
    Lim, Jungmin
    Tigchelaar, Magda
    Polio, Charlene
    [J]. LANGUAGE AWARENESS, 2022, 31 (01) : 117 - 136
  • [10] Protein Function Prediction using Text-based Features extracted from the Biomedical Literature: The CAFA Challenge
    Wong, Andrew
    Shatkay, Hagit
    [J]. BMC BIOINFORMATICS, 2013, 14