Novel approach for quantitative and qualitative authors research profiling using feature fusion and tree-based learning approach

被引:0
|
作者
Umer M. [1 ]
Aljrees T. [2 ]
Ullah S. [1 ]
Bashir A.K. [3 ]
机构
[1] Department of Computer Science, Khwaja Fareed University of Engineering & IT, Punjab, Rahim Yar Khan
[2] Department of Computer Science and Engineering, University of Hafr Al-Batin, Hafar Al-Batin
[3] Department of Computing and Mathematics, The Manchester Metropolitan University, Manchester
关键词
Authors research profiling; Citation sentiment analysis; Ensemble learning; Feature engineering; Feature fusion; Intelligent recommendation and text analysis; Self citation analysis;
D O I
10.7717/PEERJ-CS.1752
中图分类号
学科分类号
摘要
Article citation creates a link between the cited and citing articles and is used as a basis for several parameters like author and journal impact factor, H-index, i10 index, etc., for scientific achievements. Citations also include self-citation which refers to article citation by the author himself. Self-citation is important to evaluate an author’s research profile and has gained popularity recently. Although different criteria are found in the literature regarding appropriate self-citation, self-citation does have a huge impact on a researcher’s scientific profile. This study carries out two cases in this regard. In case 1, the qualitative aspect of the author’s profile is analyzed using hand-crafted feature engineering techniques. The sentiments conveyed through citations are integral in assessing research quality, as they can signify appreciation, critique, or serve as a foundation for further research. Analyzing sentiments within in-text citations remains a formidable challenge, even with the utilization of automated sentiment annotations. For this purpose, this study employs machine learning models using term frequency (TF) and term frequency-inverse document frequency (TF-IDF). Random forest using TF with Synthetic Minority Oversampling Technique (SMOTE) achieved a 0.9727 score of accuracy. Case 2 deals with quantitative analysis and investigates direct and indirect self-citation. In this study, the top 2% of researchers in 2020 is considered as a baseline. For this purpose, the data of the top 25 Pakistani researchers are manually retrieved from this dataset, in addition to the citation information from the Web of Science (WoS). The selfcitation is estimated using the proposed model and results are compared with those obtained from WoS. Experimental results show a substantial difference between the two, as the ratio of self-citation from the proposed approach is higher than WoS. It is observed that the citations from the WoS for authors are overstated. For a comprehensive evaluation of the researcher's profile, both direct and indirect selfcitation must be included. © 2023 Umer et al.
引用
收藏
相关论文
共 50 条
  • [1] Novel approach for quantitative and qualitative authors research profiling using feature fusion and tree-based learning approach
    Umer, Muhammad
    Aljrees, Turki
    Ullah, Saleem
    Bashir, Ali Kashif
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [2] Tree-Based Morse Regions: A Topological Approach to Local Feature Detection
    Xu, Yongchao
    Monasse, Pascal
    Geraud, Thierry
    Najman, Laurent
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (12) : 5612 - 5625
  • [3] Comparing Image Objects Using Tree-Based Approach
    Zielinski, Bartlomiej
    Iwanowski, Marcin
    COMPUTER VISION AND GRAPHICS, 2012, 7594 : 702 - 709
  • [4] Classifying Familial Hypercholesterolaemia: A Tree-based Machine Learning Approach
    Rosli, Marshima Mohd
    Edward, Jafhate
    Onn, Marcella
    Chua, Yung-An
    Kasim, Noor Alicezah Mohd
    Nawawi, Hapizah
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (09) : 66 - 73
  • [5] A tree-based approach for visible and thermal sensor fusion in winter autonomous driving
    Boisclair, Jonathan
    Amamou, Ali
    Kelouwani, Sousso
    Alam, M. Zeshan
    Oueslati, Hedi
    Zeghmi, Lotfi
    Agbossou, Kodjo
    MACHINE VISION AND APPLICATIONS, 2024, 35 (04)
  • [6] VR-Tree: A novel tree-based approach for modeling Web Query Interfaces
    Marin-Castro, Heidy M.
    Sosa Sosa, Victor J.
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2017, 49 (03) : 367 - 390
  • [7] VR-Tree: A novel tree-based approach for modeling Web Query Interfaces
    Heidy M. Marin-Castro
    Victor J. Sosa Sosa
    Journal of Intelligent Information Systems, 2017, 49 : 367 - 390
  • [8] Classification Prediction of PM10 Concentration Using a Tree-Based Machine Learning Approach
    Shaziayani, Wan Nur
    Ul-Saufie, Ahmad Zia
    Mutalib, Sofianita
    Noor, Norazian Mohamad
    Zainordin, Nazatul Syadia
    ATMOSPHERE, 2022, 13 (04)
  • [9] A feature fusion sequence learning approach for quantitative analysis of tremor symptoms based on digital handwriting
    Ma, Chenbin
    Zhang, Peng
    Pan, Longsheng
    Li, Xuemei
    Yin, Chunyu
    Li, Ailing
    Zong, Rui
    Zhang, Zhengbo
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 203
  • [10] A novel approach to research on feature extraction of acoustic targets based on manifold learning
    Liu Hui
    Yang Jun-An
    Wang Yi
    ACTA PHYSICA SINICA, 2011, 60 (07)