Novel approach for quantitative and qualitative authors research profiling using feature fusion and tree-based learning approach

被引:0
|
作者
Umer, Muhammad [1 ]
Aljrees, Turki [2 ]
Ullah, Saleem [1 ]
Bashir, Ali Kashif [3 ]
机构
[1] Khwaja Fareed Univ Engn & IT, Dept Comp Sci, Rahim Yar Khan, Punjab, Pakistan
[2] Univ Hafr Al Batin, Dept Comp Sci & Engn, Hafar Al Batin, Saudi Arabia
[3] Manchester Metropolitan Univ, Dept Comp & Math, Manchester, England
关键词
Citation sentiment analysis; Ensemble learning; Feature engineering; Feature fusion; Intelligent recommendation and text analysis; Authors research profiling; Self citation analysis; SELF-CITATION RATES; H-INDEX; IMPACT; CLASSIFICATION; PATTERNS; MACRO; SMOTE;
D O I
10.7717/peerj-cs.1752
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Article citation creates a link between the cited and citing articles and is used as a basis for several parameters like author and journal impact factor, H-index, i10 index, etc., for scientific achievements. Citations also include self-citation which refers to article citation by the author himself. Self-citation is important to evaluate an author's research profile and has gained popularity recently. Although different criteria are found in the literature regarding appropriate self-citation, self-citation does have a huge impact on a researcher's scientific profile. This study carries out two cases in this regard. In case 1, the qualitative aspect of the author's profile is analyzed using hand-crafted feature engineering techniques. The sentiments conveyed through citations are integral in assessing research quality, as they can signify appreciation, critique, or serve as a foundation for further research. Analyzing sentiments within in-text citations remains a formidable challenge, even with the utilization of automated sentiment annotations. For this purpose, this study employs machine learning models using term frequency (TF) and term frequency-inverse document frequency (TF-IDF). Random forest using TF with Synthetic Minority Oversampling Technique (SMOTE) achieved a 0.9727 score of accuracy. Case 2 deals with quantitative analysis and investigates direct and indirect self-citation. In this study, the top 2% of researchers in 2020 is considered as a baseline. For this purpose, the data of the top 25 Pakistani researchers are manually retrieved from this dataset, in addition to the citation information from the Web of Science (WoS). The self citation is estimated using the proposed model and results are compared with those obtained from WoS. Experimental results show a substantial difference between the two, as the ratio of self-citation from the proposed approach is higher than WoS. It is observed that the citations from the WoS for authors are overstated. For a comprehensive evaluation of the researcher's profile, both direct and indirect self citation must be included.
引用
收藏
页数:26
相关论文
共 50 条
  • [11] LogTransformer: Transforming IT System Logs Into Events Using Tree-Based Approach
    Fu, Yuanyuan
    Xu, Jian
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2024, 21 (04): : 3904 - 3918
  • [12] Adversarial evasion attacks detection for tree-based ensembles: A representation learning approach
    Braun, Gal
    Cohen, Seffi
    Rokach, Lior
    INFORMATION FUSION, 2025, 118
  • [13] Vision-Based Personal Face Emotional Recognition Approach Using Machine Learning and Tree-Based Classifier
    Sathya, R.
    Manivannan, R.
    Vaidehi, K.
    INVENTIVE COMPUTATION AND INFORMATION TECHNOLOGIES, ICICIT 2021, 2022, 336 : 561 - 573
  • [14] AGENT BASED DECISION TREE LEARNING: A NOVEL APPROACH
    Rahmani, Mohsen
    Hashemi, Sattar
    Hamzeh, Ali
    Sami, Ashkan
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2009, 19 (07) : 1015 - 1022
  • [15] A Novel Hybrid Approach for a Content-Based Image Retrieval Using Feature Fusion
    Sikandar, Shahbaz
    Mahum, Rabbia
    Alsalman, AbdulMalik
    APPLIED SCIENCES-BASEL, 2023, 13 (07):
  • [16] A NOVEL APPROACH BASED ON FEATURE FUSION FOR FRACTURE IDENTIFICATION USING WELL LOG DATA
    Li, Tianyang
    Li, Ruiheng
    Yu, Nian
    Wang, Zizhen
    Wang, Ruihe
    FRACTALS-COMPLEX GEOMETRY PATTERNS AND SCALING IN NATURE AND SOCIETY, 2021, 29 (08)
  • [17] A Deep Learning Approach Based on Novel Multi-Feature Fusion for Power Load Prediction
    Xiao, Ling
    An, Ruofan
    Zhang, Xue
    PROCESSES, 2024, 12 (04)
  • [18] Predictive analysis for road accidents using a tree-based and deep learning fusion system
    Ameksa, Mohammed
    Abou Elassad, Zouhair Elamrani
    Abou Elassad, Dauha Elamrani
    Mousannif, Hajar
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2024, 46 (01) : 2381 - 2397
  • [19] The Quantitative and Qualitative Study of the Effectiveness of the Problem-based Learning Approach in Teaching Research Methods
    Kaeedi, Azam
    Esfahani, Ahmad Reza Nasr
    Sharifian, Fereydoon
    Moosavipour, Saeed
    JOURNAL OF UNIVERSITY TEACHING AND LEARNING PRACTICE, 2023, 20 (05):
  • [20] Predicting Bulk Average Velocity with Rigid Vegetation in Open Channels Using Tree-Based Machine Learning: A Novel Approach Using Explainable Artificial Intelligence
    Meddage, D. P. P.
    Ekanayake, I. U.
    Herath, Sumudu
    Gobirahavan, R.
    Muttil, Nitin
    Rathnayake, Upaka
    SENSORS, 2022, 22 (12)