Effective Use of Evaluation Measures for the Validation of Best Classifier in Urdu Sentiment Analysis

被引:19
|
作者
Mukhtar, Neelam [1 ]
Khan, Mohammad Abid [1 ]
Chiragh, Nadia [2 ]
机构
[1] Univ Peshawar, Dept Comp Sci, Peshawar, Khyber Pakhtunk, Pakistan
[2] Univ Agr, Peshawar, Pakistan
关键词
Urdu sentiment analysis; Classifiers; Evaluation measures; Best classifier; AGREEMENT; ACCURACY; ERROR;
D O I
10.1007/s12559-017-9481-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis (SA) can help in decision making, drawing conclusion, or recommending appropriate solution for different business, political, or other problems. At the same time reliable ways are also required to verify the results that are achieved after SA. In the frame of biologically inspired approaches for machine learning, getting reliable result is challenging but important. Properly verified and validated results are always appreciated and preferred by the research community. The strategy of achieving reliable result is adopted in this research by using three standard evaluation measures. First, SA of Urdu is performed. After collection and annotation of data, five classifiers, i.e., PART, Naives Bayes mutinomial Text, Lib SVM (support vector machine), decision tree (J48), and k nearest neighbor (KNN, IBK) are employed using Weka. After using 10-fold cross-validation, three top most classifiers, i.e., Lib SVM, J48, and IBK are selected on the basis of high accuracy, precision, recall, and F-measure. Further, IBK resulted as the best classifier among the three. For verification of this result, labels of the sentences (positive, negative, or neutral) are predicted by using training and test data, followed by the application of the three standard evaluation measures, i.e., McNemar's test, kappa statistic, and root mean squared error. IBK performs much better than the other two classifiers. To make this result more reliable, a number of steps are taken including the use of three evaluation measures for getting a confirmed and validated result which is the main contribution of this research. It is concluded with confidence that IBK is the best classifier in this case.
引用
收藏
页码:446 / 456
页数:11
相关论文
共 26 条
  • [1] Effective Use of Evaluation Measures for the Validation of Best Classifier in Urdu Sentiment Analysis
    Neelam Mukhtar
    Mohammad Abid Khan
    Nadia Chiragh
    [J]. Cognitive Computation, 2017, 9 : 446 - 456
  • [2] Effective lexicon-based approach for Urdu sentiment analysis
    Neelam Mukhtar
    Mohammad Abid Khan
    [J]. Artificial Intelligence Review, 2020, 53 : 2521 - 2548
  • [3] Effective lexicon-based approach for Urdu sentiment analysis
    Mukhtar, Neelam
    Khan, Mohammad Abid
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (04) : 2521 - 2548
  • [4] Resource Creation and Evaluation of Aspect Based Sentiment Analysis in Urdu
    Rani, Sadaf
    Anwar, Muhammad Waqas
    [J]. AACL-IJCNLP 2020: THE 1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2020, : 72 - 77
  • [5] Exploiting Linguistic Features for Effective Sentence-Level Sentiment Analysis in Urdu Language
    Amna Altaf
    Muhammad Waqas Anwar
    Muhammad Hasan Jamal
    Usama Ijaz Bajwa
    [J]. Multimedia Tools and Applications, 2023, 82 : 41813 - 41839
  • [6] Exploiting Linguistic Features for Effective Sentence-Level Sentiment Analysis in Urdu Language
    Altaf, Amna
    Anwar, Muhammad Waqas
    Jamal, Muhammad Hasan
    Bajwa, Usama Ijaz
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (27) : 41813 - 41839
  • [7] Aspect-based sentiment analysis in Urdu language: resource creation and evaluation
    Altaf, Amna
    Anwar, Muhammad Waqas
    Jamal, Muhammad Hasan
    Bajwa, Usama Ijaz
    Rani, Sadaf
    [J]. Neural Computing and Applications, 2024, 36 (34) : 21365 - 21381
  • [8] Effective Use of Linguistic Features for Sentiment Analysis of Korean
    Jang, Hayeon
    Shin, Hyopil
    [J]. PROCEEDINGS OF THE 24TH PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION, 2010, : 173 - 182
  • [9] Sentiment Analysis and the Use of Extrinsic Datasets in Evaluation
    Devitt, Ann
    Ahmad, Khurshid
    [J]. SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1063 - 1066
  • [10] Effective Sentiment Analysis based on Term Evaluation by Bayesian Model Selection Criteria
    Kang, Dae-Ki
    [J]. 2013 SECOND IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR 2013), 2013, : 887 - 891