Improving Text Classification Performance Using PCA and Recall-Precision Criteria

被引:0
|
作者
M. Zahedi
A. Ghanbari Sorkhi
机构
[1] Shahrood University of Technology,
关键词
Text classification; Term frequency and category relevancy factor; Principle component analysis; Recall and precision criteria;
D O I
暂无
中图分类号
学科分类号
摘要
Persian text is usually associated with a wide range of important or useless features. This is the main reason why feature extraction process is one of the difficult tasks in the field of Persian text analysis and understanding. While few research works have focused on this problem, the aim of this paper is to introduce a novel approach for extracting the most relevant features and classification of Persian text. Experimental results show that utilizing the principle component analysis along with recall and precision criteria and employing term frequency and category relevancy factor can result in considerable improvement in running time of the classification process while accuracy and precision criteria are improved a little or are not decreased as much as affecting classification performance.
引用
收藏
页码:2095 / 2102
页数:7
相关论文
共 50 条
  • [1] Improving Text Classification Performance Using PCA and Recall-Precision Criteria
    Zahedi, M.
    Sorkhi, A. Ghanbari
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2013, 38 (08) : 2095 - 2102
  • [2] Text Classification using Clustering Techniques and PCA
    Kaur, Manpreet
    Bansal, Meenakshi
    [J]. 2016 FOURTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2016, : 642 - 646
  • [3] Study on measure criteria in evaluating classification performance: Lift charts, ROC and precision-recall curves
    Gu, Qiong
    Zhu, Li
    Cai, Zhihua
    [J]. PROGRESS IN INTELLIGENCE COMPUTATION AND APPLICATIONS, PROCEEDINGS, 2007, : 488 - 492
  • [4] Improving the performance of radial basis function (RBF) classification using information criteria
    Liu, ZQ
    Bozdogan, H
    [J]. STATISTICAL DATA MINING AND KNOWLEDGE DISCOVERY, 2004, : 193 - 216
  • [5] Improving text classification by using encyclopedia knowledge
    Wang, Pu
    Hu, Han
    Zeng, Hua-Jun
    Chen, Lijun
    Chen, Zheng
    [J]. ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 332 - 341
  • [6] Improving Text Classification Using Knowledge in Labels
    Zhang, Cheng
    Yamana, Hayato
    [J]. 2021 IEEE 6TH INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (ICBDA 2021), 2021, : 193 - 197
  • [7] A Technique for Improving the Performance of Naive Bayes Text Classification
    Jiang, Yuqian
    Lin, Huaizhong
    Wang, Xuesong
    Lu, Dongming
    [J]. WEB INFORMATION SYSTEMS AND MINING, PT II, 2011, 6988 : 196 - 203
  • [8] Improving Text Classification Performance with Incremental Background Knowledge
    Silva, Catarina
    Ribeiro, Bernardete
    [J]. ARTIFICIAL NEURAL NETWORKS - ICANN 2009, PT I, 2009, 5768 : 923 - +
  • [9] Techniques for improving the performance of naive Bayes for text classification
    Schneider, KM
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2005, 3406 : 682 - 693
  • [10] Improving the precision-recall trade-off in undersampling-based binary text categorization using unanimity rule
    Zafer Erenel
    Hakan Altınçay
    [J]. Neural Computing and Applications, 2013, 22 : 83 - 100