Improve Abstract Data with Feature Selection for Classification Techniques

被引:0
|
作者
Nuipian, Vatinee [1 ,2 ]
Meesad, Phayung [3 ]
Boonrawd, Pudsadee [2 ]
机构
[1] King Mongkuts Univ Technol North Bangkok, Inst Comp & Informat Technol, Bangkok, Thailand
[2] King Mongkuts Univ Technol North Bangkok, Fac Informat Technol, Dept Informat Technol, Bangkok, Thailand
[3] King Mongkuts Univ Technol North Bangkok, Fac Tech Educ, Dept Teacher Training Elect Engn, Bangkok, Thailand
来源
关键词
digital library; text classification; feature selection; support vector machine; SVM Attribute;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
A universal problem with text classification has a problem due to the high dimensionality of feature space, e.g. word frequency vectors. To overcome this problem, this paper proposed a feature selection which focuses on statistical pattern based on SVM Attribute. Experiments have shown that the determination of word importance may increase the speed of the classification algorithm and save their resource used significantly. The proposed method was studied by comparing classification performance among Decision Tree, Naive Bayes, and Support Vector Machine. The results showed that Support Vector Machine was found to be the best algorithm with F-measure 93.6%. It is found that the feature selection can reduce dimensionality of data significantly.
引用
下载
收藏
页码:213 / 217
页数:5
相关论文
共 50 条
  • [21] Automatic feature selection for classification of health data
    He, HX
    Jin, HD
    Chen, J
    AI 2005: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2005, 3809 : 910 - 913
  • [22] Online feature selection and classification with incomplete data
    Kalkan, Habil
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2014, 22 (06) : 1625 - 1636
  • [23] Feature Selection for Classification of Hyperspectral Data by SVM
    Pal, Mahesh
    Foody, Giles M.
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2010, 48 (05): : 2297 - 2307
  • [24] Feature Selection for EEG Data Classification with Weka
    Murtazina, Marina
    Avdeenko, Tatiana
    ADVANCES IN SWARM INTELLIGENCE, ICSI 2022, PT II, 2022, : 279 - 288
  • [25] A Projected Feature Selection Algorithm for Data Classification
    Yin, Zhiwu
    Huang, Shangteng
    2007 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-15, 2007, : 3665 - 3668
  • [26] CLASSIFICATION AND FEATURE SELECTION WITH HUMAN PERFORMANCE DATA
    Pavlopoulou, Christina
    Yu, Stella X.
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 1557 - 1560
  • [27] Logic classification and feature selection for biomedical data
    Bertolazzi, P.
    Felici, G.
    Festa, P.
    Lancia, G.
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2008, 55 (05) : 889 - 899
  • [28] Efficient feature selection and classification for microarray data
    Li, Zifa
    Xie, Weibo
    Liu, Tao
    PLOS ONE, 2018, 13 (08):
  • [29] Bagging and Feature Selection for Classification with Incomplete Data
    Cao Truong Tran
    Zhang, Mengjie
    Andreae, Peter
    Xue, Bing
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2017, PT I, 2017, 10199 : 471 - 486
  • [30] Distributed feature selection (DFS) strategy for microarray gene expression data to improve the classification performance
    Potharaju, Sai Prasad
    Sreedevi, M.
    CLINICAL EPIDEMIOLOGY AND GLOBAL HEALTH, 2019, 7 (02): : 171 - 176