Improve Abstract Data with Feature Selection for Classification Techniques

被引:0
|
作者
Nuipian, Vatinee [1 ,2 ]
Meesad, Phayung [3 ]
Boonrawd, Pudsadee [2 ]
机构
[1] King Mongkuts Univ Technol North Bangkok, Inst Comp & Informat Technol, Bangkok, Thailand
[2] King Mongkuts Univ Technol North Bangkok, Fac Informat Technol, Dept Informat Technol, Bangkok, Thailand
[3] King Mongkuts Univ Technol North Bangkok, Fac Tech Educ, Dept Teacher Training Elect Engn, Bangkok, Thailand
来源
关键词
digital library; text classification; feature selection; support vector machine; SVM Attribute;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
A universal problem with text classification has a problem due to the high dimensionality of feature space, e.g. word frequency vectors. To overcome this problem, this paper proposed a feature selection which focuses on statistical pattern based on SVM Attribute. Experiments have shown that the determination of word importance may increase the speed of the classification algorithm and save their resource used significantly. The proposed method was studied by comparing classification performance among Decision Tree, Naive Bayes, and Support Vector Machine. The results showed that Support Vector Machine was found to be the best algorithm with F-measure 93.6%. It is found that the feature selection can reduce dimensionality of data significantly.
引用
收藏
页码:213 / 217
页数:5
相关论文
共 50 条
  • [1] Improve Abstract Data with Feature Selection for Classification Techniques
    Nuipian, Vatinee
    Meesad, Phayung
    Boonrawd, Pudsadee
    [J]. MEMS, NANO AND SMART SYSTEMS, PTS 1-6, 2012, 403-408 : 3699 - +
  • [2] Imbalanced Data Classification Based on Feature Selection Techniques
    Ksieniewicz, Pawel
    Wozniak, Michal
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING (IDEAL 2018), PT II, 2018, 11315 : 296 - 303
  • [3] Using classification techniques to improve replica selection in data grid
    Jin, Hai
    Huang, Jin
    Xie, Xia
    Zhang, Qin
    [J]. ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS 2006: COOPIS, DOA, GADA, AND ODBASE PT 2, PROCEEDINGS, 2006, 4276 : 1376 - 1387
  • [4] An Approach Based on Resampling and Feature Selection to Improve the Classification of Microarray Data
    Soleymani, Nafiseh
    Moattar, Mohammad Hussein
    [J]. 2018 6TH IRANIAN JOINT CONGRESS ON FUZZY AND INTELLIGENT SYSTEMS (CFIS), 2018, : 61 - 64
  • [5] A Comparative Study of Various Feature Selection Techniques in High-Dimensional data set to Improve Classification Accuracy
    Shroff, Kandarp P.
    Maheta, Hardik H.
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2015,
  • [6] Function to Flatten Gesture Data for Specific Feature Selection Methods to Improve Classification
    Cervantes Salgado, Marilu
    Pinto Elias, Raul
    Magadan Salazar, Andrea
    [J]. TRAITEMENT DU SIGNAL, 2021, 38 (04) : 929 - 935
  • [7] Analysis of Feature Selection Techniques for Classification Problems
    Adamov, Abzetdin Z.
    [J]. 2021 IEEE 15TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT2021), 2021,
  • [8] Optimizing feature selection techniques for sentiment classification
    Uribe, Diego
    [J]. 2011 IEEE ELECTRONICS, ROBOTICS AND AUTOMOTIVE MECHANICS CONFERENCE (CERMA 2011), 2011, : 103 - 107
  • [9] Local Feature Selection for Data Classification
    Armanfard, Narges
    Reilly, James P.
    Komeili, Majid
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (06) : 1217 - 1227
  • [10] Using Feature Selection in Combination with Ensemble Learning Techniques to Improve Tweet Sentiment Classification Performance
    Prusa, Joseph D.
    Khoshgoftaar, Taghi M.
    Napolitano, Amri
    [J]. 2015 IEEE 27TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2015), 2015, : 186 - 193