Feature selection for support vector machines in text categorization

被引:0
|
作者
Liu, Y [1 ]
Lu, HM [1 ]
Lu, ZX [1 ]
Wang, P [1 ]
机构
[1] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
关键词
text categorization; feature selection; SVM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Support vector machine (SVM) has been successfully applied to text categorization. We find that the decision hyperplane of SVM classifier can also be used for feature selection. When linear kernel is used, the decision rule can be seen as a combination of weighted values of each dimension of the vector to be categorized, each weight indicating the importance of corresponding dimension in the categorization decision. Hence we can only preserve those dimensions with bigger weights. In this way we combine the feature selection into the learning process of SVM training. Empirical experiment result is given to verify our new method.
引用
收藏
页码:129 / 134
页数:6
相关论文
共 50 条
  • [1] Feature selection for scene categorization using support vector machines
    Devendran, V
    Thiagarajan, Hemalatha
    Santra, A. K.
    Wahi, Amitabh
    [J]. CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 1, PROCEEDINGS, 2008, : 588 - +
  • [2] Exploring Feature Selection and Support Vector Machine in Text Categorization
    Abdul-Rahman, Shuzlina
    Mutalib, Sofianita
    Khanafi, Nur Amira
    Ali, Azliza Mohd
    [J]. 2013 IEEE 16TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE 2013), 2013, : 1101 - 1104
  • [3] A wrapper approach with support vector machines for text categorization
    Montanés, E
    Quevedo, JR
    Díaz, I
    [J]. COMPUTATIONAL METHODS IN NEURAL MODELING, PT 1, 2003, 2686 : 230 - 237
  • [4] Feature selection for support vector machines
    Hermes, L
    Buhmann, JM
    [J]. 15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS: PATTERN RECOGNITION AND NEURAL NETWORKS, 2000, : 712 - 715
  • [5] Virtual relevant documents in text categorization with support vector machines
    Lee, Kyung-Soon
    Kageura, Kyo
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (04) : 902 - 913
  • [6] Support Vector Machines based on a semantic kernel for text categorization
    Siolas, G
    d'Alché-Buc, F
    [J]. IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL V, 2000, : 205 - 209
  • [7] Least Squares Twin Support Vector Machines for Text Categorization
    Kumar, M. Arun
    Gopal, M.
    [J]. PROCEEDINGS OF THE 2015 39TH NATIONAL SYSTEMS CONFERENCE (NSC), 2015,
  • [8] Support vector machines for text categorization in Chinese question classification
    Lin, Xu-Dong
    Peng, Hong
    Liu, Bo
    [J]. 2006 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, (WI 2006 MAIN CONFERENCE PROCEEDINGS), 2006, : 334 - +
  • [9] Feature selection for bagging of support vector machines
    Li, Guo-Zheng
    Liu, Tian-Yu
    [J]. PRICAI 2006: TRENDS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4099 : 271 - 277
  • [10] Feature selection for multiclass support vector machines
    Aazi, F. Z.
    Abdesselam, R.
    Achchab, B.
    Elouardighi, A.
    [J]. AI COMMUNICATIONS, 2016, 29 (05) : 583 - 593