Feature Selection Based on Semantics

被引:2
|
作者
Chua, Stephanie [1 ]
Kulathuramaiyer, Narayanan [1 ]
机构
[1] Univ Malaysia Sarawak, Fac Comp Sci & Informat Technol, Kota Samarahan 94300, Sarawak, Malaysia
关键词
D O I
10.1007/978-1-4020-8735-6_88
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The need for an automated text categorization system is spurred on by the extensive increase of digital documents. This paper looks into feature selection, one of the main processes in text categorization. The feature selection approach is based on semantics by employing WordNet [1]. The proposed WordNet-based feature selection approach makes use of synonymous nouns and dominant senses in selecting terms that are reflective of a category's content. Experiments are carried out using the top ten most populated categories of the Reuters-21578 dataset. Results have shown that statistical feature selection approaches, Chi-Square and Information Gain, are able to produce better results when used with the WordNet-basecl feature selection approach. The use of the WordNet-based feature selection approach with statistical weighting results in it set of terms that is more meaningful compared to the terms chosen by the statistical approaches. In addition, there is also nit effective dimensionality reduction of the feature space when the WordNet-based feature selection method is used.
引用
收藏
页码:471 / 476
页数:6
相关论文
共 50 条
  • [1] UNSUPERVISED FEATURE SELECTION BASED ON FEATURE RELEVANCE
    Zhang, Feng
    Zhao, Ya-Jun
    Chen, Jun-Fen
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 487 - +
  • [2] ICA based Feature Learning and Feature Selection
    Ibrahim, Marwa Farouk Ibrahim
    Al-Jumaily, Adel Ali
    2016 5TH INTERNATIONAL CONFERENCE ON ELECTRONIC DEVICES, SYSTEMS AND APPLICATIONS (ICEDSA), 2016,
  • [3] Incremental feature selection by sample selection and feature-based accelerator
    Yang, Yanyan
    Chen, Degang
    Zhang, Xiao
    Ji, Zhenyan
    Zhang, Yingjun
    APPLIED SOFT COMPUTING, 2022, 121
  • [4] Intelligent Feature Selection Using Hybrid Based Feature Selection Method
    Nisar, Shibli
    Tariq, Muhammad
    2016 SIXTH INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING TECHNOLOGY (INTECH), 2016, : 168 - 172
  • [5] Event driven semantics based ad selection
    Thawani, A
    Gopalan, S
    Sridhar, V
    2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 1875 - 1878
  • [6] Improving Feature-based Visual SLAM by Semantics
    Wang, Ya
    Zell, Andreas
    2018 IEEE THIRD INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, APPLICATIONS AND SYSTEMS (IPAS), 2018, : 7 - 12
  • [7] Visual Exploration of Neural Document Embedding in Information Retrieval: Semantics and Feature Selection
    Ji, Xiaonan
    Shen, Han-Wei
    Ritter, Alan
    Machiraju, Raghu
    Yen, Po-Yin
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2019, 25 (06) : 2181 - 2192
  • [8] A feature-based semantics model of reusable component
    Li, J
    Zhan, DC
    Wang, ZJ
    ADVANCED WEB AND NETWORK TECHNOLOGIES, AND APPLICATIONS, PROCEEDINGS, 2006, 3842 : 955 - 964
  • [9] Feature selection based on bootstrapping
    Diaz-Diaz, Norberto
    Aguilar-Ruiz, Jesus S.
    Nepomuceno, Juan A.
    Garcia, Jorge
    2005 ICSC CONGRESS ON COMPUTATIONAL INTELLIGENCE METHODS AND APPLICATIONS (CIMA 2005), 2005, : 217 - 222
  • [10] Feature selection based on similarity
    Lazzerini, B
    Marcelloni, F
    ELECTRONICS LETTERS, 2002, 38 (03) : 121 - 122