Feature Selection Based on Semantics

被引:2
|
作者
Chua, Stephanie [1 ]
Kulathuramaiyer, Narayanan [1 ]
机构
[1] Univ Malaysia Sarawak, Fac Comp Sci & Informat Technol, Kota Samarahan 94300, Sarawak, Malaysia
关键词
D O I
10.1007/978-1-4020-8735-6_88
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The need for an automated text categorization system is spurred on by the extensive increase of digital documents. This paper looks into feature selection, one of the main processes in text categorization. The feature selection approach is based on semantics by employing WordNet [1]. The proposed WordNet-based feature selection approach makes use of synonymous nouns and dominant senses in selecting terms that are reflective of a category's content. Experiments are carried out using the top ten most populated categories of the Reuters-21578 dataset. Results have shown that statistical feature selection approaches, Chi-Square and Information Gain, are able to produce better results when used with the WordNet-basecl feature selection approach. The use of the WordNet-based feature selection approach with statistical weighting results in it set of terms that is more meaningful compared to the terms chosen by the statistical approaches. In addition, there is also nit effective dimensionality reduction of the feature space when the WordNet-based feature selection method is used.
引用
收藏
页码:471 / 476
页数:6
相关论文
共 50 条
  • [21] A Feature Selection Method Based on Feature Correlation Networks
    Savic, Milos
    Kurbalija, Vladimir
    Ivanovic, Mirjana
    Bosnic, Zoran
    MODEL AND DATA ENGINEERING (MEDI 2017), 2017, 10563 : 248 - 261
  • [22] Feature selection based on feature curve of subclass problem
    Liu, Lei
    Zhang, Bing
    Wang, Shidong
    Li, Shuangjie
    Zhang, Kaixiang
    Wang, Shuqin
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [23] A feature cluster taxonomy based feature selection technique
    Goswami, Saptarsi
    Das, Amit Kumar
    Chakrabarti, Amlan
    Chakraborty, Basabi
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 79 : 76 - 89
  • [24] A grouping feature selection method based on feature interaction
    Zhou, Hongfang
    An, Lei
    Zhu, Rourou
    INTELLIGENT DATA ANALYSIS, 2023, 27 (02) : 361 - 377
  • [25] Classification Algorithm Based on Feature Selection and Samples Selection
    Xu, Yitian
    Zhen, Ling
    Yang, Liming
    Wang, Laisheng
    ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 2, PROCEEDINGS, 2009, 5552 : 631 - 638
  • [26] Composite service template selection algorithm based on semantics
    Institute of Meteorology, PLA Univ. of Sci. and Tech., Nanjing 211101, China
    Jiefangjun Ligong Daxue Xuebao, 2008, 6 (667-670):
  • [27] Automatic machining feature recognition based on MBD and process semantics
    Xu, Tongming
    Li, Jianxun
    Chen, Zhuoning
    COMPUTERS IN INDUSTRY, 2022, 142
  • [28] Semantics and Feature Discovery via Confidence-Based Ensemble
    Goh, Kingshy
    Li, Beitao
    Chang, Edward Y.
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2005, 1 (02)
  • [29] Feature model and component semantics based conceptual architecture design
    Peng, Xin
    Zhao, Wen-Yun
    Liu, Yi-Ming
    Ruan Jian Xue Bao/Journal of Software, 2006, 17 (06): : 1307 - 1317
  • [30] CPCA: A Feature Semantics Based Crowd Dimension Reduction Framework
    Zhang, Yuanyuan
    Gao, Dawei
    Luo, Jie
    Xu, Ke
    IEEE ACCESS, 2018, 6 : 73191 - 73199