Feature subset selection in SOM based text categorization

被引:0
|
作者
Bassiouny, S [1 ]
Nagi, M [1 ]
Hussein, MF [1 ]
机构
[1] Univ Alexandria, Comp Sci & Automat Control Dept, Fac Engn, Alexandria, Egypt
关键词
feature selection; text categorization; self-organizing map (SOM);
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection, as a preprocessing step to machine learning, is effective in reducing dimensionality, removing irrelevant data, increasing learning accuracy, and improving result comprehensibility. In this paper we describes several methods for feature subset selection on large text data. The experimental comparison of the described methods will be given on the collected data from the Web, focusing on aggressive dimensionality reduction. In our experiments the well known self-organizing map (SOM) neural net-work was used as text categorization algorithm. We will also, investigates the effect of dimensionality reduction by this methods on the performance of text categorization using both entropy and f-measure as evaluation measures.
引用
收藏
页码:860 / 866
页数:7
相关论文
共 50 条
  • [31] Temporal-based Feature Selection and Transfer Learning for Text Categorization
    Fukumoto, Fumiyo
    Suzuki, Yoshimi
    [J]. 2015 7TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (IC3K), 2015, : 17 - 26
  • [32] Study On Feature Selection And Weighting Based On Synonym Merge In Text Categorization
    Lu, Zhenyu
    Lin, Yongmin
    Zhao, Shuang
    Chen, Xuebin
    [J]. SECOND INTERNATIONAL CONFERENCE ON FUTURE NETWORKS: ICFN 2010, 2010, : 105 - 109
  • [33] An empirical study of feature selection for text categorization based on term weightage
    How, BC
    Narayanan, K
    [J]. IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2004), PROCEEDINGS, 2004, : 599 - 602
  • [34] A Feature Selection Methods Based on Concept Extraction and SOM Text Clustering Analysis
    Wang, Lin
    Jiang, Minghu
    Liao, Shasha
    Lu, Yinghua
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2006, 6 (1A): : 20 - 28
  • [35] Applying cascaded feature selection to SVM text categorization
    Masuyama, T
    Nakagawa, H
    [J]. 13TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2002, : 241 - 245
  • [36] Enhancement of DTP feature selection method for text categorization
    Moyotl-Hernández, E
    Jiménez-Salazar, H
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2005, 3406 : 719 - 722
  • [37] Feature selection for support vector machines in text categorization
    Liu, Y
    Lu, HM
    Lu, ZX
    Wang, P
    [J]. MLMTA'03: INTERNATIONAL CONFERENCE ON MACHINE LEARNING; MODELS, TECHNOLOGIES AND APPLICATIONS, 2003, : 129 - 134
  • [38] A discriminative and semantic feature selection method for text categorization
    Zong, Wei
    Wu, Feng
    Chu, Lap-Keung
    Sculli, Domenic
    [J]. INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2015, 165 : 215 - 222
  • [39] Using typical testors for feature selection in text categorization
    Pons-Porratal, Aurora
    Gil-Garcia, Reynaldo
    Berlanga-Liavori, Rafael
    [J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2007, 4756 : 643 - +
  • [40] Feature Selection with Structural Sparse Mode for Text Categorization
    Zheng, Wenbin
    Tang, Dan
    Zhang, Haiqing
    Tang, Hong
    [J]. 2017 NINTH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC 2017), VOL 1, 2017, : 359 - 362