A Genetic-Fuzzy Approach for Automatic Text Categorization

被引:0
|
作者
Kumbhar, Pradnya [1 ]
Mali, Manisha [1 ]
Atique, Mohammad [2 ]
机构
[1] VIIT, Dept Comp Engn, Pune, Maharashtra, India
[2] SGBAU, Dept Comp Sci, Amravati, India
关键词
Feature selection; text classification; genetic algorithm; fuzzy-rule based system; FEATURE-SELECTION; MACHINE;
D O I
10.1109/IACC.2017.114
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The rapid growth of World Wide Web has resulted in massive information from varied sources rising at an exponential rate. The high availability of such disparate information has precipitated the need of automatic text categorization for managing, organizing huge data and knowledge discovery. Main challenges of text classification include high dimensionality of feature space and classification accuracy. Thus, to make classifiers more accurate and efficient, there arises the need of Feature Selection. Genetic algorithms have gained much attention over traditional methods due to its simplicity and robustness to solve the optimization problem and high exponential search ability. Thus, the paper focuses on using Genetic Algorithm (GA) for Feature Selection to obtain optimal features for classifying unstructured data. We build a fuzzy rule-based classifier that automatically generates fuzzy rules for classification. The experiments are conducted on two-datasets namely 20-Newsgroup and Reuters-21578 and the results indicate that GA outperforms Principal Component Analysis (PCA).
引用
收藏
页码:572 / 578
页数:7
相关论文
共 50 条
  • [31] Automatic text categorization of news articles
    Amasyali, MF
    Yildirim, T
    [J]. PROCEEDINGS OF THE IEEE 12TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, 2004, : 224 - 226
  • [32] Automatic text categorization with learning logic
    Al-Mubaid, H
    Siddiqui, MS
    [J]. COMPUTER APPLICATIONS IN INDUSTRY AND ENGINEERING, 2003, : 178 - 183
  • [33] Text Classifiers for Automatic Articles Categorization
    Westa, Mateusz
    Szymanski, Julian
    Krawczyk, Henryk
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, PT II, 2012, 7268 : 196 - 204
  • [34] Fuzzy clustering and categorization of text documents
    Ayeldeen, Heba
    Rassanien, Aboul Ella
    Fahmy, Aly Aly
    [J]. 2013 13TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS (HIS), 2013, : 262 - 266
  • [35] Text Categorization by Fuzzy Domain Adaptation
    Behbood, Vahid
    Lu, Jie
    Zhang, Guangquan
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ - IEEE 2013), 2013,
  • [36] Learning to avoid moving obstacles optimally for mobile robots using a genetic-fuzzy approach
    Deb, K
    Pratihar, DK
    Ghosh, A
    [J]. PARALLEL PROBLEM SOLVING FROM NATURE - PPSN V, 1998, 1498 : 583 - 592
  • [37] A Divide-and-Conquer Genetic-Fuzzy Mining Approach for Items with Multiple Minimum Supports
    Chen, Chun-Hao
    Hong, Tzung-Pei
    Tseng, Vincent S.
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2008, : 1233 - +
  • [38] A Cluster-based Genetic-Fuzzy mining approach for items with Multiple Minimum Supports
    Chen, Chun-Hao
    Hong, Tzung-Pei
    Tseng, Vincent S.
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2008, 5012 : 864 - +
  • [39] Time-series-dynamics Modeling and Forecasting - An Accurate and Interpretable Genetic-Fuzzy Approach
    Gorzalczany, Marian B.
    Rudzinski, Filip
    [J]. ADVANCES IN FUZZY LOGIC AND TECHNOLOGY 2017, VOL 2, 2018, 642 : 165 - 175
  • [40] Genetic-fuzzy model of diesel engine working cycle
    Kekez, M.
    Radziszewski, L.
    [J]. BULLETIN OF THE POLISH ACADEMY OF SCIENCES-TECHNICAL SCIENCES, 2010, 58 (04) : 665 - 671