A Novel Fuzzy Based Clustering Algorithm for Text Classification

被引:0
|
作者
Mohan, A. Krishna [1 ]
Rao, V. V. Narasimha [2 ]
Prasad, M. H. M. Krishna [3 ]
机构
[1] JNTU Kakinada, Dept Comp & Engn, Kakinada, India
[2] JNTU Kakinada, Dept Comp Sci & Engn, Kakinada, India
[3] JNTU Vijayanagaram, Dept Comp Sci Sci & Engn Sci & Engn, Vizianagaram, India
关键词
Dimensionality reduction; Skewness; feature extraction; fuzzy clustering; split normal distribution;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the flourish of World Wide Web and the rapid development of the Internet technology, the increasing volume of digital textual data become more and more unmanageable, therefore the importance of text classification has gained significant attention. Text classification pose some specific challenges such as high dimensionality with each document (data point) having only a very small subset of them and representing multiple labels at the same time. Feature clustering is a powerful method to reduce the dimensionality of feature vectors for text classification. Many researchers worked on Feature Clustering for efficient text classification. Recently a Fuzzy based feature clustering was proposed in which Gaussian distribution is used for fuzzy membership function for clustering. But the problem of skewness may occur with this distribution. To overcome that we propose an efficient Fuzzy similarity based membership function for efficient clustering and with this proposed algorithm satisfactory results obtained.
引用
收藏
页码:100 / 107
页数:8
相关论文
共 50 条
  • [1] Text information classification method based on secondly fuzzy clustering algorithm
    Zhang, Yuan
    Zhang, Yanping
    Zhang, Runmei
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 38 (06) : 7743 - 7754
  • [2] A text fuzzy clustering method based on genetic algorithm
    Xu, ZJ
    He, ZS
    Xuan, J
    [J]. Proceedings of the 11th Joint International Computer Conference, 2005, : 876 - 879
  • [3] Fuzzy Set Based Clustering Algorithm of Web Text
    Wan, Hongxin
    Peng, Yun
    [J]. ADVANCES IN MECHATRONICS AND CONTROL ENGINEERING III, 2014, 678 : 19 - +
  • [4] A Fuzzy Self-Constructing Feature Clustering Algorithm for Text Classification
    Jiang, Jung-Yi
    Liou, Ren-Jia
    Lee, Shie-Jue
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (03) : 335 - 349
  • [5] An Improved KNN Text Classification Algorithm Based on Clustering
    Zhou Yong
    Li Youwen
    Xia Shixiong
    [J]. JOURNAL OF COMPUTERS, 2009, 4 (03) : 230 - 237
  • [6] A fuzzy support vector machine algorithm for classification based on a novel PIM fuzzy clustering method
    Wu, Zhenning
    Zhang, Huaguang
    Liu, Jinhai
    [J]. NEUROCOMPUTING, 2014, 125 : 119 - 124
  • [7] A Text Classification Algorithm Based on Rocchio and Hierarchical Clustering
    Zeng, Anping
    Huang, Yongping
    [J]. ADVANCED INTELLIGENT COMPUTING, 2011, 6838 : 432 - +
  • [8] Novel Fuzzy Clustering Algorithm Based on Fireflies
    Dan, Li
    Ke, Luo
    Zhen, Sun
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON APPLIED SCIENCE AND ENGINEERING INNOVATION, 2015, 12 : 112 - 116
  • [9] Clustering Sentence-Level Text Using a Novel Fuzzy Relational Clustering Algorithm
    Skabar, Andrew
    Abdalgader, Khaled
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (01) : 62 - 75
  • [10] Algorithm on Moving Targets Classification Based on Fuzzy Clustering
    Zeng, Ruili
    Xiao, Yunkui
    Li, Gang
    Zhang, Lingling
    He, Guoben
    [J]. PROCEEDINGS OF 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (ICCSIT 2010), VOL 6, 2010, : 288 - 291