A Modified Support Vector Clustering Method for Document Categorization

被引:0
|
作者
Harish, B. S. [1 ]
Revanasiddappa, M. B. [1 ]
Kumar, S. V. Aruna [1 ]
机构
[1] JSS Sci & Technol Univ, Dept Informat Sci & Engn, Mysuru, Karnataka, India
关键词
text categorization; support vector clustering; juzzy C-Means; term document matrix;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, we propose a novel text categorization method based on modified Support Vector Clustering (SVC). SVC is a density based clustering approach, which handles the arbitrary shape clusters effectively. The main drawback of traditional SVC is that it treats unclassified documents as outliers. To overcome this problem, we employed Fuzzy C-Means (FCM) to cluster unclassified documents. The modified SVC (SVC-FCM) is applied to categorize text documents. The proposed method consists of three steps: In the first step, Regularized Locality Preserving Indexing (RLPI) is applied on Term Document Matrix (TDM) to reduce dimensionality of features. In second step, we use SVC to find base-cluster centers of documents. Finally, we use FCM to cluster unclassified documents. To evaluate the performance of the proposed method, we conducted experiments on standard 20-NewsGroup dataset.
引用
收藏
页码:1 / 5
页数:5
相关论文
共 50 条
  • [31] Fuzzy support vector clustering
    Zheng, En-Hui
    Yang, Min
    Li, Ping
    Song, Zhi-Huan
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 1, 2006, 3971 : 1050 - 1056
  • [32] Shrunk Support Vector Clustering
    Ling, Ping
    Rong, Xiangsheng
    Hao, Guosheng
    Dong, Yongquan
    [J]. PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 438 - 445
  • [33] Reduced support vector clustering
    Ling, Ping
    Wang, Zhe
    Zhou, Chunguang
    Huang, Lan
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2010, 47 (08): : 1372 - 1381
  • [34] Improved support vector clustering
    Ling Ping
    Zhou Chun-Guang
    Zhou Xu
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2010, 23 (04) : 552 - 559
  • [35] Rough support vector clustering
    Asharaf, S
    Shevade, SK
    Murty, MN
    [J]. PATTERN RECOGNITION, 2005, 38 (10) : 1779 - 1783
  • [36] Support Vector Motion Clustering
    Lawal, Isah A.
    Poiesi, Fabio
    Anguita, Davide
    Cavallaro, Andrea
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (11) : 2395 - 2408
  • [37] Multiscale support vector clustering
    Hansen, Michael Sass
    Holm, David Alberg
    Sjostrand, Karl
    Ley, Carsten Dan
    Rowland, Ian John
    Larsen, Rasmus
    [J]. MEDICAL IMAGING 2008: IMAGE PROCESSING, PTS 1-3, 2008, 6914
  • [38] Method of image database clustering and categorization
    Department of Computer Science and Engineering, Changshu Institute of Technology, Changshu 215500, China
    不详
    不详
    [J]. Kongzhi yu Juece Control Decis, 2008, 6 (701-704):
  • [39] Predicting the Possibilistic Score of OWL Axioms through Modified Support Vector Clustering
    Malchiodi, Dario
    Tettamanzi, Andrea G. B.
    [J]. 33RD ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2018, : 1984 - 1991
  • [40] A Model of Extended Paragraph Vector for Document Categorization and Trend Analysis
    Liu, Pengfei
    Wu, King Keung
    Meng, Helen
    [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 2400 - 2406