SISC: A Text Classification Approach Using Semi Supervised Subspace Clustering

被引:0
|
作者
Ahmed, Mohammad Salim [1 ]
Khan, Latifur [1 ]
机构
[1] Univ Texas Dallas, Dept Comp Sci, Richardson, TX 75083 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text classification poses some specific challenges. One such challenge is its high dimensionality where each document (data point) contains only a small subset of them. In this paper, we propose Semi-supervised Impurity based Subspace Clustering (SISC) in conjunction with,c-Nearest Neighbor approach, based on semi-supervised subspace clustering that considers the high dimensionality as well as the sparse nature of them in text data. S/SC finds clusters in the subspaces of the high dimensional text data where each text document has fuzzy cluster membership. This fuzzy clustering exploits two factors - chi square statistic of the dimensions and the impurity measure within each cluster. Empirical evaluation on real world data sets reveals the effectiveness of our approach as it significantly outperforms other state-of-the-art text classification and subspace clustering algorithms.
引用
收藏
页码:1 / 6
页数:6
相关论文
共 50 条
  • [1] TESC: An approach to TExt classification using Semi-supervised Clustering
    Zhang, Wen
    Tang, Xijin
    Yoshida, Taketoshi
    [J]. KNOWLEDGE-BASED SYSTEMS, 2015, 75 : 152 - 160
  • [2] Text Classification Using Semi-Supervised Clustering
    Zhang, Wen
    Yoshida, Taketoshi
    Tang, Xijin
    [J]. 2009 INTERNATIONAL CONFERENCE ON BUSINESS INTELLIGENCE AND FINANCIAL ENGINEERING, PROCEEDINGS, 2009, : 197 - 200
  • [3] Semi supervised approach towards subspace clustering
    Harikumar, Sandhya
    Akhil, A. S.
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 34 (03) : 1619 - 1629
  • [4] A genetic semi-supervised fuzzy clustering approach to text classification
    Liu, H
    Huang, ST
    [J]. ADVANCES IN WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2003, 2762 : 173 - 180
  • [5] A novel semi supervised approach for text classification
    Barman D.
    Chowdhury N.
    [J]. International Journal of Information Technology, 2020, 12 (4) : 1147 - 1157
  • [6] Text classification with enhanced semi-supervised fuzzy clustering
    Keswani, G
    Hall, LO
    [J]. PROCEEDINGS OF THE 2002 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOL 1 & 2, 2002, : 621 - 626
  • [7] Use of Distributed Semi-Supervised Clustering for Text Classification
    Li, Pei
    Deng, Ze
    [J]. JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2019, 28 (08)
  • [8] Text Classification using Semi-supervised Approach for Multi Domain
    Deshmukh, Jyoti S.
    Tripathy, Amiya Kumar
    [J]. 2017 INTERNATIONAL CONFERENCE ON NASCENT TECHNOLOGIES IN ENGINEERING (ICNTE-2017), 2017,
  • [9] An Approach for Classification of Network Traffic on Semi - Supervised Data using Clustering Techniques
    Shukla, Dheeraj Basant
    Chandel, Gajendra Singh
    [J]. 2013 4TH NIRMA UNIVERSITY INTERNATIONAL CONFERENCE ON ENGINEERING (NUICONE 2013), 2013,
  • [10] Improving Semi-Supervised Classification using Clustering
    Arora, J.
    Tushir, M.
    Kashyap, R.
    [J]. EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2020, 7 (25) : 1 - 9