Fuzzy co-clustering of web documents

被引:0
|
作者
William-Chandra, T [1 ]
Chen, L [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Div Informat Engn, Singapore, Singapore
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Web is the largest information repository in the history of mankind. Due to its huge size however, finding relevant information without any appropriate tool can be virtually impossible. Web document clustering is one possible technique to improve the efficiency in information finding process. In this paper, we are looking into fuzzy co-clustering, which is known to be robust for clustering standard text documents. In our opinion, its robustness can also be extended to web documents because it can generate descriptive clusters in high dimension and it is able to discover data clusters with overlaps. We consider two existing fuzzy co-clustering algorithms, FCCM and Fuzzy Codok. In addition, we propose a new algorithm, FCC-STF, as an alternative to the existing ones. Empirical study of these algorithms on benchmark datasets is presented, together with the performance comparison with a standard fuzzy clustering algorithm HFCM. The results show that fuzzy co-clustering is generally superior to standard fuzzy clustering in the Web environment, making it a technique with great potential to assist internet user in discovering relevant information effectively.
引用
收藏
页码:545 / 551
页数:7
相关论文
共 50 条
  • [1] Fuzzy co-clustering of documents and keywords
    Kurnmamuru, K
    Dhawale, A
    Krishnapuram, R
    [J]. PROCEEDINGS OF THE 12TH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1 AND 2, 2003, : 772 - 777
  • [2] Fuzzy semi-supervised co-clustering for text documents
    Yan, Yang
    Chen, Lihui
    Tjhi, William-Chandra
    [J]. FUZZY SETS AND SYSTEMS, 2013, 215 : 74 - 89
  • [3] Constrained Co-Clustering for Textual Documents
    Song, Yangqiu
    Pan, Shimei
    Liu, Shixia
    Wei, Furu
    Zhou, Michelle X.
    Qian, Weihong
    [J]. PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 581 - 586
  • [4] Co-clustering of fuzzy lagged data
    Shaham, Eran
    Sarne, David
    Ben-Moshe, Boaz
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 44 (01) : 217 - 252
  • [5] Robust fuzzy co-clustering algorithm
    Tjhi, William-Chandra
    Chen, Lihui
    [J]. 2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 1591 - 1595
  • [6] Spectral co-clustering documents and words using fuzzy K-harmonic means
    Na Liu
    Fei Chen
    Mingyu Lu
    [J]. International Journal of Machine Learning and Cybernetics, 2013, 4 : 75 - 83
  • [7] Spectral co-clustering documents and words using fuzzy K-harmonic means
    Liu, Na
    Chen, Fei
    Lu, Mingyu
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2013, 4 (01) : 75 - 83
  • [8] Co-clustering of fuzzy lagged data
    Eran Shaham
    David Sarne
    Boaz Ben-Moshe
    [J]. Knowledge and Information Systems, 2015, 44 : 217 - 252
  • [9] Co-clustering WSDL Documents to Bootstrap Service Discovery
    Liang, Tingting
    Chen, Liang
    Ying, Haochao
    Wu, Jian
    [J]. 2014 IEEE 7TH INTERNATIONAL CONFERENCE ON SERVICE-ORIENTED COMPUTING AND APPLICATIONS (SOCA), 2014, : 215 - 222
  • [10] Fuzzy Co-clustering with Automated Variable Weighting
    Laclau, Charlotte
    de Carvalho, Francisco de A. T.
    Nadif, Mohamed
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2015), 2015,