Web Document Clustering Approach using WordNet Lexical Categories and Fuzzy Clustering

被引:0
|
作者
Gharib, Tarek F. [1 ]
Fouad, Mohammed M. [2 ]
Aref, Mostafa M. [1 ]
机构
[1] Ain Shams Univ, Fac Comp & Informat Sci, Cairo, Egypt
[2] Akhbar EI Yom Acad, Dept Comp Sci, October, Egypt
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Web mining is defined as applying data mining techniques to the content, structure, and usage of Web resources. The three areas of Web mining are commonly distinguished: content mining, structure mining, and usage mining. In all these areas, a wide range of general data mining techniques, in particular association rule discovery, clustering, classification, and sequence mining, are employed and developed further to reflect the specific structures of Web resources and the specific questions posed in Web mining. In this paper, we introduced a web document clustering approach that uses WordNet lexical categories and fuzzy c-means algorithm to improve the performance of clustering problem for web document. Experiments show that Fuzzy c-means algorithm achieves great performance optimization with comparison with the recent algorithms for document clustering.
引用
收藏
页码:55 / +
页数:3
相关论文
共 50 条
  • [21] Phrase Based Web Document Clustering: An Indexing Approach
    Singh, Amit Prakash
    Srivastava, Shalini
    Sahu, Sanjib Kumar
    [J]. COMPUTER COMMUNICATION, NETWORKING AND INTERNET SECURITY, 2017, 5 : 481 - 492
  • [22] A maximal frequent itemset approach for web document clustering
    Zhuang, L
    Dai, HH
    [J]. FOURTH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY, PROCEEDINGS, 2004, : 970 - 977
  • [23] A Novel Modified Apriori Approach for Web Document Clustering
    Roul, Rajendra Kumar
    Varshneya, Saransh
    Kalra, Ashu
    Sahay, Sanjay Kumar
    [J]. COMPUTATIONAL INTELLIGENCE IN DATA MINING, VOL 3, 2015, 33
  • [24] Lexical and semantic clustering by web links
    Menczer, F
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2004, 55 (14): : 1261 - 1269
  • [25] Hybrid neural document clustering using guided self-organization and wordnet
    Hung, CL
    Wermter, S
    Smith, P
    [J]. IEEE INTELLIGENT SYSTEMS, 2004, 19 (02) : 68 - 77
  • [26] Application of fuzzy clustering algorithm in Chinese document clustering
    Li, Jiafu
    Zhang, Yafei
    Lu, Jianjiang
    [J]. Jisuanji Gongcheng/Computer Engineering, 2002, 28 (04):
  • [27] MMCDM Based Approach for Efficient Web Document Clustering in Web Search
    Siva, R.
    Thandapani, T.
    Ramesh, R.
    Balamurali, R.
    [J]. BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (03): : 82 - 88
  • [28] Clustering Web pages into hierarchial categories
    Louisiana Tech University, Ruston, LA, United States
    [J]. Int. J. Intell. Inf. Technologies, 2007, 2 (17-35):
  • [29] Clustering Web Pages into Hierarchical Categories
    Yao, Zhongmei
    Choi, Ben
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2007, 3 (02) : 17 - 35
  • [30] Web document clustering using semantic link analysis
    Arch-int, Somjit
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR MODELLING, CONTROL & AUTOMATION JOINTLY WITH INTERNATIONAL CONFERENCE ON INTELLIGENT AGENTS, WEB TECHNOLOGIES & INTERNET COMMERCE, VOL 2, PROCEEDINGS, 2006, : 13 - 18