Performance Evaluation of Semantic Based and Ontology Based Text Document Clustering Techniques

被引:10
|
作者
Punitha, S. C. [1 ]
Punithavalli, M. [1 ]
机构
[1] PSGR Krishnammal Coll Women, Dept Comp Sci, Coimbatore, Tamil Nadu, India
关键词
Dataming; Document clustering; HSTC; Feature Selection; TCFSmethod;
D O I
10.1016/j.proeng.2012.01.839
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The amount of digital information is created and used is steadily growing along with the development of sophisticated hardware and software. This has increased the need for powerful algorithms that can interpret and extract interesting knowledge from these data. Data mining is a technique that has been successfully exploited for this purpose. Text mining, a category of data mining, considers only digital documents or text. Text Clustering is the process of grouping text or documents such that the document in the same cluster are similar and are dissimilar from the one in other clusters. This paper studies the working of two sophisticated algorithms. The first work is a hybrid method that combines pattern recognition process with semantic driven methods for clustering documents, while the second uses an ontology-based approach to cluster documents. Through experiments, the performance of both the selected algorithms is analyzed in terms of clustering efficiency and speed of clustering. (C) 2011 Published by Elsevier Ltd. Selection and/or peer-review under responsibility of ICCTSD 2011
引用
收藏
页码:100 / 106
页数:7
相关论文
共 50 条
  • [1] Semantic document clustering based on ontology
    Wang, Ying
    Peng, Tao
    Zuo, Wanli
    He, Fengling
    Wang, Dong
    [J]. Journal of Computational Information Systems, 2009, 5 (03): : 1437 - 1444
  • [2] Ontology-based text document clustering
    Staab, S
    Hotho, A
    [J]. INTELLIGENT INFORMATION PROCESSING AND WEB MINING, 2003, : 451 - 452
  • [3] A Text Document Clustering Method Based on Ontology
    Ding, Yi
    Fu, Xian
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2011, PT II, 2011, 6676 : 199 - 206
  • [4] Text Clustering Based on Domain Ontology and Latent Semantic Analysis
    Li Yaxiong
    Pan Deng
    [J]. MECHATRONICS ENGINEERING, COMPUTING AND INFORMATION TECHNOLOGY, 2014, 556-562 : 3536 - +
  • [5] An Ontology-based Semantic Clustering Algorithm for Accounting Text
    Jiang, Yanhui
    Li, Mo
    Yao, Kaohua
    [J]. INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS & STATISTICS, 2013, 43 (13): : 59 - 67
  • [6] Performance of Ontology-Based Semantic Similarities in Clustering
    Batet, Montserrat
    Valls, Aida
    Gibert, Karina
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, PT I, 2010, 6113 : 281 - +
  • [7] An Ontology Based Model for Document Clustering
    Sridevi, U.
    Nagaveni, N.
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2011, 7 (03) : 54 - 69
  • [8] Ontology-based semantic clustering
    Batet, Montserrat
    [J]. AI COMMUNICATIONS, 2011, 24 (03) : 291 - 292
  • [9] Semantic-Based Text Document Clustering Using Cognitive Semantic Learning and Graph Theory
    Ali, Ismael
    Melton, Austin
    [J]. 2018 IEEE 12TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2018, : 243 - 247
  • [10] Text document clustering based on neighbors
    Luo, Congnan
    Li, Yanjun
    Chung, Soon M.
    [J]. DATA & KNOWLEDGE ENGINEERING, 2009, 68 (11) : 1271 - 1288