Validation of text clustering based on document contents

被引:0
|
作者
Toivonen, J
Visa, A
Vesanen, T
Back, B
Vanharanta, H
机构
[1] Tampere Univ Technol, FIN-33101 Tampere, Finland
[2] Abo Akad Univ, FIN-20520 Turku, Finland
[3] Pori Sch Technol & Econ, FIN-28101 Pori, Finland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper some results of a new text clustering methodology axe presented. A prototype is an interesting document or a part of an extracted, interesting text. The given prototype is matched with the existing document database or the monitored document flow. Our claim is that the new methodology is capable of automatic content-based clustering using the information of the document. To verify this hypothesis an experiment was designed with the Bible. Four different translations, one Greek, one Latin, and two Finnish translations from years 1933/38 and 1992 were selected as test text material. Validation experiments were performed with a designed prototype version of the software application.
引用
收藏
页码:184 / 195
页数:12
相关论文
共 50 条
  • [1] Text document clustering based on neighbors
    Luo, Congnan
    Li, Yanjun
    Chung, Soon M.
    [J]. DATA & KNOWLEDGE ENGINEERING, 2009, 68 (11) : 1271 - 1288
  • [2] Ontology-based text document clustering
    Staab, S
    Hotho, A
    [J]. INTELLIGENT INFORMATION PROCESSING AND WEB MINING, 2003, : 451 - 452
  • [3] A Text Document Clustering Method Based on Ontology
    Ding, Yi
    Fu, Xian
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2011, PT II, 2011, 6676 : 199 - 206
  • [4] A Text Document Clustering Method Based on Topical Concept
    Ding, Yi
    Fu, Xian
    [J]. ADVANCES IN ELECTRONIC COMMERCE, WEB APPLICATION AND COMMUNICATION, VOL 1, 2012, 148 : 547 - 552
  • [5] A parallel text document clustering algorithm based on neighbors
    Li, Yanjun
    Luo, Congnan
    Chung, Soon M.
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (02): : 933 - 948
  • [6] A parallel text document clustering algorithm based on neighbors
    Yanjun Li
    Congnan Luo
    Soon M. Chung
    [J]. Cluster Computing, 2015, 18 : 933 - 948
  • [7] Text Document Preprocessing and Dimension Reduction Techniques for Text Document Clustering
    Kadhim, Ammar Ismael
    Cheah, Yu-N
    Ahamed, Nurul Hashimah
    [J]. PROCEEDINGS 2014 4TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE WITH APPLICATIONS IN ENGINEERING AND TECHNOLOGY ICAIET 2014, 2014, : 69 - 73
  • [8] Text document clustering and the space of concept on text document automatically generated
    Fu, WP
    Wu, B
    He, Q
    Shi, ZZ
    [J]. 2001 INTERNATIONAL CONFERENCES ON INFO-TECH AND INFO-NET PROCEEDINGS, CONFERENCE A-G: INFO-TECH & INFO-NET: A KEY TO BETTER LIFE, 2001, : C107 - C112
  • [9] Text Document Clustering Based on Neural K-Mean Clustering Technique
    Kaur, Daljeet
    Bajwa, Jagpuneet Kaur
    [J]. ADVANCES IN COMPUTING AND DATA SCIENCES, ICACDS 2016, 2017, 721 : 336 - 344
  • [10] Efficient prediction-based validation for document clustering
    Greene, Derek
    Cunningham, Padraig
    [J]. MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 : 663 - 670