Web document clustering using a hybrid neural network

被引:11
|
作者
Khan, MS [1 ]
Khor, SW [1 ]
机构
[1] Murdoch Univ, Sch Informat Technol, Murdoch, WA 6150, Australia
关键词
hybrid neural network; PCA; ART; Web document clustering; information retrieval; document features extraction;
D O I
10.1016/j.asoc.2004.02.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The list of documents returned by Internet search engines in response to a query these days can be quite overwhelming. There is an increasing need for organising this information and presenting it in a more compact and efficient manner. This paper describes a method developed for the automatic clustering of World Wide Web documents, according to their relevance to the user's information needs, by using a hybrid neural network. The objective is to reduce the time and effort the user has to spend to find the information sought after. Clustering documents by features representative of their contents - in this case, key words and phrases - increases the effectiveness and efficiency of the search process. It is shown that a two-dimensional visual presentation of information on retrieved documents, instead of the traditional linear listing, can create a more user-friendly interface between a search engine and the user. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:423 / 432
页数:10
相关论文
共 50 条
  • [41] Diagnosis of Cancer Using Hybrid Clustering and Convolution Neural Network from Breast Thermal Image
    Lakshminarayanan, Aarthy Seshadri
    Radhakrishnan, Sujatha
    Pandiasankar, Gopinath Masila
    Ramu, Swarnapriya
    JOURNAL OF TESTING AND EVALUATION, 2019, 47 (06) : 3975 - 3987
  • [42] Comparison of neural models for document clustering
    Guerrero-Bote, VP
    López-Pujalte, C
    de Moya-Anegón, F
    Herrero-Solana, V
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2003, 34 (2-3) : 287 - 305
  • [43] A New Hybrid Approach for Document Clustering
    Ismael, Osama
    2017 13TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO), 2017, : 291 - 296
  • [44] Automatic Document Classification Using Convolutional Neural Network
    Sun, Xingping
    Li, Yibing
    Kang, Hongwei
    Shen, Yong
    2018 INTERNATIONAL SEMINAR ON COMPUTER SCIENCE AND ENGINEERING TECHNOLOGY (SCSET 2018), 2019, 1176
  • [45] Global Binarization of Document Images Using a Neural Network
    Khashman, Adnan
    Sekeroglu, Boran
    SITIS 2007: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGIES & INTERNET BASED SYSTEMS, 2008, : 665 - 672
  • [46] DOCUMENT IMAGE BINARISATION USING A SUPERVISED NEURAL NETWORK
    Khashman, Adnan
    Sekeroglu, Boran
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2008, 18 (05) : 405 - 418
  • [47] Web search result refinement by document clustering
    Tsui, Ming Hei
    Lim, Bresley
    Shi, Daming
    2007 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-8, 2007, : 2224 - 2229
  • [48] A probabilistic relational approach for web document clustering
    Fersini, E.
    Messina, E.
    Archetti, F.
    INFORMATION PROCESSING & MANAGEMENT, 2010, 46 (02) : 117 - 130
  • [49] Digital Web Library of a Website with Document Clustering
    Mahecha-Nieto, Isabel
    Leon, Elizabeth
    ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA 2010, 2010, 6433 : 214 - 223
  • [50] Unsupervised clustering for nontextual web document classification
    Chan, SWK
    Chong, MWC
    DECISION SUPPORT SYSTEMS, 2004, 37 (03) : 377 - 396