Web document clustering using a hybrid neural network

被引:11
|
作者
Khan, MS [1 ]
Khor, SW [1 ]
机构
[1] Murdoch Univ, Sch Informat Technol, Murdoch, WA 6150, Australia
关键词
hybrid neural network; PCA; ART; Web document clustering; information retrieval; document features extraction;
D O I
10.1016/j.asoc.2004.02.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The list of documents returned by Internet search engines in response to a query these days can be quite overwhelming. There is an increasing need for organising this information and presenting it in a more compact and efficient manner. This paper describes a method developed for the automatic clustering of World Wide Web documents, according to their relevance to the user's information needs, by using a hybrid neural network. The objective is to reduce the time and effort the user has to spend to find the information sought after. Clustering documents by features representative of their contents - in this case, key words and phrases - increases the effectiveness and efficiency of the search process. It is shown that a two-dimensional visual presentation of information on retrieved documents, instead of the traditional linear listing, can create a more user-friendly interface between a search engine and the user. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:423 / 432
页数:10
相关论文
共 50 条
  • [21] Web Services Clustering Based on HDP and SOM Neural Network
    Xiao, Qiaoxiang
    Cao, Buqing
    Zhang, Xiangping
    Liu, Jianxun
    Hu, Rong
    Li, Bing
    2018 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2018, : 397 - 404
  • [22] Document block identification using a neural network
    Strouthopoulos, C
    Papamarkos, N
    DSP 97: 1997 13TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2: SPECIAL SESSIONS, 1997, : 999 - 1002
  • [23] Document Classification Using Lightweight Neural Network
    Chen, Chung-Hsing
    Huang, Ko-Wei
    JOURNAL OF INTERNET TECHNOLOGY, 2023, 24 (07): : 1505 - 1511
  • [24] Document mining using graph neural network
    Yong, S. L.
    Hagenbuchner, M.
    Tsoi, A. C.
    Scarselli, F.
    Gori, M.
    COMPARATIVE EVALUATION OF XML INFORMATION RETRIEVAL SYSTEMS, 2007, 4518 : 458 - 472
  • [25] A novel training algorithm for RBF neural network using a hybrid fuzzy clustering approach
    Niros, Antonios D.
    Tsekouras, George E.
    FUZZY SETS AND SYSTEMS, 2012, 193 : 62 - 84
  • [26] Credit Card Fraud Detection: A Hybrid Approach Using Fuzzy Clustering & Neural Network
    Behera, Tanmay Kumar
    Panigrahi, Suvasini
    2015 SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING AND COMMUNICATION ENGINEERING ICACCE 2015, 2015, : 494 - 499
  • [27] A review of Web document clustering approaches
    Oikonomakou, N
    Vazirgiannis, M
    TEXT MINING AND ITS APPLICATIONS, 2004, 138 : 65 - 79
  • [28] Graph representations for Web document clustering
    Schenker, A
    Last, M
    Bunke, H
    Kandel, A
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PROCEEDINGS, 2003, 2652 : 935 - 942
  • [29] An improved clustering algorithm for web document
    Wang, Jing
    Liu, Zhijing
    Journal of Information and Computational Science, 2009, 6 (02): : 959 - 966
  • [30] Review of Web Document Clustering Algorithms
    Sahu, Sanjib Kumar
    Srivastava, Shalini
    PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 1153 - 1155