Web document clustering using a hybrid neural network

被引:11
|
作者
Khan, MS [1 ]
Khor, SW [1 ]
机构
[1] Murdoch Univ, Sch Informat Technol, Murdoch, WA 6150, Australia
关键词
hybrid neural network; PCA; ART; Web document clustering; information retrieval; document features extraction;
D O I
10.1016/j.asoc.2004.02.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The list of documents returned by Internet search engines in response to a query these days can be quite overwhelming. There is an increasing need for organising this information and presenting it in a more compact and efficient manner. This paper describes a method developed for the automatic clustering of World Wide Web documents, according to their relevance to the user's information needs, by using a hybrid neural network. The objective is to reduce the time and effort the user has to spend to find the information sought after. Clustering documents by features representative of their contents - in this case, key words and phrases - increases the effectiveness and efficiency of the search process. It is shown that a two-dimensional visual presentation of information on retrieved documents, instead of the traditional linear listing, can create a more user-friendly interface between a search engine and the user. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:423 / 432
页数:10
相关论文
共 50 条
  • [31] Comparison of algorithms for web document clustering using graph representations of data
    Schenker, A
    Last, M
    Bunke, H
    Kandell, A
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, PROCEEDINGS, 2004, 3138 : 190 - 197
  • [32] Time and Space Efficient Web Document Clustering Using Rayleigh Distribution
    Srikanth, D.
    Sakthivel, S.
    WIRELESS PERSONAL COMMUNICATIONS, 2018, 102 (04) : 3255 - 3268
  • [33] Distributed collaborative Web document clustering using cluster keyphrase summaries
    Hammouda, Khaled
    Kamel, Mohamed
    INFORMATION FUSION, 2008, 9 (04) : 465 - 480
  • [34] Time and Space Efficient Web Document Clustering Using Rayleigh Distribution
    D. Srikanth
    S. Sakthivel
    Wireless Personal Communications, 2018, 102 : 3255 - 3268
  • [35] Forecasting the CPI using a hybrid SARIMA and neural network model with web news articles
    Yuan, Hui
    Zhang, Dailing
    Xu, Wei
    Wang, Mingming
    Dong, Wenda
    2013 SIXTH INTERNATIONAL CONFERENCE ON BUSINESS INTELLIGENCE AND FINANCIAL ENGINEERING (BIFE), 2014, : 84 - 88
  • [36] Data clustering using a reorganizing neural network
    Kukolj, Dragan
    Atlagic, Branislav
    Petrov, Milovan
    CYBERNETICS AND SYSTEMS, 2006, 37 (07) : 779 - 790
  • [37] Evolving document features for Web document clustering: A feasability study
    Sinka, MP
    Corne, DW
    CEC2004: PROCEEDINGS OF THE 2004 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1 AND 2, 2004, : 891 - 897
  • [38] Web user preferences and behavior clustering based on BP neural network
    Tang, Xiaoyue
    Li, Shijun
    Yu, Wei
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 38 (02) : 1189 - 1196
  • [39] DeepWSC: A Novel Framework with Deep Neural Network for Web Service Clustering
    Zou, Guobing
    Qin, Zhen
    He, Qiang
    Wang, Pengwei
    Zhang, Bofeng
    Gan, Yanglan
    2019 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES (IEEE ICWS 2019), 2019, : 434 - 436
  • [40] Hybrid Human Skin Detection Using Neural Network and K-Means Clustering Technique
    Al-Mohair, Hani K.
    Saleh, Junita Mohamad
    Suandi, Shahrel Azmin
    APPLIED SOFT COMPUTING, 2015, 33 : 337 - 347