Using information filtering in web data mining process

被引:5
|
作者
Zhou, Xujuan [1 ]
Li, Yuefeng [1 ]
Bruza, Peter [1 ]
Wu, Sheng-Tang [1 ]
Xu, Yue [1 ]
Lau, Raymond Y. K. [2 ]
机构
[1] Queensland Univ Technol, Fac Informat Technol, Brisbane, Qld 4000, Australia
[2] City Univ Hong Kong, Dept Informat Syst, Kowloon, Peoples R China
基金
澳大利亚研究理事会;
关键词
D O I
10.1109/WI.2007.24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The amount of Web information is growing rapidly, improving the efficiency and accuracy of Web information retrieval is uphill battle. There are two fundamental issues regarding the effectiveness of Web information gathering: information mismatch and overload. To tackle these difficult issues, an integrated information filtering and sophisticated data processing model has been presented in this paper In the first phase of the proposed scheme, an information filter that based on user search intents was incorporated in Web search process to quickly filter out irrelevant data. In the second data processing phase, a pattern taxonomy model (PTM) was carried out using the reduced data. PTM rationalizes the data relevance by applying data mining techniques that involves more rigorous computations. Several experiments have been conducted and the results show that more effective and efficient access Web information has been achieved using the new scheme.
引用
收藏
页码:163 / +
页数:2
相关论文
共 50 条
  • [1] Big Data and the Web Discovering Meaningful Information from Web Data using Data Mining Techniques
    Abd Wahab, Mohd Helmy
    [J]. 2015 4TH INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (ICRITO) (TRENDS AND FUTURE DIRECTIONS), 2015,
  • [2] Information management and process improvement using data mining techniques
    Gibbons, WM
    Ranta, M
    Scott, TM
    Mantyla, M
    [J]. INTELLIGENT PROBLEM SOLVING: METHODOLOGIES AND APPROACHES, PROCEEDINGS, 2000, 1821 : 93 - 98
  • [3] Applications of an web information mining model to data mining and information retrieval tasks
    Pereira, AR
    Baeza-Yates, R
    [J]. SIXTEENTH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2005, : 1031 - 1035
  • [4] Elimination of redundant information for web data mining
    Taib, SM
    Yeom, SJ
    Kang, BH
    [J]. ITCC 2005: International Conference on Information Technology: Coding and Computing, Vol 1, 2005, : 200 - 205
  • [5] Study on Data Preprocessing Process in Web Mining
    Peng, Sumian
    Zhou, Xingmei
    [J]. PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON INFORMATION, ELECTRONIC AND COMPUTER SCIENCE, VOLS I AND II, 2009, : 19 - 22
  • [6] A Hybrid Information Filtering Algorithm Based on Distributed Web log Mining
    Ling Yun
    Wang Xun
    Gu Huamao
    [J]. THIRD 2008 INTERNATIONAL CONFERENCE ON CONVERGENCE AND HYBRID INFORMATION TECHNOLOGY, VOL 1, PROCEEDINGS, 2008, : 1086 - 1091
  • [7] Filtering and sophisticated data processing for web information gathering
    Li, Yuefeng
    Zhong, Ning
    Zhou, Xujuan
    Wu, Sheng-Tang
    [J]. ROUGH SETS AND INTELLIGENT SYSTEMS PARADIGMS, PROCEEDINGS, 2007, 4585 : 813 - +
  • [8] Web + Data Mining = Web Mining
    Kilian Stoffel
    [J]. HMD Praxis der Wirtschaftsinformatik, 2009, 46 (4) : 6 - 20
  • [9] Information Intelligent System based on Web Data Mining
    Zhong, Shaobo
    [J]. PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON ELECTRONIC COMMERCE AND SECURITY, 2008, : 514 - 517
  • [10] Personalized Web Information Recommendation Based on Data Mining
    He, Bo
    [J]. ADVANCED RESEARCH ON AUTOMATION, COMMUNICATION, ARCHITECTONICS AND MATERIALS, PTS 1 AND 2, 2011, 225-226 (1-2): : 546 - 549