Information Mining System Design and Implementation Based on Web Crawler

被引:0
|
作者
Lin, Shan [1 ]
Li, You-meng [1 ]
Li, Qing-cheng [1 ]
机构
[1] Nankai Univ, Coll Informat Tech Sci, Tianjin 300072, Peoples R China
关键词
Crawler; information mining; RSS; low cost;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the information explosion causing by the World Wide Web in recent years, the issue of how to execute the enormous information efficiently at a reasonable lost has become the concern of information providers, service agencies and end users. When many research focus on how to design an efficient web crawler, we pay our attention to how to make the best of the result of web crawler. In this paper, we describe the design and implementation of an information mining system running on the results of web crawler to gain more metadata from unstructured documents for focused search (such as RSS search). We present the software architecture of the system, describe efficient techniques for achieving high performance and report preliminary experimental results to prove that this system can address the issue of robustness, flexibility and accuracy at a low cost.
引用
收藏
页码:100 / 104
页数:5
相关论文
共 50 条
  • [21] Design and Implementation of Logistics Information Management System Based On Web Service
    Jiang, Hua
    Li, Yuman
    Fang, Hua
    [J]. 14TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS FOR BUSINESS, ENGINEERING AND SCIENCE (DCABES 2015), 2015, : 130 - 133
  • [22] Design and Implementation of Bank Client Information Management System Based on Web
    Qiu, Dehai
    Yang, Ziyan
    [J]. PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON ELECTRONIC, MECHANICAL, INFORMATION AND MANAGEMENT SOCIETY (EMIM), 2016, 40 : 915 - 918
  • [23] Research on the product design information filtering system based on web and the implementation
    Xiao, P
    He, B
    [J]. ISTM/2005: 6TH INTERNATIONAL SYMPOSIUM ON TEST AND MEASUREMENT, VOLS 1-9, CONFERENCE PROCEEDINGS, 2005, : 5743 - 5746
  • [24] Design and Implementation of Project Cost Management Information System Based on Web
    Li, Xiaoran
    [J]. 2021 6TH INTERNATIONAL CONFERENCE ON SMART GRID AND ELECTRICAL AUTOMATION (ICSGEA 2021), 2021, : 288 - 292
  • [25] Design and Implementation of University Student Information Management System Based on Web
    Jiang, W-J.
    Jin, Z-G.
    Yang, Z-Y.
    [J]. 2011 SECOND INTERNATIONAL CONFERENCE ON EDUCATION AND SPORTS EDUCATION (ESE 2011), VOL II, 2011, : 257 - 259
  • [26] Design and Implementation of an Aggregation-based Tourism Web Information System
    Idris, Ainie Zeinaida
    Yahaya, Nor Adnan
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2009, 9 (12): : 143 - 148
  • [27] The Design and Implementation of Tourist Perception Information Service System based on Web
    Liu, Yan
    [J]. PROCEEDINGS OF THE 2016 4TH INTERNATIONAL CONFERENCE ON MACHINERY, MATERIALS AND COMPUTING TECHNOLOGY, 2016, 60 : 382 - 385
  • [28] Design and Application of Intelligent Dynamic Crawler for Web Data Mining
    Zheng Guojun
    Jia Wenchao
    Shi Jihui
    Shi Fan
    Zhu Hao
    Liu Jiang
    [J]. 2017 32ND YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2017, : 1098 - 1105
  • [29] Mining Techniques of XSS Vulnerabilities Based on Web Crawler
    Wan Fangfang
    Xie Xusheng
    [J]. MECHATRONICS ENGINEERING, COMPUTING AND INFORMATION TECHNOLOGY, 2014, 556-562 : 6290 - 6293
  • [30] Research on Web Data Mining Based on Topic Crawler
    Guo, Hongjian
    [J]. JOURNAL OF WEB ENGINEERING, 2021, 20 (04): : 1131 - 1143