DATA PREPROCESSING IN WEB TEXT MINING

被引:0
|
作者
Jiang Yongbo [1 ]
机构
[1] Qingdao Technol Univ, Sch Business, Qingdao, Peoples R China
关键词
Data preprocessing; Web text mining; Search engine;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The development of highly efficient and effective search engines is accelerated by the abundant WWW information and people's need for high quality information. Web text mining is one of the key techniques for search engines. But Web data is much complex which enlarges the difficulty in web text mining. To get good mining results, Web page preprocessing is necessary before any text mining starting. Here given the pages set collected from the Robot of search engines, we discussed some essential work to present pages in vectors, such as the term selection, weights presentation, etc. The purpose is to make preparation for the following Web text mining task.
引用
收藏
页码:573 / 581
页数:9
相关论文
共 50 条
  • [1] Data Preprocessing for Web Data Mining
    Zhang, Wei
    Chen, Tinggui
    [J]. ADVANCES IN ELECTRONIC COMMERCE, WEB APPLICATION AND COMMUNICATION, VOL 2, 2012, 149 : 303 - +
  • [2] Preprocessing and mining web log data for web personalization
    Baglioni, M
    Ferrara, U
    Romei, A
    Ruggieri, S
    Turini, F
    [J]. AI(ASTERISK)IA 2003: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, 2829 : 237 - 249
  • [3] An overview of data preprocessing in data and web usage mining
    Suresh, R. M.
    Padmajavalli, R.
    [J]. 2006 1ST INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT, 2006, : 193 - +
  • [4] Study on Data Preprocessing Process in Web Mining
    Peng, Sumian
    Zhou, Xingmei
    [J]. PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON INFORMATION, ELECTRONIC AND COMPUTER SCIENCE, VOLS I AND II, 2009, : 19 - 22
  • [5] Data Preprocessing Algorithm for Web Structure Mining
    Sharma, Suvarn
    Bhagat, Amit
    [J]. 2016 FIFTH INTERNATIONAL CONFERENCE ON ECO-FRIENDLY COMPUTING AND COMMUNICATION SYSTEMS (ICECCS), 2016, : 94 - 98
  • [6] Interdependence of Text Mining Quality and the Input Data Preprocessing
    Darena, Frantisek
    Zizka, Jan
    [J]. ARTIFICIAL INTELLIGENCE PERSPECTIVES AND APPLICATIONS (CSOC2015), 2015, 347 : 141 - 150
  • [7] Data Preprocessing Method on Data Mining of Web Log File
    Li, Jia
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL AND INFORMATION SCIENCES (ICCIS 2014), 2014, : 712 - 717
  • [8] Research and development of data preprocessing in Web Usage Mining
    Li Chaofeng
    [J]. PROCEEDINGS OF THE 2006 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND ENGINEERING, 2006, : 1311 - 1315
  • [9] An effective Data Preprocessing method for Web Usage Mining
    Reddy, K. Sudheer
    Reddy, M. Kantha
    Sitaramulu, V.
    [J]. 2013 INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS (ICICES), 2013, : 7 - 10
  • [10] Advanced data preprocessing for intersites web usage mining
    Tanasa, D
    Trousse, B
    [J]. IEEE INTELLIGENT SYSTEMS, 2004, 19 (02) : 59 - 65