Review on Modern Data Preprocessing Techniques in Web Usage Mining (WUM)

被引:0
|
作者
Sukumar, P. [1 ]
Robert, L. [1 ]
Yuvaraj, S. [1 ]
机构
[1] Govt Arts Coll, Dept CS, Coimbatore, Tamil Nadu, India
关键词
WUM; Web mining; Web usage mining; Web log mining; Data Preprocessing; Data cleaning algorithms; User Identification algorithms; Session Identification algorithms;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The web contains huge amount of data that is increasing in volume and dimension day by day. Data mining applications that make use of Web data is referred as Web mining. Web mining is one of the hot topics in the field of data mining. Web mining is classified into three types based on extracting knowledge. They are Web Structure mining, Web content mining the Web usage mining. Web usage mining process can be divided into three interdependent stages: data preprocessing, pattern discovery and pattern analysis. This paper is mainly related to web usage mining. The contribution of this paper is based on the investigation of data preprocessing and is used to determine the effectiveness of the algorithms, its limitations, and their stands are verified. Various preprocessing algorithms and its heuristics are applied and examined by implemented using programming languages. Data preprocessing algorithms are used to parse the raw log files that involve splitting of the log files and then cleansed to obtain superior quality of data. Based on this data, the unique users are identified which in turn helps to identify user sessions.
引用
下载
收藏
页码:64 / 69
页数:6
相关论文
共 50 条
  • [1] An overview of data preprocessing in data and web usage mining
    Suresh, R. M.
    Padmajavalli, R.
    2006 1ST INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT, 2006, : 193 - +
  • [2] Research and development of data preprocessing in Web Usage Mining
    Li Chaofeng
    PROCEEDINGS OF THE 2006 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND ENGINEERING, 2006, : 1311 - 1315
  • [3] Advanced data preprocessing for intersites web usage mining
    Tanasa, D
    Trousse, B
    IEEE INTELLIGENT SYSTEMS, 2004, 19 (02) : 59 - 65
  • [4] An effective Data Preprocessing method for Web Usage Mining
    Reddy, K. Sudheer
    Reddy, M. Kantha
    Sitaramulu, V.
    2013 INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS (ICICES), 2013, : 7 - 10
  • [5] A Review Paper on Data Preprocessing: A Critical Phase in Web Usage Mining Process
    Dwivedi, Sanjay Kumar
    Rawat, Bhupesh
    2015 INTERNATIONAL CONFERENCE ON GREEN COMPUTING AND INTERNET OF THINGS (ICGCIOT), 2015, : 506 - 510
  • [6] Web Usage Mining Data Preprocessing and Multi Level Analysis on Moodle
    Sael, Nawal
    Marzak, Abdelaziz
    Behja, Hicham
    2013 ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2013,
  • [7] An Inclusive Survey on Data Preprocessing Methods Used in Web Usage Mining
    Bakariya, Brijesh
    Mohbey, Krishna K.
    Thakur, G. S.
    PROCEEDINGS OF SEVENTH INTERNATIONAL CONFERENCE ON BIO-INSPIRED COMPUTING: THEORIES AND APPLICATIONS (BIC-TA 2012), VOL 2, 2013, 202 : 407 - 416
  • [8] The Integrating Between Web Usage Mining and Data Mining Techniques
    Nassar, Omer Adel
    Al Saiyd, Nedhal A.
    2013 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (CSIT), 2013, : 243 - 247
  • [9] Data Preprocessing for Web Data Mining
    Zhang, Wei
    Chen, Tinggui
    ADVANCES IN ELECTRONIC COMMERCE, WEB APPLICATION AND COMMUNICATION, VOL 2, 2012, 149 : 303 - +
  • [10] Web Usage Mining: A Review on Process, Methods and Techniques
    Varnagar, Chintan R.
    Madhak, Nirali N.
    Kodinariya, Trupti M.
    Rathod, Jayesh N.
    2013 INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS (ICICES), 2013, : 40 - 46