Review on Modern Data Preprocessing Techniques in Web Usage Mining (WUM)

被引:0
|
作者
Sukumar, P. [1 ]
Robert, L. [1 ]
Yuvaraj, S. [1 ]
机构
[1] Govt Arts Coll, Dept CS, Coimbatore, Tamil Nadu, India
关键词
WUM; Web mining; Web usage mining; Web log mining; Data Preprocessing; Data cleaning algorithms; User Identification algorithms; Session Identification algorithms;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The web contains huge amount of data that is increasing in volume and dimension day by day. Data mining applications that make use of Web data is referred as Web mining. Web mining is one of the hot topics in the field of data mining. Web mining is classified into three types based on extracting knowledge. They are Web Structure mining, Web content mining the Web usage mining. Web usage mining process can be divided into three interdependent stages: data preprocessing, pattern discovery and pattern analysis. This paper is mainly related to web usage mining. The contribution of this paper is based on the investigation of data preprocessing and is used to determine the effectiveness of the algorithms, its limitations, and their stands are verified. Various preprocessing algorithms and its heuristics are applied and examined by implemented using programming languages. Data preprocessing algorithms are used to parse the raw log files that involve splitting of the log files and then cleansed to obtain superior quality of data. Based on this data, the unique users are identified which in turn helps to identify user sessions.
引用
下载
收藏
页码:64 / 69
页数:6
相关论文
共 50 条
  • [21] A Unified Model for Preprocessing and Clustering Technique for Web Usage Mining
    Pandian, P. Senthil
    Srinivasan, S.
    JOURNAL OF MULTIPLE-VALUED LOGIC AND SOFT COMPUTING, 2016, 26 (3-5) : 205 - 220
  • [22] Active user-based and ontology-based web log data preprocessing for web usage mining
    Khasawneh, Natheer
    Chan, Chien-Chung
    2006 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, (WI 2006 MAIN CONFERENCE PROCEEDINGS), 2006, : 325 - +
  • [23] Applying data mining techniques in intrusion detection system on web and analysis of web usage
    Al-Ahliyya Amman University, Amman, Jordan
    不详
    Inf. Technol. J., 2006, 1 (57-63):
  • [24] Study on Data Preprocessing Process in Web Mining
    Peng, Sumian
    Zhou, Xingmei
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON INFORMATION, ELECTRONIC AND COMPUTER SCIENCE, VOLS I AND II, 2009, : 19 - 22
  • [25] Data Preprocessing Algorithm for Web Structure Mining
    Sharma, Suvarn
    Bhagat, Amit
    2016 FIFTH INTERNATIONAL CONFERENCE ON ECO-FRIENDLY COMPUTING AND COMMUNICATION SYSTEMS (ICECCS), 2016, : 94 - 98
  • [26] User identification in the process of web usage data preprocessing
    Kapusta J.
    Munk M.
    Halvoník D.
    Drlík M.
    International Journal of Emerging Technologies in Learning, 2019, 14 (09) : 21 - 33
  • [27] User Identification in the Process of Web Usage Data Preprocessing
    Kapusta, Jozef
    Munk, Michal
    Halvonik, Dominik
    Drlik, Martin
    INTERNATIONAL JOURNAL OF EMERGING TECHNOLOGIES IN LEARNING, 2019, 14 (09): : 21 - 33
  • [28] A Survey on Web Usage Mining Techniques and Applications
    Suadaa, Lya Hulliyyatus
    2014 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY SYSTEMS AND INNOVATION (ICITSI), 2014, : 39 - 43
  • [29] A Memory Efficient Algorithm with Enhance Preprocessing Technique for Web Usage Mining
    Pathak, Nisarg
    Shah, Viral
    Ajmeera, Chandramohan
    EMERGING ICT FOR BRIDGING THE FUTURE, VOL 2, 2015, 338 : 601 - 608
  • [30] Web usage data mining agent
    Madiraju, P
    Zhang, YQ
    DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS AND TECHNOLOGY IV, 2002, 4730 : 224 - 228