Automatic discovery of the sequential accesses from web log data files via a genetic algorithm

被引:11
|
作者
Tug, Emine [1 ]
Sakiroglu, Merve [1 ]
Arslan, Ahmet [1 ]
机构
[1] Selcuk Univ, Dept Comp Sci, Konya 42300, Turkey
关键词
web mining; genetic algorithm; knowledge discovery; sequential access;
D O I
10.1016/j.knosys.2005.10.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper is concerned with finding sequential accesses from web log files, using 'Genetic Algorithm' (GA). Web log files are independent from servers, and they are ASCII format. Each transaction, whether completed or not, is recorded in the web log files and these files are unstructured for knowledge discovery in database techniques. Data which is stored in web logs have become important for discovering of user behaviors since the using of internet increased rapidly. Analyzing of these log files is one of the important research area of web mining. Especially, with the advent of CRM (Customer Resource Management) issues in business circle, most of the modem firms operating web sites for several purposes are now adopting web-mining as a strategic way of capturing knowledge about potential needs of target customers, future trends in the market and other management factors. Our work (ALMG-Automatic Log Mining via Genetic) has mined web log files via genetic algorithm. When we search the studies about web mining in literature, it can be seen that, GA is generally used in web content and web structure mining. On the other hand, ALMG is a study about web mining usage. The difference between ALMG and other similar works at literature is this point. As for in another work that we are encountering, GA is used for processing the data between HTML tags which are placed at client PC. But ALMG extracts information from data which is placed at server. It is thought to use log files is an advantage for our purpose. Because, we find the character of requests which is made to the server than detect a single person's behavior. We developed an application with this purpose. Firstly, the application is analyzed web log files, than found sequential accessed page groups automatically. (c) 2005 Elsevier B.V. All rights reserved.
引用
收藏
页码:180 / 186
页数:7
相关论文
共 50 条
  • [41] Extracting users' interests from web log data
    Murata, Tsuyoshi
    Saito, Kota
    2006 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, (WI 2006 MAIN CONFERENCE PROCEEDINGS), 2006, : 343 - +
  • [42] Knowledge discovery from web usage data: Extraction of sequential patterns through ART1 neural network based clustering algorithm
    Raju, G. T.
    Kunal
    Satyanarayana, P. S.
    ICCIMA 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, VOL II, PROCEEDINGS, 2007, : 88 - +
  • [43] Hybrid rough-genetic algorithm for knowledge discovery from large data
    Chakraborty, G
    Chakraborty, B
    SOFT COMPUTING AS TRANSDISCIPLINARY SCIENCE AND TECHNOLOGY, 2005, : 904 - 913
  • [44] Genetic Algorithm for Oil Spill Automatic Detection from Envisat Satellite Data
    Marghany, Maged
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2013, PT II, 2013, 7972 : 587 - 598
  • [45] Automatic Topic Discovery For Online Digital Library Using Genetic Algorithm
    Li, Yinxing
    2ND INTERNATIONAL SYMPOSIUM ON COMPUTER NETWORK AND MULTIMEDIA TECHNOLOGY (CNMT 2010), VOLS 1 AND 2, 2010, : 402 - 405
  • [46] An Automatic Discovery Framework of Cross-source Data Inconsistency for Web Big Data
    Yang, Sha
    Yu, Wei
    Hu, Yahui
    Wang, Kai
    Wang, Jun
    Li, Shijun
    2015 THIRD INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA, 2015, : 73 - 79
  • [47] An Approach for Interesting Subgraph Mining from Web Log Data Using W-Gaston Algorithm
    Jayalakshmi, N.
    Padmaja, P.
    Suma, G. Jaya
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2019, 27 (02) : 277 - 301
  • [48] The Automatic Discovery of Ecological Theory from Data
    Martin, B. T.
    Munch, S.
    Hein, A. M.
    INTEGRATIVE AND COMPARATIVE BIOLOGY, 2018, 58 : E146 - E146
  • [49] Automatic Test Data Generation Using a Genetic Algorithm
    Aleb, Nassima
    Kechid, Samir
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2013, PT II, 2013, 7972 : 574 - 586
  • [50] An automatic clustering for interval data using the genetic algorithm
    Tai Vovan
    Dinh Phamtoan
    Le Hoang Tuan
    Thao Nguyentrang
    Annals of Operations Research, 2021, 303 : 359 - 380