Automatic discovery of the sequential accesses from web log data files via a genetic algorithm

被引:11
|
作者
Tug, Emine [1 ]
Sakiroglu, Merve [1 ]
Arslan, Ahmet [1 ]
机构
[1] Selcuk Univ, Dept Comp Sci, Konya 42300, Turkey
关键词
web mining; genetic algorithm; knowledge discovery; sequential access;
D O I
10.1016/j.knosys.2005.10.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper is concerned with finding sequential accesses from web log files, using 'Genetic Algorithm' (GA). Web log files are independent from servers, and they are ASCII format. Each transaction, whether completed or not, is recorded in the web log files and these files are unstructured for knowledge discovery in database techniques. Data which is stored in web logs have become important for discovering of user behaviors since the using of internet increased rapidly. Analyzing of these log files is one of the important research area of web mining. Especially, with the advent of CRM (Customer Resource Management) issues in business circle, most of the modem firms operating web sites for several purposes are now adopting web-mining as a strategic way of capturing knowledge about potential needs of target customers, future trends in the market and other management factors. Our work (ALMG-Automatic Log Mining via Genetic) has mined web log files via genetic algorithm. When we search the studies about web mining in literature, it can be seen that, GA is generally used in web content and web structure mining. On the other hand, ALMG is a study about web mining usage. The difference between ALMG and other similar works at literature is this point. As for in another work that we are encountering, GA is used for processing the data between HTML tags which are placed at client PC. But ALMG extracts information from data which is placed at server. It is thought to use log files is an advantage for our purpose. Because, we find the character of requests which is made to the server than detect a single person's behavior. We developed an application with this purpose. Firstly, the application is analyzed web log files, than found sequential accessed page groups automatically. (c) 2005 Elsevier B.V. All rights reserved.
引用
收藏
页码:180 / 186
页数:7
相关论文
共 50 条
  • [1] Web User Navigation Patterns Discovery from WWW Server Log Files
    Weichbroth, Pawel
    Owoc, Mieczyslaw
    Pleszkun, Michal
    2012 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2012, : 1171 - 1176
  • [2] The Research of Preprocessing and Pattern Discovery Techniques on Web Log files
    Dhanalakshmi, P.
    Ramani, K.
    Reddy, B. Eswara
    2016 IEEE 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (IACC), 2016, : 139 - 145
  • [3] Data Mining Algorithms for Knowledge Extraction from Web Log Files
    El Alami, Anass Abdelhamid
    Ezzikouri, Hanane
    Erritali, Mohammed
    ADVANCED INTELLIGENT SYSTEMS FOR SUSTAINABLE DEVELOPMENT (AI2SD'2019): VOL 1 - ADVANCED INTELLIGENT SYSTEMS FOR EDUCATION AND INTELLIGENT LEARNING SYSTEM, 2020, 1102 : 118 - 128
  • [4] Data Mining in the SIMBAD Database Web Log Files
    Wenger, Marc
    Oberto, Anais
    ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XIX, 2010, 434 : 453 - 456
  • [5] A wavelet-based approach for business protocol discovery of web services from log files
    Moudjari, A.
    Kezzouli, I
    Talbi, H.
    Draa, A.
    BULLETIN OF THE POLISH ACADEMY OF SCIENCES-TECHNICAL SCIENCES, 2019, 67 (03) : 535 - 546
  • [6] Data preparation of web log files for marketing aspects analyses
    Reichle, Meike
    Perner, Petra
    Althoff, Klaus-Dieter
    ADVANCES IN DATA MINING: APPLICATIONS IN MEDICINE, WEB MINING, MARKETING, IMAGE AND SIGNAL MINING, 2006, 4065 : 131 - 145
  • [7] Algorithm Research on User Interests Extracting via Web Log Data
    Wang, Shuqing
    She, Li
    Liu, Zhen
    Fu, Yan
    WISM: 2009 INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND MINING, PROCEEDINGS, 2009, : 93 - 97
  • [8] Gathering and mining information from Web log files
    Agosti, Maristella
    Di Nunzio, Giorgio Maria
    DIGITAL LIBRARIES: RESEARCH AND DEVELOPMENT, 2007, 4877 : 104 - 113
  • [9] Discovery of interesting association rules from Livelink web log data
    Huang, XG
    An, AJ
    Cercone, N
    Promhouse, G
    2002 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2002, : 763 - 766
  • [10] Automatic Attribute Discovery and Characterization from Noisy Web Data
    Berg, Tamara L.
    Berg, Alexander C.
    Shih, Jonathan
    COMPUTER VISION-ECCV 2010, PT I, 2010, 6311 : 663 - 676