AUTOMATIC MAINTENANCE OF WEB DIRECTORIES BY MINING WEB BROWSING DATA

被引:0
|
作者
Hurtado, Carlos [1 ]
Mendoza, Marcelo [2 ]
机构
[1] Univ Adolfo Ibanez, Fac Sci & Engn, Santiago, Chile
[2] Univ Tecn Federico Santa Maria, Dept Comp Sci, Santiago, Chile
来源
JOURNAL OF WEB ENGINEERING | 2011年 / 10卷 / 02期
关键词
Web directories; Web Mining; Query Logs;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Web directories allow Web users to browse a hierarchy of categories, under which different types of resources are classified. We study the problem of maintaining a Web directory, that is, the problem of continually discovering and ranking resources that are relevant to the categories of the directory. We propose an unsupervised computational method that conducts the maintenance of the directory by analyses of user browsing data. The method is based on the extraction and classification of user sessions (sequences of resources selected by users) into the categories of the directory. In addition, we show that the directory maintenance method can be slightly modified to find queries that are useful to find relevant resources allowing users to switch from directory browsing to query formulation. Experimental results allow for affirmation that the proposed methods are effective, that they attain identification of new pages in each category and also recommend related queries with high precision, without; needing labeled data to conduct traditional web page and query classification tasks.
引用
收藏
页码:153 / 173
页数:21
相关论文
共 50 条
  • [1] A text mining approach on automatic generation of web directories and hierarchies
    Yang, HC
    Lee, CH
    [J]. IEEE/WIC INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, PROCEEDINGS, 2003, : 625 - 628
  • [2] A text mining approach on automatic generation of web directories and hierarchies
    Yang, HC
    Lee, CH
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2004, 27 (04) : 645 - 663
  • [3] Web usage mining with intentional browsing data
    Tao, Yu-Hu
    Hong, Tzung-Pe
    Su, Yu-Ming
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2008, 34 (03) : 1893 - 1904
  • [4] Managing web repositories in emerging economies: Case studies of browsing web directories
    Chung, Wingyan
    [J]. INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT, 2012, 32 (03) : 232 - 238
  • [5] NEAR-Miner: Mining Evolution Associations of Web Site Directories for Efficient Maintenance of Web Archives
    Chen, Ling
    Bhowmick, Sourav S.
    Nejdl, Wolfgang
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2009, 2 (01):
  • [6] Web mining for browsing patterns
    Hong, TP
    Lin, KY
    Wang, SL
    [J]. KNOWLEDGE-BASED INTELLIGENT INFORMATION ENGINEERING SYSTEMS & ALLIED TECHNOLOGIES, PTS 1 AND 2, 2001, 69 : 495 - 499
  • [7] Personalizing Web Directories with the Aid of Web Usage Data
    Pierrakos, Dimitrios
    Paliouras, Georgios
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (09) : 1331 - 1344
  • [8] Automatic association of Web directories with word senses
    Santamaría, C
    Gonzalo, J
    Verdejo, F
    [J]. COMPUTATIONAL LINGUISTICS, 2003, 29 (03) : 485 - 502
  • [9] Data Preparation for Mining World Wide Web Browsing Patterns
    Robert Cooley
    Bamshad Mobasher
    Jaideep Srivastava
    [J]. Knowledge and Information Systems, 1999, 1 (1) : 5 - 32
  • [10] Mining of generalized web browsing patterns
    Wang, SL
    Lo, WS
    Hong, TP
    [J]. 7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL II, PROCEEDINGS: COMPUTER SCIENCE AND ENGINEERING, 2003, : 267 - 271