Determining the titles of Web pages using anchor text and link analysis

被引:6
|
作者
Jeong, Ok-Ran [1 ]
Oh, Jehwan [2 ]
Kim, Dong-Jin
Lyu, Heetae [3 ]
Kim, Won [1 ]
机构
[1] Gachon Univ, Dept Software Design & Management, Songnam, South Korea
[2] Univ Minnesota, Dept Comp Sci, Minneapolis, MN 55455 USA
[3] Naver Corp, Songnam, South Korea
基金
新加坡国家研究基金会;
关键词
Anchor text; Link analysis; Title extraction; Web page;
D O I
10.1016/j.eswa.2013.12.033
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Determining the titles of Web pages is an important element in characterizing and categorizing the vast number of Web pages. There are a few approaches to automatically determining the titles of Web pages. As an R&D project for Naver, the operator of Naver (Korea's largest portal site), we developed a new method that makes use of anchor texts and analysis of links among Web pages. In this paper, we describe our method and show experiment results of its performance. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:4322 / 4329
页数:8
相关论文
共 50 条
  • [31] Using anchor text to improve web page title in process of search engine optimization
    Matosevic, Goran
    [J]. CENTRAL EUROPEAN CONFERENCE ON INFORMATION AND INTELLIGENT SYSTEMS, 2015, 2015, : 173 - 176
  • [32] Collecting topic-related web pages for link structure analysis by using a potential hub and authority first approach
    Wang, LH
    Lee, TW
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2005, 3518 : 832 - 837
  • [33] Local and global topics in text modeling of web pages nested in web sites*
    Wang, Jason
    Weiss, Robert E.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2022, 173
  • [34] Multiresolution Web Link Analysis Using Generalized Link Relations
    Park, Laurence A. F.
    Ramamohanarao, Kotagiri
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (11) : 1691 - 1703
  • [35] Using the web infrastructure to preserve web pages
    Nelson, Michael L.
    McCown, Frank
    Smith, Joan A.
    Klein, Martin
    [J]. INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2007, 6 (04) : 327 - 349
  • [36] Extracting Topic Maps from Web Pages by Web Link Structure and Content
    Mase, Motohiro
    Yamada, Seiji
    Nitta, Katsumi
    [J]. 2008 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-8, 2008, : 1232 - +
  • [37] How people recognise previously seen Web pages from titles, URLs and thumbnails
    Kaasten, S
    Greenberg, S
    Edwards, C
    [J]. PEOPLE AND COMPUTERS XVI- MEMORABLE YET INVISIBLE, PROCEEDINGS, 2002, : 247 - 265
  • [38] Dictionary-based text categorization of chemical web pages
    Liang, CY
    Guo, L
    Xia, ZH
    Nie, FG
    Li, XX
    Su, LA
    Yang, ZY
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2006, 42 (04) : 1017 - 1029
  • [39] Individual differences in searching and reading Chinese text on web pages
    Xuan, YM
    Fu, XL
    [J]. INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2000, 35 (3-4) : 151 - 151
  • [40] TEXT: Automatic Template Extraction from Heterogeneous Web Pages
    Kim, Chulyun
    Shim, Kyuseok
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (04) : 612 - 626