Determining the titles of Web pages using anchor text and link analysis

被引：6

作者：

Jeong, Ok-Ran ^{[1
]}

Oh, Jehwan ^{[2
]}

Kim, Dong-Jin

Lyu, Heetae ^{[3
]}

Kim, Won ^{[1
]}

机构：

[1] Gachon Univ, Dept Software Design & Management, Songnam, South Korea

[2] Univ Minnesota, Dept Comp Sci, Minneapolis, MN 55455 USA

[3] Naver Corp, Songnam, South Korea

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2014年 / 41卷 / 09期

基金：

新加坡国家研究基金会;

关键词：

Anchor text; Link analysis; Title extraction; Web page;

D O I：

10.1016/j.eswa.2013.12.033

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Determining the titles of Web pages is an important element in characterizing and categorizing the vast number of Web pages. There are a few approaches to automatically determining the titles of Web pages. As an R&D project for Naver, the operator of Naver (Korea's largest portal site), we developed a new method that makes use of anchor texts and analysis of links among Web pages. In this paper, we describe our method and show experiment results of its performance. (C) 2014 Elsevier Ltd. All rights reserved.

引用

页码：4322 / 4329

页数：8

共 50 条

[31] Using anchor text to improve web page title in process of search engine optimization
Matosevic, Goran
[J]. CENTRAL EUROPEAN CONFERENCE ON INFORMATION AND INTELLIGENT SYSTEMS, 2015, 2015, : 173 - 176
[32] Collecting topic-related web pages for link structure analysis by using a potential hub and authority first approach
Wang, LH
Lee, TW
[J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2005, 3518 : 832 - 837
[33] Local and global topics in text modeling of web pages nested in web sites*
Wang, Jason
Weiss, Robert E.
[J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2022, 173
[34] Multiresolution Web Link Analysis Using Generalized Link Relations
Park, Laurence A. F.
Ramamohanarao, Kotagiri
[J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (11) : 1691 - 1703
[35] Using the web infrastructure to preserve web pages
Nelson, Michael L.
McCown, Frank
Smith, Joan A.
Klein, Martin
[J]. INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2007, 6 (04) : 327 - 349
[36] Extracting Topic Maps from Web Pages by Web Link Structure and Content
Mase, Motohiro
Yamada, Seiji
Nitta, Katsumi
[J]. 2008 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-8, 2008, : 1232 - +
[37] How people recognise previously seen Web pages from titles, URLs and thumbnails
Kaasten, S
Greenberg, S
Edwards, C
[J]. PEOPLE AND COMPUTERS XVI- MEMORABLE YET INVISIBLE, PROCEEDINGS, 2002, : 247 - 265
[38] Dictionary-based text categorization of chemical web pages
Liang, CY
Guo, L
Xia, ZH
Nie, FG
Li, XX
Su, LA
Yang, ZY
[J]. INFORMATION PROCESSING & MANAGEMENT, 2006, 42 (04) : 1017 - 1029
[39] Individual differences in searching and reading Chinese text on web pages
Xuan, YM
Fu, XL
[J]. INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2000, 35 (3-4) : 151 - 151
[40] TEXT: Automatic Template Extraction from Heterogeneous Web Pages
Kim, Chulyun
Shim, Kyuseok
[J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (04) : 612 - 626

← 1 2 3 4 5 →