Using Anchor Text Refined by Page Importance to Improve Web Retrieval

被引:0
|
作者
Zhang, Yonggang [1 ]
Lei, Kai [1 ]
Huang, Lian'en [1 ]
机构
[1] Peking Univ, Shenzhen Grad Sch, Shenzhen Key Lab Cloud Comp Technol & Applicat SP, Shenzhen 518055, Guangdong, Peoples R China
关键词
component; Web Retrieval; Anchor Text; Page Importance;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As important part of web page contents, anchor texts have been widely used and proved to be useful in web information retrieval systems, especially for the navigational queries. But previous work only focused on how to determine the importance of anchor texts for a given destination page. From global perspective, web link-structure is very useful to determine page importance, and this paper proposes a method that combines page importance and relevance between anchor texts and their destination pages together to build new anchor-based retrieval models. Experimental results show that the combined models are better than the models which only consider the relevance between anchor texts and their destination pages.
引用
收藏
页码:1200 / 1203
页数:4
相关论文
共 50 条
  • [31] Using scatterplots to understand and improve probabilistic models for text categorization and retrieval
    Di Nunzio, Giorgio Maria
    [J]. INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2009, 50 (07) : 945 - 956
  • [32] THE OPEN REVOLUTION: USING CITATION ANALYSIS TO IMPROVE LEGAL TEXT RETRIEVAL
    Geist, Anton
    [J]. EUROPEAN JOURNAL OF LEGAL STUDIES, 2010, 2 (03): : 137 - 145
  • [33] A novel web page text information extraction method
    Wang, Chongjun
    Wei, Peng
    [J]. PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 2213 - 2218
  • [34] Chinese web page classification based on text contents
    Liang, JZ
    [J]. ISTM/2003: 5TH INTERNATIONAL SYMPOSIUM ON TEST AND MEASUREMENT, VOLS 1-6, CONFERENCE PROCEEDINGS, 2003, : 4733 - 4736
  • [35] Presenting a Way to Improve Web Page Ranking Algorithm Using Firefly Algorithm
    Karimi, Fariba
    Iran, Damavand
    Harounabadi, Ali
    Mirabedini, Seyed Javad
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2016, 16 (12): : 37 - 42
  • [36] Improving Web Page Retrieval using Search Context from Clicked Domain Names
    Li, Rongmei
    [J]. PROCEEDINGS OF THE 20TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATION, 2009, : 393 - 397
  • [37] Detecting Website Defacement Attacks using Web-page Text and Image Features
    Trong Hung Nguyen
    Xuan Dau Hoang
    Duc Dung Nguyen
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (07) : 215 - 222
  • [38] Web Forum Retrieval and Text Analytics: a Survey
    Hoogeveen, Doris
    Wang, Li
    Baldwin, Timothy
    Verspoor, Karin M.
    [J]. FOUNDATIONS AND TRENDS IN INFORMATION RETRIEVAL, 2018, 12 (01): : 2 - +
  • [39] A COMPARATIVE ANALYSIS OF CLICKSTREAM AS WEB PAGE IMPORTANCE METRIC
    Surya, Anupama
    Sharma, Dilip Kumar
    [J]. 2013 IEEE CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES (ICT 2013), 2013, : 776 - 781
  • [40] Development of Web Robot using the full-text retrieval software JTOPIC
    Imada, H
    Araki, Y
    Aoki, N
    [J]. NEC RESEARCH & DEVELOPMENT, 1998, 39 (01): : 55 - 60