EMACrawler: web search engine database freshness optimization

被引:0
|
作者
Alanoglu, Zuelfue [1 ]
Akcayol, M. Ali [2 ]
机构
[1] Hatay Mustafa Kemal Univ, Antakya Meslek Yuksek Okulu, Bilgisayar Teknolojileri Bolumu, Antakya, Turkiye
[2] Gazi Univ, Muhendislik Fak, Bilgisayar Muhendisligi Bolumu, Ankara, Turkiye
关键词
Web crawler; update module; data collection; data indexing;
D O I
10.2339/politeknik.1347054
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In today's information and technology age, search engines have become an important part of our lives. However, search engines are the first to be used to access information, old and unnecessary information is included in the content offered to users. Regarding providing up-to-date data, today's search engines often cannot offer the desired success. In order to keep the data presented by web browsers up-to-date, the time of return visits must be accurately estimated. In this study, EMACrawler based on exponential moving averages is proposed to determine the revisit times, which is the most important feature that affects the performance of search engines. The proposed method is tested using precision, total coverage, and efficiency metrics. It has been seen that EMACrawler obtains the current data on the web pages accurately and quickly. As a result of the experimental studies, it has been seen that EMACrawler is more successful than other methods in obtaining up-to-date data and maintaining the freshness of the browser database.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] The freshness of web search engine databases
    Lewandowski, D
    Wahlig, H
    Meyer-Bautor, G
    [J]. JOURNAL OF INFORMATION SCIENCE, 2006, 32 (02) : 131 - 148
  • [2] Internet search engine freshness by web server help
    Gupta, V
    Campbell, R
    [J]. 2001 SYMPOSIUM ON APPLICATIONS AND THE INTERNET, PROCEEDINGS, 2001, : 113 - 119
  • [3] Analysis of Web freshness strategies and its improvement in search engine
    Wen, Kunmei
    Lu, Zhengding
    Ye, Weiguo
    Jin, Li
    [J]. Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2002, 30 (12):
  • [4] A cooperative schema between web sever and search engine for improving freshness of web repository
    College of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China
    [J]. Wuhan Univ J Nat Sci, 2006, 1 (11-14):
  • [6] A three-year study on the freshness of web search engine databases
    Lewandowski, Dirk
    [J]. JOURNAL OF INFORMATION SCIENCE, 2008, 34 (06) : 817 - 831
  • [7] Optimization of Web Search Engine and Its Application to Web Mining
    CHEN Hao1
    2. Software School
    3. Department of Computer Science and Technology
    [J]. Wuhan University Journal of Natural Sciences, 2009, 14 (02) : 115 - 118
  • [8] Search Engine Optimization: An Analysis of Rhinoplasty Web sites
    Rayess, Hani M.
    Gupta, Amar
    Nissan, Michael
    Carron, Michael A.
    Zuliani, Giancarlo F.
    [J]. FACIAL PLASTIC SURGERY, 2017, 33 (06) : 665 - 669
  • [9] Overlapping factors in search engine optimization and web accessibility
    Moreno, Lourdes
    Martinez, Paloma
    [J]. ONLINE INFORMATION REVIEW, 2013, 37 (04) : 564 - 580
  • [10] Analyzing and Classifying User Search Histories for Web Search Engine Optimization
    Kurian, Archana
    Jayasree, M.
    [J]. 2014 3RD INTERNATIONAL CONFERENCE ON ECO-FRIENDLY COMPUTING AND COMMUNICATION SYSTEMS (ICECCS 2014), 2014, : 39 - 44