SMARTCRAWLER: A PERSONALIZED WEB SEARCH FOR RELEVANT WEB PAGES

被引:0
|
作者
Wardekar, Arati Anilrao [1 ]
Gupta, Poonam [1 ]
机构
[1] GH Raisoni Coll Engn & Management, Pune 412207, Maharashtra, India
关键词
Web Crawler; Inner web; URL Feature selection; IP; Site frequency; Two-stage crawler; Site Ranking; Personalized web search;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
On web we can see that web pages are not indexed by crawling in speed, it was developed many crawlers to efficiently locate inner web interfaces, due to the large amount of resources in the network and the dynamic nature of the deep web, the better result is a challenging problem. To solve this problem, we propose a two-stage framework, mainly SmartCrawler, to relevantly finding a deep web. Smart-crawler gets the seed from the seed database. First stage, Smart Crawler performs the "reverse search" that matches the user's query in the URLs. In the second step, the "Incremental Site Prioritize" is perform in which the content of the query in the form matches. Then, according to frequency matching, sort relevant and irrelevant pages and rank this page. High-ranking pages are displayed on the results page. Our proposed crawler efficiently recovers deep interfaces from large databases and achieves a higher result than other developed crawlers. We have propose a comprehensive and customized search to improve performance by considering how long we keep the log file. Before viewing the query before entering the query in the search box that is the focus, enter the search box.
引用
收藏
页数:4
相关论文
共 50 条
  • [41] Dynamically generalizing web pages based on users' search intentions
    Wang D.-L.
    Yu G.
    Bao Y.-B.
    Zhang M.
    Shen Z.
    Ruan Jian Xue Bao/Journal of Software, 2010, 21 (05): : 1083 - 1097
  • [42] Personalized Web Search with User Geographic and Temporal Preferences
    Yang, Dan
    Nie, Tiezheng
    Shen, Derong
    Yu, Ge
    Kou, Yue
    WEB TECHNOLOGIES AND APPLICATIONS, 2011, 6612 : 95 - 106
  • [43] A new era of search engines: Not just web pages anymore
    Hock, Ran
    Online (Wilton, Connecticut), 2002, 26 (05):
  • [44] A new era of search engines: Not just Web pages anymore
    Hock, R
    ONLINE, 2002, 26 (05): : 20 - +
  • [45] Personalized web search using probabilistic query expansion
    Palleti, Pallavi
    Karnick, Harish
    Mitra, Pabitra
    PROCEEDING OF THE 2007 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY, WORKSHOPS, 2007, : 83 - +
  • [46] UPS: Efficient Privacy Protection in Personalized Web Search
    Chen, Gang
    Bai, He
    Shou, Lidan
    Chen, Ke
    Gao, Yunjun
    PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 615 - 624
  • [47] PERSONALIZED IMAGE RECOMMENDATION FOR WEB SEARCH ENGINE USERS
    Li, Yuncheng
    Luo, Jiebo
    Mei, Tao
    2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2014,
  • [48] Construction of Semantic User Profile for Personalized Web Search
    Uddin, Mohammed Nazim
    Trong Hai Duong
    Sean, Visal
    Jo, Geun-Sik
    COMPUTATIONAL COLLECTIVE INTELLIGENCE - TECHNOLOGIES AND APPLICATIONS, PT II, 2012, 7654 : 99 - 108
  • [49] Personalized web search with self-organizing map
    Ding, C
    Patra, JC
    Peng, FC
    2005 IEEE International Conference on e-Technology, e-Commerce and e-Service, Proceedings, 2005, : 144 - 147
  • [50] Augmenting Web Pages and Search Results to Support Credibility Assessment
    Schwarz, Julia
    Morris, Meredith Ringel
    29TH ANNUAL CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2011, : 1245 - 1254