SMARTCRAWLER: A PERSONALIZED WEB SEARCH FOR RELEVANT WEB PAGES

被引:0
|
作者
Wardekar, Arati Anilrao [1 ]
Gupta, Poonam [1 ]
机构
[1] GH Raisoni Coll Engn & Management, Pune 412207, Maharashtra, India
关键词
Web Crawler; Inner web; URL Feature selection; IP; Site frequency; Two-stage crawler; Site Ranking; Personalized web search;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
On web we can see that web pages are not indexed by crawling in speed, it was developed many crawlers to efficiently locate inner web interfaces, due to the large amount of resources in the network and the dynamic nature of the deep web, the better result is a challenging problem. To solve this problem, we propose a two-stage framework, mainly SmartCrawler, to relevantly finding a deep web. Smart-crawler gets the seed from the seed database. First stage, Smart Crawler performs the "reverse search" that matches the user's query in the URLs. In the second step, the "Incremental Site Prioritize" is perform in which the content of the query in the form matches. Then, according to frequency matching, sort relevant and irrelevant pages and rank this page. High-ranking pages are displayed on the results page. Our proposed crawler efficiently recovers deep interfaces from large databases and achieves a higher result than other developed crawlers. We have propose a comprehensive and customized search to improve performance by considering how long we keep the log file. Before viewing the query before entering the query in the search box that is the focus, enter the search box.
引用
收藏
页数:4
相关论文
共 50 条
  • [21] On Having Search Engines Deliver Hierarchies of Web Pages
    Jeong, Ok-Ran
    Han, Jiawei
    Kim, Won
    Lee, Eunseok
    JOURNAL OF OBJECT TECHNOLOGY, 2008, 7 (04): : 33 - 41
  • [22] Supporting Privacy Protection in Personalized Web Search
    Shou, Lidan
    Bai, He
    Chen, Ke
    Chen, Gang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (02) : 453 - 467
  • [23] Personalized web search results with profile comparisons
    Lai, J
    Soh, B
    THIRD INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS, VOL 1, PROCEEDINGS, 2005, : 573 - 576
  • [24] Discovering Web Pages Censored by Search Engines in Japan
    Moroi, Takanori
    Yoshiura, Noriaki
    2008 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR MODELLING CONTROL & AUTOMATION, VOLS 1 AND 2, 2008, : 1171 - 1176
  • [25] Personalized web search for improving retrieval effectiveness
    Liu, F
    Yu, C
    Meng, WY
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2004, 16 (01) : 28 - 40
  • [26] Personalized Search Strategies for Spatial Information on the Web
    Yang, Yanwu
    IEEE INTELLIGENT SYSTEMS, 2012, 27 (01) : 12 - 20
  • [27] Factic: Personalized Exploratory Search in the Semantic Web
    Tvarozek, Michal
    Bielikova, Maria
    WEB ENGINEERING, 2010, 6189 : 527 - 530
  • [28] Personalized web search using user profile
    Xu, Jingqiu
    Zhu, Zhengyu
    Ren, Xiang
    Tian, Yunyan
    Luo, Ying
    CIS: 2007 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PROCEEDINGS, 2007, : 222 - +
  • [29] Personalized Web Search Using Information Scent
    Chawla, Suruchi
    Bedi, Punam
    INNOVATIONS AND ADVANCED TECHNIQUES IN SYSTEMS, COMPUTING SCIENCES AND SOFTWARE ENGINEERING, 2008, : 483 - 488
  • [30] Effectively finding relevant Web pages from linkage information
    Hou, JY
    Zhang, YC
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2003, 15 (04) : 940 - 951