Hybrid Algorithm on Semantic Web Crawler for Search Engine to Improve Memory Space and Time

被引:1
|
作者
Lambhate, Poonam [1 ]
Hambarde, Aparna [2 ]
Emmanuel, M. [3 ]
Hambarde, Shailesh [1 ]
机构
[1] Savitribai Phule Pune Univ, JSPMs JSCOE, Handewadi Rd, Pune 28, Maharashtra, India
[2] Savitribai Phule Pune Univ, KJCOEMR, Yewalewadi Rd, Pune 48, Maharashtra, India
[3] Savitribai Phule Pune Univ, PICT, Pune 43, Maharashtra, India
关键词
Internet; Search Engine; Crawler; Semantics;
D O I
10.1109/I2CT51068.2021.9418139
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
World Wide Web is a colossal reservoir of hyperlink documents, these hyperlinks documents lay the foundation for communication on this omnipresent computing world. An acute need has arisen to develop and modify or design search algorithms that helps in efficiently and competently searching the specific required data from the huge repository available. A variety of search engines employ diverse web crawlers for obtaining search results efficiently. A variety of search engines use diverse web crawlers for obtaining search results proficiently. Some search engines use focused web crawler that collects different web pages that usually satisfy some specific property, by effectively prioritizing the crawler frontier and managing the exploration process for hyperlink. In this paper the main objective of this research is identifying the bottlenecks in the conventional framework. It conceives the experiment architecture which will showcase an improved technique to speed up the crawling process. This will lay the foundation for a future generation of current web. It will expand the horizon of crawling to diversify its application to the industry specific crawling mechanism. Shannon gain algorithm is used to determine the Threshold value of dynamic dataset. Experiments have been conducted on Apriori, Eclat, Declat Algorithms, Proposed hybrid algorithm. The comparative assessment of memory usage reflects the minimal consumption by the hybrid architecture. One of the main features of our proposed Method is its ability to tunnel through pages with a low score.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] An Enhanced Semantic Focused Web Crawler Based on Hybrid String Matching Algorithm
    Prabha, K. S. Sakunthala
    Mahesh, C.
    Raja, S. P.
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2021, 21 (02) : 105 - 120
  • [2] An Efficient Hybrid User Profile Based Web Search Personalization Through Semantic Crawler
    Jaytrilok Choudhary
    Deepak Singh Tomar
    Dhirendra Pratap Singh
    National Academy Science Letters, 2019, 42 : 105 - 108
  • [3] An Efficient Hybrid User Profile Based Web Search Personalization Through Semantic Crawler
    Choudhary, Jaytrilok
    Tomar, Deepak Singh
    Singh, Dhirendra Pratap
    NATIONAL ACADEMY SCIENCE LETTERS-INDIA, 2019, 42 (02): : 105 - 108
  • [4] An Improved Search Algorithm of Focused Crawler in Vertical Search Engine
    Zuo, Xiao-jun
    Zhang, Kai-tuo
    ASIA-PACIFIC YOUTH CONFERENCE ON COMMUNICATION TECHNOLOGY 2010 (APYCCT 2010), 2010, : 509 - +
  • [5] SemSearch: A search engine for the semantic web
    Lei, Yuangui
    Uren, Victoria
    Motta, Enrico
    MANAGING KNOWLEDGE IN A WORLD OF NETWORKS, PROCEEDINGS, 2006, 4248 : 238 - 245
  • [6] Research on Tibetan News Sites' Web Crawler and Search Engine
    Han Zhiqiang
    Xu Guixian
    Sun Wei
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON LOGISTICS, ENGINEERING, MANAGEMENT AND COMPUTER SCIENCE (LEMCS 2015), 2015, 117 : 607 - 611
  • [7] Semantic Web Service Discovery and Integration using Service Search Crawler
    Kaewmarin, Viriya
    Arch-int, Ngamnij
    Arch-int, Somjit
    2008 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR MODELLING CONTROL & AUTOMATION, VOLS 1 AND 2, 2008, : 884 - 888
  • [8] The Research of Search Engine Based on Semantic Web
    Jin, Yi
    Lin, Zhuying
    Lin, Hongwei
    2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION WORKSHOP: IITA 2008 WORKSHOPS, PROCEEDINGS, 2008, : 360 - 363
  • [9] Querying the semantic web with Corese search engine
    Corby, O
    Dieng-Kuntz, R
    Faron-Zucker, C
    ECAI 2004: 16TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 110 : 705 - 709
  • [10] Interface Features of Semantic Web Search Engine
    Azizan, Azilawati
    Abu Bakar, Zainab
    Ismail, Normaly Kamal
    Amran, Mohd Firdaus
    2013 IEEE CONFERENCE ON E-LEARNING, E-MANAGEMENT AND E-SERVICES (IC3E), 2013, : 142 - 147