Crawler by Contextual Inference

被引:0
|
作者
Hegade P. [1 ]
Lingadhal N. [1 ]
Jain S. [1 ]
Khan U. [1 ]
Vijeth K.L. [1 ]
机构
[1] School of Computer Science and Engineering, KLE Technological University, Vidyanagar, Hubli
关键词
Contextual; Crawlers; Inferences; Web;
D O I
10.1007/s42979-021-00574-z
中图分类号
学科分类号
摘要
With a million new pages getting added every single day, the already gigantic web is growing exponentially. While it challenges the search engine and traditional information retrieval methods in producing the relevant results, so does the crawler, which does the background job of traversing the web with hyperlink structure to obtain the web snapshot. The traditional crawlers put forth the challenges of maintaining an appropriate traversal data structure and tracking the already visited pages. Contemporary applications require context and domain-specific crawlers that harvest the right set of pages and data. A focused crawler needs to have domain-specific evaluation parameters to evaluate and crawl the right set of pages based on relevance. This paper proposes a novel model—Crawler by Contextual Inference to achieve the said objectives using semantic similarity, paradigmatic similarity, and inference rules. The proposed methodology prioritizes the links based on the number of new rules built or discovered using a similarity matrix to generate inference rules. The model proposes an efficient data structure—an intelligent queue, which holds the links on a priority basis. The paper also presents the results in comparison with the traditional crawler, crawler by inference, and our model—crawler by contextual inference. The model promises to produce better results by avoiding the crawl of irrelevant pages. © 2021, The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd.
引用
收藏
相关论文
共 50 条
  • [1] Contextual Tag Inference
    Mandel, Michael I.
    Pascanu, Razvan
    Eck, Douglas
    Bengio, Yoshua
    Aiello, Luca M.
    Schifanella, Rossano
    Menczer, Filippo
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2011, 7 (01)
  • [2] A Proposal for Contextual Grammatical Inference
    Becerra-Bonache, Leonor
    Galvan, Maria
    Jacquenet, Francois
    [J]. ADVANCES IN COMPUTATIONAL INTELLIGENCE, PT I (IWANN 2015), 2015, 9094 : 18 - 28
  • [3] Object Detection with Contextual Inference
    Kalaycilar, Firat
    Aksoy, Selim
    [J]. 2009 IEEE 17TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2009, : 539 - 542
  • [4] Mutual boosting for contextual inference
    Fink, M
    Perona, P
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 16, 2004, 16 : 1515 - 1522
  • [5] Contextual inference in learning and memory
    Heald, James B.
    Lengyel, Mate
    Wolpert, Daniel M.
    [J]. TRENDS IN COGNITIVE SCIENCES, 2023, 27 (01) : 43 - 64
  • [6] Post-Contextual-Bandit Inference
    Bibaut, Aurelien
    Dimakopoulou, Maria
    Kallus, Nathan
    Chambaz, Antoine
    van der Laan, Mark
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [7] Contextual perception under active inference
    Mirza, M. Berk
    Cullen, Maell
    Parr, Thomas
    Shergill, Sukhi
    Moran, Rosalyn J.
    [J]. SCIENTIFIC REPORTS, 2021, 11 (01)
  • [8] Exploiting contextual independence in probabilistic inference
    Poole, D
    Zhang, NLW
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2003, 18 : 263 - 313
  • [9] STATISTICAL-INFERENCE IN CONTEXTUAL ANALYSIS
    STIPAK, B
    HENSLER, C
    [J]. AMERICAN JOURNAL OF POLITICAL SCIENCE, 1982, 26 (01) : 151 - 175
  • [10] Contextual perception under active inference
    M. Berk Mirza
    Maell Cullen
    Thomas Parr
    Sukhi Shergill
    Rosalyn J. Moran
    [J]. Scientific Reports, 11