Context Based Relevance Evaluation of Web Documents

被引:0
|
作者
Gupta, Pooja [1 ]
机构
[1] GGSIPU, Maharaja Agrasen Inst Technol, New Delhi, India
来源
CONTEMPORARY COMPUTING | 2012年 / 306卷
关键词
Context; Web documents; Back-links; WWW; Relevance; Search Engine; Contextual Senses; Query Response;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Focused crawling is considered to be an important strategy to reduce search space and give more relevant links to a user, based on search queries. Existing web crawlers work only on the basis of full string matching of query keywords with words present in various tags or fields in the web pages. But a particular keyword can have different meanings in different contexts depending on its usage as verb, noun etc. For example fly refers to an insect if used as a noun and refers to an act of moving in the air if used as a verb. Most of the existing search engines work on semantic context, based on string matching of keywords but not based on contextual senses of keywords. Further, general crawling strategy of various crawlers is forward oriented, giving less consideration to the backward links of the page. There is a strong need to work on a crawling strategy that overcomes these gaps. In this paper a mechanism that evaluates the web document on the basis of contextual senses (verb, noun etc.) of the keywords contained in the downloaded page is being proposed. Moreover back-link to a web page has also been analyzed with reference to a specific page providing links related to the page. Consequently, more number of relevant links related to one topic is displayed to the user.
引用
收藏
页码:201 / 212
页数:12
相关论文
共 50 条
  • [1] Relevance of web documents: Ghosts consensus method
    Gorbunov, AL
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2002, 53 (10): : 783 - 788
  • [2] Predicting the Situational Relevance of Health Web Documents
    Oroszlanyova, Melinda
    Lopes, Carla Teixeira
    Nunes, Sergio
    Ribeiro, Cristina
    [J]. 2017 12TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2017,
  • [3] The influence of documents, users and tasks on the relevance and comprehension of health web documents
    Oroszlanyova, Melinda
    Ribeiro, Cristina
    Nunes, Sergio
    Lopes, Carla Teixeira
    [J]. CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS/INTERNATIONAL CONFERENCE ON PROJECT MANAGEMENT/CONFERENCE ON HEALTH AND SOCIAL CARE INFORMATION SYSTEMS AND TECHNOLOGIES, CENTERIS/PROJMAN / HCIST 2015, 2015, 64 : 771 - 778
  • [4] Relevance Assessments for Web Search Evaluation: Should We Randomise or Prioritise the Pooled Documents?
    Sakai, Tetsuya
    Tao, Sijie
    Zeng, Zhaohao
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2022, 40 (04)
  • [5] Automated Subject Classification of Textual Documents in the Context of Web-Based Hierarchical Browsing
    Golub, Koraljka
    [J]. KNOWLEDGE ORGANIZATION, 2011, 38 (03): : 230 - 244
  • [6] An algorithm to cluster documents based on relevance
    Desai, M
    Spink, A
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2005, 41 (05) : 1035 - 1049
  • [7] Searching documents based on relevance and type
    Xu, Jun
    Cao, Yunbo
    Li, Hang
    Craswell, Nick
    Huang, Yalou
    [J]. ADVANCES IN INFORMATION RETRIEVAL, 2007, 4425 : 629 - +
  • [8] Evaluation of Triple Indices in Retrieving Web Documents
    Zulkefli, Nurul Syeilla Syazhween
    Abd Rahman, Nurazzah
    Abu Bakar, Zainab
    Nordin, Sharifalillah
    Sembok, Tengku Mohd Tengku
    Teo, Noor Hasimah Ibrahim
    [J]. 2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE APPLICATIONS AND TECHNOLOGIES (ACSAT), 2014, : 525 - 529
  • [9] What is a context of utterance? (Evaluation, relevance)
    Gauker, C
    [J]. PHILOSOPHICAL STUDIES, 1998, 91 (02) : 149 - 172
  • [10] Semantic based clustering of web documents
    Lin, TY
    Chiang, IJ
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, VOLS 1 AND 2, 2005, : 189 - 192