Context Based Relevance Evaluation of Web Documents

被引:0
|
作者
Gupta, Pooja [1 ]
机构
[1] GGSIPU, Maharaja Agrasen Inst Technol, New Delhi, India
来源
CONTEMPORARY COMPUTING | 2012年 / 306卷
关键词
Context; Web documents; Back-links; WWW; Relevance; Search Engine; Contextual Senses; Query Response;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Focused crawling is considered to be an important strategy to reduce search space and give more relevant links to a user, based on search queries. Existing web crawlers work only on the basis of full string matching of query keywords with words present in various tags or fields in the web pages. But a particular keyword can have different meanings in different contexts depending on its usage as verb, noun etc. For example fly refers to an insect if used as a noun and refers to an act of moving in the air if used as a verb. Most of the existing search engines work on semantic context, based on string matching of keywords but not based on contextual senses of keywords. Further, general crawling strategy of various crawlers is forward oriented, giving less consideration to the backward links of the page. There is a strong need to work on a crawling strategy that overcomes these gaps. In this paper a mechanism that evaluates the web document on the basis of contextual senses (verb, noun etc.) of the keywords contained in the downloaded page is being proposed. Moreover back-link to a web page has also been analyzed with reference to a specific page providing links related to the page. Consequently, more number of relevant links related to one topic is displayed to the user.
引用
收藏
页码:201 / 212
页数:12
相关论文
共 50 条
  • [41] Ontology-based automatic classification of web documents
    Song, MuHee
    Lim, SooYeon
    Kang, DongJin
    Lee, SangJo
    [J]. COMPUTATIONAL INTELLIGENCE, PT 2, PROCEEDINGS, 2006, 4114 : 690 - 700
  • [42] Classification of news web documents based on structural features
    Tongchim, Shisanu
    Sornlertlamvanich, Virach
    Isahara, Hitoshi
    [J]. ADVANCES IN NATURAL LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4139 : 153 - 160
  • [43] A Summarizer System Based on a Semantic Analysis of Web Documents
    Florence, Angelin
    Padmadas, Vijaya
    [J]. 2015 INTERNATIONAL CONFERENCE ON TECHNOLOGY FOR SUSTAINABLE DEVELOPMENT (ICTSD-2015), 2015,
  • [44] Hadoop Based Parallel Deduplication Method for Web Documents
    Song, Junjie
    Liu, Jin
    Zheng, Yuhui
    [J]. ADVANCES IN COMPUTER SCIENCE AND UBIQUITOUS COMPUTING, 2018, 474 : 499 - 504
  • [45] Entity-based keyword search in web documents
    Sartori E.
    Velegrakis Y.
    Guerra F.
    [J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2016, 9630 : 21 - 49
  • [46] Web Service Ranking based on Context
    Zhang, Rong
    Zettsu, Koji
    Kidawara, Yutaka
    Kiyoki, Yasushi
    [J]. SECOND INTERNATIONAL CONFERENCE ON CLOUD AND GREEN COMPUTING / SECOND INTERNATIONAL CONFERENCE ON SOCIAL COMPUTING AND ITS APPLICATIONS (CGC/SCA 2012), 2012, : 375 - 382
  • [47] Evidence-based medicine: Context and relevance
    Raspe, H
    Stange, EF
    [J]. ZEITSCHRIFT FUR GASTROENTEROLOGIE, 1999, 37 (06): : 525 - 533
  • [48] Qualitative Evaluation of the Relevance and Acceptability of a Web-Based HIV Prevention Game for Rural Adolescents
    Enah, Comfort
    Piper, Kendra
    Moneyham, Linda
    [J]. JOURNAL OF PEDIATRIC NURSING-NURSING CARE OF CHILDREN & FAMILIES, 2015, 30 (02): : 321 - 328
  • [49] Document Word Clouds: Visualising Web Documents as Tag Clouds to Aid Users in Relevance Decisions
    Gottron, Thomas
    [J]. RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, PROCEEDINGS, 2009, 5714 : 94 - 105
  • [50] The Effects of Annotated Web Documents, Using Context Highlighting, on Quiz Performance and Preparation Time
    Zucker, Ron
    [J]. PROCEEDINGS OF THE 48TH ANNUAL SOUTHEAST REGIONAL CONFERENCE (ACM SE 10), 2010, : 191 - 195