Enhancing Accuracy of Topic Sensitive PageRank using Jaccard Index and Cosine Similarity

被引:6
|
作者
Rezvani, Mojtaba [1 ]
Hashemi, S. Mehdi [1 ]
机构
[1] Amirkabir Univ Technol, Tehran Polytech, Dept Comp Sci, Tehran, Iran
关键词
Information Retrieval; Web Ranking; PageRank; Topic Sensitive PageRank; Cosine Similarity; Jaccard Similarity; RETRIEVAL; WEB;
D O I
10.1109/WI-IAT.2012.166
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The importance of online information retrieval systems has dramatically increased through considerable growth in the size of the web, and the challenges beyond this topic have become a center of attention for many researchers. This remarkable growth in the size of the web has led to in-depth studies on every element of information retrieval systems such as web page ranking algorithms. The accuracy of these algorithms plays a critical role in the search engines, whereas the ranker is responsible for accuracy. Thus, the ranker is a principal module of each search engine. In this paper, a new framework based on web graph and content similarity is presented in order to improve the accuracy of PageRank. This framework is implemented using Jaccard index and cosine similarity measures, and as a result of our empirical analysis, we shall show that putting page content similarity in action increases the accuracy of web ranking in some candidate ranking algorithms. In addition, time complexity and implementation issues are discussed to achieve a practical result.
引用
收藏
页码:620 / 624
页数:5
相关论文
共 26 条
  • [1] Matching Scientific Article Titles using Cosine Similarity and Jaccard Similarity Algorithm
    Rinjeni, Tri Puspa
    Indriawan, Ade
    Rakhmawati, Nur Aini
    Procedia Computer Science, 2024, 234 : 553 - 560
  • [2] SIMILARITY MEASURES IN SCIENTOMETRIC RESEARCH - THE JACCARD INDEX VERSUS SALTON COSINE FORMULA
    HAMERS, L
    HEMERYCK, Y
    HERWEYERS, G
    JANSSEN, M
    KETERS, H
    ROUSSEAU, R
    VANHOUTTE, A
    INFORMATION PROCESSING & MANAGEMENT, 1989, 25 (03) : 315 - 318
  • [3] Examining Bitcoin mempools Resemblance Using Jaccard Similarity Index
    Dae-Yong, Kim
    Meryam, Essaid
    Hongtaek, Ju
    APNOMS 2020: 2020 21ST ASIA-PACIFIC NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM (APNOMS), 2020, : 287 - 290
  • [4] An Automatic Thai Text Summarization Using Topic Sensitive PageRank
    Chongsuntornsri, Aekkasit
    Sornil, Ohm
    2006 INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES,VOLS 1-3, 2006, : 597 - +
  • [5] Mathematical properties of soft cardinality: Enhancing Jaccard, Dice and cosine similarity measures with element-wise distance
    Jimenez, Sergio
    Gonzalez, Fabio A.
    Gelbukh, Alexander
    INFORMATION SCIENCES, 2016, 367 : 373 - 389
  • [6] Evaluating single-cell cluster stability using the Jaccard similarity index
    Tang, Ming
    Kaymaz, Yasin
    Logeman, Brandon L.
    Eichhorn, Stephen
    Liang, Zhengzheng S.
    Dulac, Catherine
    Sackton, Timothy B.
    BIOINFORMATICS, 2021, 37 (15) : 2212 - 2214
  • [7] Android Malware Similarity Clustering using Method based Opcode Sequence and Jaccard Index
    Lee, Shinho
    Jung, Wookhyun
    Kim, Sangwon
    Kim, Eui Tak
    2019 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC): ICT CONVERGENCE LEADING THE AUTONOMOUS FUTURE, 2019, : 178 - 183
  • [8] New Similarity Measures Between Generalized Trapezoidal Fuzzy Numbers Using the Jaccard Index
    Hwang, Chao-Ming
    Yang, Miin-Shen
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2014, 22 (06) : 831 - 844
  • [9] Behavioral Analysis of System Call Sequences using LSTM Seq-Seq, Cosine Similarity and Jaccard Similarity for Real-time Anomaly Detection
    Soni, Jayesh
    Prabakar, Nagarajan
    Upadhyay, Himanshu
    2019 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2019), 2019, : 214 - 219
  • [10] A modification of the Jaccard-Tanimoto similarity index for diverse selection of chemical compounds using binary strings
    Fligner, MA
    Verducci, JS
    Blower, PE
    TECHNOMETRICS, 2002, 44 (02) : 110 - 119