Probabilistic Ranking of Documents Using Vectors in Information Retrieval

被引:0
|
作者
Saini, Balwinder [1 ]
Singh, Vikram [1 ]
机构
[1] NIT Kurukshetra, Dept Comp Engn, Chandigarh, Haryana, India
关键词
Information retrieval (IR); Ranking/indexing; Tokenization; Clustering;
D O I
10.1007/978-81-322-2205-7_57
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
On the web, electronic form of information is increasing exponentially with the passage of past few years. Also, this advancement creates its own uncertainties. The overload information result is progressive while finding the relevant data with a chance of HIT or Miss Exposure. For improving this, Information Retrieval Ranking, Tokenization and Clustering techniques are suggestive as probable solutions. In this paper, Probabilistic Ranking using Vectors (PRUV) algorithm is proposed, in which tokenization and Clustering of a given documents are used to create more precisely and efficient rank gratify user's information need to execute sharply reduced search, is believed to be a part of IR. Tokenization involves pre-processing of the given documents and generates its respective tokens and then based on probability score cluster are created. Performance of some of existing clustering techniques (K-Means and DB-Scan) is compared with proposed algorithm PRUV, using various parameters, e.g. Time, Accuracy and Number of Tokens Generated.
引用
收藏
页码:613 / 624
页数:12
相关论文
共 50 条
  • [1] A probabilistic information retrieval model by document ranking using term dependencies
    You, Hyun-Jo
    Lee, Jung-Jin
    [J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2019, 32 (05) : 763 - 782
  • [2] A Probabilistic Method for Ranking Refinement in Geographic Information Retrieval
    Villatoro-Tello, Esau
    Omar Chavez-Garcia, R.
    Montes-y-Gomez, Manuel
    Villasenor-Pineda, Luis
    Enrique Sucar, L.
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2010, (44): : 123 - 130
  • [4] Feature location using probabilistic ranking of methods based on execution scenarios and information retrieval
    Poshyvanyk, Denys
    Gueheneuc, Yann-Gael
    Marcus, Andrian
    Antoniol, Giuliano
    Rajlich, Vaclav
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2007, 33 (06) : 420 - 432
  • [5] Probabilistic information retrieval approach for ranking of database query results
    Chaudhuri, Surajit
    Das, Gautam
    Hristidis, Vagelis
    Weikum, Gerhard
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 2006, 31 (03): : 1134 - 1168
  • [6] Documents ranking based on a hybrid language model for Chinese information retrieval
    Zheng, Dequan
    Yu, Feng
    Zhao, Tiejun
    Li, Sheng
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON INFORMATION ACQUISITION, VOLS 1 AND 2, CONFERENCE PROCEEDINGS, 2006, : 279 - 283
  • [7] Information Retrieval Meets Game Theory: The Ranking Competition Between Documents' Authors
    Raifer, Nimrod
    Raiber, Fiana
    Tennenholtz, Moshe
    Kurland, Oren
    [J]. SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 465 - 474
  • [8] Retrieval of multimedia documents using low cost image ranking algorithms
    Haque, N
    Chowdhury, MU
    [J]. COMPUTER APPLICATIONS IN INDUSTRY AND ENGINEERING, 2001, : 41 - 44
  • [9] Information retrieval for OCR documents: A content-based probabilistic correction model
    Jin, R
    Zhai, CX
    Hauptmann, AG
    [J]. DOCUMENT RECOGNITION AND RETRIEVAL X, 2003, 5010 : 128 - 135
  • [10] Information Retrieval Ranking Using Machine Learning Techniques
    Pandey, Shweta
    Mathur, Iti
    Joshi, Nisheeth
    [J]. PROCEEDINGS 2019 AMITY INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AICAI), 2019, : 86 - 92