Improving Information Retrieval Through a Global Term Weighting Scheme

被引:0
|
作者
Cuellar, Daniel [1 ]
Diaz, Elva [1 ]
Ponce-de-Leon-Senti, Eunice [1 ]
机构
[1] UAA, Basic Sci Ctr, Dept Comp Sci, Aguascalientes 20131, Aguascalientes, Mexico
来源
关键词
Information retrieval; Indexing; Vector space model; Term weighting; Marginal distribution; Weighting scheme;
D O I
10.1007/978-3-319-19264-2_24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The output of an information retrieval system is an ordered list of documents corresponding to the user query, represented by an input list of terms. This output relies on the estimated similarity between each document and the query. This similarity depends in turn on the weighting scheme used for the terms of the document index. Term weighting then plays a big role in the estimation of the aforementioned similarity. This paper proposes a new term weighting approach for information retrieval based on the marginal frequencies. Consisting of the global count of term frequencies over the corpus of documents, while conventional term weighting schemes such as the normalized term frequency takes into account the term frequencies for particular documents. The presented experiment shows the advantages and disadvantages of the proposed retrieval scheme. Performance measures such as precision and recall and F-Score are used over classical benchmarks such as CACM to validate the experimental results.
引用
收藏
页码:246 / 257
页数:12
相关论文
共 50 条
  • [1] A Part-Of-Speech term weighting scheme for biomedical information retrieval
    Wang, Yanshan
    Wu, Stephen
    Li, Dingcheng
    Mehrabi, Saeed
    Liu, Hongfang
    JOURNAL OF BIOMEDICAL INFORMATICS, 2016, 63 : 379 - 389
  • [2] Term frequency - function of document frequency: a new term weighting scheme for enterprise information retrieval
    Zhang, Hui
    Wang, Deqing
    Wu, Wenjun
    Hu, Hongping
    ENTERPRISE INFORMATION SYSTEMS, 2012, 6 (04) : 433 - 444
  • [3] RF*IPF: A weighting scheme for multimedia information retrieval
    Wang, JZ
    Du, YP
    11TH INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND PROCESSING, PROCEEDINGS, 2001, : 380 - 385
  • [4] Evolving local and global weighting schemes in information retrieval
    Ronan Cummins
    Colm O’Riordan
    Information Retrieval, 2006, 9 : 311 - 330
  • [5] Evolving local and global weighting schemes in information retrieval
    Cummins, Ronan
    O'Riordan, Colm
    INFORMATION RETRIEVAL, 2006, 9 (03): : 311 - 330
  • [6] N-gram IDF: A Global Term Weighting Scheme Based on Information Distance
    Shirakawa, Masumi
    Hara, Takahiro
    Nishio, Shojiro
    PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW 2015), 2015, : 960 - 970
  • [7] Graph-based term weighting for information retrieval
    Blanco, Roi
    Lioma, Christina
    INFORMATION RETRIEVAL, 2012, 15 (01): : 54 - 92
  • [8] Part of Speech Based Term Weighting for Information Retrieval
    Lioma, Christina
    Blanco, Roi
    ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5478 : 412 - +
  • [9] Graph-based term weighting for information retrieval
    Roi Blanco
    Christina Lioma
    Information Retrieval, 2012, 15 : 54 - 92
  • [10] Orbit Weighting Scheme in the Context of Vector Space Information Retrieval
    Ababneh, Ahmad
    Sanjalawe, Yousef
    Fraihat, Salam
    Al-E'mari, Salam
    Alqudah, Hamzah
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (01): : 1347 - 1379