Static pruning of terms in inverted files

被引:0
|
作者
Blanco, Roi [1 ]
Barreiro, Alvaro [1 ]
机构
[1] Univ A Coruna, Dept Comp Sci, IRLab, La Coruna, Spain
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the problem of identifying collection dependent stop-words in order to reduce the size of inverted files. We present four methods to automatically recognise stop-words, analyse the tradeoff between efficiency and effectiveness, and compare them with a previous pruning approach. The experiments allow us to conclude that in some situations stop-words pruning is competitive with respect to other inverted file reduction techniques.
引用
下载
收藏
页码:64 / +
页数:2
相关论文
共 50 条
  • [1] Probabilistic Static Pruning of Inverted Files
    Blanco, Roi
    Barreiro, Alvaro
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2010, 28 (01)
  • [2] Compressing inverted files
    Trotman, A
    INFORMATION RETRIEVAL, 2003, 6 (01): : 5 - 19
  • [3] Compressing Inverted Files
    Andrew Trotman
    Information Retrieval, 2003, 6 : 5 - 19
  • [4] Inverted files versus signature files for text indexing
    Zobel, J
    Moffat, A
    Ramamohanarao, K
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 1998, 23 (04): : 453 - 490
  • [5] COMPARISON OF SIGNATURE AND INVERTED FILES
    NELSON, MJ
    CANADIAN JOURNAL OF INFORMATION SCIENCE-REVUE CANADIENNE DES SCIENCES DE L INFORMATION, 1988, 13 (3-4): : 79 - 89
  • [6] COMPUTERIZED SEARCHING OF INVERTED FILES
    LYTLE, FE
    ANALYTICAL CHEMISTRY, 1970, 42 (03) : 355 - &
  • [7] UNIFORM ORGANIZATION OF INVERTED FILES
    MOTZKIN, D
    WILLIAMS, K
    CHANG, K
    AFIPS CONFERENCE PROCEEDINGS, 1984, 53 : 567 - +
  • [8] RETRIEVAL OPTIMIZATION IN INVERTED FILES
    ELIGULASHVILI, BG
    PROGRAMMING AND COMPUTER SOFTWARE, 1987, 13 (06) : 268 - 271
  • [9] OPTIMAL PERFORMANCE OF INVERTED FILES
    HOFFER, JA
    KOVACEVIC, A
    OPERATIONS RESEARCH, 1982, 30 (02) : 336 - 354
  • [10] Comparing inverted files and signature files for searching a large lexicon
    Carterette, B
    Can, F
    INFORMATION PROCESSING & MANAGEMENT, 2005, 41 (03) : 613 - 633