Static pruning of terms in inverted files

被引:0
|
作者
Blanco, Roi [1 ]
Barreiro, Alvaro [1 ]
机构
[1] Univ A Coruna, Dept Comp Sci, IRLab, La Coruna, Spain
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the problem of identifying collection dependent stop-words in order to reduce the size of inverted files. We present four methods to automatically recognise stop-words, analyse the tradeoff between efficiency and effectiveness, and compare them with a previous pruning approach. The experiments allow us to conclude that in some situations stop-words pruning is competitive with respect to other inverted file reduction techniques.
引用
收藏
页码:64 / +
页数:2
相关论文
共 50 条
  • [21] Parallel search using partitioned inverted files
    MacFarlane, A
    McCann, JA
    Robertson, SE
    SPIRE 2000: SEVENTH INTERNATIONAL SYMPOSIUM ON STRING PROCESSING AND INFORMATION RETRIEVAL - PROCEEDINGS, 2000, : 209 - 220
  • [22] Parallel methods for the update of partitioned inverted files
    MacFarlane, A.
    McCann, J. A.
    Robertson, S. E.
    ASLIB PROCEEDINGS, 2007, 59 (4-5): : 367 - 396
  • [23] Compression of boolean inverted files by document ordering
    Gelbukh, A
    Han, SY
    Sidorov, G
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 244 - 249
  • [24] IN-SITU GENERATION OF COMPRESSED INVERTED FILES
    MOFFAT, A
    BELL, TAH
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1995, 46 (07): : 537 - 550
  • [25] Fast concurrency control for distributed inverted files
    Marín, M
    COMPUTATIONAL SCIENCE - ICCS 2005, PT 1, PROCEEDINGS, 2005, 3514 : 411 - 418
  • [26] Parallel methods for the generation of partitioned inverted files
    MacFarlane, A
    McCann, JA
    Robertson, SE
    ASLIB PROCEEDINGS, 2005, 57 (05): : 434 - 459
  • [27] Compressing Inverted Files using Modified LZW
    Iosifidis, Vasileios
    Makris, Christos
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 1 (WEBIST), 2016, : 156 - 163
  • [28] Two-Dimensional Distributed Inverted Files
    Feuerstein, Esteban
    Marin, Mauricio
    Mizrahi, Michel
    Gil-Costa, Veronica
    Baeza-Yates, Ricardo
    STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5721 : 206 - +
  • [29] Pruning terms for principal type assignment
    Graduate School of Informatics, Kyoto Univ., 606-8501, Japan
    Electronic Notes in Theoretical Computer Science, 2000, 31 : 144 - 159
  • [30] ALGORITHMS FOR MULTIDIMENSIONAL PARTITIONING OF STATIC FILES
    ROTEM, D
    SEGEV, A
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1988, 14 (11) : 1700 - 1710