Static pruning of terms in inverted files

被引:0
|
作者
Blanco, Roi [1 ]
Barreiro, Alvaro [1 ]
机构
[1] Univ A Coruna, Dept Comp Sci, IRLab, La Coruna, Spain
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the problem of identifying collection dependent stop-words in order to reduce the size of inverted files. We present four methods to automatically recognise stop-words, analyse the tradeoff between efficiency and effectiveness, and compare them with a previous pruning approach. The experiments allow us to conclude that in some situations stop-words pruning is competitive with respect to other inverted file reduction techniques.
引用
收藏
页码:64 / +
页数:2
相关论文
共 50 条
  • [31] Optimistic concurrency control for inverted files in text databases
    Marín, M
    Proceedings of the IASTED International Conference on Databases and Applications, 2004, : 31 - 36
  • [32] COMPRESSION OF LARGE INVERTED FILES WITH HYPERBOLIC TERM DISTRIBUTION
    SCHUEGRAF, EJ
    INFORMATION PROCESSING & MANAGEMENT, 1976, 12 (06) : 377 - 384
  • [33] Distributed query processing using partitioned inverted files
    Badue, C
    Ribeiro-Neto, B
    Baeza-Yates, R
    Ziviani, N
    EIGHTH SYMPOSIUM ON STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2001, : 10 - 20
  • [34] COMPLETE INVERTED FILES FOR EFFICIENT TEXT RETRIEVAL AND ANALYSIS
    BLUMER, A
    BLUMER, J
    HAUSSLER, D
    MCCONNELL, R
    EHRENFEUCHT, A
    JOURNAL OF THE ACM, 1987, 34 (03) : 578 - 595
  • [35] An Online Static Index Pruning Algorithm
    Liu Xiaofeng
    2013 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND APPLICATIONS (CSA), 2013, : 262 - 265
  • [36] Load balancing distributed inverted files: Query ranking
    Gomez-Pantoja, Carlos
    Marin, Mauricio
    PROCEEDINGS OF THE 16TH EUROMICRO CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, 2008, : 329 - 333
  • [37] Static Pruning of Index Based on SDST
    Huo, Lin
    Zou, Xianze
    Xing, Xiao
    Zhao, Ying
    ADVANCES IN FUTURE COMPUTER AND CONTROL SYSTEMS, VOL 2, 2012, 160 : 471 - 476
  • [38] Improved Methods for Static Index Pruning
    Jiang, Wei
    Rodriguez, Juan
    Suel, Torsten
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 686 - 695
  • [39] MINING FOR RELEVANT TERMS FROM LOG FILES
    Saneifar, Hassan
    Bonniol, Stephane
    Laurent, Anne
    Poncelet, Pascal
    Roche, Mathieu
    KDIR 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2009, : 77 - +
  • [40] Electronic Document Management Using Inverted Files System
    Suhartono, Derwin
    Setiawan, Erwin
    Irwanto, Djon
    ICASCE 2013 - INTERNATIONAL CONFERENCE ON ADVANCES SCIENCE AND CONTEMPORARY ENGINEERING, 2014, 68