Compaction techniques for nextword indexes

被引:8
|
作者
Bahle, D [1 ]
Williams, HE [1 ]
Zobel, J [1 ]
机构
[1] RMIT Univ, Sch Comp & Informat Technol, Melbourne, Vic 3001, Australia
关键词
D O I
10.1109/SPIRE.2001.989735
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most queries to text search engines are ranked or Boolean. Phrase querying is a powerful technique for refining searches, but is expensive to implement on conventional indexes. In previous work we introduced the nextword index, a structure specifically designed for phrase queries, which however is relatively large. In this paper we introduce new compaction techniques for nextword indexes. In contrast to most index compression schemes, these techniques are lossy, yet as we show allow fill resolution of phrase queries without false match checking. We show experimentally that our novel techniques lead to significant savings in index size.
引用
收藏
页码:33 / 45
页数:13
相关论文
共 50 条
  • [1] COMPACTION EFFECT ON FLOW PROPERTY INDEXES FOR POWDERS
    HARWOOD, CF
    JOURNAL OF PHARMACEUTICAL SCIENCES, 1971, 60 (01) : 161 - &
  • [2] THE USE OF TABLETING INDEXES TO STUDY THE COMPACTION PROPERTIES OF POWDERS
    WILLIAMS, RO
    MCGINITY, JW
    DRUG DEVELOPMENT AND INDUSTRIAL PHARMACY, 1988, 14 (13) : 1823 - 1844
  • [3] Compiler techniques for code compaction
    Debray, SK
    Evans, W
    Muth, R
    De Sutter, B
    ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 2000, 22 (02): : 378 - 415
  • [4] A COMPARISON OF FOUNDATION COMPACTION TECHNIQUES
    SOLYMAR, ZV
    REED, DJ
    CANADIAN GEOTECHNICAL JOURNAL, 1986, 23 (03) : 271 - 280
  • [5] SOIL COMPACTION - DEFINITION AND TECHNIQUES
    ABEELS, P
    DECLERCQ, D
    REVUE DE L AGRICULTURE, 1977, 30 (01): : 131 - 150
  • [6] LOCAL MICROCODE COMPACTION TECHNIQUES
    LANDSKOV, D
    DAVIDSON, S
    SHRIVER, B
    MALLETT, PW
    COMPUTING SURVEYS, 1980, 12 (03) : 261 - 294
  • [7] Software techniques for program compaction
    De Sutter, B
    De Bosschere, K
    COMMUNICATIONS OF THE ACM, 2003, 46 (08) : 33 - 34
  • [8] Post-pass compaction techniques
    De Bus, B
    Kästner, D
    Chanet, D
    Van Put, L
    De Sutter, B
    COMMUNICATIONS OF THE ACM, 2003, 46 (08) : 41 - 46
  • [9] THE NEED FOR BOOK INDEXES, INDEXERS, TYPES OF INDEXES AND SOME SEARCH TECHNIQUES
    RAPER, R
    ASLIB PROCEEDINGS, 1990, 42 (7-8): : 207 - 212
  • [10] Center Selection Techniques for Metric Indexes
    Mendoza Alric, Cristian
    Edith Herrera, Norma
    JOURNAL OF COMPUTER SCIENCE & TECHNOLOGY, 2007, 7 (01): : 98 - 104