DATA-COMPRESSION IN FULL-TEXT RETRIEVAL-SYSTEMS

被引:0
|
作者
BELL, TC
MOFFAT, A
NEVILLMANNING, CG
WITTEN, IH
ZOBEL, J
机构
[1] UNIV MELBOURNE, PARKVILLE, VIC 3052, AUSTRALIA
[2] UNIV WAIKATO, HAMILTON, NEW ZEALAND
[3] ROYAL MELBOURNE INST TECHNOL, MELBOURNE, VIC 3001, AUSTRALIA
关键词
D O I
10.1002/(SICI)1097-4571(199310)44:9<508::AID-ASI2>3.0.CO;2-A
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
When data compression is applied to full-text retrieval systems, intricate relationships emerge between the amount of compression, access speed, and computing resources required. We propose compression methods, and explore corresponding tradeoffs, for all components of static full-text systems such as text databases on CD-ROM. These components include lexical indexes, inverted files, bitmaps, signature files, and the main text itself. Results are reported on the application of the methods to several substantial full-text databases, and show that a large, unindexed text can be stored, along with indexes that facilitate fast searching, in less than half its original size-at some appreciable cost in primary memory requirements.
引用
收藏
页码:508 / 531
页数:24
相关论文
共 50 条
  • [1] SPECIALIZED HARDWARE FOR IMPLEMENTING FULL-TEXT RETRIEVAL-SYSTEMS
    HOLLAAR, LA
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1982, 183 (MAR): : 25 - CINF
  • [2] PROBABILISTIC DESIGN PRINCIPLES FOR CONVENTIONAL AND FULL-TEXT RETRIEVAL-SYSTEMS
    MARON, ME
    [J]. INFORMATION PROCESSING & MANAGEMENT, 1988, 24 (03) : 249 - 255
  • [3] EXPERIMENTS IN LOCAL METRICAL FEEDBACK IN FULL-TEXT RETRIEVAL-SYSTEMS
    ATTAR, R
    FRAENKEL, AS
    [J]. INFORMATION PROCESSING & MANAGEMENT, 1981, 17 (03) : 115 - 126
  • [4] DATA CACHING STRATEGIES FOR DISTRIBUTED FULL TEXT RETRIEVAL-SYSTEMS
    MARTIN, TP
    RUSSELL, JI
    [J]. INFORMATION SYSTEMS, 1991, 16 (01) : 1 - 11
  • [5] ADDING COMPRESSION TO A FULL-TEXT RETRIEVAL-SYSTEM
    ZOBEL, J
    MOFFAT, A
    [J]. SOFTWARE-PRACTICE & EXPERIENCE, 1995, 25 (08): : 891 - 903
  • [6] LOCAL FEEDBACK IN FULL-TEXT RETRIEVAL SYSTEMS
    ATTAR, R
    FRAENKEL, AS
    [J]. JOURNAL OF THE ACM, 1977, 24 (03) : 397 - 417
  • [7] FULL-TEXT INFORMATION RETRIEVAL
    FAY, RJ
    [J]. LAW LIBRARY JOURNAL, 1971, 64 (02): : 167 - 175
  • [8] Harvesting for full-text retrieval
    Simeoni, F
    Yakici, M
    Neely, S
    Crestani, F
    [J]. DIGITAL LIBRARIES: IMPLEMENTING STRATEGIES AND SHARING EXPERIENCES, PROCEEDINGS, 2005, 3815 : 204 - 213
  • [9] FULL-TEXT ONLINE RETRIEVAL
    COLBERT, AW
    [J]. ONLINE, 1988, 12 (02): : 91 - 91
  • [10] RESEARCH INTO FULL-TEXT RETRIEVAL
    OJALA, M
    [J]. DATABASE, 1990, 13 (04): : 78 - 80