Efficient Online Index Maintenance for SSD-based Information Retrieval Systems

被引:5
|
作者
Li, Ruixuan [1 ]
Chen, Xuefan [1 ]
Li, Chengzhou [1 ]
Gu, Xiwu [1 ]
Wen, Kunmei [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Intelligent & Distributed Comp Lab, Wuhan 430074, Peoples R China
关键词
D O I
10.1109/HPCC.2012.43
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Solid state disks (SSDs) can potentially eliminate the I/O bottleneck for many conventional applications. However, they have a very unique characteristic of erase-before-write, which probably makes existing index maintenance methods inapplicable to SSDs. In this paper, we propose Hybrid Merge, a new online index maintenance strategy for information retrieval systems, which applies SSDs instead of hard disk drives (HDDs) to store inverted indexes. We analyze the existing indexing methods through experiments, and design a new merge-based indexing method with no random writes. We try to take the full advantage of the SSD's fast random reads to overcome the defects of existing methods. Experimental results show that the proposed method improves indexing and query performance with extremely low write traffic compare to existing approaches.
引用
收藏
页码:262 / 269
页数:8
相关论文
共 50 条
  • [1] Fast Online Reconstruction for SSD-Based RAID-5 Storage Systems
    Lin, Haodong
    Luo, Junhao
    Li, Jun
    Sha, Zhibing
    Cai, Zhigang
    Shi, Yuanquan
    Liao, Jianwei
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 43 (06) : 1886 - 1899
  • [2] Design Patterns for Tunable and Efficient SSD-based Indexes
    Anand, Ashok
    Gember-Jacobson, Aaron
    Engstrom, Collin
    Akella, Aditya
    TENTH 2014 ACM/IEEE SYMPOSIUM ON ARCHITECTURES FOR NETWORKING AND COMMUNICATIONS SYSTEMS (ANCS'14), 2014, : 149 - 160
  • [3] SOYA: SSD-Based RAID Systems Reliability Simulator
    Chamazcoti, Saeideh Alinezhad
    Safaei, Bardia
    Miremadi, Seyed Ghassem
    2016 INTERNATIONAL CONFERENCE ON SYSTEM RELIABILITY AND SCIENCE (ICSRS 2016), 2016, : 167 - 173
  • [4] On Endurance of Erasure Codes in SSD-based Storage Systems
    Chamazcoti, Saeideh Alinezhad
    Miremadi, Seyed Ghassem
    Asadi, Hossein
    2013 17TH CSI INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND DIGITAL SYSTEMS (CADS 2013), 2013, : 67 - 72
  • [5] Efficient and Consistent NVMM Cache for SSD-Based File System
    Chen, Youmin
    Lu, Youyou
    Chen, Pei
    Shu, Jiwu
    IEEE TRANSACTIONS ON COMPUTERS, 2019, 68 (08) : 1147 - 1158
  • [6] On endurance and performance of erasure codes in SSD-based storage systems
    Chamazcoti, Saeideh Alinezhad
    Delavari, Ziba
    Miremadi, Seyed Ghassem
    Asadi, Hossein
    MICROELECTRONICS RELIABILITY, 2015, 55 (11) : 2453 - 2467
  • [7] A Novel Cache and SSD-based Index Structure for Health Record Indexing
    Du, Yang
    Yildirim-Yayilgan, Sule
    DIGITAL HEALTHCARE EMPOWERING EUROPEANS, 2015, 210 : 993 - 994
  • [8] Power saving-aware prefetching for SSD-based systems
    Laura Prada
    Javier Garcia
    J. Daniel Garcia
    Jesus Carretero
    The Journal of Supercomputing, 2011, 58 : 323 - 331
  • [9] Power saving-aware prefetching for SSD-based systems
    Prada, Laura
    Garcia, Javier
    Daniel Garcia, J.
    Carretero, Jesus
    JOURNAL OF SUPERCOMPUTING, 2011, 58 (03): : 323 - 331
  • [10] Can Erasure Codes Damage Reliability in SSD-Based Storage Systems?
    Chamazcoti, Saeideh Alinezhad
    Safaei, Bardia
    Miremadi, Seyed Ghassem
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2019, 7 (03) : 435 - 446