Propeller: A Scalable Real-Time File-Search Service in Distributed Systems

被引:6
|
作者
Xu, Lei [1 ]
Jiang, Hong [1 ]
Tian, Lei [1 ]
Huang, Ziling [1 ]
机构
[1] Univ Nebraska Lincoln, Lincoln, NE 68588 USA
关键词
D O I
10.1109/ICDCS.2014.46
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
File-search service is a valuable facility to accelerate many analytics applications, because it can drastically reduce the scale of the input data. The main challenge facing the design of large-scale and accurate file-search services is how to support real-time indexing in an efficient and scalable way. To address this challenge, we propose a distributed file-search service, called Propeller, which utilizes a special file-access pattern, called access-causality, to partition file-indices in order to expose substantial access locality and parallelism to accelerate the file-indexing process. The extensive evaluations of Propeller show that it is real-time in file-indexing operations, accurate in file-search results, and scalable in large datasets. It achieves significantly better file-indexing and file-search performance (up to 250x) than a centralized solution (MySQL) and much higher accuracy and substantially lower query latency (up to 22x) than a state-of-the-art desktop search engine (Spotlight).
引用
收藏
页码:378 / 388
页数:11
相关论文
共 50 条
  • [31] MONITORING DISTRIBUTED REAL-TIME SYSTEMS
    SCHMID, U
    REAL-TIME SYSTEMS, 1994, 7 (01) : 33 - 56
  • [32] Monitoring Distributed Real-Time Systems
    于波
    石油化工高等学校学报, 1998, (01) : 72 - 74+87
  • [33] Real-time distributed VXI systems
    Wolfe, Ron
    Graff, John
    EE: Evaluation Engineering, 1991, 30 (05):
  • [34] Real-time Distributed MIMO Systems
    Hamed, Ezzeldin
    Rahul, Hariharan
    Abdelghany, Mohammed A.
    Katabi, Dina
    PROCEEDINGS OF THE 2016 ACM CONFERENCE ON SPECIAL INTEREST GROUP ON DATA COMMUNICATION (SIGCOMM '16), 2016, : 412 - 425
  • [35] Real-time performance estimation for dynamic, distributed real-time systems
    Huh, EN
    Welch, LR
    Mun, Y
    COMPUTATIONAL SCIENCE-ICCS 2002, PT III, PROCEEDINGS, 2002, 2331 : 1071 - 1079
  • [36] Real-time commit protocol for distributed real-time database systems
    Yoon, Y
    Han, M
    Cho, J
    SECOND IEEE INTERNATIONAL CONFERENCE ON ENGINEERING OF COMPLEX COMPUTER SYSTEMS: HELD JOINTLY WITH 6TH CSESAW, 4TH IEEE RTAW, AND SES'96, 1996, : 221 - 225
  • [37] Distributed and Scalable Directory Service in a Parallel File System
    Wang, Lixin
    Lu, Yutong
    Zhang, Wei
    Lei, Yan
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (02): : 313 - 323
  • [38] On Structural Robustness of Distributed Real-Time Systems Towards Uncertainties in Service Times
    Jayachandran, Praveen
    Abdelzaher, Tarek
    31ST IEEE REAL-TIME SYSTEMS SYMPOSIUM (RTSS 2010), 2010, : 317 - 326
  • [39] Enhancing real-time event service for synchronization in object oriented distributed systems
    Singh, G
    Maddula, B
    Zeng, Q
    ISORC 2002: FIFTH IEEE INTERNATIONAL SYMPOSIUM ON OBJECT-ORIENTED REAL-TIME DISTRIBUTED COMPUTING, PROCEEDINGS, 2002, : 233 - 240
  • [40] Highly Scalable Algorithm For Distributed Real-Time Text Indexing
    Narang, Ankur
    Agarwal, Vikas
    Kedia, Monu
    Garg, Vijay K.
    16TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), PROCEEDINGS, 2009, : 332 - 341