Propeller: A Scalable Real-Time File-Search Service in Distributed Systems

被引:6
|
作者
Xu, Lei [1 ]
Jiang, Hong [1 ]
Tian, Lei [1 ]
Huang, Ziling [1 ]
机构
[1] Univ Nebraska Lincoln, Lincoln, NE 68588 USA
关键词
D O I
10.1109/ICDCS.2014.46
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
File-search service is a valuable facility to accelerate many analytics applications, because it can drastically reduce the scale of the input data. The main challenge facing the design of large-scale and accurate file-search services is how to support real-time indexing in an efficient and scalable way. To address this challenge, we propose a distributed file-search service, called Propeller, which utilizes a special file-access pattern, called access-causality, to partition file-indices in order to expose substantial access locality and parallelism to accelerate the file-indexing process. The extensive evaluations of Propeller show that it is real-time in file-indexing operations, accurate in file-search results, and scalable in large datasets. It achieves significantly better file-indexing and file-search performance (up to 250x) than a centralized solution (MySQL) and much higher accuracy and substantially lower query latency (up to 22x) than a state-of-the-art desktop search engine (Spotlight).
引用
下载
收藏
页码:378 / 388
页数:11
相关论文
共 50 条
  • [1] A scalable scheduling algorithm for real-time distributed systems
    Atif, Y
    Hamidzadeh, B
    18TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, PROCEEDINGS, 1998, : 352 - 359
  • [2] On Scalable Synchronization for Distributed Embedded Real-Time Systems
    Fahmy, Sherif F.
    Ravindran, Binoy
    Jensen, E. Douglas
    SOFTWARE TECHNOLOGIES FOR EMBEDDED AND UBIQUITOUS SYSTEMS, PROCEEDINGS, 2008, 5287 : 394 - +
  • [3] Building distributed scalable dependable real-time systems
    Ravindran, B
    Welch, LR
    Kelling, C
    INTERNATIONAL CONFERENCE AND WORKSHOP ON ENGINEERING OF COMPUTER-BASED SYSTEMS, PROCEEDINGS, 1997, : 452 - 459
  • [4] A real-time distributed scheduling service for middleware systems
    Zhang, JY
    DiPippo, L
    Fay-Wolfe, V
    Bryan, K
    Murphy, M
    WORDS 2005: 10TH IEEE INTERNATIONAL WORKSHOP ON OBJECT-ORIENTED REAL-TIME DEPENDABLE, PROCEEDINGS, 2005, : 59 - 65
  • [5] An Integrated Indexing and Search Service for Distributed File Systems
    Sim, Hyogi
    Khan, Awais
    Vazhkudai, Sudharshan S.
    Lim, Seung-Hwan
    Butt, Ali R.
    Kim, Youngjae
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (10) : 2375 - 2391
  • [6] Distributed, scalable, dependable real-time systems: Middleware services and applications
    Welch, LR
    Ravindran, B
    Werme, PV
    Masters, MW
    Shirazi, BA
    Shirolkar, PA
    Harrison, RD
    Mills, W
    Do, T
    Lafratta, J
    Anwar, SM
    Sharp, S
    Sergeant, T
    Bilowus, G
    Swick, M
    Hoppel, J
    Caruso, J
    IPPS/SPDP 1999: 13TH INTERNATIONAL PARALLEL PROCESSING SYMPOSIUM & 10TH SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS, 1999, : 297 - 301
  • [7] Scalable Data Gathering for Real-time Monitoring Systems on Distributed Computing
    Kamoshida, Yoshikazu
    Taura, Kenjiro
    CCGRID 2008: EIGHTH IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, VOLS 1 AND 2, PROCEEDINGS, 2008, : 425 - 432
  • [8] Design of a Scalable Reasoning Engine for Distributed, Real-Time and Embedded Systems
    Edmondson, James
    Gokhale, Aniruddha
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, 2011, 7091 : 221 - 232
  • [9] iLAND: An Enhanced Middleware for Real-Time Reconfiguration of Service Oriented Distributed Real-Time Systems
    Garcia Valls, Marisol
    Rodriguez Lopez, Iago
    Fernandez Villar, Laura
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2013, 9 (01) : 228 - 236
  • [10] Quality of service negotiation for distributed, dynamic real-time systems
    Cavanaugh, CD
    Welch, LR
    Shirazi, BA
    Huh, EN
    Anwar, S
    PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS, 2000, 1800 : 757 - 765