On Main-memory Flushing in Microblogs Data Management Systems

被引:0
|
作者
Magdy, Amr [1 ]
Alghamdi, Rami [1 ]
Mokbel, Mohamed F. [1 ]
机构
[1] Univ Minnesota, Dept Comp Sci & Engn, Minneapolis, MN 55455 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Searching microblogs, e.g., tweets and comments, is practically supported through main-memory indexing for scalable data digestion and efficient query evaluation. With continuity and excessive numbers of microblogs, it is infeasible to keep data in main-memory for long periods. Thus, once allocated memory budget is filled, a portion of data is flushed from memory to disk to continuously accommodate newly incoming data. Existing techniques come with either low memory hit ratio due to flushing items regardless of their relevance to incoming queries or significant overhead of tracking individual data items, which limit scalability of microblogs systems in either cases. In this paper, we propose kFlushing policy that exploits popularity of top-k queries in microblogs to smartly select a subset of microblogs to flush. kFlushing is mainly designed to increase memory hit ratio. To this end, it identifies and flushes in-memory data that does not contribute to incoming queries. The freed memory space is utilized to accumulate more useful data that is used to answer more queries from memory contents. When all memory is utilized for useful data, kFlushing flushes data that is less likely to degrade memory hit ratio. In addition, kFlushing comes with a little overhead that keeps high system scalability in terms of high digestion rates of incoming fast data. Extensive experimental evaluation shows the effectiveness and scalability of kFlushing to improve main-memory hit by 26-330% while coping up with fast microblog streams of up to 100K microblog/second.
引用
收藏
页码:445 / 456
页数:12
相关论文
共 50 条
  • [21] A robust main-memory compression scheme
    Ekman, M
    Stenstrom, P
    [J]. 32ND INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, PROCEEDINGS, 2005, : 74 - 85
  • [22] Concurrency control in a main-memory DBMS
    Kim, SW
    [J]. COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2004, 19 (04): : 263 - 272
  • [23] HOT: A Height Optimized Trie Index for Main-Memory Database Systems
    Binna, Robert
    Zangerle, Eva
    Pichl, Martin
    Specht, Guenther
    Leis, Viktor
    [J]. SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2018, : 521 - 534
  • [24] Parallel Materialisation of Datalog Programs in Centralised, Main-Memory RDF Systems
    Motik, Boris
    Nenov, Yavor
    Piro, Robert
    Horrocks, Ian
    Olteanu, Dan
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 129 - 137
  • [25] Low-Overhead Asynchronous Checkpointing in Main-Memory Database Systems
    Ren, Kun
    Diamond, Thaddeus
    Abadi, Daniel J.
    Thomson, Alexander
    [J]. SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 1539 - 1551
  • [26] A Comparative Study of Consistent Snapshot Algorithms for Main-Memory Database Systems
    Li, Liang
    Wang, Guoren
    Wu, Gang
    Yuan, Ye
    Chen, Lei
    Lian, Xiang
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (02) : 316 - 330
  • [27] An extensible architecture for main-memory real-time storage systems
    Cha, SK
    Park, JH
    Lee, SJ
    Park, BD
    Lee, JS
    [J]. THIRD INTERNATIONAL WORKSHOP ON REAL-TIME COMPUTING SYSTEMS AND APPLICATIONS, PROCEEDINGS, 1996, : 67 - 73
  • [28] DimmWitted: A Study of Main-Memory Statistical Analytics
    Zhang, Ce
    Re, Christopher
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 7 (12): : 1283 - 1294
  • [29] The Architecture of the Dalí Main-Memory Storage Manager
    Philip Bohannon
    Daniel Lieuwen
    Rajeev Rastogi
    Avi Silberschatz
    S. Seshadri
    S. Sudarshan
    [J]. Multimedia Tools and Applications, 1997, 4 : 115 - 151
  • [30] Access Path Selection in Main-Memory Optimized Data Systems: Should I Scan or Should I Probe?
    Kester, Michael S.
    Athanassoulis, Manos
    Idreos, Stratos
    [J]. SIGMOD'17: PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2017, : 715 - 730