On Main-memory Flushing in Microblogs Data Management Systems

被引:0
|
作者
Magdy, Amr [1 ]
Alghamdi, Rami [1 ]
Mokbel, Mohamed F. [1 ]
机构
[1] Univ Minnesota, Dept Comp Sci & Engn, Minneapolis, MN 55455 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Searching microblogs, e.g., tweets and comments, is practically supported through main-memory indexing for scalable data digestion and efficient query evaluation. With continuity and excessive numbers of microblogs, it is infeasible to keep data in main-memory for long periods. Thus, once allocated memory budget is filled, a portion of data is flushed from memory to disk to continuously accommodate newly incoming data. Existing techniques come with either low memory hit ratio due to flushing items regardless of their relevance to incoming queries or significant overhead of tracking individual data items, which limit scalability of microblogs systems in either cases. In this paper, we propose kFlushing policy that exploits popularity of top-k queries in microblogs to smartly select a subset of microblogs to flush. kFlushing is mainly designed to increase memory hit ratio. To this end, it identifies and flushes in-memory data that does not contribute to incoming queries. The freed memory space is utilized to accumulate more useful data that is used to answer more queries from memory contents. When all memory is utilized for useful data, kFlushing flushes data that is less likely to degrade memory hit ratio. In addition, kFlushing comes with a little overhead that keeps high system scalability in terms of high digestion rates of incoming fast data. Extensive experimental evaluation shows the effectiveness and scalability of kFlushing to improve main-memory hit by 26-330% while coping up with fast microblog streams of up to 100K microblog/second.
引用
收藏
页码:445 / 456
页数:12
相关论文
共 50 条
  • [1] Adaptive Data Skipping in Main-Memory Systems
    Qin, Wilson
    Idreos, Stratos
    [J]. SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 2255 - 2256
  • [2] Main-Memory Database Systems
    Kemper, Alfons
    Neumann, Thomas
    [J]. 2014 IEEE 30TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2014, : 1310 - 1310
  • [3] Modern Main-Memory Database Systems
    Larson, Per-Ake
    Levandoski, Justin
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 9 (13): : 1609 - +
  • [4] Identifying Hot and Cold Data in Main-Memory Databases
    Levandoski, Justin J.
    Larson, Per-Ake
    Stoica, Radu
    [J]. 2013 IEEE 29TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2013, : 26 - 37
  • [5] A Data Distribution Strategy for Scalable Main-Memory Database
    Huang, Yunkui
    Zhang, YanSong
    Ji, XiaoDong
    Wang, ZhanWei
    Wang, Shan
    [J]. ADVANCES IN WEB AND NETWORK TECHNOLOGIES, AND INFORMATION MANAGEMENT, 2009, 5731 : 13 - 24
  • [6] Integrating Cluster-Based Main-Memory Accelerators in Relational Data Warehouse Systems
    Knut Stolze
    Felix Beier
    Oliver Koeth
    Kai-Uwe Sattler
    [J]. Datenbank-Spektrum , 2011, 11 (2) : 101 - 110
  • [7] SEMANTICALLY RICH API FOR IN-DATABASE DATA MANIPULATION IN MAIN-MEMORY ERP SYSTEMS
    Borovskiy, Vadym
    Schwarz, Christian
    Zeier, Alexander
    Koch, Wolfgang
    [J]. ICEIS 2011: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL 1, 2011, : 253 - 260
  • [8] Guaranteeing the physical consistency of shared data in a main-memory DBMS
    Lim, HJ
    Kim, SW
    [J]. 2002 STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT, PROCEEDINGS: GLOBALIZING RESEARCH AND DEVELOPMENT IN ELECTRICAL AND ELECTRONICS ENGINEERING, 2002, : 348 - 351
  • [9] DimensionSlice: A main-memory data layout for fast scans of multidimensional data
    Suh, Ilhyun
    Chung, Yon Dohn
    [J]. INFORMATION SYSTEMS, 2020, 94
  • [10] Versioning in Main-Memory Database Systems From MusaeusDB to TardisDB
    Schule, Maximilian E.
    Karnowski, Lukas
    Schmeisser, Josef
    Kleiner, Benedikt
    Kemper, Alfons
    Neumann, Thomas
    [J]. SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT (SSDBM 2019), 2019, : 169 - 180