Survey on Machine Learning for Database Systems

被引:0
|
作者
Meng X. [1 ]
Ma C. [1 ]
Yang C. [1 ]
机构
[1] School of Information, Renmin University of China, Beijing
基金
中国国家自然科学基金;
关键词
Automatic database systems; Database systems; Learned index; Machine learning;
D O I
10.7544/issn1000-1239.2019.20190446
中图分类号
学科分类号
摘要
As one of the most popular technologies, database systems have been developed for more than 50 years, and are mature enough to support many real scenarios. Although many researches still focus on the traditional database optimization tasks, the performance improvement is little. Actually, with the advent of big data, we have met the new gap obstructing the further performance improvement of database systems. The database systems face challenges in two aspects. Firstly, the increase of data volume requires the database system to process tasks more quickly. Secondly, the rapid change of query workload and its diversity make database systems impossible to adjust the system knobs to the optimal configuration in real time. Fortunately, machine learning may be the dawn bringing an unprecedented opportunity for the traditional database systems to lead us to the new optimization direction. In this paper, we introduce how to combine machine learning into the further development of database management systems. We focus on the current research work of machine learning for database systems, mainly including the machine learning for storage management and query optimization, as well as automatic database management systems. This area has also opened various challenges and problems to be solved. Thus, based on the analysis of existing technologies, the future challenges, which may be encountered in machine learning for database systems, are pointed out. © 2019, Science Press. All right reserved.
引用
收藏
页码:1803 / 1820
页数:17
相关论文
共 102 条
  • [1] Re C., Agrawal D., Balazinska M., Et al., Machine learning and databases: The sound of things to come or a cacophony of hype?, Proc of the 2015 ACM SIGMOD Int Conf on Management of Data, pp. 283-284, (2015)
  • [2] Yang C., Du Z., Meng X., Et al., A frequency scaling based performance indicator framework for big data systems, Proc of the 24th Int Conf on DSFAA, pp. 19-35, (2019)
  • [3] Kraska T., Beutel A., Chi E.H., Et al., The case for learned index structures, Proc of the 2018 Int Conf on Management of Data, pp. 489-504, (2018)
  • [4] Huang J., Moore law is dead but GPU will get 1000X faster by 2025
  • [5] GEFORCE RTX 2080 Ti
  • [6] Kipf A., Kipf T., Radke B., Et al., Learned cardinalities: Estimating correlated joins with deep learning, Proc of the 9th Biennial Conf on Innovative Data Systems Research(CIDR), (2019)
  • [7] Van Aken D., Pavlo A., Gordon G.J., Et al., Automatic database management system tuning through large-scale machine learning, Proc of the 2017 ACM Int Conf on Management of Data, pp. 1009-1024, (2017)
  • [8] MySQL-InnoDB startup options and system variables
  • [9] Zhou Z., Machine Learning, (2016)
  • [10] Garcia-Molina H., Ullman J.D., Widom J., Database System Implementation, (2000)