Architecture and Technology of OceanBase Distributed Relational Database

被引:0
|
作者
Yang Z. [1 ]
Yang C. [1 ]
Han F. [1 ]
Wang G. [1 ]
Yang Z. [1 ]
Cheng X. [1 ]
机构
[1] Вeijing OceanВase Technology Co., Ltd., Beijing
关键词
distributed SQL optimizer; distributed transaction; LSM-tree-based storage; multitenant; relational database;
D O I
10.7544/issn1000-1239.202330835
中图分类号
学科分类号
摘要
Relational database is the key information infrastructure of today’s society. The Internet and digitization have brought high concurrency and massive data. Due to their centralized architectures, the processing power and storage capacity of traditional relational databases are stretched. OceanBase is a distributed relational database based on commodity PC servers. It achieves online horizontal scalability, automatic lossless disaster recovery from data center failure and high-ratio data compression. It has been used in finance, government affairs, telecommunication systems, Internet, etc. We introduce the architecture and some key technologies of OceanBase, including distributed transaction processing, LSM-tree-based storage system and distributed SQL optimizer. In addition, we explain in detail the high availability and data consistency of OceanBase, which can ensure that RPO is 0 and RTO is less than 8 seconds. At the same time, it also introduces OceanBase’s multi-tenant mechanism, which adopts a native multi-tenant design within the cluster to implement multiple independent database services in the cluster. Based on the Sysbench and TPC-H evaluation benchmarks, comparative experimental results show that 1) in a stand-alone mode, the performance of OceanBase is 1.27 times to over 2 times that of MySQL; 2) in a single-master mode, the performance of OceanBase is 1.25 times to nearly 2 times that of MySQL; 3) in a multi-master mode, the performance of OceanBase is 1.09 to 3.1 times that of MySQL, and for complex OLAP queries, the performance of OceanBase is 6 to 327 times that of MySQL. © 2024 Science Press. All rights reserved.
引用
收藏
页码:540 / 554
页数:14
相关论文
共 30 条
  • [1] Codd E F., A relational model of data for large shared data banks[J], Communications of the ACM, 13, 6, (1970)
  • [2] Zhenkun Yang, Yang Chuanhui, Han Fusheng, Et al., OceanBase: A 707 million tpmC distributed relational database system[J], Proceedings of the VLDB Endowment, 15, 12, pp. 3385-3397, (2022)
  • [3] Zhifeng Yang, Xu Quanqing, Gao Shanyan, Et al., OceanBase Paetica: A hybrid shared-nothing/shared-everything database for supporting single machine and distributed cluster[J], Proceedings of the VLDB Endowment, 16, 12, pp. 3728-3740, (2023)
  • [4] Serlin O., TPC-C Details: 60, 880, 800 tpmC [EB/OL]
  • [5] Serlin O., TPC-H Result Details: 15, 265, 305 QphH@30000GB
  • [6] Lamport L., The part-time parliament[J], ACM Transactions on Computer Systems, 16, 2, (1998)
  • [7] Gray J., The transaction concept: Virtues and limitations[C], Proc of Int Conf on Very Large Data Bases, pp. 144-154, (1981)
  • [8] Mohan C, Lindsay B, Obermarck R., Transaction management in the R* distributed database management system[J], ACM Transactions on Database Systems, 11, 4, (1986)
  • [9] Berenson H, Bernstein P, Gray J, Et al., A critique of ANSI SQL isolation levels[C], Proc of the 1995 ACM SIGMOD Int Conf on Management of Data, pp. 1-10, (1995)
  • [10] Bernstein P, Goodman N., Multiversion concurrency control−Theory and algorithms[J], ACM Transactions on Database Systems, 8, 4, (1983)