TimeCloth: Fast Point-in-Time Database Recovery in The Cloud

被引:0
|
作者
Deng, Jianjun [1 ]
Lu, Jianan [2 ]
Fan, Hua [1 ]
Liu, Chaoyang [1 ]
Cheng, Shi [1 ]
Fu, Cuiyun [1 ]
Zhou, Wenchao [1 ]
机构
[1] Alibaba Grp, Hangzhou, Peoples R China
[2] Princeton Univ, Princeton, NJ USA
关键词
database recovery; point-in-time recovery; user-triggered recovery; transparent lazy loading;
D O I
10.1145/3626246.3653382
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, we have noted the frequent occurrence of user-triggered database recoveries in the cloud. In contrast to traditional failure-triggered recovery scenarios, they come with unique consumer-centric challenges. Unfortunately, current solutions prove inefficient, falling short either in meeting our customers' requirements or due to their close integration with the native recovery support of the underlying database engines. In this work, we present TimeCloth, a generic cloud-native recovery mechanism that achieves sublinear recovery time while meeting the specific needs of our customers. It comprises a recovery module optimized for fine-grained point-in-time recoveries and an import module enabling nearly instantaneous access to remote tables. The recovery module performs fast log filtering, parallelizes replay of non-conflicting log records and coalesce log records to reduce the volume of replay work. The import module implements a transparent FUSE-based lazy loading mechanism as well as a smart prefetcher to achieve good access performance for remote tables. Collectively, they significantly accelerate user-triggered recoveries in the cloud. TimeCloth is launched in production at Alibaba Cloud for about 15 months. We have witnesses a reduction in RTO among our customers by 44% on average and sometimes up to 92%.
引用
收藏
页码:214 / 226
页数:13
相关论文
共 50 条
  • [31] Calibration of rating grades to point-in-time and through-the-cycle levels of probability of default
    Rubtsov, Mark
    JOURNAL OF RISK MODEL VALIDATION, 2021, 15 (04): : 51 - 74
  • [32] A Single Point-in-Time eGFR Is Not Associated with Increased Risk of Dementia in the Elderly Authors' Reply
    Kurella Tamura, Manjula
    Pajewski, Nicholas
    Weiner, Daniel E.
    JOURNAL OF THE AMERICAN SOCIETY OF NEPHROLOGY, 2020, 31 (12): : 2966 - 2966
  • [33] Point-in-time probability of default term structure models for multiperiod scenario loss projection
    Yang, Bill Huajian
    JOURNAL OF RISK MODEL VALIDATION, 2017, 11 (01): : 73 - 94
  • [34] Surface Recovery: Fusion of Image and Point Cloud
    Hosseinyalamdary, Siavash
    Yilmaz, Alper
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOP (ICCVW), 2015, : 175 - 183
  • [35] Research on BIM Database Based on Point Cloud Model
    Huang Nanxin
    Wang Jia
    2017 29TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2017, : 2733 - 2737
  • [36] Closure of discussion: Point-in-time and extreme-value probability simulation technique for engineering design
    Minguez, R.
    Guanche, Y.
    Mendez, F. J.
    STRUCTURAL SAFETY, 2014, 46 : 5 - 7
  • [37] A point-in-time comparison of the environmental impact of Jersey vs. Holstein milk production.
    Capper, J. L.
    Cady, R. A.
    JOURNAL OF DAIRY SCIENCE, 2010, 93 : 569 - 570
  • [39] Real-Time Point-Cloud-Based Haptic Rendering with Fast Contact Detection
    Iravani, Elham
    Talebi, Heidar Ali
    Zareinejad, Mohammad
    Dehghan, Mohammad Reza
    2016 4TH RSI INTERNATIONAL CONFERENCE ON ROBOTICS AND MECHATRONICS (ICROM), 2016, : 578 - 583
  • [40] RECOVERY TIME OF WILSON CLOUD CHAMBERS
    IVANOV, YS
    FEDOROV, VM
    INSTRUMENTS AND EXPERIMENTAL TECHNIQUES, 1964, (JUN) : 1039 - &