Flexible workload-aware clustering of XML documents

被引:0
|
作者
Bordawekar, R [1 ]
Shmueli, O
机构
[1] IBM TJ Watson Res Ctr, Hawthorne, NY 10532 USA
[2] Technion, Comp Sci Dept, IL-32000 Haifa, Israel
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We investigate workload-directed physical data clustering in native XML database and repository systems. We present a practical algorithm for clustering XML documents, called XC, which is based on Lukes' tree partitioning algorithm. XC carefully approximates certain aspects of Lukes' algorithm so as to substantially reduce memory and time usage. XC can operate with varying degrees of precision, even in memory constrained environments. Experimental results indicate that XC is a superior clustering algorithm in terms of partition quality, with only a slight overhead in performance when compared to a workload-directed depth-first scan and store scheme. We demonstrate that XC is substantially faster than the exact Lukes' algorithm, with only a minimal loss in clustering quality. Results also indicate that XC can exploit application workload information to generate XML clustering solutions that lead to major reduction in page faults for the workload under consideration.
引用
收藏
页码:204 / 218
页数:15
相关论文
共 50 条
  • [1] Towards optimal workload-aware XML to relational schema mapping
    Wang, Xiaoling
    Luan, Jinfeng
    Liu, Guimei
    Zhou, Aoying
    [J]. ANNALS OF OPERATIONS RESEARCH, 2009, 168 (01) : 133 - 150
  • [2] Runtime prediction of parallel applications with workload-aware clustering
    Park, Ju-Won
    Kim, Eunhye
    [J]. JOURNAL OF SUPERCOMPUTING, 2017, 73 (11): : 4635 - 4651
  • [3] Towards optimal workload-aware XML to relational schema mapping
    Xiaoling Wang
    Jinfeng Luan
    Guimei Liu
    Aoying Zhou
    [J]. Annals of Operations Research, 2009, 168 : 133 - 150
  • [4] Runtime prediction of parallel applications with workload-aware clustering
    Ju-Won Park
    Eunhye Kim
    [J]. The Journal of Supercomputing, 2017, 73 : 4635 - 4651
  • [5] Workload-Aware Column Imprints
    Slavitch, Noah
    [J]. SIGMOD'20: PROCEEDINGS OF THE 2020 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2020, : 2865 - 2867
  • [6] cCluster: A Core Clustering Mechanism for Workload-Aware Virtual Machine Scheduling
    Dehsangi, Mostafa
    Asyabi, Esmail
    Sharifi, Mohsen
    Azhari, Seyed Vahid
    [J]. 2015 3RD INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD (FICLOUD) AND INTERNATIONAL CONFERENCE ON OPEN AND BIG (OBD), 2015, : 248 - 255
  • [7] Workload-aware histograms for remote applications
    Malik, Tanu
    Burns, Randal
    [J]. DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2008, 5182 : 402 - +
  • [8] STHoles: A multidimensional workload-aware histogram
    Bruno, N
    Chaudhuri, S
    Gravano, L
    [J]. SIGMOD RECORD, 2001, 30 (02) : 211 - 222
  • [9] Workload-Aware Approximate Computing Configuration
    Ma, Dongning
    Thapa, Rahul
    Wang, Xingjian
    Jiao, Xun
    Hao, Cong
    [J]. PROCEEDINGS OF THE 2021 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2021), 2021, : 920 - 925
  • [10] Workload-Aware Periodic Interconnect BIST
    Sadeghi-Kohan, Somayeh
    Hellebrand, Sybille
    Wunderlich, Hans-Joachim
    [J]. IEEE DESIGN & TEST, 2024, 41 (04) : 50 - 55