Parallel XPath query based on cost optimization

被引:3
|
作者
Chen, Rongxin [1 ,2 ]
Wang, Zhijin [1 ]
Su, Hang [3 ]
Xie, Shutong [1 ]
Wang, Zongyue [1 ]
机构
[1] Jimei Univ, Comp Engn Coll, Xiamen, Peoples R China
[2] Digital Fujian Big Data Modeling & Intelligent Co, Xiamen, Peoples R China
[3] Beijing Univ Technol, Coll Comp Sci, Beijing, Peoples R China
来源
JOURNAL OF SUPERCOMPUTING | 2022年 / 78卷 / 04期
关键词
XPath query; Relation index; Cost estimation; Load balancing; Parallel effectiveness;
D O I
10.1007/s11227-021-04074-y
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The performance of XPath query is the key factor to the capacity of XML processing. It is an important way to improve the performance of XPath by making full use of multi-threaded computing resources for parallel processing. However, in the process of XPath parallelization, load imbalance and thread inefficiency often lead to the decline of parallel performance. In this paper, we propose a cost optimization-based parallel XPath query method named coPXQ. This method improves the parallel processing effect of navigational XPath query through a series of optimization measures. The main measures include as follows: first, by optimizing the storage of XML node relation index, both storage and access efficiency of the index are improved. Secondly, load balancing is realized by a new cost estimation method according to the number of XML node relations to optimize parallel relation index creation and parallel primitive execution. Thirdly, the strategy of determining the number of worker threads based on parallel effectiveness estimation is utilized to ensure the effective use of threads in query. Compared with the existing typical methods, the experimental results show that our method can obtain better parallel performance.
引用
收藏
页码:5420 / 5449
页数:30
相关论文
共 50 条
  • [31] Parallel XPath evaluation based on node relation matrix
    Chen, Rongxin
    Liao, Husheng
    Wang, Zongyue
    Journal of Computational Information Systems, 2013, 9 (19): : 7583 - 7592
  • [32] Communication Cost in Parallel Query Evaluation A Tutorial
    Suciu, Dan
    PODS'17: PROCEEDINGS OF THE 36TH ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2017, : 319 - 319
  • [33] XPath Query Relaxation through Rewriting Rules
    Fazzinga, Bettina
    Flesca, Sergio
    Furfaro, Filippo
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (10) : 1583 - 1600
  • [34] On the expressiveness of generalization rules for XPath query relaxation
    Fazzinga, Bettina
    Flesca, Sergio
    Furfaro, Filippo
    PROCEEDINGS OF THE FOURTEENTH INTERNATIONAL DATABASE ENGINEERING & APPLICATIONS SYMPOSIUM (IDEAS '10), 2010, : 157 - 168
  • [35] Semantic XPath query transformation: Opportunities and performance
    Le, Dung Xuan Thi
    Bressan, Stephane
    Taniar, David
    Rahayu, Wenny
    ADVANCES IN DATABASES: CONCEPTS, SYSTEMS AND APPLICATIONS, 2007, 4443 : 994 - +
  • [36] The complexity of XPath query evaluation and XML typing
    Gottlob, G
    Koch, C
    Pichler, R
    JOURNAL OF THE ACM, 2005, 52 (02) : 284 - 335
  • [37] An XPATH query processor for a decomposition storage scheme
    Ko, YK
    Hong, EK
    Kim, M
    WEB AND COMMUNICATION TECHNOLOGIES AND INTERNET-RELATED SOCIAL ISSUES - HSI 2003, 2003, 2713 : 197 - 206
  • [38] CostFed: Cost-Based Query Optimization for SPARQL Endpoint Federation
    Saleem, Muhammad
    Potocki, Alexander
    Soru, Tommaso
    Hartig, Olaf
    Ngomo, Axel-Cyrille Ngonga
    PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON SEMANTIC SYSTEMS, 2018, 137 : 163 - 174
  • [39] XPath query completeness verification algorithms in DAS
    Bao X.
    Jin Y.
    Journal of Networks, 2010, 5 (10) : 1143 - 1150
  • [40] From the modeling of parallel relational query processing to query optimization and simulation
    Brunie, Lionel
    Kosch, Harald
    Wohner, Wolfgang
    Parallel Processing Letters, 1998, 8 (01): : 51 - 62