Parallel XPath query based on cost optimization

被引:3
|
作者
Chen, Rongxin [1 ,2 ]
Wang, Zhijin [1 ]
Su, Hang [3 ]
Xie, Shutong [1 ]
Wang, Zongyue [1 ]
机构
[1] Jimei Univ, Comp Engn Coll, Xiamen, Peoples R China
[2] Digital Fujian Big Data Modeling & Intelligent Co, Xiamen, Peoples R China
[3] Beijing Univ Technol, Coll Comp Sci, Beijing, Peoples R China
来源
JOURNAL OF SUPERCOMPUTING | 2022年 / 78卷 / 04期
关键词
XPath query; Relation index; Cost estimation; Load balancing; Parallel effectiveness;
D O I
10.1007/s11227-021-04074-y
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The performance of XPath query is the key factor to the capacity of XML processing. It is an important way to improve the performance of XPath by making full use of multi-threaded computing resources for parallel processing. However, in the process of XPath parallelization, load imbalance and thread inefficiency often lead to the decline of parallel performance. In this paper, we propose a cost optimization-based parallel XPath query method named coPXQ. This method improves the parallel processing effect of navigational XPath query through a series of optimization measures. The main measures include as follows: first, by optimizing the storage of XML node relation index, both storage and access efficiency of the index are improved. Secondly, load balancing is realized by a new cost estimation method according to the number of XML node relations to optimize parallel relation index creation and parallel primitive execution. Thirdly, the strategy of determining the number of worker threads based on parallel effectiveness estimation is utilized to ensure the effective use of threads in query. Compared with the existing typical methods, the experimental results show that our method can obtain better parallel performance.
引用
收藏
页码:5420 / 5449
页数:30
相关论文
共 50 条
  • [41] Parallel query optimization methods and approaches: a survey
    Hameurlain, A
    Morvan, F
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2004, 19 (05): : 275 - 288
  • [42] An overview of parallel query optimization in relational systems
    Hameurlain, A
    Morvan, F
    11TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATION, PROCEEDINGS, 2000, : 629 - 634
  • [43] Query optimization and execution in a parallel analytics DBMS
    Eavis, Todd
    Taleb, Ahmad
    2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2012, : 897 - 908
  • [44] Parallel I/O Aware Query Optimization
    Ghodsnia, Pedram
    Bowman, Ivan T.
    Nica, Anisoara
    SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2014, : 349 - 360
  • [45] OPTIMIZATION OF PARALLEL QUERY EXECUTION PLANS IN XPRS
    HONG, W
    STONEBRAKER, M
    DISTRIBUTED AND PARALLEL DATABASES, 1993, 1 (01) : 9 - 32
  • [46] Fuzzy Logic Programming for Implementing a Flexible XPath-based Query Language
    Almendros-Jimenez, Jesus M.
    Luna, Alejandro
    Moreno, Gines
    ELECTRONIC NOTES IN THEORETICAL COMPUTER SCIENCE, 2012, 282 : 3 - 18
  • [47] A Flexible XPath-Based Query Language Implemented with Fuzzy Logic Programming
    Almendros-Jimenez, Jesus M.
    Luna, Alejandro
    Moreno, Gines
    RULE-BASED REASONING, PROGRAMMING, AND APPLICATIONS, 2011, 6826 : 186 - +
  • [48] Validated cost models for parallel OQL query processing
    Sampaio, SDM
    Paton, NW
    Smith, J
    Watson, P
    OBJECT-ORIENTED INFORMATION SYSTEMS, PROCEEDINGS, 2002, 2425 : 60 - 75
  • [49] Development of Query Transformation Method by Cost Optimization
    Altayeva, Aigerim Bakatkaliyevna
    Yoon, Youngmi
    Cho, Young Im
    INTERNATIONAL JOURNAL OF FUZZY LOGIC AND INTELLIGENT SYSTEMS, 2016, 16 (01) : 36 - 43
  • [50] An XML/XPath query language and XMark performance study
    Davis, KC
    Zhan, YS
    Davis, RB
    2003 SYMPOSIUM ON APPLICATIONS AND THE INTERNET, PROCEEDINGS, 2003, : 422 - 427