Parallel XPath query based on cost optimization

被引:3
|
作者
Chen, Rongxin [1 ,2 ]
Wang, Zhijin [1 ]
Su, Hang [3 ]
Xie, Shutong [1 ]
Wang, Zongyue [1 ]
机构
[1] Jimei Univ, Comp Engn Coll, Xiamen, Peoples R China
[2] Digital Fujian Big Data Modeling & Intelligent Co, Xiamen, Peoples R China
[3] Beijing Univ Technol, Coll Comp Sci, Beijing, Peoples R China
来源
JOURNAL OF SUPERCOMPUTING | 2022年 / 78卷 / 04期
关键词
XPath query; Relation index; Cost estimation; Load balancing; Parallel effectiveness;
D O I
10.1007/s11227-021-04074-y
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The performance of XPath query is the key factor to the capacity of XML processing. It is an important way to improve the performance of XPath by making full use of multi-threaded computing resources for parallel processing. However, in the process of XPath parallelization, load imbalance and thread inefficiency often lead to the decline of parallel performance. In this paper, we propose a cost optimization-based parallel XPath query method named coPXQ. This method improves the parallel processing effect of navigational XPath query through a series of optimization measures. The main measures include as follows: first, by optimizing the storage of XML node relation index, both storage and access efficiency of the index are improved. Secondly, load balancing is realized by a new cost estimation method according to the number of XML node relations to optimize parallel relation index creation and parallel primitive execution. Thirdly, the strategy of determining the number of worker threads based on parallel effectiveness estimation is utilized to ensure the effective use of threads in query. Compared with the existing typical methods, the experimental results show that our method can obtain better parallel performance.
引用
收藏
页码:5420 / 5449
页数:30
相关论文
共 50 条
  • [21] Query Translation from SQL to XPath
    Vidhya, P. M.
    Samuel, Philip
    2009 WORLD CONGRESS ON NATURE & BIOLOGICALLY INSPIRED COMPUTING (NABIC 2009), 2009, : 1748 - +
  • [22] A fuzzy extension of the XPath query language
    Campi, Alessandro
    Damiani, Ernesto
    Guinea, Sam
    Marrara, Stefania
    Pasi, Gabriella
    Spoletini, Paola
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2009, 33 (03) : 285 - 305
  • [23] How XPath Query Minimization Impacts Query Processing Performance
    Lee, Ki-Hoon
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (09): : 2258 - 2264
  • [24] A fuzzy extension of the XPath query language
    Alessandro Campi
    Ernesto Damiani
    Sam Guinea
    Stefania Marrara
    Gabriella Pasi
    Paola Spoletini
    Journal of Intelligent Information Systems, 2009, 33 : 285 - 305
  • [25] A fuzzy extension for the XPath query language
    Campi, Alessandro
    Guinea, Sam
    Spoletini, Paola
    FLEXIBLE QUERY ANSWERING SYSTEMS, PROCEEDINGS, 2006, 4027 : 210 - 221
  • [26] Cost-based query optimization for multi reachability joins
    Cheng, Jiefeng
    Yu, Jeffrey Xu
    Ding, Bolin
    ADVANCES IN DATABASES: CONCEPTS, SYSTEMS AND APPLICATIONS, 2007, 4443 : 18 - +
  • [27] Cost-Based Query Optimization via AI Planning
    Robinson, Nathan
    McIlraith, Sheila A.
    Toman, David
    PROCEEDINGS OF THE TWENTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2014, : 2344 - 2351
  • [28] Query optimization technique for parallel databases
    College of Computer Sci. and Technol., Huazhong Univ. of Sci. and Technol., Wuhan 430074, China
    Huazhong Ligong Daxue Xuebao, 2006, 3 (11-13+20):
  • [29] XPath Query Technology of XML Data Stream Based on Structure Index
    Wei, Xianmin
    ADVANCED MATERIALS SCIENCE AND TECHNOLOGY, PTS 1-2, 2011, 181-182 : 103 - 108
  • [30] Flow algorithms for parallel query optimization
    Deshpande, Amol
    Hellerstein, Lisa
    2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 754 - +