Boosting exact pattern matching with extreme gradient boosting (and more)

被引:0
|
作者
Susik, Robert [1 ]
Grabowski, Szymon [1 ]
机构
[1] Lodz Univ Technol, Inst Appl Comp Sci, Lodz, Poland
来源
JOURNAL OF SUPERCOMPUTING | 2025年 / 81卷 / 05期
关键词
Text matching; Algorithm selection; Machine learning; Gradient boosting; Pattern matching;
D O I
10.1007/s11227-025-07165-2
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Pattern matching is a well-known problem in computer science. Over the years, dozens of exact pattern matching algorithms have been developed. Clearly, search speed is usually the most important aspect, but it is difficult to tell which algorithm is fastest for a specific (given) pattern. Most applications, programming languages, and domain-specific tools maintain a single algorithm for exact pattern matching that may not be the best choice for all use cases. The key finding of this study is that the pattern itself contains information about which algorithm should be used to search for it. We take advantage of this fact to develop a solution that enables faster pattern searching by leveraging machine learning models to select the best-performing algorithm for a given pattern. The selection method uses machine learning models such as Random Forest, Extra Trees, AdaBoost, Bootstrap Aggregation, and Gradient Boosting. The proposed solution is online, i.e., does not require prior reading of the text and is based on the information extracted from the pattern. Experiments show that it is 11% faster than the fastest (on average) exact pattern matching algorithm.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Boosting exact pattern matching with extreme gradient boosting (and more)Boosting exact pattern matching with extreme gradient...R. Susik, S. Grabowski
    Robert Susik
    Szymon Grabowski
    The Journal of Supercomputing, 81 (5)
  • [2] Boosting Privately: Federated Extreme Gradient Boosting for Mobile Crowdsensing
    Liu, Yang
    Ma, Zhuo
    Liu, Ximeng
    Ma, Siqi
    Nepal, Surya
    Deng, Robert H.
    Ren, Kui
    2020 IEEE 40TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2020, : 1 - 11
  • [3] Extreme Gradient Boosting for Limb Position Invariant Myoelectric Pattern Recognition
    Samui, Suman
    Mukhopadhyay, Anand Kumar
    Ghadge, Pratik K.
    Kumar, Gaurav
    2020 6TH IEEE INTERNATIONAL SYMPOSIUM ON SMART ELECTRONIC SYSTEMS (ISES 2020) (FORMERLY INIS), 2020, : 81 - 85
  • [4] Gradient boosting for extreme quantile regression
    Jasper Velthoen
    Clément Dombry
    Juan-Juan Cai
    Sebastian Engelke
    Extremes, 2023, 26 : 639 - 667
  • [5] Gradient boosting for extreme quantile regression
    Velthoen, Jasper
    Dombry, Clement
    Cai, Juan-Juan
    Engelke, Sebastian
    EXTREMES, 2023, 26 (04) : 639 - 667
  • [6] Extreme Gradient Boosting for Cyberpropaganda Detection
    Fattahi, Jaouhar
    Mejri, Mohamed
    Ziadia, Marwa
    NEW TRENDS IN INTELLIGENT SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES, 2021, 337 : 99 - 112
  • [7] Extreme Gradient Boosting and Behavioral Biometrics
    Manning, Benjamin
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4969 - 4970
  • [8] A boosting ensemble learning based hybrid light gradient boosting machine and extreme gradient boosting model for predicting house prices
    Sibindi, Racheal
    Mwangi, Ronald Waweru
    Waititu, Anthony Gichuhi
    ENGINEERING REPORTS, 2023, 5 (04)
  • [9] Stock Selection based on Extreme Gradient Boosting
    Zhang, Xiaoyun
    Chen, Wanyi
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 8926 - 8931
  • [10] A hybrid prediction model based on pattern sequence-based matching method and extreme gradient boosting for holiday load forecasting
    Zhu, Kedong
    Geng, Jian
    Wang, Ke
    ELECTRIC POWER SYSTEMS RESEARCH, 2021, 190