Boosting exact pattern matching with extreme gradient boosting (and more)

被引:0
|
作者
Susik, Robert [1 ]
Grabowski, Szymon [1 ]
机构
[1] Lodz Univ Technol, Inst Appl Comp Sci, Lodz, Poland
来源
JOURNAL OF SUPERCOMPUTING | 2025年 / 81卷 / 05期
关键词
Text matching; Algorithm selection; Machine learning; Gradient boosting; Pattern matching;
D O I
10.1007/s11227-025-07165-2
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Pattern matching is a well-known problem in computer science. Over the years, dozens of exact pattern matching algorithms have been developed. Clearly, search speed is usually the most important aspect, but it is difficult to tell which algorithm is fastest for a specific (given) pattern. Most applications, programming languages, and domain-specific tools maintain a single algorithm for exact pattern matching that may not be the best choice for all use cases. The key finding of this study is that the pattern itself contains information about which algorithm should be used to search for it. We take advantage of this fact to develop a solution that enables faster pattern searching by leveraging machine learning models to select the best-performing algorithm for a given pattern. The selection method uses machine learning models such as Random Forest, Extra Trees, AdaBoost, Bootstrap Aggregation, and Gradient Boosting. The proposed solution is online, i.e., does not require prior reading of the text and is based on the information extracted from the pattern. Experiments show that it is 11% faster than the fastest (on average) exact pattern matching algorithm.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Investigation on eXtreme Gradient Boosting for cutting force prediction in milling
    Heitz, Thomas
    He, Ning
    Ait-Mlouk, Addi
    Bachrathy, Daniel
    Chen, Ni
    Zhao, Guolong
    Li, Liang
    JOURNAL OF INTELLIGENT MANUFACTURING, 2025, 36 (01) : 285 - 301
  • [32] EXTREME GRADIENT BOOSTING REGRESSION MODEL FOR SOIL THERMAL CONDUCTIVITY
    Yurttakal, Ahmet Hasim
    THERMAL SCIENCE, 2021, 25 : S1 - S7
  • [33] Investigation on eXtreme Gradient Boosting for cutting force prediction in milling
    Heitz, Thomas
    He, Ning
    Ait-Mlouk, Addi
    Bachrathy, Daniel
    Chen, Ni
    Zhao, Guolong
    Li, Liang
    JOURNAL OF INTELLIGENT MANUFACTURING, 2025, 36 (01) : 285 - 301
  • [34] Electricity Theft Detection Base on Extreme Gradient Boosting in AMI
    Yan, Zhongzong
    Wen, He
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70
  • [35] Metaheuristic Optimized Extreme Gradient Boosting Milling Maintenance Prediction
    Bozovic, Aleksandra
    Jovanovic, Luka
    Desnica, Eleonora
    Bacanin, Nebojsa
    Zivkovic, Miodrag
    Antonijevic, Milos
    Mani, Joseph P.
    FOURTH CONGRESS ON INTELLIGENT SYSTEMS, VOL 1, CIS 2023, 2024, 868 : 361 - 374
  • [36] Pavement aggregate shape classification based on extreme gradient boosting
    Pei, Lili
    Sun, Zhaoyun
    Yu, Ting
    Li, Wei
    Hao, Xueli
    Hu, Yuanjiao
    Yang, Chunmei
    CONSTRUCTION AND BUILDING MATERIALS, 2020, 256
  • [37] Power Grid Stability Identification Based on eXtreme Gradient Boosting
    Liu, Wei
    Sun, Yixin
    Chen, Guang
    Chen, Ruixin
    PROCEEDINGS OF 2019 IEEE 3RD INTERNATIONAL ELECTRICAL AND ENERGY CONFERENCE (CIEEC), 2019, : 1636 - 1642
  • [38] Fault Diagnosis of Centrifugal Chiller Based on Extreme Gradient Boosting
    Liu, Yaxiang
    Liang, Tao
    Zhang, Mengxin
    Jing, Nijie
    Xia, Yudong
    Ding, Qiang
    BUILDINGS, 2024, 14 (06)
  • [39] Predicting Systemic Banking Crises using Extreme Gradient Boosting
    Alaminos, D.
    Fernandez-Gamez, M. A.
    Santos, Jose Antonio C.
    Campos-Soria, J. A.
    JOURNAL OF SCIENTIFIC & INDUSTRIAL RESEARCH, 2019, 78 (09): : 571 - 575
  • [40] Extreme Gradient Boosting Regression Model for Soil Available Boron
    F. Gökmen
    V. Uygur
    E. Sukuşu
    Eurasian Soil Science, 2023, 56 : 738 - 746