Entropy-Based Approach in Selection Exact String-Matching Algorithms

被引:5
|
作者
Markic, Ivan [1 ]
Stula, Maja [2 ]
Zoric, Marija [3 ]
Stipanicev, Darko [2 ]
机构
[1] Univ Split, Fac Elect Engn Mech Engn & Naval Architecture, Split 21000, Croatia
[2] Univ Split, Dept Elect & Comp, Fac Elect Engn Mech Engn & Naval Architecture, Split 21000, Croatia
[3] Univ Split, Fac Elect Engn Mech Engn & Naval Architecture, IT Dept, Split 21000, Croatia
关键词
exact string-matching; algorithm efficiency; algorithm performance; entropy; comparison; testing framework;
D O I
10.3390/e23010031
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
The string-matching paradigm is applied in every computer science and science branch in general. The existence of a plethora of string-matching algorithms makes it hard to choose the best one for any particular case. Expressing, measuring, and testing algorithm efficiency is a challenging task with many potential pitfalls. Algorithm efficiency can be measured based on the usage of different resources. In software engineering, algorithmic productivity is a property of an algorithm execution identified with the computational resources the algorithm consumes. Resource usage in algorithm execution could be determined, and for maximum efficiency, the goal is to minimize resource usage. Guided by the fact that standard measures of algorithm efficiency, such as execution time, directly depend on the number of executed actions. Without touching the problematics of computer power consumption or memory, which also depends on the algorithm type and the techniques used in algorithm development, we have developed a methodology which enables the researchers to choose an efficient algorithm for a specific domain. String searching algorithms efficiency is usually observed independently from the domain texts being searched. This research paper aims to present the idea that algorithm efficiency depends on the properties of searched string and properties of the texts being searched, accompanied by the theoretical analysis of the proposed approach. In the proposed methodology, algorithm efficiency is expressed through character comparison count metrics. The character comparison count metrics is a formal quantitative measure independent of algorithm implementation subtleties and computer platform differences. The model is developed for a particular problem domain by using appropriate domain data (patterns and texts) and provides for a specific domain the ranking of algorithms according to the patterns' entropy. The proposed approach is limited to on-line exact string-matching problems based on information entropy for a search pattern. Meticulous empirical testing depicts the methodology implementation and purports soundness of the methodology.
引用
收藏
页码:1 / 19
页数:19
相关论文
共 50 条
  • [31] Bit-Parallel Algorithms for Exact Circular String Matching
    Chen, Kuei-Hao
    Huang, Guan-Shieng
    Lee, Richard Chia-Tung
    COMPUTER JOURNAL, 2014, 57 (05): : 731 - 743
  • [32] Entropy-based convergence rates of greedy algorithms
    Li, Yuwen
    Siegel, Jonathan W.
    MATHEMATICAL MODELS & METHODS IN APPLIED SCIENCES, 2024, 34 (05): : 779 - 802
  • [33] A generic approach to detect design patterns in model transformations using a string-matching algorithm
    Chihab eddine Mokaddem
    Houari Sahraoui
    Eugene Syriani
    Software and Systems Modeling, 2022, 21 : 1241 - 1269
  • [34] A generic approach to detect design patterns in model transformations using a string-matching algorithm
    Mokaddem, Chihab Eddine
    Sahraoui, Houari
    Syriani, Eugene
    SOFTWARE AND SYSTEMS MODELING, 2022, 21 (03): : 1241 - 1269
  • [35] Multiscale Fuzzy Entropy-Based Feature Selection
    Wang, Zhihong
    Chen, Hongmei
    Yuan, Zhong
    Wan, Jihong
    Li, Tianrui
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2023, 31 (09) : 3248 - 3262
  • [36] Entropy-based pattern matching for document image compression
    Zhang, Q
    Danskin, JM
    INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, PROCEEDINGS - VOL II, 1996, : 221 - 224
  • [37] New Entropy-Based Method for Gene Selection
    Mahmoodian, Hamid
    Marhaban, M. H.
    Rahim, R. Abdul
    Rosli, R.
    Saripan, I.
    IETE JOURNAL OF RESEARCH, 2009, 55 (04) : 162 - 168
  • [38] Relative entropy-based feature matching for image retrieval
    Shao, Y
    Celenk, M
    INTERNET IMAGING, 2000, 3964 : 70 - 78
  • [39] Robot Evaluation and Selection with Entropy-Based Combination Weighting and Cloud TODIM Approach
    Wang, Jing-Jing
    Miao, Zhong-Hua
    Cui, Feng-Bao
    Liu, Hu-Chen
    ENTROPY, 2018, 20 (05)
  • [40] Exact String Matching Algorithms: Survey, Issues, and Future Research Directions
    Hakak, Saqib Iqbal
    Kamsin, Amirrudin
    Shivakumara, Palaiahnakote
    Gilkar, Gulshan Amin
    Khan, Wazir Zada
    Imran, Muhammad
    IEEE ACCESS, 2019, 7 : 69614 - 69637