Fast exact string pattern-matching algorithms adapted to the characteristics of the medical language

被引:16
|
作者
Lovis, C
Baud, RH
机构
[1] Univ Hosp Geneva, Div Med Informat, CH-1211 Geneva 4, Switzerland
[2] Puget Sound Hlth Care Syst, Seattle, WA USA
关键词
D O I
10.1136/jamia.2000.0070378
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: The authors consider the problem of exact string pattern matching using algorithms that do not require any preprocessing. To choose the most appropriate algorithm, distinctive features of the medical language must be taken into account. The characteristics of medical language are emphasized in this regard, the best algorithm of those reviewed is proposed, and detailed evaluations of time complexity for processing medical texts are provided. Design: The authors first illustrate and discuss the techniques of various string pattern-matching algorithms. Next, the source code and the behavior of representative exact string pattern-matching algorithms are presented in a comprehensive manner to promote their implementation. Detailed explanations of the use of various techniques to improve performance are given. Measurements: Real-time measures of time complexity with English medical texts are presented. They lead to results distinct from those found in the computer science literature, which are typically computed with normally distributed texts. Results: The Boyer-Moore-Horspool algorithm achieves the best overall results when used with medical texts. This algorithm usually performs at least twice as fast as the other algorithms tested. Conclusion: The time performance of exact string pattern matching can be greatly improved ii: an efficient algorithm is used. Considering the growing amount of text handled in the electronic patient record, it is worth implementing this efficient algorithm.
引用
收藏
页码:378 / 391
页数:14
相关论文
共 50 条
  • [21] EFFICIENT RANDOMIZED PATTERN-MATCHING ALGORITHMS.
    Karp, Richard M.
    Rabin, Michael O.
    IBM Journal of Research and Development, 1987, 31 (02): : 249 - 260
  • [22] Pattern-matching and text-compression algorithms
    Crochemore, M
    Lecroq, T
    ACM COMPUTING SURVEYS, 1996, 28 (01) : 39 - 41
  • [23] STOCHASTIC OPTIMIZATION ALGORITHMS - AN APPLICATION TO PATTERN-MATCHING
    BENGTSSON, M
    PATTERN RECOGNITION LETTERS, 1990, 11 (11) : 717 - 724
  • [24] A PRACTICAL METHOD FOR IMPLEMENTING STRING PATTERN-MATCHING MACHINES
    AOE, JI
    INFORMATION SCIENCES, 1992, 64 (1-2) : 95 - 114
  • [25] A simple fast hybrid pattern-matching algorithm
    Franek, Frantisek
    Jennings, Christopher G.
    Smyth, W. F.
    JOURNAL OF DISCRETE ALGORITHMS, 2007, 5 (04) : 682 - 695
  • [26] FAST 2-DIMENSIONAL PATTERN-MATCHING
    BAEZAYATES, R
    REGNIER, M
    INFORMATION PROCESSING LETTERS, 1993, 45 (01) : 51 - 57
  • [27] VLSI NEURAL NETWORK FOR FAST PATTERN-MATCHING
    GRAF, HP
    HUBBARD, W
    NEURAL NETWORKS FROM MODELS TO APPLICATIONS, 1989, : 725 - 732
  • [28] A simple fast hybrid pattern-matching algorithm
    Franek, F
    Jennings, CG
    Smyth, WF
    COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2005, 3537 : 288 - 297
  • [29] On a Comprehensive Implementation and Performance Evaluation Framework of Fast Algorithms for Exact String Matching
    Cakeri, Dion
    Karras, Dimitrios A.
    Proceedings - 2022 International Conference on Computing, Networking, Telecommunications and Engineering Sciences Applications, CoNTESA 2022, 2022, : 23 - 30
  • [30] STRING OVERLAPS, PATTERN-MATCHING, AND NON-TRANSITIVE GAMES
    GUIBAS, LJ
    ODLYZKO, AM
    JOURNAL OF COMBINATORIAL THEORY SERIES A, 1981, 30 (02) : 183 - 208