Analysis of Boyer-Moore-Horspool string-matching heuristic

被引:0
|
作者
Mahmoud, HM [1 ]
Smythe, RT [1 ]
Regnier, M [1 ]
机构
[1] INST NATL RECH INFORMAT & AUTOMAT,F-78153 LE CHESNAY,FRANCE
关键词
string matching; analysis of algorithms; central limit theorem;
D O I
10.1002/(SICI)1098-2418(199701/03)10:1/2<169::AID-RSA9>3.0.CO;2-T
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
e investigate the probabilistic behavior of a string-matching heuristic used for searching for the occurrences of a pattern in a random text. Our investigation covers the two cases when the pattern itself is fixed or random. Under suitable normalization we show that the total search time is asymptotically normally distributed in the case of fixed pattern, whereas in the case of random pattern the distribution of the search time becomes a mixture of degenerate distributions. An instrumental recurrence equation is obtained by shifting the pattern within the text. To handle the sum of dependent random variables appearing in the recurrence, analytic methods based on the behavior of the shift generating function near its dominant singularity in the complex plane are devised to yield moment calculation and the asymptotic distributions. Adaptation of the standard central limit theorem under mixing conditions complements our analytic toolkit. (C) 1997 John Wiley & Sons, Inc.
引用
收藏
页码:169 / 186
页数:18
相关论文
共 50 条