On Tuning the Bad-Character Rule: the Worst-Character Rule

Domenico Cantone; Simone Faro

arxiv: 1012.1338 · v1 · pith:ZPSHENS2new · submitted 2010-12-06 · 💻 cs.DS

On Tuning the Bad-Character Rule: the Worst-Character Rule

Domenico Cantone , Simone Faro This is my paper

classification 💻 cs.DS

keywords ruleworst-characterbad-charactercaseresultstextsaccordingachieves

0 comments

read the original abstract

In this note we present the worst-character rule, an efficient variation of the bad-character heuristic for the exact string matching problem, firstly introduced in the well-known Boyer-Moore algorithm. Our proposed rule selects a position relative to the current shift which yields the largest average advancement, according to the characters distribution in the text. Experimental results show that the worst-character rule achieves very good results especially in the case of long patterns or small alphabets in random texts and in the case of texts in natural languages.

This paper has not been read by Pith yet.

On Tuning the Bad-Character Rule: the Worst-Character Rule

discussion (0)