Z-algorithm¶
The Z-array of a string of length contains for each the length of longest substring of s that begins at position and is a prefix of .
Thus, tells us that equals .
Many string processing problems can be efficiently solved using Z-array.
Algorithm description¶
Next we describe an algorithm, called the Z-algorithm, that efficiently constructs the Z-array in time.
The algorithm calculates the Z-array values from left to right by both using information already stored in the Z-array and comparing substrings character by character.
To efficiently calculate the Z-array values, the algorithm maintains a range such that are equal, we can use this information when calculating Z-values for positions .
At each poisition , we first check the value of .
if , we know that .
However, if equals and to determine the value of we need to compare the substring character by character.
Still, the algorithm works in time, because we start comparing at positions and .
Using Z-array¶
It if often a matter of taste whether to use string hashing or the Z-algorithm.
Unlike hashing, the Z-algorithm always works and there is no risk for collisions.
On the other hand, Z-algorithm is more difficult to implement and some problems can only be solved using hashing.
Implementation¶
Here is a short impelmentation of the Z-algorithm that returns a vector that corresponds to the Z-array.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 |
|