in reply to Seeking with 'x' in unpack and out of bounds reads
unpack, unlike regexes, allows partial matches. Eg, with $str = "abc", $str =~ /(.{2})*/; will match "ab" but unpack "(a2)*", "abc" will return "ab", "c". In the case of unpack, the second iteration of the sub pattern "a2" is partial. Without 'x', you can tell that a match is partial in unpack because the output is incompatible with the pattern. In my example, you expected 2 characters but got one, so this has to be a partial match. However, 'x' doesn't have a direct effect on the output, so a partial match involving 'x' must be communicated through an error.
Note that cases like unpack "(xa)*", "abcd"; don't die because this is not a partial match, the sub pattern "xa" matches exactly 2 times, which is a valid value for *. unpack "(ax)*", "abc"; does die because there is one full match, and then a partial match of the sub pattern (a has matched but not x). unpack "(xa)*", "abc"; still doesn't die because although the match is partial, it fails on 'a' which communicates the failure by returning an empty string (which is not a valid value for "a")
This means you can always tell if there was a partial match. If the pattern matched partially and failed on a token that isn't x, you will get an output that is incompatible with the pattern (eg '.' for a2). If the pattern matched partially because it couldn't skip a byte, it will die. In all other cases, the match was complete.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: Seeking with 'x' in unpack and out of bounds reads
by mxb (Pilgrim) on Apr 27, 2018 at 12:31 UTC | |
by Eily (Monsignor) on Apr 27, 2018 at 13:27 UTC | |
by vr (Curate) on Apr 27, 2018 at 15:36 UTC | |
by Eily (Monsignor) on Apr 27, 2018 at 15:56 UTC | |
by BillKSmith (Monsignor) on Apr 27, 2018 at 14:56 UTC |