Hence the idea to use punctuation as the structural element inherent to the text itself, to break if up into units that can be compared.
My approach would seem to s/[\s\n]+/ /gs and s/([\.,:;\!\?])\s/$1\n/gs and diff the resulting lines.
cheers,
--shmem
_($_=" "x(1<<5)."?\n".q·/)Oo. G°\ /
/\_¯/(q /
---------------------------- \__(m.====·.(_("always off the crowd"))."·
");sub _{s./.($e="'Itrs `mnsgdq Gdbj O`qkdq")=~y/"-y/#-z/;$e.e && print}
In reply to Re^3: web_diff.pl
by shmem
in thread web_diff.pl
by ciderpunx
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |