Sounds to me like you're thinking about something like a "web diff". The problem here is, the rules are not well established as for, say, diff and patch, and telling big from minor changes is not easy either. E.g. you could move block level elements around in the source and see no difference in rendering, the rules for that being in a css.
If you are interested in text only, it's easy I think. Text in normal files breaks at line ends, which is not the case in HTML. I'd suggest stripping the text from the HTML (with e.g. Tom Christiansens striphtml) removing empty lines and leading/trailing whitespace, jam it together and break it again at punctuation. The text between punctuations are your lines then, which you could run through diff.
If you are interested in markup changes/layout, you could compile the page into a DOM tree (e.g. with HTML::Tree), and compare it's content starting from the twigs.
Just some ideas...
--shmem
_($_=" "x(1<<5)."?\n".q·/)Oo. G°\ /
/\_¯/(q /
---------------------------- \__(m.====·.(_("always off the crowd"))."·
");sub _{s./.($e="'Itrs `mnsgdq Gdbj O`qkdq")=~y/"-y/#-z/;$e.e && print}
Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
Read Where should I post X? if you're not absolutely sure you're posting in the right place.
Please read these before you post! —
Posts may use any of the Perl Monks Approved HTML tags:
- a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
| |
For: |
|
Use: |
| & | | & |
| < | | < |
| > | | > |
| [ | | [ |
| ] | | ] |
Link using PerlMonks shortcuts! What shortcuts can I use for linking?
See Writeup Formatting Tips and other pages linked from there for more info.