in reply to Re: Extract a string from the line
in thread Extract a string from the line

> What LanX said. Do not do it with regex.

Yeah, but for clarification, I said "in most cases". :)

Sometimes the XML is just so static and restricted that using a full parser would be overkill.

pdftohtml -xml is one example for that.

PS: if you want to allow optional whitespace, you might also want to add an /s modifier to match newlines too.

Cheers Rolf
(addicted to the Perl Programming Language :)
Wikisyntax for the Monastery