in reply to Re: web::scraper and regex
in thread web::scraper and regex

Web::Scraper resp. the underlying HTML::Selector::XPath doesn't understand XPath regular expressions - they came in with XPath version 2.0 which it doesn't support. The correct syntax for a contains() query would be:

:) FWIW/AFAIK http://www.w3.org/TR/xpath/#function-contains doesn't take a regex, it only takes strings, but I could be reading that wrong

OTOH, perl regex are supported HTML::TreeBuilder::XPath and regular expressions

Replies are listed 'Best First'.
Re^3: web::scraper and regex
by Corion (Patriarch) on Sep 07, 2015 at 08:39 UTC

    I'm sorry I was unclear - I didn't expect regular expressions but most of the time I'm content with matching substrings.

    I have to reinvestigate how to use Perl regular expressions in HTML::Selector::XPath and how these could be passed on downwards to HTML::TreeBuilder::XPath, thanks!