$ cat junk.html <ul><li>The following Host Names were requested from a host database:< +/li> <ul> <li>192.5.5.241</li> . . . </ul></ul> $ cat jonk.xsh open --format html "junk.html"; # ls --indent /; for //ul { pwd; for ./li { pwd; print text(); }; echo; }; echo; $ xsh -q jonk.xsh /html/body/ul /html/body/ul/li The following Host Names were requested from a host database: /html/body/ul/ul /html/body/ul/ul/li 192.5.5.241
See also xpather.pl/htmltreexpather.pl which can give you paths to start with, and all the links here Re: Retrieve select information from HTML, they're examples(for tree-xpath and others)/walkthroughs/tutorials ... XML::XSH2/https://metacpan.org/pod/distribution/XML-XSH2/XSH2.pod#open,
In reply to Re: question about lookaheads and threatexpert/html parsing
by Anonymous Monk
in thread question about lookaheads and threatexpert/html parsing
by ejc1
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |