in reply to Re: HTML::TreeBuilder::XPath not loading the complete $page
in thread HTML::TreeBuilder::XPath not loading the complete $page
I looked line by line at the output html file containing $response->content
It seems that sometimes there's an internal error printed inside another HTML tag...
In other words, the $response->content goes as follows:
<html> <head> ... ... </head> <body> ... ... ... <DIV></DIV> # Div number 12 <HTML> <HEAD> </HEAD> <BODY> <p>You have an error blah blah blah</p> </BODY> </HTML> <DIV></DIV> # Div number 13 and so on until number 19 ... ... </body> </html>
The problem is that it's not inside any iframe or something like that, so the onl way out that comes to my mind is use a while to look for such issues and take them out, then regenerate the output and go on...
Does TreeBuilder have an option to avoid this things?? Google is not helping me
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^3: HTML::TreeBuilder::XPath not loading the complete $page
by tangent (Parson) on Mar 19, 2013 at 21:00 UTC | |
by Lord Gartlar (Initiate) on Mar 19, 2013 at 21:26 UTC | |
|
Re^3: HTML::TreeBuilder::XPath not loading the complete $page
by Anonymous Monk on Mar 20, 2013 at 06:56 UTC |