in reply to Re^2: How to parse not closed HTML tags that don't have any attributes? (updated)
in thread How to parse not closed HTML tags that don't have any attributes?
no matter what I do, the <br/> element in it always gets removed, so it seems to me that I cannot search inside the address string.
I can't reproduce this (see the output in the [[]]s below), and you haven't said what your expected output is or what you mean by "search inside the address string" - see How do I post a question effectively? and Short, Self-Contained, Correct Example. As an example, I can replace the <br/> like so:
use warnings; use strict; use Mojo::DOM; use Mojo::Util qw/trim/; my $dom = Mojo::DOM->new(<<'HTML'); <div class="address"> <div class="icon"></div> <address> Sample Street 123<br/>45678 Randomcity </address> </div> HTML my $addr = $dom->find('.address address')->first; print "[[$addr]]\n"; $addr->find('br')->map('replace',"\n"); print "[", trim($addr->text), "]\n"; __END__ [[<address> Sample Street 123<br>45678 Randomcity </address>] +] [Sample Street 123 45678 Randomcity]
Edit: Forgot to remove the "(updated)" from the node's title before it got a reply. At the time of writing this node and its reply were not actually updated.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^4: How to parse not closed HTML tags that don't have any attributes? (updated)
by Rantanplan (Novice) on Mar 07, 2021 at 16:40 UTC | |
by haukex (Archbishop) on Mar 07, 2021 at 17:47 UTC | |
by Rantanplan (Novice) on Mar 08, 2021 at 14:39 UTC | |
by haukex (Archbishop) on Mar 08, 2021 at 16:06 UTC | |
by Rantanplan (Novice) on Mar 09, 2021 at 13:39 UTC |