in reply to Re: substitution in textual area of HTML file
in thread substitution in textual area of HTML file
Hi Marto, nice script to start with. I just changed it a little bit to preserve the RegEx matching feature instead of the index one.
#!/usr/bin/perl use strict; use warnings; use feature 'say'; use Mojo::DOM; my $html = '<ul><div class="txt" style="position:absolute; left:84px; +top:73px;"><span id="f1" style="font-size:11px;vertical-align:baselin +e;color:rgba(0,0,0,1);">technology of S2S translation, also known as +Spoken Language Translation (SLT),</span></div><div class="txt" style +="position:absolute; left:44px; top:73px;"><span id="f1" style="font- +size:11px;vertical-align:baseline;color:rgba(0,0,0,1);">is a new appl +ication of AI,</span></div><li>there</li><li>everyone</li></ul> '; my $dom = Mojo::DOM->new( $html ); # find each span tag foreach my $id ( $dom->find('span')->each ){ my $text = $id->text; if ( $text =~ /Spoken/){ $text =~ s/Spoken/Derp!/g; $id->replace( $text ); } } # print the updated DOM say $dom->content;
Now I can start working on the second phase: matching n-grams across two adjacent DOMs.
|
---|
Replies are listed 'Best First'. | |
---|---|
Re^3: substitution in textual area of HTML file
by marto (Cardinal) on Mar 10, 2020 at 18:19 UTC | |
by Takamoto (Monk) on Mar 10, 2020 at 18:47 UTC | |
by marto (Cardinal) on Mar 10, 2020 at 19:29 UTC | |
by Takamoto (Monk) on Mar 10, 2020 at 20:27 UTC | |
by marto (Cardinal) on Mar 10, 2020 at 20:54 UTC | |
by marto (Cardinal) on Mar 10, 2020 at 19:21 UTC |