in reply to HTML entities converted to Non-Latin-1 format...
Can you give us an example of what you want to do with the HTML (i.e. sample HTML input and the output you are trying to achieve)? I'm not sure you just want to get rid of them. But if that's what you really want to do, the following will convert the code-points to latin-1 removing any code-points which are not representable by latin-1:
However, as I said, I think your problem might be handled better in a different way.use Encode; my $latin1 = encode('iso-8859-1', $code_points, sub { '' });
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: HTML entities converted to Non-Latin-1 format...
by vishNugupt (Novice) on Apr 20, 2008 at 19:40 UTC | |
by clinton (Priest) on Apr 20, 2008 at 20:52 UTC |