hexdump -C etest.txt 00000000 57 65 72 20 42 61 72 62 61 72 61 20 6c 69 76 65 |Wer Barba +ra live| 00000010 20 65 72 6c 65 62 65 6e 20 6d c3 b6 63 68 74 65 | erleben +m..chte| 00000020 2c 20 68 61 74 20 69 6e 20 4d c3 bc 6e 63 68 65 |, hat in +M..nche| 00000030 6e 20 69 6d 6d 65 72 20 77 69 65 64 65 72 20 64 |n immer w +ieder d| 00000040 69 65 20 47 65 6c 65 67 65 6e 68 65 69 74 2c 20 |ie Gelege +nheit, | 00000050 73 69 65 20 73 69 6e 67 65 6e 20 7a 75 20 68 c3 |sie singe +n zu h.| 00000060 b6 72 65 6e 2e 20 42 65 73 6f 6e 64 65 72 65 20 |.ren. Bes +ondere | 00000070 41 75 66 74 72 69 74 74 65 20 77 65 72 64 65 20 |Auftritte + werde | 00000080 69 63 68 20 61 62 20 73 6f 66 6f 72 74 20 69 6d |ich ab so +fort im| 00000090 20 41 6e 73 63 68 6c 75 c3 9f 20 61 6e 20 64 69 | Anschlu. +. an di| 000000a0 65 20 45 6e 67 65 6c 77 6f 72 74 65 20 61 6e 6b |e Engelwo +rte ank| 000000b0 c3 bc 6e 64 69 67 65 6e 2e 0a 0a |..ndigen. +..| 000000bb

The above is a cut-n-paste from the webpage.html -- both the html and the txt show the same missing characters.

I did read several things about UTF-8. I suppose the confusion lies in => if I create the file, I get my Latin-1. If I didn't create the file, there is only ASCII.


In reply to Re^4: HTML::Parser, file, print to Terminal by victor_charlie
in thread HTML::Parser, file, print to Terminal by victor_charlie

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.