If I compose a short line of text in my gedit (Linux equivalent for Notepad).
Nüne istá baßt alongnöw.and I use the simple code:
#!/usr/bin/perl -w # legaget.pl use strict; my $filename = "ligatext.txt"; open FILE, "<", $filename or die $1; while( my $line = <FILE> ) { print $line; } close(FILE);
My terminal does in fact show those ligatures. However, to grab a webpage.html off the web, that I might add is not W3.org compliant; they don't use the meta line encoding='iso-xxxx-x'> The browser shows the ligatures, the file saved of that webpage will show the ligatures in gedit word processor, but... the same code above will throw in the < ? > symbol with print to Terminal.
I might add I have fought this same thing with MSWord files, as MS puts the Unicode country code in the first byte of their Word.doc format as a hex.
Yes, I've read binmode <STDOUT> description, they don't show an example. Can you give me a short snippet, let me try it???? Maybe, Open a file, read a line at a time to print to Terminal?
In reply to Re^2: HTML::Parser, file, print to Terminal
by victor_charlie
in thread HTML::Parser, file, print to Terminal
by victor_charlie
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |