Godsrock37 has asked for the wisdom of the Perl Monks concerning the following question:
Fellow Perlmonks...
I have come across an issue when pulling data down from a site using UTF-8 character sets. The side might have a line like this: "something something Resistance Ω"
Now when I pull it down and put it into a database it gives me "funky symbols." What character set does perl use? At this point it doesn't help me to go back and regrab the content and convert it... theres 230K pages and its already in a database which used latin-1 charset (I've since converted it to UTF-8 in an attempt to fix my issue to no avail) I'm pulling it out with php to do some work on it, but I think if I knew what perl might have done with it I might have more success converting it.
Any thoughts? To be completely honest I could be barking up the wrong tree and it might have nothing to do with perl converting it at all... but I'm running out of ideas here.
Thanks for any and all help
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: A Character Set Enquiry
by pc88mxer (Vicar) on Jul 10, 2008 at 21:06 UTC | |
by moritz (Cardinal) on Jul 10, 2008 at 21:21 UTC | |
by ysth (Canon) on Jul 11, 2008 at 03:38 UTC | |
|
Re: A Character Set Enquiry
by moritz (Cardinal) on Jul 10, 2008 at 21:09 UTC | |
by Godsrock37 (Sexton) on Jul 11, 2008 at 12:50 UTC | |
by moritz (Cardinal) on Jul 13, 2008 at 16:38 UTC | |
by Godsrock37 (Sexton) on Jul 11, 2008 at 14:18 UTC | |
by massa (Hermit) on Jul 11, 2008 at 16:07 UTC | |
|
Re: A Character Set Enquiry
by waba (Monk) on Jul 11, 2008 at 17:15 UTC |