Latest inspiration occurred when I read that PHP didn't handle UTF-8 natively. I tried changing a few of the mbstring settings in my php.ini file and restarted apache. MBstrings refers to multibyte strings and looked promising. My phpinfo output looked like this:
mbstring
Multibyte Support enabled
Japanese support enabled
Simplified chinese support enabled
Traditional chinese support enabled
Korean support enabled
Russian support enabled
HTTP input encoding translation enabled
Multibyte (japanese) regex support enabled
Directive Local Value Master Value
mbstring.detect_order no value no value
mbstring.encoding_translation On On
mbstring.func_overload 0 0
mbstring.http_input UTF-8 UTF-8
mbstring.http_output UTF-8 UTF-8
mbstring.internal_encoding UTF-8 UTF-8
mbstring.language Neutral Neutral
mbstring.substitute_character no value no value
For the record this didn't work. Can anyone confirm they are having problems importing Unicode / UTF-8 characters in any other language? Maybe its just an issue with Chinese/multibyte characters, rather than an issue with Unicode in general.
Bookmarks