applePi wrote:but isn't this should be UTF-8 and not UTF-16 ???
since we choose the encoding in purebasic as utf8.
As everyone who posted before I completed this message said
, the value is the same Unicode character (or codepoint) whether it is UTF-8 or UTF-16.
Looking at the example code you provided:
If the source code is encoded in UTF-8 then the strings will be converted to UTF-16 when the program is compiled and so the value display (i.e. 1071) is the value of the Unicode string value encoded as UTF-16.
applePi wrote:too big number ??
any additional explanations please about these numbers ?? is appreciated
The numbers for Unicode can actually extend in hex up to $10FFFF. There are only complications in the UTF-16 encoding when the values are over $FFFF because the codepoints will take 4 bytes to encode instead of 2 bytes. I won't go into those details unless you are really interested in knowing.