In order to allow the automatic detection of the byte order, it has become customary on some platforms (notably Win32) to start every Unicode file with the character U+FEFF (ZERO WIDTH NO-BREAK SPACE), also known as the Byte-Order Mark (BOM). Its byte-swapped equivalent U+FFFE is not a valid Unicode character, therefore it helps to unambiguously distinguish the Bigendian and Littleendian variants of UTF-16 and UTF-32.
0xFE 0xFF 0x4E 0x2D
0xFF 0xFE 0x2D 0x4E
0x4E 0x2D
U+0000 - U+007F:0xxxxxxx
U+0080 - U+07FF:110xxxxx 10xxxxxx
U+0800 - U+FFFF:1110xxxx 10xxxxxx 10xxxxxx
11100100 10111000 10101101 = 0xE4 0xB8 0xAD
0xEF 0xBB 0xBF 0xE4 0xB8 0xAD
0xE4 0xB8 0xAD
歡迎光臨 PALMisLIFE 討論區 (http://f.pil.tw/) | Powered by Discuz! X2.5 |