First char | Last char | Number of bits | Binary sequence |
\u0000 | \u007F | 7 | 0xxxxxxx |
\u0080 | \u07FF | 11 | 110xxxxx 10xxxxxx |
\u0800 | \uFFFF | 16 | 1110xxxx 10xxxxxx 10xxxxxx |
e with accent in latin1
0xE9 = 1110 1001
first byte
xxx0 0011
second byte
xx10 1001
1100 0011 and 1010 1001
0xC3 and 0xA9
above example in little endian