UTF-8

0000-FFFF 最多四字节。

https://en.wikipedia.org/wiki/UTF-8

UTF-8 encodes each of the 1,112,064 valid code points in the Unicode code space (1,114,112 code points minus 2,048 surrogate code points) using one to four 8-bit bytes (a group of 8 bits is known as an octet in the Unicode Standard). 

posted @ 2017-01-26 16:01  papering  阅读(153)  评论(0编辑  收藏  举报