UTF-8 Β· 1-4 bytes per code point
unseel.com Β· Pike & Thompson 1992 Β· self-synchronizing
Bytes β€”
Range β€”
Stage code points
ASCII (1 byte)
Latin/Greek (2 bytes)
BMP / CJK (3 bytes)
Supplementary (4 bytes)
Leading byte pattern
Unseel.com Β· UTF-8