UTF-8 Decoder

v1.2, 2018-08-20 by Robert Giordano

In the old days (circa 2009), everyone used an ASCII character set where each character was represented with a single byte. A more modern approach is to use the UTF-8 character set because it contains thousands of characters. For backwards compatibility with ASCII, the first 128 characters (0 to 127) are encoded the same. Characters above ASCII 127 are encoded with a multi-byte string. This simple decoder shows the individual bytes for these characters and the cooresponding escape code.

Character:


HTML Escape String:


Unicode Escape String:


Multi-byte (Hex):


Multi-byte (Decimal):