Word info

UTF-8

Proper noun

Meaning

UTF-8

(computing) Unicode Transformation Format-8, a variable-width encoding scheme for Unicode characters, using sequences of one to four one-byte (eight-bit) code units per character.

Source: en.wiktionary.org

Examples

A BOM can also appear if another encoding with a BOM is translated to UTF-8 without stripping it. Source: Internet

Additional bits added by the UTF-8 encoding process are shown in black. Source: Internet

And ASCII bytes do not occur when encoding non-ASCII code points into UTF-8, making UTF-8 safe to use within most programming and document languages that interpret certain ASCII characters in a special way, e.g. as end of string. Source: Internet

As a result, text in (for example) Chinese, Japanese or Hindi will take more space in UTF-8 if there are more of these characters than there are ASCII characters. Source: Internet

Another factor contributing in the same direction, is the arrival of UTF-8 — which greatly diminishes the need for other encodings, and thus modern editors tends to default, as recommended by the HTML5 specification, citation to UTF-8. Source: Internet

Code points in Planes 1 through 16 (supplementary planes) are accessed as surrogate pairs in UTF-16 and encoded in four bytes in UTF-8. Source: Internet

Close letter words and terms