
Unicode, or Unicode (English Unicode) - a standard character encoding, which allows labels to provide virtually all written languages.
The standard proposed in 1991 a non-profit organization «Unicode Consortium» (engl. Unicode Consortium, Unicode Inc.), Which unites the largest IT-company. Applying this standard allows to encode a very large number of characters from different languages: in the documents can be side by side Unicode Chinese characters, mathematical symbols, Greek letters, Latin and Cyrillic script, while becoming redundant code pages.
The standard consists of two main parts: a universal character set (UCS, Universal Character Set) and a family of encodings (UTF, Unicode Transformation Format). The universal character set defines character codes-one correspondence - the elements of the code space of negative integers. Family encoding engine determines the sequence of codes UCS.
The codes in the Unicode standard is divided into several areas. Area codes from U +0000 to U +007 F contains a set of ASCII characters with the appropriate codes. Then there are the field marks of different languages, punctuation, and technical symbols. Some of the codes is reserved for future use. Under the Cyrillic characters are the area codes of the characters U +0400 to U +052 F, of U +2 DE0 to U +2 DFF, from U + A640 to U + A69F (see Cyrillic Unicode).
0 коммент.:
Отправить комментарий