ISO-IR-153 explained

ISO-IR-153
Standard:ST SEV 358-88, GOST R 34.303-92 (see below)
Lang:Russian, Bulgarian
Classification:Extended ASCII
Extensions:ISO-8859-5, IBM-1124, ISO-IR-200, ISO-IR-201
Prev:KOI8-B
Basedon:Main code page

ISO-IR-153[1] (ST SEV 358-88) is an 8-bit character set that covers the Russian and Bulgarian alphabets. Unlike the KOI encodings, this encoding lists the Cyrillic letters in their correct traditional order. This has become the basis for ISO/IEC 8859-5 and the Cyrillic Unicode block.

Standards and Naming

The name ISO-IR-153 refers to this set's number in the ISO-IR registry, and marks it as a set which may be used within ISO/IEC 2022.

ISO-IR-153 is a subset of ISO/IEC 8859-5 (synchronised with ECMA-113 since 1988).[2] The ISO-IR-153 documentation cites ST SEV 358-88 as the source standard. While it also cites the earlier GOST 19768-74 (which defines KOI-8 and was conformed to by the first version of ECMA-113, i.e. ISO-IR-111), it does not follow the KOI-8 layout (rather using a close modification of the letter layout from the Main code page)[3] so this appears to be in error. The ISO-IR-153 encoding was intended to replace GOST 19768-74, and is sometimes referred to as GOST-19768-87.[4] [5] This confusion has led to a common misconception that ISO-8859-5 was defined in or based on GOST 19768-74.

Notwithstanding the extents of their accuracy, the IANA lists, and as labels which may be used for the ISO-IR-153 encoding on the Internet, with reference to RFC 1345, which assigns it those labels.[6] [7]

GOST R 34.303-92 includes the ISO-IR-153 code page and dubs it KOI-8 V1 (in addition to using KOI-8 N1 and KOI-8 N2 for two Alternative code page/Code page 866 variants).[8]

Character set

The following table shows the ISO-IR-153 encoding. Each character is shown with its equivalent Unicode code point.

The encoding closely resembles the letter subset of the Cyrillic part of the Main code page, apart from the relocation of the uppercase Ё from 0xF0 to 0xA1. ISO-8859-5 is a superset.

See also

Notes and References

  1. Web site: ISO-IR-153 (1 December 1989) .
  2. Web site: ECMA-113 - Ecma International .
  3. Web site: Nechayev . Valentin . 2013 . Review of 8-bit Cyrillic encodings universe . live . https://web.archive.org/web/20161205134629/http://segfault.kiev.ua/cyrillic-encodings/ . 2016-12-05 . 2016-12-05 . 2001.
  4. Web site: Czyborra . Roman . 1998-11-30 . The Cyrillic Charset Soup . dead . https://web.archive.org/web/20161203230933/http://czyborra.com/charsets/cyrillic.html . 2016-12-03 . 2016-12-03 . […] in the meantime GOST had inhaled some perestroika and declared the installed base and KOI correspondence less important and revised its 19768 standard from 1974 in 1987 into an incompatible new GOST 19768-87 […] . 1998-05-25.
  5. Web site: gost19768-87 TXT.GZ file .
  6. Web site: Character Sets . IANA.
  7. Simonsen . Keld . 1992 . Character Mnemonics & Character Sets . Requests for Comments . . 10.17487/rfc1345 . RFC 1345 . cs1.
  8. Web site: 8-bit coded character sets. 8-bit code for information interchange .