ISO/IEC 8859-2 explained

ISO/IEC 8859-2
Mime:ISO-8859-2
Alias:iso-ir-101, csISOLatin2, latin2, l2, IBM1111
Standard:ECMA-94

1986, ISO/IEC 8859

Lang:(see below)
Extends:US-ASCII
Basedon:ISO-8859-1
Otherrelated:Windows-1250, MacCroatian
Classification:Extended ASCII, ISO/IEC 8859

ISO/IEC 8859-2:1999, Information technology — 8-bit single-byte coded graphic character sets — Part 2: Latin alphabet No. 2, is part of the ISO/IEC 8859 series of ASCII-based standard character encodings, first edition published in 1987. It is informally referred to as "Latin-2". It is generally intended for Central[1] or "Eastern European" languages that are written in the Latin script. Note that ISO/IEC 8859-2 is very different from code page 852 (MS-DOS Latin 2, PC Latin 2) which is also referred to as "Latin-2" in Czech and Slovak regions.[2] Almost half the use of the encoding is for Polish, and it's the main legacy encoding for Polish, while virtually all use of it has been replaced by UTF-8 (on the web).

ISO-8859-2 is the IANA preferred charset name for this standard when supplemented with the C0 and C1 control codes from ISO/IEC 6429. Less than 0.04% of all web pages use ISO-8859-2 as of October 2022.[3] [4] Microsoft has assigned code page 28592 a.k.a. Windows-28592 to ISO-8859-2 in Windows. IBM assigned code page 912 to ISO 8859-2,[5] until that code page was extended in 1999.[6] Code page 1111 is similar, but replaces byte B0 ° (degree sign) with U+02DA ˚ (ring above).

Windows-1250 is similar to ISO-8859-2 and has all the printable characters it has and more. However a few of them are rearranged (unlike Windows-1252, which keeps all printable characters from ISO-8859-1 in the same place).

Language coverage

These code values can be used for the following languages:

Code page layout

Differences from ISO-8859-1 have the Unicode code point number underneath.

See also

External links

8-Bit Single Byte Coded Graphic Character Sets - Latin Alphabets No. 1 to No. 4 2nd edition (June 1986)

Notes and References

  1. Web site: Microsoft Outlook Message Encodings. 10 January 2017 .
  2. Web site: The Czech and Slovak Character Encoding Mess Explained . 2022-02-27 . luki.sdf-eu.org.
  3. Web site: Usage Statistics and Market Share of ISO-8859-2 for Websites, October 2022 . 2022-10-23 . w3techs.com.
  4. Web site: Historical trends in the usage statistics of character encodings for websites, February 2022.
  5. Web site: Icu-data/Charset/Data/XML/Ibm-912_P100-1995.XML at main · unicode-org/Icu-data . .
  6. Web site: Icu-data/Charset/Data/Ucm/Ibm-912_P100-1999.ucm at main · unicode-org/Icu-data . .