Windows-1254 Explained

Windows-1254
Mime:windows-1254
Alias:cp1254 (Code page 1254)
By:Microsoft
Standard:WHATWG Encoding Standard
Lang:Turkish
Extends:ISO 8859-9 (without single-byte C1 controls)
Classification:extended ASCII, Windows-125x

Windows-1254 is a code page used under Microsoft Windows (and for the web), to write Turkish that it was designed for (and the vast majority of users use it for that language, even though it can also be used for some other languages). Characters with codepoints A0 through FF are compatible with ISO 8859-9, but the CR range, which is reserved for C1 control codes in ISO 8859, is instead used for additional characters (analogous to the relationship between ISO-8859-1 and Windows-1252). It matches Windows-1252 except for the replacement of six Icelandic characters (Ðð, Ýý, Þþ) with characters unique to the Turkish alphabet (Ğğ, İ, ı, Şş).

The WHATWG Encoding Standard, which specifies the character encodings which are permitted in HTML5 and which compliant browsers must support,[1] includes Windows-1254, which is used for both the Windows-1254 and ISO-8859-9 labels.[2] Unicode is preferred for modern applications; authors of new pages and the designers of new protocols are instructed to use UTF-8 instead.[3], less than 0.05% of all web pages use Windows-1254, and less than 0.05% use ISO-8859-9,[4] [5] which the WHATWG also requires web browsers to handle as Windows-1254. Since 2.2% of all websites located in Turkey use ISO-8859-9, plus the 1.3% that actually declare Windows-1254 used, in effect, 3.5% of websites there use Windows-1254.[6]

IBM uses code page 1254 (CCSID 1254 and euro sign extended CCSID 5350) for Windows-1254.[7] [8] [9]

Character set

The following table shows Windows-1254. Each character is shown with its Unicode equivalent.

See also

External links

Notes and References

  1. Web site: 8.2.2.3. Character encodings . HTML 5.1 2nd Edition . . User agents must support the encodings defined in the WHATWG Encoding standard, including, but not limited to […].
  2. Web site: Legacy single-byte encodings . Encoding Standard . van Kesteren . Anne . Anne van Kesteren . WHATWG.
  3. Web site: Names and labels . Encoding Standard . van Kesteren . Anne . Anne van Kesteren . WHATWG.
  4. Web site: Historical trends in the usage of character encodings for websites. w3techs.com.
  5. Web site: Frequently Asked Questions. w3techs.com.
  6. Web site: Distribution of character encodings among websites that use Turkey. w3techs.com. 2023-02-23.
  7. Web site: Code page 1254 information document. https://web.archive.org/web/20160303214103/http://www-01.ibm.com/software/globalization/cp/cp01254.html. 2016-03-03.
  8. Web site: CCSID 1254 information document. https://web.archive.org/web/20160326215218/http://www-01.ibm.com/software/globalization/ccsid/ccsid1254.html. 2016-03-26.
  9. Web site: CCSID 5350 information document. https://web.archive.org/web/20141129212606/http://www-01.ibm.com/software/globalization/ccsid/ccsid5350.html. 2014-11-29.