CJK Compatibility explained

Rangestart:3300
Rangeend:33FF
1 0 0:187
1 1:62
4 0:7
Note:[1] [2]

CJK Compatibility is a Unicode block containing square symbols (both CJK and Latin alphanumeric) encoded for compatibility with East Asian character sets. In Unicode 1.0, it was divided into two blocks, named CJK Squared Words (U+3300–U+337F) and CJK Squared Abbreviations (U+3380–U+33FF).[3] The square forms can have different presentations when they are used in horizontal or vertical text. For example, the characters (from Japanese: ボルト) and (from Japanese: トン) should look different in horizontal and in vertical right-to-left:[4]

Characters U+337B through U+337E are the Japanese era calendar scheme symbols Heisei (㍻), Shōwa (㍼), Taishō (㍽) and Meiji (㍾) (also available in certain legacy sets, such as the "NEC special characters" extension for JIS X 0208, as included in Microsoft's version and later JIS X 0213).[5] The Reiwa era symbol is in Enclosed CJK Letters and Months (the CJK Compatibility block having been fully allocated by the time of its commencement).

Block

History

The following Unicode-related documents record the purpose and process of defining specific characters in the CJK Compatibility block:

See also

Notes and References

  1. Web site: Unicode character database. The Unicode Standard. 2023-07-26.
  2. Web site: Enumerated Versions of The Unicode Standard. The Unicode Standard. 2023-07-26.
  3. Web site: 3.8: Block-by-Block Charts . The Unicode Standard . version 1.0 . Unicode Consortium.
  4. Web site: Lunde . Ken . Ishi . Koji . Ken Lunde . UAX #50: Unicode Vertical Text Layout . www.unicode.org . 13 June 2024 . en . 2023-07-17.
  5. Web site: A Brief History of Japan's Era Name Ligatures . Lunde . Ken . Ken Lunde . CJK Type Blog . 2019-03-21 . Adobe Inc.