CJK Unified Ideographs Extension C explained

Rangestart:2A700
Rangeend:2B73F
Script1:Han
5 2:4149
14 0:4
15 0:1
Note:[1] [2]

__FORCETOC__CJK Unified Ideographs Extension C is a Unicode block containing rare and historic CJK ideographs for Chinese, Japanese, Korean, and Vietnamese submitted to the Ideographic Research Group between 2002 and 2006, plus five "urgently needed" characters added in Unicode versions 14.0 and 15.0, some of which had previously been mistakenly unified with other characters.

The block has dozens of ideographic variation sequences registered in the Unicode Ideographic Variation Database (IVD).[3] [4] These sequences specify the desired glyph variant for a given Unicode character.

Note that the Katakana ligature (U+2A708) has been erroneously encoded in this block as a Han character.[5]

Block

History

The following Unicode-related documents record the purpose and process of defining specific characters in the CJK Unified Ideographs Extension C block:

Notes and References

  1. Web site: Unicode character database. The Unicode Standard. 2023-07-26.
  2. Web site: Enumerated Versions of The Unicode Standard. The Unicode Standard. 2023-07-26.
  3. Web site: Ideographic Variation Database. Unicode Consortium.
  4. Web site: UTS #37, Unicode Ideographic Variation Database. Unicode Consortium.
  5. https://hc.jsecs.org/irg/ws2021/app/?id=00020 IRG Working Set 2021