Type: | Non-profit consortium |
Unicode Consortium | |
Native Name: | Unicode, Inc. |
Founding Location: | California, US |
Location City: | Mountain View, California, US |
Coordinates: | 37.4118°N -122.071°W |
Status: | 501(c)(3)[1] California nonprofit benefit corporation |
Tax Id: | 77-0269756 |
Employees: | 3[2] |
Employees Year: | 2018 |
Volunteers: | 10 |
Volunteers Year: | 2018 |
Revenue: | $467,576 |
Revenue Year: | 2018 |
Expenses: | $470,257 |
Expenses Year: | 2018 |
Purpose: | "To develop, extend and promote use of various standards, data, and open source software libraries which specify the representation of text in modern software[,] ... allowing data to be shared across multiple platforms, languages and countries without corruption" |
The Unicode Consortium (legally Unicode, Inc.) is a 501(c)(3) non-profit organization incorporated and based in Mountain View, California, U.S.[3] Its primary purpose is to maintain and publish the Unicode Standard which was developed with the intention of replacing existing character encoding schemes that are limited in size and scope, and are incompatible with multilingual environments.
The consortium describes its overall purpose as:
Unicode's success at unifying character sets has led to its widespread adoption in the internationalization and localization of software.[4] The standard has been implemented in many technologies, including XML, the Java programming language, Swift, and modern operating systems.[5]
Members are usually but not limited to computer software and hardware companies with an interest in text-processing standards,[6] including Adobe, Apple, the Bangladesh Computer Council, Emojipedia, Facebook, Google, IBM, Microsoft, the Omani Ministry of Endowments and Religious Affairs, Monotype Imaging, Netflix, Salesforce, SAP SE, Tamil Virtual Academy, and the University of California, Berkeley.[7] [8] [9] Technical decisions relating to the Unicode Standard are made by the Unicode Technical Committee (UTC).[10]
The project to develop a universal character encoding scheme called Unicode was initiated in 1987 by Joe Becker, Lee Collins, and Mark Davis.[11] [12] The Unicode Consortium was incorporated in California on January 3, 1991,[13] with the stated aim to develop, extend, and promote the use of the Unicode Standard.[14] Mark Davis was the president of the Unicode Consortium from when the Consortium was incorporated in 1991 until 2023, when he changed roles to CTO.[15]
The Unicode Consortium cooperates with many standards development organizations, including ISO/IEC JTC 1/SC 2 and W3C.[16] While Unicode is often considered equivalent to ISO/IEC 10646, and the character sets are essentially identical, the Unicode standard imposes additional restrictions on implementations that ISO/IEC 10646 does not.[17] Apart from The Unicode Standard (TUS) and its annexes (UAX), the Unicode Consortium also maintains the CLDR, collaborated with the IETF on IDNA,[18] [19] and publishes related standards (UTS), reports (UTR), and utilities.[20] [21]
The group selects the emoji icons used by the world's smartphones, based on submissions from individuals and organizations who present their case with evidence for why each one is essential.[22]
The Unicode Technical Committee (UTC) meets quarterly to decide whether new characters will be encoded. A quorum of half of the Consortium's full members is required.[23]
As of May 2024, there are nine full members, all of which are tech companies: Adobe, Airbnb, Apple, Google, Meta, Microsoft, Netflix, Salesforce, and Translated.[24]
The UTC accepts documents from any organization or individual, whether they are members of the Unicode Consortium or not.[25] [26] The UTC holds its meetings behind closed doors.[27] As of July 2020, the UTC rules on both emoji and script proposals at the same meeting.
Due to the COVID-19 pandemic's effect on travel, the meetings, which used to be hosted on the campuses of various tech companies who would open their doors to the Consortium for free, were in 2020 held online via Zoom,[28] although the discussions remain confidential.
The UTC prefers to work by consensus, but on particularly contentious issues, votes may be necessary.[29] After it meets, the UTC releases a public statement on each proposal it considered. Due to the volume of incoming proposals, various subcommittees, such as the Script Ad Hoc Group and Emoji Subcommittee, exist to submit recommendations to the full UTC en banc.[30] The UTC is under no obligation to heed these recommendations, although in practice it usually does.
The Unicode Consortium maintains a History of Unicode Release and Publication Dates.
Publications include: