See also: Inputting Esperanto text on computers. There are two conventional sets ASCII substitutions for the letters in the Esperanto alphabet that have diacritics, as well as a number of graphic work-arounds.
The diacritics of Esperanto were designed with a French manual typewriter in mind, as French was the international language at the time Esperanto was developed. French typewriters have a dead key for the circumflex that can be used in combination with any other key. In handwritten Esperanto, the diacritics pose no problem. However, since the Esperanto letters with diacritics do not appear on standard computer keyboard layouts (French computer keyboards, unlike manual typewriters, typically assign the circumflex only to letters that bear it in French orthography), various alternative methods have been devised for inputting them or substituting them in type. The original method, suggested by Zamenhof for people who did not have access to a French typewriter, was a set of digraphs in h, now known as the "Zamenhof-system" or "h-system". With the rise of computer word processing, the so-called "x-system" has become equally popular. With the advent of Unicode and more easily customized computer keyboards, the need for such workarounds has lessened.
There are two alternative orthographies in common use, which replace the circumflex letters with either h digraphs or x digraphs. Another system sometimes noted is a 'QWXY system'; this is a carry-over from an early Esperanto keyboard app named, with which the Q W X and Y keys were assigned to the letters (ĥ), (ŭ), (ŝ), (ĵ), and the key sequences TX and DY to the letters (ĉ) and (ĝ).[1] There are also graphic work-arounds such as approximating the circumflexes with carets.
H-system | |
Type: | Alphabet |
Also Known As: | H-sistemo |
Creator: | L. L. Zamenhof |
Date: | 1888[2] |
Iso15924 Note: | [IETF] eo-hsistemo[3] |
Ipa-Note: | none |
The original method of working around the diacritics was developed by the creator of Esperanto himself, L. L. Zamenhof. He recommended using Esperanto: u in place of Esperanto: ŭ, and digraphs with Esperanto: h for the circumflex letters. For example, Esperanto: ŝ is replaced by Esperanto: sh, as in Esperanto: shanco for Esperanto: ŝanco (chance). Where proper orthography has Esperanto: sh, the letters should be separated with an apostrophe or a hyphen, as in Esperanto: ses-hora (six-hour) or Esperanto: flug'haveno (airport).[4]
Unfortunately, simplistic ASCII-based rules for sorting words fail badly when sorting h-digraphs, because lexicographically words in Esperanto: ĉ should follow all words in Esperanto: c and precede words in Esperanto: d. The word Esperanto: ĉu should be placed after Esperanto: ci, but sorted in the h-system, Esperanto: chu would appear before Esperanto: ci.
X-system | |
Type: | Alphabet |
Also Known As: | X-sistemo, x-kodo |
Date: | by 1962[5] |
Iso15924 Note: | [IETF] eo-xsistemo[6] |
Ipa-Note: | none |
A more recent system for typing in Esperanto is the so-called "x-system", which uses Esperanto: x instead of Esperanto: h for the digraphs, including Esperanto: ux for Esperanto: ŭ. For example, Esperanto: ŝ is represented by Esperanto: sx, as in Esperanto: sxi for Esperanto: ŝi and Esperanto: sxanco for Esperanto: ŝanco.
X-digraphs solve those problems of the h-system:
The x-system has become as popular as the h-system, but it has long been perceived as being contrary to the Fundamento de Esperanto. However, in its 2007 decision, the Akademio de Esperanto has issued general permission for the use of surrogate systems for the representation of the diacritical letters of Esperanto, under the condition that this is being done only "when the circumstances do not permit the use of proper diacritics, and when due to a special need the h-system fixed in the Fundamento is not convenient."[7] This provision covers situations such as using the x-system as a technical solution (to store data in plain ASCII) yet still displaying proper Unicode characters to the end user.
A practical problem of digraph substitution that the x-system does not completely resolve is in the complication of bilingual texts. Esperanto: Ux for Esperanto: ŭ is especially problematic when used alongside French text, because many French words end in French: aux or French: eux. Aux, for example, is a word in both languages (Esperanto: aŭ in Esperanto). Any automatic conversion of the text will alter the French words as well as the Esperanto. A few English words like "auxiliary" and "Euxine" can also suffer from such search-and-replace routines. One common solution, such as the one used in Wikipedia's MediaWiki software, is to use Esperanto: xx to escape the Esperanto: ux to Esperanto: ŭ conversion, e.g. "Esperanto: auxx" produces "aux".[8] [9] A few people have also proposed using "Esperanto: vx" instead of "Esperanto: ux" for Esperanto: ŭ to resolve this problem, but this variant of the system is rarely used.
Ĉ = Cy
Ĝ = Gy
Ĥ = X
Ĵ = Jy
Ŝ = Sy
Ŭ = W
For example: eĥoŝanĝoj ĉiuĵaŭde ("echo-change every Thursday") becomes "exosyangyoj cyiujyawde".
There are several ad hoc workarounds used in email or on the internet, where the proper letters are often not supported, as seen also in non-ASCII orthographies such as German. These "slipped-hat" conventions make use of the caret (^) or greater than sign (>) to represent the circumflex. For example, ŝanco may be written ^sanco, s^anco, or s>anco.[10] However, they have generally fallen out of favor. Before the internet age, had proposed shifting the caret onto the following vowel, since French circumflex vowels are supported in printing houses. That is, one would write ehôsângôj cîujâude for the nonsense phrase eĥoŝanĝoj ĉiuĵaŭde ("echo-change every Thursday").[11] However, this proposal has never been adopted.