The ISO basic Latin alphabet is an international standard (beginning with ISO/IEC 646) for a Latin-script alphabet that consists of two sets (uppercase and lowercase) of 26 letters, codified in various national and international standards and used widely in international communication. They are the same letters that comprise the current English alphabet. Since medieval times, they are also the same letters of the modern Latin alphabet. The order is also important for sorting words into alphabetical order.
The two sets contain the following 26 letters each:[1]
Uppercase letter set | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | Q | R | S | T | U | V | W | X | Y | Z | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Lowercase letter set | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o | p | q | r | s | t | u | v | w | x | y | z |
By the 1960s it became apparent to the computer and telecommunications industries in the First World that a non-proprietary method of encoding characters was needed. The International Organization for Standardization (ISO) encapsulated the Latin script in their (ISO/IEC 646) 7-bit character-encoding standard. To achieve widespread acceptance, this encapsulation was based on popular usage. The standard was based on the already published American Standard Code for Information Interchange, better known as ASCII, which included in the character set the 26 × 2 letters of the English alphabet. Later standards issued by the ISO, for example ISO/IEC 8859 (8-bit character encoding) and ISO/IEC 10646 (Unicode Latin), have continued to define the 26 × 2 letters of the English alphabet as the basic Latin script with extensions to handle other letters in other languages.[1]
See main article: Basic Latin (Unicode block).
The Unicode block that contains the alphabet is called "C0 Controls and Basic Latin". Two subheadings exist:[2]
There are also another two sets in the Halfwidth and Fullwidth Forms block:[3]
In ASCII the letters belong to the printable characters and in Unicode since version 1.0 they belong to the block "C0 Controls and Basic Latin". In both cases, as well as in ISO/IEC 646, ISO/IEC 8859 and ISO/IEC 10646 they are occupying the positions in hexadecimal notation 41 to 5A for uppercase and 61 to 7A for lowercase.
Not case sensitive, all letters have code words in the ICAO spelling alphabet and can be represented with Morse code.
All of the lowercase letters are used in the International Phonetic Alphabet (IPA). In X-SAMPA and SAMPA these letters have the same sound value as in IPA.
See also: List of Latin-script alphabets. The list below only includes alphabets that include all the 26 letters but exclude:
Notable omissions due to these rules include Spanish, Esperanto, Filipino and German. The German alphabet is sometimes considered by tradition to contain only 26 letters (with (ä), (ö), (ü) considered variants and (ß) considered a ligature of (ſ) (long s) and (s)), but the current German orthographic rules include (ä), (ö), (ü), (ß) in the alphabet placed after (Z). In Spanish orthography, the letters (n) and (ñ) are distinct; the tilde is not considered a diacritic in this case.
Alphabet | Diacritic | Multigraphs (not constituting distinct letters) | Ligatures |
---|---|---|---|
Afrikaans alphabet | á, ä, é, è, ê, ë, í, î, ï, ó, ô, ö, ú, û, ü, ý | Digraphs ⟨aa⟩, ⟨ai⟩, ⟨ch⟩, ⟨ee⟩, ⟨ei⟩, ⟨eu⟩, ⟨gh⟩, ⟨ie⟩, ⟨nj⟩, ⟨ng⟩ ⟨oe⟩, ⟨oi⟩, ⟨oo⟩, ⟨ou⟩, ⟨sj⟩, ⟨tj⟩, ⟨ts⟩, ⟨ui⟩, ⟨uu⟩Trigraphs ⟨aai⟩, ⟨eeu⟩, ⟨oei⟩, ⟨ooi⟩ | ʼn (Napostrophe) |
Aragonese alphabet (Academia de l'Aragonés orthography) | á, é, í, ó, ú, ü, lꞏl | ⟨ch⟩, ⟨gu⟩, ⟨ll⟩, ⟨ny⟩, ⟨qu⟩, ⟨rr⟩, ⟨tz⟩ | |
Catalan alphabet | à, é, è, í, ï, ó, ò, ú, ü, ç, lꞏl | ⟨gu⟩, ⟨ig⟩, ⟨ix⟩, ⟨ll⟩, ⟨ny⟩, ⟨qu⟩, ⟨rr⟩, ⟨ss⟩ | |
Dutch alphabet | ä, é, è, ë, ï, ö, ü | The digraph ⟨ij⟩ is sometimes considered to be a separate letter. When that is the case, it usually replaces or is intermixed with ⟨y⟩. Other digraphs: ⟨aa⟩, ⟨ae⟩, ⟨ai⟩, ⟨au⟩, ⟨ch⟩, ⟨ee⟩, ⟨ei⟩, ⟨eu⟩, ⟨ie⟩, ⟨oe⟩, ⟨oi⟩, ⟨oo⟩, ⟨ou⟩, ⟨ui⟩, ⟨uu⟩ | |
English alphabet | ⟨sh⟩, ⟨ch⟩, ⟨ea⟩, ⟨ou⟩, ⟨th⟩, ⟨ph⟩, ⟨ng⟩ | æ, œ (both archaic) | |
French alphabet | à, â, ç, é, è, ê, ë, î, ï, ô, ù, û, ü, ÿ | ⟨ai⟩, ⟨au⟩, ⟨ei⟩, ⟨eu⟩, ⟨oi⟩, ⟨ou⟩, ⟨eau⟩, ⟨ch⟩, ⟨ph⟩, ⟨gn⟩, ⟨an⟩, ⟨am⟩, ⟨en⟩, ⟨em⟩, ⟨in⟩, ⟨im⟩, ⟨on⟩, ⟨om⟩, ⟨un⟩, ⟨um⟩, ⟨yn⟩, ⟨ym⟩, ⟨ain⟩, ⟨aim⟩, ⟨ein⟩, ⟨oin⟩, ⟨aî⟩, ⟨eî⟩ | æ (rare), œ(mandatory) |
Hmong Latin alphabet | ⟨bh⟩, ⟨bl⟩, ⟨ch⟩, ⟨dh⟩, ⟨dl⟩, ⟨gh⟩, ⟨hl⟩, ⟨hm⟩, ⟨hn⟩, ⟨jh⟩, ⟨kh⟩, ⟨ml⟩, ⟨nc⟩, ⟨nq⟩, ⟨nr⟩, ⟨nt⟩, ⟨nx⟩, ⟨ny⟩, ⟨ph⟩, ⟨pl⟩, ⟨qh⟩, ⟨rh⟩, ⟨th⟩, ⟨ts⟩, ⟨tx⟩, ⟨xy⟩, ⟨bhl⟩, ⟨dhl⟩, ⟨hml⟩, ⟨hny⟩, ⟨nch⟩, ⟨ndl⟩, ⟨ngh⟩, ⟨nrh⟩, ⟨nth⟩, ⟨nxh⟩, ⟨phl⟩, ⟨tsh⟩, ⟨txh⟩, ⟨ndhl⟩ | ||
Italian alphabet (extended) | à, è, é, ì, î (formal), ò, ó, ù | ⟨ch⟩, ⟨ci⟩, ⟨gh⟩, ⟨gi⟩, ⟨gl⟩, ⟨gli⟩, ⟨gn⟩, ⟨sc⟩, ⟨sci⟩ | |
Ido alphabet | ⟨qu⟩, ⟨ch⟩, ⟨sh⟩ | ||
Indonesian alphabet | ⟨kh⟩, ⟨ng⟩, ⟨ny⟩, ⟨sy⟩, diphthongs: ⟨ai⟩, ⟨au⟩, ⟨ei⟩, ⟨oi⟩ | ||
Interlingua alphabet | ⟨ch⟩, ⟨ph⟩, ⟨qu⟩, ⟨rh⟩, ⟨sh⟩ | ||
Javanese Latin alphabet | é, è | ⟨dh⟩, ⟨kh⟩, ⟨ng⟩, ⟨ny⟩, ⟨sy⟩, ⟨th⟩ | |
Latino sine flexione alphabet* | ⟨ae⟩, ⟨ch⟩, ⟨oe⟩, ⟨ph⟩, ⟨qu⟩, ⟨rh⟩, ⟨th⟩ [8] | ||
Luxembourgish alphabet | ä, é, ë | ⟨aa⟩, ⟨ch⟩, ⟨ck⟩, ⟨ee⟩, ⟨ei⟩, ⟨ie⟩, ⟨ii⟩, ⟨ng⟩, ⟨oo⟩, ⟨ou⟩, ⟨qu⟩, ⟨ue⟩, ⟨uu⟩, ⟨sch⟩ | |
Malay alphabet | ⟨gh⟩, ⟨kh⟩, ⟨ng⟩, ⟨ny⟩, ⟨sy⟩ | ||
Portuguese alphabet | ⟨ch⟩, ⟨lh⟩, ⟨nh⟩, ⟨rr⟩, ⟨ss⟩, ⟨am⟩, ⟨em⟩, ⟨im⟩, ⟨om⟩, ⟨um⟩, ⟨ãe⟩, ⟨ão⟩, ⟨õe⟩ | ||
Sundanese Latin alphabet | é | ⟨eu⟩, ⟨ng⟩, ⟨ny⟩ | |
Xhosa alphabet | ⟨bh⟩, ⟨ch⟩, ⟨dl⟩, ⟨dy⟩, ⟨dz⟩, ⟨gc⟩, ⟨gq⟩, ⟨gr⟩, ⟨gx⟩, ⟨hh⟩, ⟨hl⟩, ⟨kh⟩, ⟨kr⟩, ⟨krh⟩, ⟨lh⟩, ⟨mh⟩, ⟨nc⟩, ⟨ng⟩, ⟨ngʼ⟩, ⟨ngc⟩, ⟨ngh⟩, ⟨ngq⟩, ⟨ngx⟩, ⟨nh⟩, ⟨nkc⟩, ⟨nkq⟩, ⟨nkx⟩, ⟨nq⟩, ⟨nx⟩, ⟨ny⟩, ⟨nyh⟩, ⟨ph⟩, ⟨qh⟩, ⟨rh⟩, ⟨sh⟩, ⟨th⟩, ⟨ths⟩, ⟨thsh⟩, ⟨ts⟩, ⟨tsh⟩, ⟨ty⟩, ⟨tyh⟩, ⟨wh⟩, ⟨xh⟩, ⟨yh⟩, ⟨zh⟩ | ||
Zulu alphabet | ⟨bh⟩, ⟨ch⟩, ⟨dl⟩, ⟨dy⟩, ⟨gc⟩, ⟨gq⟩, ⟨gx⟩, ⟨hh⟩, ⟨hl⟩, ⟨kh⟩, ⟨kl⟩, ⟨mb⟩, ⟨nc⟩, ⟨ng⟩, ⟨ngc⟩, ⟨ngq⟩, ⟨ngx⟩, ⟨nj⟩, ⟨nk⟩, ⟨nq⟩, ⟨ntsh⟩, ⟨nx⟩, ⟨ny⟩, ⟨ph⟩, ⟨qh⟩, ⟨sh⟩, ⟨th⟩, ⟨ts⟩, ⟨tsh⟩, ⟨xh⟩ |
The Roman (Latin) alphabet is commonly used for column numbering in a table or chart. This avoids confusion with row numbers using Arabic numerals. For example, a 3-by-3 table would contain columns A, B, and C, set against rows 1, 2, and 3. If more columns are needed beyond Z (normally the final letter of the alphabet), the column immediately after Z is AA, followed by AB, and so on (see bijective base-26 system). This can be seen by scrolling far to the right in a spreadsheet program such as Microsoft Excel or LibreOffice Calc.
These are double-digit "letters" for table columns, in the same way that 10 through 99 are double-digit numbers. The Greek alphabet has a similar extended form that uses such double-digit letters if necessary, but it is used for chapters of a fraternity as opposed to columns of a table.
Such double-digit letters for bullet points are AA, BB, CC, etc., as opposed to the number-like place value system explained above for table columns.