Beta Code was a method of representing, using only ASCII characters, the characters, accents, and formatting found in ancient Greek texts (and other ancient languages). Its aim was to be not merely a romanization of the Greek alphabet, but to represent faithfully a wide variety of source texts – including formatting as well as rare or idiosyncratic characters. For most applications, it has been obsoleted by Unicode.
Beta Code was developed by David W. Packard in the late 1970s and adopted by Thesaurus Linguae Graecae in 1981. It became the standard for encoding polytonic Greek and was also used by a number of other projects such as the Perseus Project, the Packard Humanities Institute, the Duke collection of Documentary Papyri, and the Greek Epigraphy Project at Cornell and Ohio State University. Beta Code can be easily converted to a variety of systems for display, most notably Unicode.[1] Most of these projects have since converted their data to Unicode. For example, Perseus originally encoded all its Ancient Greek texts using Beta code,[2] but now releases them as Unicode.
Α |
| α | A | ||
Β |
| β | B | ||
Γ |
| γ | G | ||
Δ |
| δ | D | ||
Ε |
| ε | E | ||
Ϝ |
| ϝ | V | ||
Ζ |
| ζ | Z | ||
Η |
| η | H | ||
Θ |
| θ | Q | ||
Ι |
| ι | I | ||
Κ |
| κ | K | ||
Λ |
| λ | L | ||
Μ |
| μ | M | ||
Ν |
| ν | N | ||
Ξ |
| ξ | C | ||
Ο |
| ο | O | ||
Π |
| π | P | ||
Ρ |
| ρ | R | ||
Σ |
| σ | S, S1 | ||
ς | S, S2, J | ||||
Ϲ |
| ϲ | S3 | ||
Τ |
| τ | T | ||
Υ |
| υ | U | ||
Φ |
| φ | F | ||
Χ |
| χ | X | ||
Ψ |
| ψ | Y | ||
Ω |
| ω | W |
. | . | Period | |
, | , | Comma | |
· | Colon (Ano Stigme) | ||
Question Mark | |||
’ | ' | Apostrophe | |
‐ | - | Hyphen | |
— | _ | Dash | |
ʹ | Numeral (Keraia) |
̓ | ) | ἐν | E)N | ||
̔ | ( | ὁ, οἱ | O(OI( | ||
́ | / | πρός | PRO/S | ||
͂ | = | τῶν | TW=N | ||
̀ | \ | πρὸς | PRO\S | ||
̈ | + | προϊέναι | PROI+E/NAI | ||
ͅ | | | τῷ | TW= | ||
̄ | & | μαχαίρᾱς | MAXAI/RA&S | ||
̆ | ' | μάχαιρᾰ | MA/XAIRA' |
These are normally postfix operators, as in the examples above, but for capitalized words, accents come between the * and the letter. The documentation does not otherwise define a required or canonical order for accents. In some data sources, breathing is normally seen before a tonal accent,[3] and one implementation defines a canonical order of breathing, accent, iota subscript.[4] In some implementations, the ordering of the accents can determine the arrangement of the combining characters after conversion to Unicode.[5] Many implementations do not implement macronization.