Avestan | |
Time: | 400 - 1000 CE |
Fam1: | Egyptian hieroglyphs[1] |
Sample: | Bodleian_J2_fol_175_Y_28_1.jpg |
Unicode: | U+10B00 - U+10B3F |
Iso15924: | Avst |
The Avestan alphabet (Avestan: {{script|Avst| transliteration:, Middle Persian: transliteration:, transcription: Pahlavi: dēn dēbīrē, Persian: دین دبیره|translit=din dabire) is a writing system developed during Iran's Sasanian era (226 - 651 CE) to render the Avestan language.
As a side effect of its development, the script was also used for Pazend, a method of writing Middle Persian that was used primarily for Zend commentaries on the texts of the Avesta. In the texts of Zoroastrian tradition, the alphabet is referred to as "the religion's script" (Pahlavi: dēn dibīrih in Middle Persian and Persian: din dabireh in New Persian).
The development of the Avestan alphabet was initiated by the need to represent recited Avestan language texts correctly. The various text collections that today constitute the canon of Zoroastrian scripture are the result of a collation that occurred in the 4th century, probably during the reign of Shapur II (309 - 379). It is likely that the Avestan alphabet was an ad hoc[2] innovation related to this—"Sassanid archetype"—collation.
The enterprise, "which is indicative of a Mazdean revival and of the establishment of a strict orthodoxy closely connected with the political power, was probably caused by the desire to compete more effectively with Buddhists, Christians, and Manicheans, whose faith was based on a revealed book". In contrast, the Zoroastrian priesthood had for centuries been accustomed to memorizing scripture—following by rote the words of a teacher-priest until they had memorized the words, cadence, inflection and intonation of the prayers. This they passed on to their pupils in turn, so preserving for many generations the correct way to recite scripture. This was necessary because the priesthood considered (and continue to consider) precise and correct enunciation and cadence a prerequisite of effective prayer. Further, the recitation of the liturgy was (and is) accompanied by ritual activity that leaves no room to attend to a written text.
The ability to correctly render Avestan did, however, have a direct benefit: By the common era, the Avestan language words had almost ceased to be understood, which led to the preparation of the Zend texts (from Avestan zainti "understanding"), commentaries on and translations of the canon. The development of the Avestan alphabet allowed these commentaries to interleave quotation of scripture with explanation thereof. The direct effect of these texts was a "standardized" interpretation of scripture that survives to the present day. For scholarship these texts are enormously interesting, since they occasionally preserve passages that have otherwise been lost.
The 9th - 12th century texts of Zoroastrian tradition suggest that there was once a much larger collection of written Zoroastrian literature, but these texts—if they ever existed—have since been lost, and it is hence not known what script was used to render them. The question of the existence of a pre-Sassanid "Arsacid archetype" occupied Avestan scholars for much of the 19th century, and, "[w]hatever may be the truth about the Arsacid Avesta, the linguistic evidence shows that even if it did exist, it can not have had any practical influence, since no linguistic form in the Vulgate can be explained with certainty as resulting from wrong transcription and the number of doubtful cases is minimal; in fact it is being steadily reduced. Though the existence of an Arsacid archetype is not impossible, it has proved to contribute nothing to Avestan philology."
The Pahlavi script, upon which the Avestan alphabet is based, was in common use for representing various Middle Iranian languages, but was not adequate for representing a religious language that demanded precision since Pahlavi was a simplified abjad syllabary with at most 22 symbols, most of which were ambiguous (i.e. could represent more than one sound).
In contrast, Avestan was a full alphabet, with explicit characters for vowels, and allowed for phonetic disambiguation of allophones. The alphabet included many characters (a, i, k, t, p, b, m, n, r, s, z, š, xv) closely resembling Book Pahlavi of the early Islamic Persia, while some (ā, γ) are characters that only exist in the older (6th–7th c. AD) Psalter Pahlavi script (in later cursive Pahlavi γ and k have the same symbol).[3] Some of the vowels, such as ə appear to derive from Greek cursives. Avestan o is a special form of Pahlavi
l that exists only in Aramaic signs. Some letters (e.g. ŋ́, ṇ, ẏ, v), are free inventions.[4]Avestan script, like Pahlavi script and Aramaic script also, is written from right to left. In Avestan script, letters are not connected, and ligatures are "rare and clearly of secondary origin".
In total, the Avestan alphabet has 37 consonants and 16 vowels. There are two main transcription schemes for Avestan, the newer orthography used by Karl Hoffmann and the older one used by Christian Bartholomae.
Letter | Transcription[5] | IPA | Unicode | |
---|---|---|---|---|
Hoff. | Bar. | |||
a | a | pronounced as //a// | U+10B00: AVESTAN LETTER A | |
ā | ā | pronounced as //aː// | U+10B01: AVESTAN LETTER AA | |
å | - | pronounced as //ɒ// | U+10B02: AVESTAN LETTER AO | |
ā̊ | å | pronounced as //ɒː// | U+10B03: AVESTAN LETTER AAO | |
ą | ą | pronounced as //ã// | U+10B04: AVESTAN LETTER AN | |
ą̇ | - | pronounced as //ãː// | U+10B05: AVESTAN LETTER AAN | |
ə | ə | pronounced as //ə// | U+10B06: AVESTAN LETTER AE | |
ə̄ | ə̄ | pronounced as //əː// | U+10B07: AVESTAN LETTER AEE | |
e | e | pronounced as //e// | U+10B08: AVESTAN LETTER E | |
ē | ē | pronounced as //eː// | U+10B09: AVESTAN LETTER EE | |
o | o | pronounced as //ɔ// | U+10B0A: AVESTAN LETTER O | |
ō | ō | pronounced as //oː// | U+10B0B: AVESTAN LETTER OO | |
i | i | pronounced as //ɪ// | U+10B0C: AVESTAN LETTER I | |
ī | ī | pronounced as //iː// | U+10B0D: AVESTAN LETTER II | |
u | u | pronounced as //ʊ// | U+10B0E: AVESTAN LETTER U | |
ū | ū | pronounced as //uː// | U+10B0F: AVESTAN LETTER UU | |
k | k | pronounced as //k// | U+10B10: AVESTAN LETTER KE | |
x | x | pronounced as //x// | U+10B11: AVESTAN LETTER XE | |
x́ | ḣ | pronounced as //xʲ//, pronounced as //ç// | U+10B12: AVESTAN LETTER XYE | |
pronounced as //xʷ// | U+10B13: AVESTAN LETTER XVE | |||
g | g | pronounced as //ɡ// | U+10B14: AVESTAN LETTER GE | |
ġ | - | pronounced as //ɡʲ//, pronounced as //ɟ// | U+10B15: AVESTAN LETTER GGE | |
γ | γ | pronounced as //ɣ// | U+10B16: AVESTAN LETTER GHE | |
c | č | pronounced as //t͡ʃ// | U+10B17: AVESTAN LETTER CE | |
j | ǰ | pronounced as //d͡ʒ// | U+10B18: AVESTAN LETTER JE | |
t | t | pronounced as //t// | U+10B19: AVESTAN LETTER TE | |
θ | θ | pronounced as //θ// | U+10B1A: AVESTAN LETTER THE | |
d | d | pronounced as //d// | U+10B1B: AVESTAN LETTER DE | |
δ | δ | pronounced as //ð// | U+10B1C: AVESTAN LETTER DHE | |
t̰ | t̰ | pronounced as //t̚//[6] | U+10B1D: AVESTAN LETTER TTE | |
p | p | pronounced as //p// | U+10B1E: AVESTAN LETTER PE | |
f | f | pronounced as //f// | U+10B1F: AVESTAN LETTER FE | |
b | b | pronounced as //b// | U+10B20: AVESTAN LETTER BE | |
β | w | pronounced as //β// | U+10B21: AVESTAN LETTER BHE | |
ŋ | ŋ | pronounced as //ŋ// | U+10B22: AVESTAN LETTER NGE | |
ŋ́ | ŋ́ | pronounced as //ŋʲ// | U+10B23: AVESTAN LETTER NGYE | |
- | pronounced as //ŋʷ// | U+10B24: AVESTAN LETTER NGVE | ||
n | n | pronounced as //n// | U+10B25: AVESTAN LETTER NE | |
ń | - | pronounced as //ɲ// | U+10B26: AVESTAN LETTER NYE | |
ṇ | n, m | pronounced as //ŋ// | U+10B27: AVESTAN LETTER NNE | |
m | m | pronounced as //m// | U+10B28: AVESTAN LETTER ME | |
m̨ | - | pronounced as //m̥//, pronounced as //mʰ// | U+10B29: AVESTAN LETTER HME | |
ẏ | y | pronounced as //j// | U+10B2A: AVESTAN LETTER YYE | |
y | pronounced as //j// | U+10B2B: AVESTAN LETTER YE | ||
ii | pronounced as //ii̯// | U+10B0C: AVESTAN LETTER I (doubled) | ||
v | v | pronounced as //w// | U+10B2C: AVESTAN LETTER VE | |
uu | pronounced as //uu̯// | U+10B0E: AVESTAN LETTER U (doubled) | ||
r | r | pronounced as //r// | U+10B2D: AVESTAN LETTER RE | |
l | l | pronounced as //l// | U+10B2E: AVESTAN LETTER LE | |
s | s | pronounced as //s// | U+10B2F: AVESTAN LETTER SE | |
z | z | pronounced as //z// | U+10B30: AVESTAN LETTER ZE | |
š | š | pronounced as //ʃ// | U+10B31: AVESTAN LETTER SHE | |
ž | ž | pronounced as //ʒ// | U+10B32: AVESTAN LETTER ZHE | |
š́ | š | pronounced as //ɕ// | U+10B33: AVESTAN LETTER SHYE | |
ṣ̌ | pronounced as //ʂ// | U+10B34: AVESTAN LETTER SSHE | ||
h | h | pronounced as //h// | U+10B35: AVESTAN LETTER HE | |
Letter | Hoff. | Bar. | IPA | Unicode |
Transcription |
Later, when writing Middle Persian in the script (i.e. Pazend), another consonant was added to represent the pronounced as //l// phoneme that didn't exist in the Avestan language.
Four ligatures are commonly used in Avestan manuscripts:
U+200C ZERO WIDTH NON-JOINER can be used to prevent ligatures if desired.For example, compare (U+10B31 10B00) with (U+10B31 200C 10B00).
Fossey[7] lists 16 ligatures, but most are formed by the interaction of swash tails.
Digits and numbers can be seen on the Faulmann chart above.
Words and the end of the first part of a compound are separated by a dot (in a variety of vertical positions). Beyond that, punctuation is weak or non-existent in the manuscripts, and in the 1880s Karl Friedrich Geldner had to devise one for standardized transcription. In his system, which he developed based on what he could find, a triangle of three dots serves as a colon, a semicolon, an end of sentence or end of section; which is determined by the size of the dots and whether there is one dot above and two below, or two above and one below. Two above and one below signify—in ascending order of "dot" size—colon, semicolon, end of sentence or end of section.
Mark | Function | Unicode | |
---|---|---|---|
word separator | U+2E31: WORD SEPARATOR MIDDLE DOT | ||
U+00B7: MIDDLE DOT | |||
U+002E: FULL STOP | |||
abbreviation or repetition | U+10B39: AVESTAN ABBREVIATION MARK | ||
colon | U+10B3A: TINY TWO DOTS OVER ONE DOT PUNCTUATION | ||
semicolon | U+10B3B: SMALL TWO DOTS OVER ONE DOT PUNCTUATION | ||
end of sentence | U+10B3C: LARGE TWO DOTS OVER ONE DOT PUNCTUATION | ||
alternative mark for end of sentence (found in Avestan texts but not used by Geldner) | U+10B3D: LARGE ONE DOT OVER TWO DOTS PUNCTUATION | ||
end of section (may be doubled for extra finality) | U+10B3E: LARGE TWO RINGS OVER ONE RING PUNCTUATION | ||
alternative mark for end of section (found in Avestan texts but not used by Geldner) | U+10B3F: LARGE ONE RING OVER TWO RINGS PUNCTUATION |
See main article: Avestan (Unicode block).
The Avestan alphabet was added to the Unicode Standard in October, 2009 with the release of version 5.2.
The characters are encoded at U+10B00 - 10B35 for letters (ii and uu are not represented as single characters, but as sequences of characters) and U+10B38 - 10B3F for punctuation.