The Caucasian languages comprise a large and extremely varied array of languages spoken by more than ten million people in and around the Caucasus Mountains, which lie between the Black Sea and the Caspian Sea.
Linguistic comparison allows the classification of these languages into several different language families, with little or no discernible affinity to each other. However, the languages of the Caucasus are sometimes mistakenly referred to as a family of languages.[1] According to Asya Pereltsvaig, "grammatical differences between the three groups of languages are considerable. [...] These differences force the more conservative historical linguistics to treat the three language families of the Caucasus as unrelated."[2]
Three of these families have no current indigenous members outside the Caucasus, and are considered indigenous to the area. The term Caucasian languages is generally restricted to these families, which are spoken by about 11.2 million people.[3]
The Northeast and Northwest Caucasian families are notable for their high number of consonant phonemes (inventories range up to the 80–84 consonants of Ubykh). The consonant inventories of the South Caucasian languages, however, are not nearly as extensive, ranging from 28 (Georgian) to 30 (Laz) – comparable to languages like Russian (up to 37 consonant phonemes, depending on definition), Arabic (28 phonemes), and Western European languages (often more than 20 phonemes).
The autochthonous languages of the Caucasus share some areal features, such as the presence of ejective consonants and a highly agglutinative structure, and, with the sole exception of Mingrelian, all of them exhibit a greater or lesser degree of ergativity. Many of these features are shared with other languages that have been in the Caucasus for a long time, such as Ossetian (which has ejective sounds but no ergativity).
Since the birth of comparative linguistics in the 19th century, the riddle of the apparently isolated Caucasian language families has attracted the attention of many scholars, who have endeavored to relate them to each other or to languages outside the Caucasus region.[4] The most promising proposals are connections between the Northeast and Northwest Caucasian families and each other or with languages formerly spoken in Anatolia and northern Mesopotamia.[5]
See main article: North Caucasian languages. Linguists such as Sergei Starostin see the Northeast (Nakh-Dagestanian) and Northwest (Abkhaz–Adyghe) families as related and propose uniting them in a single North Caucasian family, sometimes called Caucasic or simply Caucasian. This theory excludes the South Caucasian languages, thereby proposing two indigenous language families.[6] While these two families share many similarities, their morphological structure, with many morphemes consisting of a single consonant, make comparison between them unusually difficult, and it has not been possible to establish a genetic relationship with any certainty.
See main article: Ibero-Caucasian languages. There are no known affinities between the South Caucasian and North Caucasian families. Nevertheless, some scholars have proposed the single name Ibero-Caucasian for all the Caucasian language families, North and South, in an attempt to unify the Caucasian languages under one family.
Some linguists have claimed affinities between the Northwest Caucasian (Circassian) family and the extinct Hattic language of central Anatolia. See the article on Northwest Caucasian languages for details.
See main article: Alarodian languages. Alarodian is a proposed connection between Northeast Caucasian and the extinct Hurro-Urartian languages of Anatolia.
See main article: Dené–Caucasian languages. Linguists such as Sergei Starostin have proposed a Dené–Caucasian macrofamily, which includes the North Caucasian languages together with Basque, Burushaski, Na-Dené, Sino-Tibetan, and Yeniseian. This proposal is rejected by most linguists.
Other languages historically and currently spoken in the Caucasus area can be placed into families with a much wider geographical distribution.
The predominant Indo-European language in the Caucasus is Armenian, spoken by the Armenians (circa 6.7 million speakers). The Ossetians, speaking the Ossetian language, form another group of around 700,000 speakers. Other Indo-European languages spoken in the Caucasus include Greek (Pontic Greek), Persian (including Tat Persian), Kurdish, Talysh, Judeo-Tat, and the Slavic languages, such as Russian and Ukrainian, whose speakers number over a third of the total population of the Caucasus.
Two dialects of Neo-Aramaic are spoken in the Caucasus: Assyrian Neo-Aramaic, with around 30,000 speakers, and Bohtan Neo-Aramaic, with around 1,000 speakers. Both of these were brought to the Caucasus by ethnic Assyrians fleeing the Sayfo or Assyrian genocide during World War I.
A dialect of Arabic known as Shirvani Arabic was spoken natively in parts of Azerbaijan and Dagestan throughout medieval times until the early 20th century.[7] [8] In the nineteenth century, it was considered that the best literary Arabic was spoken in the mountains of Dagestan.[9]
Several Turkic languages are spoken in the Caucasus. Of these, Azerbaijani is predominant, with around 9 million speakers in Azerbaijan and more than 10 million in North Western Iran. Other Turkic languages spoken include Karachay-Balkar, Kumyk, Nogai, Turkish, Turkmen and Urum.
Kalmyk Oirat, spoken by descendants of Oirat-speakers from East Asia, is a Mongolic language.
Below are selected basic vocabulary items for all three language families of the Caucasus.
gloss | Proto-NE Caucasian[10] | Proto-NW Caucasian[11] | Proto-Kartvelian[12] | Georgian | |
---|---|---|---|---|---|
eye | Uncoded languages: *(b)ul, *(b)al | Uncoded languages: *b-la | Uncoded languages: *twal- | Georgian: tvali | |
tooth | Uncoded languages: *cVl- | Uncoded languages: *ca | GZ Uncoded languages: *ḳb-il- | Georgian: k’bili | |
tongue | Uncoded languages: *maʒ-i | Uncoded languages: *bza | Uncoded languages: *nena- | Georgian: ena | |
hand, arm | Uncoded languages: *kV, *kol- | Uncoded languages: *q’a | Uncoded languages: *qe- | Georgian: xeli | |
back (of body) | Uncoded languages: *-uqq’ | Uncoded languages: *pxá | Georgian: zurgi | ||
heart | Uncoded languages: *rVk’u / *Vrk’u | Uncoded languages: *g°ə | Uncoded languages: *gul- | Georgian: guli | |
meat | Uncoded languages: *(CV)-(lV)ƛƛ’ | Uncoded languages: *Lə | GZ Uncoded languages: *qorc- | Georgian: xorci | |
sun | Uncoded languages: *bVrVg | Uncoded languages: *dəɣa | Uncoded languages: *mz₁e- | Georgian: mze | |
moon | Uncoded languages: *baʒVr / *buʒVr | Uncoded languages: *məʒa | Uncoded languages: *tute- | Georgian: mtvare | |
earth | Uncoded languages: *(l)ončči | Uncoded languages: *č’ə-g°ə (P-Circassian) | Georgian: dedamiʦ’a | ||
water | Uncoded languages: *ɬɬin | Uncoded languages: *psə (P-Circassian) | GZ Uncoded languages: *c̣q̣a- | Georgian: ʦ’q’ali | |
fire | Uncoded languages: *c’ar(i), *c’ad(i) | Uncoded languages: *məć’°a | GZ Uncoded languages: *ʓec₁xl- | Georgian: cecxli; xanʒari | |
ashes | Uncoded languages: *rV-uqq’ / *rV-uƛƛ’ | Uncoded languages: *tq°a | Uncoded languages: *ṭuṭa- | Georgian: perpli | |
road | Uncoded languages: *-eqq’ / *-aqq’ | Uncoded languages: *məʕ°á | GZ Uncoded languages: *gza- | Georgian: gza | |
name | Uncoded languages: *cc’Vr, *cc’Vri | Uncoded languages: *(p’)c’a | Uncoded languages: *ʓ₁ax-e- | Georgian: saxeli; gvari | |
kill | Uncoded languages: *-Vƛ’ | Uncoded languages: *ƛ’ə́ | Georgian: k’vla | ||
burn | Uncoded languages: *-Vk’ | Uncoded languages: *ca; *bla/ə | Uncoded languages: *c₁x- | Georgian: ʦ’va | |
know | Uncoded languages: *(-)Vc’ | Uncoded languages: *ć’a | Georgian: codna | ||
black | Uncoded languages: *alč’i- (*ʕalč’i-) | Uncoded languages: *ć’°a | Georgian: šavi | ||
round | Uncoded languages: *goRg / *gog-R- | Georgian: mrgvali | |||
dry | Uncoded languages: *-aqq’(u) / *-uqq’ | Uncoded languages: *ʕ°ə́ | Uncoded languages: *šwer-, *šwr- | Georgian: mšrali | |
thin | Uncoded languages: *(C)-uƛ’Vl- | Uncoded languages: *č’°a | GZ Uncoded languages: *ttx-el- | Georgian: txeli | |
what | Uncoded languages: *sti- | Uncoded languages: *sə-tʰə; *śə-da (P-Circassian) | Uncoded languages: *ma- | Georgian: ra | |
one | Uncoded languages: *cV (*cʕV ?) | Uncoded languages: *za | GZ Uncoded languages: *ert- | Georgian: erti | |
five | Uncoded languages: *(W)-ƛƛi / *ƛƛwi | Uncoded languages: *txᵒə | Uncoded languages: *xut- | Georgian: xuti |