Hindustani etymology explained

Hindustani, also known as Hindi-Urdu, is the vernacular form of two standardized registers used as official languages in India and Pakistan, namely Hindi and Urdu. It comprises several closely related dialects in the northern, central and northwestern parts of the Indian subcontinent but is mainly based on Khariboli of the Delhi region. As an Indo-Aryan language, Hindustani has a core base that traces back to Sanskrit but as a widely-spoken lingua franca, it has a large lexicon of loanwords,[1] [2] acquired through centuries of foreign rule and ethnic diversity.

Standard Hindi derives much of its formal and technical vocabulary from Sanskrit while standard Urdu derives much of its formal and technical vocabulary from Persian and Arabic. Standard Hindi and Urdu are used primarily in public addresses and radio or TV news, while the everyday spoken language is one of the several varieties of Hindustani, whose vocabulary contains words drawn from Persian, Arabic, and Sanskrit. In addition, spoken Hindustani includes words from English and the Dravidian languages, as well as several others.

Hindustani developed over several centuries throughout much of the northern subcontinent including the areas that comprise modern-day India, Pakistan, and Nepal. In the same way that the core vocabulary of English evolved from Old English (Anglo-Saxon) but assimilated many words borrowed from French and other languages (whose pronunciations often changed naturally so as to become easier for speakers of English to pronounce), what may be called Hindustani can be said to have evolved from Sanskrit while borrowing many Persian and Arabic words over the years, and changing the pronunciations (and often even the meanings) of these words to make them easier for Hindustani speakers to pronounce. Many Persian words entered the Hindustani lexicon due to the influence of the Mughal rulers of north India, who followed a very Persianised culture and also spoke Persian. Many Arabic words entered Hindustani via Persian, which had previously been assimilated into the Persian language due to the influence of Arabs in the area. The dialect of Persian spoken by the Mughal ruling elite was known as 'Dari', which is the dialect of Persian spoken in modern-day Afghanistan. Therefore, Hindustani is the naturally developed common language of north India. This article will deal with the separate categories of Hindustani words and some of the common words found in the Hindustani language.

Categorization

Hindustani words, apart from loans, basically derive from two linguistic categories:

According to the traditional categorization in Hindi (also found in other Indo-Aryan languages except Urdu) the loanwords are classed as tatsam (Hindi: तत्सम "as it is, same as therein") for Sanskrit loans and vides͟hī (Hindi: विदेशी "foreign, non-native") for non-Sanskrit loans,[5] such as those from Persian or English, respectively contrasting with tadbhava and deśaja words.

The most common words in Hindustani are tadbhavas.

Examples of derivations

Second person pronouns

In Hindustani, the pronoun āp (आप) "[one]self", originally used as a third person honorific plural, denotes respect or formality (politeness) and originates from Prakrit appā, which derived from Sanskrit ātman,[6] which refers to the higher self or level of consciousness.

The pronoun (तू) and its grammatically plural form tum (तुम) (also the second person honorific plural) denote informality, familiarity or intimacy and originate respectively from tuhuṃ and tumma from Prakrit tuṃ and its variant tumaṃ, which derived from Sanskrit tvam, nominative singular of युष्मद् yuṣmad (the base of the second person plural pronoun).[7] [8] In modern usage, is widely used to display a range of attitudes depending on the context, from extreme informality (impoliteness) to extreme intimacy to outright disrespect and even extreme reverence. Usage of in most contexts is considered highly offensive in the formal register except when addressing God as a display of spiritual intimacy. This is very similar to the usage of "thou" in archaic English and many other Indo-European languages showing T–V distinction.

Present "be" verb

One of the most common words in Hindustani, the copula hai (है) and its plural form haiṉ (हैं) − present forms of honā (होना, meaning "to be" and originating from Prakrit bhodi derived from Sanskrit bhavati "to happen")[9] − rather originate from the following developments:

Shortening of ahai occurred in Hindustani resulting in hai probably to fulfill the symmetry of the other grammatical forms of honā. Ahai can be found in some older works of Hindustani literature and its evidence can also be seen in other closely related Indo-Aryan languages such as Marathi (आहे āhe) or Sindhi (آهي āhe).

Perfective "go" verb

The verb jānā (जाना, "to go"), which originates from Prakrit jādi derived from Sanskrit yāti ("to move"; root ), however has its perfective form originating from another Prakrit word gaya derived from Sanskrit gata, past participle of gacchati ("to go"; root gam or gacch),[10] for example, in gayā (गया, "went, gone").

Some other words

The word ājā (आजा) has also been used in Northern India and Pakistan for "grandfather". It is indeed derived from arya meaning "sir" in this case. Jain nuns are addressed either as Aryika or Ajji.

The word dādā (दादा) also has a similar meaning which varies by region. It is used in some regions for "father", in other regions for "older brother", or even for "grandfather" in other regions. This word is an amalgam of two sources:

The word baṛā (बड़ा "older, bigger, greater") is derived from the Sanskrit vridhha through Prakrit vaḍḍha.

Desi words

The term Desi words is used to describe the component of the lexicon in Indo-Aryan languages which is non-Indo-Aryan in origin, but native to other language families of the Indian subcontinent. Examples of Desi words in Hindustani include:[11] [12] loṭā (लोटा) "lota (water vessel)", kapās (कपास) "cotton", kauṛī (कौड़ी) "cowrie (shell money)", ṭhes (ठेस) "wound, injury", jhaṉḍā (झंडा) "flag", mukkā (मुक्का) "fist, punch", lakṛī (लकड़ी) "wood", ṭharrā (ठर्रा) "tharra (liquor)", čūhā (चूहा) "mouse, rat", čūlhā (चूल्हा) "stove, oven", pagṛī (पगड़ी) "turban", luṉgī (लुंगी) "lungi (sarong)", ghoṭālā (घोटाला) "scam", dāṉḍī (दांडी) "salt", jholā (झोला) "bag, satchel", ṭakkar (टक्कर) "crash, collision, confrontation", kākā (काका) "paternal uncle", uṭpaṭāṉg/ūṭpaṭāṉg (उटपटांग/ऊट-पटांग) "ludicrous", ḍabbā/ḍibbā (डब्बा/डिब्बा) "box, container" and jhuggī (झुग्गी) "hut"

Onomatopoeic words

Nouns: gaṛbaṛ (गड़बड़) "disorder, disturbance", dhaṛām (धड़ाम) "thud", bakbak (बक-बक) "chatter/chitter-chatter", khusur pusar (खुसुर-पुसर) "whisper", jhilmil (झिलमिल) "shimmer", ṭhakṭhak (ठक-ठक) "knock knock", khaṭpaṭ (खटपट) "quarrel, disagreement"
Verbs: khaṭkhaṭānā (खटखटाना) "to knock", gaḍgaḍānā (गडगडाना) "to rumble, to fuss", jagmagānā (जगमगाना) "to shine/glitter", hinhinānā (हिनहिनाना) "to neigh", phusphusānā (फुसफुसाना) "to whisper"
Adjectives and Adverbs: čaṭpaṭ (चट-पट) "in a jiffy", tharthar (थर-थर) "with jerky motion (characteristic of shaking or trembling)", čaṭpaṭā (चटपटा) "dextrous, spicy", čipčipā (चिपचिपा) "sticky, slimy", čiṛčiṛā (चिड़चिड़ा) "irritable", gaṛbaṛiyā (गड़बड़िया) "chaotic, messy"

Loanwords

Due to the language's status as a lingua franca, Hindustani's vocabulary has a large inventory of loanwords, the largest number of which are adopted from Punjabi. Punjabi borrowings often bear sound changes from the parent Prakrit and Sanskrit vocabulary which did not occur in Hindustani, particularly the preservation of short vowels in initial syllables and the gemination of the following consonant. A certain amount of vocabulary from other South Asian languages, Persian, Arabic, and English has been loaned indirectly into Hindustani through Punjabi. Other Indic languages which exist in a state of diglossia with Hindustani and are prone to mutual borrowing include Rajasthani, the Western Pahari languages, Haryanvi, Bhojpuri, Marathi, Nepali, and Gujarati. Besides these, common sources of loan words include those manually adopted from Classical Sanskrit, Classical Persian, Arabic, Chagatai Turkic, Portuguese and English, as well as Mandarin Chinese and French to a lesser extent.

Classical Sanskrit

Phonetic alterations

Many Classical Sanskrit words which were not learned borrowings underwent phonetic alterations. In the vernacular form, these include the merger of Sanskrit श (śa) and ष (ṣa), ण (ṇa) and न (na) as well as ऋ (r̥) and रि (ri). Other common alterations were s͟h [/ʃ/] (श) becoming s [/s/] (स), v/w [/ʋ/, /w/] (व) becoming b [/b/] (ब) and y [/j/] (य) becoming j [/dʒ/] (ज). Short vowels were also sometimes introduced to break up consonant clusters. Such words in Hindi (and other Indo-Aryan languages except Urdu) are called ardhatatsam (Hindi: अर्धतत्सम "semi-tatsam").

Hindustani Meaning Sanskrit Corresponding Persian loan
ardhatatsam tatsam
baras (बरस) varṣ (Hindi: वर्ष) year वर्ष (varṣa) sāl (साल), san (सन)
des (देस) deśa (Hindi: देश) country देश (deśa) mulk (मुल्क), vatan (वतन)
bāsī (बासी) vāsī (Hindi: वासी) inhabitant वासी (vāsī) bāśindā (बाशिन्दा)
jantar (जन्तर) yantra (Hindi: यन्त्र) device यन्त्र (yantra) ālā (आला)
rāt (रात) rātri (Hindi: रात्रि) night रात्रि (rātri) śab (शब), lail (लैल)
ādhā (आधा) ardh (Hindi: अर्ध) half अर्ध (ardha) nisf (निस्फ़), nem (नेम)
sūraj (सूरज) sūrya (Hindi: सूर्य) sun सूर्य (sūrya) śams (शम्स), āftāb (आफ़ताब)
pūrā (पूरा) pūrṇ (Hindi: पूर्ण) complete पूर्ण (pūrṇa) tamām (तमाम), mukammal (मुकम्मल)
ūncā/ūm̐cā (ऊंचा/ऊँचा) ucc (Hindi: उच्च) high/tall उच्च (ucca) buland (बुलन्द), aʾalā (आ'ला)
kām (काम) karm (Hindi: कर्म) action कर्म (karma) kār (कार)
nēm (नेम) niyam (Hindi: नियम) rule नियम (niyama) qāʾedā (क़ाएदा)
dakhin (दखिन) dakṣiṇa (Hindi: दक्षिण) south दक्षिण (dakṣiṇa) junūb (जुनूब)
nain (नैन) nayan (Hindi: नयन) eye नयन (nayana) caśm (चश्म)
lāj (लाज) lajjā (Hindi: लज्जा) shame लज्जा (lajjā) Śarm (शर्म), hayā (हया)

Classical Persian

Persian words which were not later artificially added were loaned from Classical Persian, the historical variety of the tenth, eleventh and twelfth centuries, which continued to be used as literary language and lingua franca under the Persianate dynasties of the Late Middle Ages and Early Modern Era and is not the same as Modern Persian (though the Dari Persian of Afghanistan is a direct descendant).

Borrowings

Persian loanwords in Hindustani are mainly borrowed nouns and adjectives as well as adverbs and conjunctions and some other parts of speech.

Hindustani Meaning Persian Corresponding Sanskrit loan
sāyā (साया) shadow/shade سایه (sāya) čhāyā (छाया)
pares͟hān (परेशान) anxious پرِیشان (parēšān) čintit (चिंतित)
hames͟hā (हमेशा) always/forever همِیشه (hamēša) sadaiv (सदैव), sadā (सदा)
k͟hus͟hī (ख़ुशी) happiness خوشی (xušī) ānand (आनंद), sukh (सुख)
sabzī (सब्ज़ी) vegetable سبزی (sabzī) sāg (साग), s͟hāk (शाक)
mehrbān (मेहरबान) kind مهربان (meherbān) dayālu (दयालु), karunāmaya (करुणामय کرُنامیَ)
agar (अगर) if اگر (agar) yadi (यदि)
dīvār (दीवार) wall دیوار (dīwār) bhīt (भीत)
darvāzā (दरवाज़ा) door/gate دروازه (darwāza) dwār (द्वार)
andar (अंदर) inside/in اندر (andar) bhītar (भीतर)
tāzā (ताज़ा) fresh تازه (tāza) nirjar (निर्जर), jarhīn (जरहीन جرہین)
roz (रोज़) day رُوز (rōz) din (दिन), diwas (दिवस دِوس)
s͟hahr (शहर) city شهر (šahr) nagar (नगर)
hind (हिंद) India هند (hind) bhārat (भारत)
ki (कि) that (conjunction) که (ki) -
vāh (वाह) wow واه (wāh) -

From stems:

Present:

Hindustani Meaning Persian verb Non-Persian alternative
par (पर) wing پریدن (parīdan, "to fly") paṉkh (पंख)
pasand (पसंद) liked, liking پسندیدن (pasandīdan, "to prefer") čahit (चहित), čāhat (चाहत)
k͟hwāb/k͟hāb (ख़्वाब/ख़ाब) dream خوابیدن (xābīdan, "to sleep") sapnā (सपना), swapna (स्वप्न)

Past:

Hindustani Meaning Persian verb Non-Persian alternative
āmad (आमद) arrival آمدن (āmadan, "to come") āgaman (आगमन)
s͟hikast (शिकस्त) defeat, defeated شکستن (šikastan, "to break") parājay (पराजय), parājit (पराजित), hār (हार)
giraft (गिरफ़्त) grip, gripped گرفتن (giriftan, "to grab") pakaṛ (पकड़), jabt (जब्त)

From participles:

Present:

Hindustani Meaning Persian verb Non-Persian alternative
āyindā/āʾindā (आइन्दा) future آمدن (āmadan, "to come") bhaviṣya (भविष्य), āgāmī (आगामी آگامی)
parindā (परिन्दा) bird پریدن (parīdan, "to fly") pančhī (पंछी), pakṣī (पक्षी)
zindā (ज़िन्दा) living, alive زیستن (zīstan, "to live") jīvit (जीवित), jīvant (जीवंत جیونت)

Past:

Hindustani Meaning Persian verb Non-Persian alternative
bastā (बस्ता) bag, sack بستن (bastan, "to bind") thailā (थैला)
pasandīdā (पसन्दीदा) favorite پسندیدن (pasandīdan, "to prefer") priya (प्रिय)
murdā (मुर्दा) dead مردن (murdan, "to die") mr̥t (मृत), hat (हत ہت)

By adding noun suffix ـِش (-iš):

Hindustani Meaning Persian verb Non-Persian alternative
parvaris͟h (परवरिश) upbringing, rearing پروردن (parwardan, "to foster") pālanpoṣaṇ (पालन-पोषण)
kos͟his͟h (कोशिश) effort, attempt کوشیدن (kōšīdan, "to attempt") prayās (प्रयास)
varzis͟h (वर्ज़िश) exercise ورزیدن (warzīdan, "to exercise") vyāyām (व्यायाम)
āzmāʾis͟h (आज़माइश) trial, test آزمودن (āzmūdan, "to test") vičāraṇ (विचारण), parīkṣaṇ (परीक्षण)

By forming composite words with Arabic:

Hindustani Meaning Persian affix Arabic element Non-Persian alternative
k͟hūbsūrat (ख़ूबसूरत) beautiful خوب (xūb, "good") صورت (sūrat, "appearance") sundar (सुंदर)
darasal (दरअसल) actually در (dar, "at, in") اصل (asl, "reality") vastutah (वस्तुत:), vastabik (वास्तबिक واستبِک)
fīsad (फ़ीसद) percent صد (sad, "hundred") فی (, "in, at") pratis͟hat (प्रतिशत), s͟hatansh (शतांश ستانش)
rahmdil (रहमदिल) compassionate دل (dil, "heart") رحم (rahm, "mercy") kr̥pālū (कृपालु), saday (सदय)

Loaned Verbs

A substantial number of Hindustani verbs have been loaned from Punjabi, however, verb stems originating in less closely related languages are relatively rare. There are a few common verbs formed directly out of Persian stems (or nouns in some cases) listed below.

Hindustani verb Verb meaning Persian stem Stem meaning Non-Persian alternative
k͟harīdnā (ख़रीदना) to buy خرید (xarīd) - noun buy, purchase kray karnā (क्रय करना), mol lenā (मोल लेना)
guzārnā (गुज़ारना) to pass (transitive), to spend گذار (guẕār) letting bitānā (बिताना)
navāznā (नवाज़ना) to bestow, to patronize, to favor نواز (nawāz) playing, caressing pradān karnā (प्रदान करना), arpit karnā (अर्पित करना), kr̥pā karnā (कृपा करना), sahāyatā denā (सहायता देना)
guzarnā (गुज़रना) to pass (intransitive), to occur گذر (guẕar) passing bītnā (बीतना)
farmānā (फ़रमाना) to dictate, to say (formal) فرما (farmā) ordering, saying (formal) āgyā karnā (आज्ञा करना), ādes͟h karnā (आदेश करना)
badalnā (बदलना) to change بدل (badl) - noun substitute, change parivartan karnā (परिवर्तन करना), vinimay karnā (विनिमय करना), palṭā denā (पलटा देना)
laraznā (लरज़ना) to tremble لرز (laraz) shivering kāṉpnā (कांपना)
guzrānnā (गुज़रानना) to pass time, to present, to adduce گذران (guẕarān) passing time prastut karnā (प्रस्तुत करना), sāmne rakhnā (सामने रखना)

Arabic

Some of the most commonly used words from Arabic, all entering the language through Persian, include vaqt (वक़्त) "time", qalam (क़लम) "pen", kitāb (किताब) "book", qarīb (क़रीब) "near", sahīh/sahī (सहीह/सही) "correct, right", g͟harīb (ग़रीब) "poor", amīr (अमीर) "rich", duniyā (दुनिया) "world", hisāb (हिसाब) "calculation", qudrat (क़ुदरत) "nature", nasīb (नसीब) "fate, luck, fortune", ajīb (अजीब) "strange, unusual", qānūn (क़ानून) "law", filhāl (फ़िलहाल) "currently", sirf (सिर्फ़) "only, mere", taqrīban (तक़रीबन) "close to, about", k͟habar (ख़बर) "news", ak͟hbār (अख़बार) "newspaper", qilā (क़िला) "fort", kursī (कुर्सी) "chair, seat", s͟harbat (शर्बत) "drink, beverage", muāf/māf (मुआफ़/माफ़) "forgiven, pardoned", zarūrī (ज़रूरी) "necessary", etc.[13]

Chagatai Turkic

There are a very small number of Turkic words in Hindustani, numbering as little as 24 according to some sources,[14] all entering the language through Persian.Other words attributed to Turkish, the most widely spoken Turkic language, are actually words which are common to Hindustani and Turkish but are of non-Turkic origins, mostly Perso-Arabic.[15] Both languages also share mutual loans from English.Most notably, some honorifics and surnames common in Hindustani are Turkic due to the influence of the ethnically Turkic Mughals - these include k͟hānam (ख़ानम), bājī (बाजी) "sister", and begam (बेगम). Common surnames include k͟hān (ख़ान), čug͟htāʾī (चुग़ताई), pās͟hā (पाशा), and arsalān (अर्सलान). Common Turkic words used in everyday Hindustani are qaiṉčī/qainčī (क़ैंची) "scissors", annā (अन्ना) "governess", tamg͟hā (तमग़ा) "stamp, medal", and čaqmaq (चक़मक़) "flint".

Mandarin Chinese

There are not many Chinese words that were loaned into Hindustani in spite of geographical proximity.

Hindustani Meaning Chinese/Sinitic Note
čāy/čāʾe (चाय) tea 茶 (chá) Derived through Persian Persian: چای (čāy)
čīn (चीन) China 秦 (qín) Derived through Sanskrit Sanskrit: चीन (cīna)
līčī (लीची) lychee 茘枝 (lìzhī)

European languages

Portuguese

A small number of Hindustani words were derived from Portuguese due to interaction with colonists and missionaries. These include the following:

Hindustani Meaning Portuguese
anannās/anānās (अनन्नास/अनानास) pineapple ananás
pādrī (पाद्री) priest padre
bālṭī (बाल्टी) bucket balde
čābī (चाबी) key chave
girjā (गिर्जा) church igreja
almārī (अलमारी) cupboard armário
botal (बोतल) bottle botelha
aspatāl (अस्पताल) hospital Hospital
olandez/valandez (ओलंदेज़/वलंदेज़) Dutch holandês

French

A few French loans exist in Hindustani resulting from French colonial settlements in India. Other French words such as s͟hemīz (शेमीज़) "chemise" and kūpan (कूपन) "coupon" have entered the language through English.

Hindustani Meaning French
kārtūs (कारतूस) cartridge cartouche
restorāṉ (रेस्तोरां/रेस्तोराँ) restaurant restaurant

English

See also: Hinglish. Loanwords from English were borrowed through interaction with the British East India Company and later British rule. English-language education for the native administrative and richer classes during British rule accelerated the adoption of English vocabulary in Hindustani. Many technical and modern terms were and still are borrowed from English, such as ḍākṭar/ḍôkṭar (डाक्टर/डॉक्टर) "doctor", ṭaiksī (टैक्सी) "taxi", and kilomīṭar (किलोमीटर) "kilometer".

Photo-semantic matching

Some loanwords from English undergo a significant phonetic transformation. This can either be done intentionally, in order to nativize words or to make them sound more or less "English-sounding", or happen naturally. Words often undergo a phonetic change in order to make them easier for native speakers to pronounce while others change due to a lack of English education or incomplete knowledge of English phonetics, where an alternate pronunciation becomes an accepted norm and overtakes the original as the most used pronunciation.

Hindustani English
darjan (दर्जन)[16] dozen
tijorī (तिजोरी) treasury
satalta (सतलता) subtlety
māčis (माचिस) match(es)
godām (गोदाम) godown
bigul (बिगुल) bugle
raṉgrūṭ (रंगरूट) recruit
ṭamāṭar (टमाटर) tomato
kābīnā (काबीना) cabinet
ketlī (केतली) kettle
darāz (दराज़) drawer(s)
bam (बम) bomb
lālṭen (लालटेन) lantern
būčaṛ (बूचड़) butcher
ṭaṉkī (टंकी) tank
baksā (बक्सा) box
janvarī (जनवरी) January

Sources

Notes and References

  1. Web site: A Guide to Hindi. BBC - Languages - Hindi. BBC. 11 December 2015.
  2. Web site: Kumar. Nitin. Hindi & Its Origin. Hindi Language Blog. 28 June 2011. 11 December 2015.
  3. Grierson . George . Indo-Aryan Vernaculars (Continued) . Bulletin of the School of Oriental Studies . 3 . 1 . 51–85 . 1920. 10.1017/S0041977X00087152 . 161798254 . at pp. 67-69.
  4. https://web.archive.org/web/20080102012959/http://linguistlist.org/forms/langs/LLDescription.cfm?code=psu
  5. Masica, p. 65
  6. Web site: aap. rekhta.org. Rekhta Foundation. 2016-12-24.
  7. Web site: tum. rekhta.org. Rekhta Foundation. 2016-12-25.
  8. Web site: tu. rekhta.org. Rekhta Foundation. 2016-12-25.
  9. Web site: Platts . John T. . هونا होना honā . A Dictionary of Urdu, Classical Hindi, and English (Digital Dictionaries of South Asia) . 29 February 2020.
  10. Book: Monier-Williams . Monier . Cappeller . Carl . Leumann . Ernst . Monier-Williams . Montagu Sneade Faithfull . A Sanskrit-English dictionary, etymologically and philologically arranged, with special reference to cognate Indo-European languages, by Sir Monier Monier-Williams, ... New edition, greatly enlarged and improved with the collaboration of professor E. Leumann, ... professor C. Cappeller, ... and other scholars. . 1899 . The Clarendon Press . 458052227 . en.
  11. Web site: Deshaj, Videshaj and Sankar Shabd Examples, Types and Definition . www.successcds.net. 28 May 2021 .
  12. Web site: देशज, संकर और विदेशी शब्द . Hindi Gyan Ganga . 17 February 2020.
  13. Web site: A قميص qamīṣ, vulg. qamīz, kamīj, s.m. A shirt; a shift; a chemise (cf. It. camicia; Port. camisa).. Platts. John T.. A Dictionary of Urdu, Classical Hindi, and English. University of Chicago. 6 December 2014.
  14. News: Anwer. Syed Mohammed. Language: Urdu and the borrowed words. dawn.com. November 13, 2011.
  15. Maldonado García. María Isabel. Yapici. Mustafa. Common Vocabulary in Urdu and Turkish Language: A Case of Historical Onomasiology. Pakistan Vision. 2014. 15. 1. 194–225.
  16. With intrusive hypercorrectional 'r' via arhotic British English