Hindustani, also known as Hindi-Urdu, is the vernacular form of two standardized registers used as official languages in India and Pakistan, namely Hindi and Urdu. It comprises several closely related dialects in the northern, central and northwestern parts of the Indian subcontinent but is mainly based on Khariboli of the Delhi region. As an Indo-Aryan language, Hindustani has a core base that traces back to Sanskrit but as a widely-spoken lingua franca, it has a large lexicon of loanwords,[1] [2] acquired through centuries of foreign rule and ethnic diversity.
Standard Hindi derives much of its formal and technical vocabulary from Sanskrit while standard Urdu derives much of its formal and technical vocabulary from Persian and Arabic. Standard Hindi and Urdu are used primarily in public addresses and radio or TV news, while the everyday spoken language is one of the several varieties of Hindustani, whose vocabulary contains words drawn from Persian, Arabic, and Sanskrit. In addition, spoken Hindustani includes words from English and the Dravidian languages, as well as several others.
Hindustani developed over several centuries throughout much of the northern subcontinent including the areas that comprise modern-day India, Pakistan, and Nepal. In the same way that the core vocabulary of English evolved from Old English (Anglo-Saxon) but assimilated many words borrowed from French and other languages (whose pronunciations often changed naturally so as to become easier for speakers of English to pronounce), what may be called Hindustani can be said to have evolved from Sanskrit while borrowing many Persian and Arabic words over the years, and changing the pronunciations (and often even the meanings) of these words to make them easier for Hindustani speakers to pronounce. Many Persian words entered the Hindustani lexicon due to the influence of the Mughal rulers of north India, who followed a very Persianised culture and also spoke Persian. Many Arabic words entered Hindustani via Persian, which had previously been assimilated into the Persian language due to the influence of Arabs in the area. The dialect of Persian spoken by the Mughal ruling elite was known as 'Dari', which is the dialect of Persian spoken in modern-day Afghanistan. Therefore, Hindustani is the naturally developed common language of north India. This article will deal with the separate categories of Hindustani words and some of the common words found in the Hindustani language.
Hindustani words, apart from loans, basically derive from two linguistic categories:
According to the traditional categorization in Hindi (also found in other Indo-Aryan languages except Urdu) the loanwords are classed as tatsam (Hindi: तत्सम "as it is, same as therein") for Sanskrit loans and vides͟hī (Hindi: विदेशी "foreign, non-native") for non-Sanskrit loans,[5] such as those from Persian or English, respectively contrasting with tadbhava and deśaja words.
The most common words in Hindustani are tadbhavas.
In Hindustani, the pronoun āp (आप) "[one]self", originally used as a third person honorific plural, denotes respect or formality (politeness) and originates from Prakrit appā, which derived from Sanskrit ātman,[6] which refers to the higher self or level of consciousness.
The pronoun tū (तू) and its grammatically plural form tum (तुम) (also the second person honorific plural) denote informality, familiarity or intimacy and originate respectively from tuhuṃ and tumma from Prakrit tuṃ and its variant tumaṃ, which derived from Sanskrit tvam, nominative singular of युष्मद् yuṣmad (the base of the second person plural pronoun).[7] [8] In modern usage, tū is widely used to display a range of attitudes depending on the context, from extreme informality (impoliteness) to extreme intimacy to outright disrespect and even extreme reverence. Usage of tū in most contexts is considered highly offensive in the formal register except when addressing God as a display of spiritual intimacy. This is very similar to the usage of "thou" in archaic English and many other Indo-European languages showing T–V distinction.
One of the most common words in Hindustani, the copula hai (है) and its plural form haiṉ (हैं) − present forms of honā (होना, meaning "to be" and originating from Prakrit bhodi derived from Sanskrit bhavati "to happen")[9] − rather originate from the following developments:
Shortening of ahai occurred in Hindustani resulting in hai probably to fulfill the symmetry of the other grammatical forms of honā. Ahai can be found in some older works of Hindustani literature and its evidence can also be seen in other closely related Indo-Aryan languages such as Marathi (आहे āhe) or Sindhi (آهي āhe).
The verb jānā (जाना, "to go"), which originates from Prakrit jādi derived from Sanskrit yāti ("to move"; root yā), however has its perfective form originating from another Prakrit word gaya derived from Sanskrit gata, past participle of gacchati ("to go"; root gam or gacch),[10] for example, in gayā (गया, "went, gone").
The word ājā (आजा) has also been used in Northern India and Pakistan for "grandfather". It is indeed derived from arya meaning "sir" in this case. Jain nuns are addressed either as Aryika or Ajji.
The word dādā (दादा) also has a similar meaning which varies by region. It is used in some regions for "father", in other regions for "older brother", or even for "grandfather" in other regions. This word is an amalgam of two sources:
The word baṛā (बड़ा "older, bigger, greater") is derived from the Sanskrit vridhha through Prakrit vaḍḍha.
The term Desi words is used to describe the component of the lexicon in Indo-Aryan languages which is non-Indo-Aryan in origin, but native to other language families of the Indian subcontinent. Examples of Desi words in Hindustani include:[11] [12] loṭā (लोटा) "lota (water vessel)", kapās (कपास) "cotton", kauṛī (कौड़ी) "cowrie (shell money)", ṭhes (ठेस) "wound, injury", jhaṉḍā (झंडा) "flag", mukkā (मुक्का) "fist, punch", lakṛī (लकड़ी) "wood", ṭharrā (ठर्रा) "tharra (liquor)", čūhā (चूहा) "mouse, rat", čūlhā (चूल्हा) "stove, oven", pagṛī (पगड़ी) "turban", luṉgī (लुंगी) "lungi (sarong)", ghoṭālā (घोटाला) "scam", dāṉḍī (दांडी) "salt", jholā (झोला) "bag, satchel", ṭakkar (टक्कर) "crash, collision, confrontation", kākā (काका) "paternal uncle", uṭpaṭāṉg/ūṭpaṭāṉg (उटपटांग/ऊट-पटांग) "ludicrous", ḍabbā/ḍibbā (डब्बा/डिब्बा) "box, container" and jhuggī (झुग्गी) "hut"
Nouns: gaṛbaṛ (गड़बड़) "disorder, disturbance", dhaṛām (धड़ाम) "thud", bakbak (बक-बक) "chatter/chitter-chatter", khusur pusar (खुसुर-पुसर) "whisper", jhilmil (झिलमिल) "shimmer", ṭhakṭhak (ठक-ठक) "knock knock", khaṭpaṭ (खटपट) "quarrel, disagreement"
Verbs: khaṭkhaṭānā (खटखटाना) "to knock", gaḍgaḍānā (गडगडाना) "to rumble, to fuss", jagmagānā (जगमगाना) "to shine/glitter", hinhinānā (हिनहिनाना) "to neigh", phusphusānā (फुसफुसाना) "to whisper"
Adjectives and Adverbs: čaṭpaṭ (चट-पट) "in a jiffy", tharthar (थर-थर) "with jerky motion (characteristic of shaking or trembling)", čaṭpaṭā (चटपटा) "dextrous, spicy", čipčipā (चिपचिपा) "sticky, slimy", čiṛčiṛā (चिड़चिड़ा) "irritable", gaṛbaṛiyā (गड़बड़िया) "chaotic, messy"
Due to the language's status as a lingua franca, Hindustani's vocabulary has a large inventory of loanwords, the largest number of which are adopted from Punjabi. Punjabi borrowings often bear sound changes from the parent Prakrit and Sanskrit vocabulary which did not occur in Hindustani, particularly the preservation of short vowels in initial syllables and the gemination of the following consonant. A certain amount of vocabulary from other South Asian languages, Persian, Arabic, and English has been loaned indirectly into Hindustani through Punjabi. Other Indic languages which exist in a state of diglossia with Hindustani and are prone to mutual borrowing include Rajasthani, the Western Pahari languages, Haryanvi, Bhojpuri, Marathi, Nepali, and Gujarati. Besides these, common sources of loan words include those manually adopted from Classical Sanskrit, Classical Persian, Arabic, Chagatai Turkic, Portuguese and English, as well as Mandarin Chinese and French to a lesser extent.
Many Classical Sanskrit words which were not learned borrowings underwent phonetic alterations. In the vernacular form, these include the merger of Sanskrit श (śa) and ष (ṣa), ण (ṇa) and न (na) as well as ऋ (r̥) and रि (ri). Other common alterations were s͟h [/ʃ/] (श) becoming s [/s/] (स), v/w [/ʋ/, /w/] (व) becoming b [/b/] (ब) and y [/j/] (य) becoming j [/dʒ/] (ज). Short vowels were also sometimes introduced to break up consonant clusters. Such words in Hindi (and other Indo-Aryan languages except Urdu) are called ardhatatsam (Hindi: अर्धतत्सम "semi-tatsam").
Hindustani | Meaning | Sanskrit | Corresponding Persian loan | ||
---|---|---|---|---|---|
ardhatatsam | tatsam | ||||
baras (बरस) | varṣ (Hindi: वर्ष) | year | वर्ष (varṣa) | sāl (साल), san (सन) | |
des (देस) | deśa (Hindi: देश) | country | देश (deśa) | mulk (मुल्क), vatan (वतन) | |
bāsī (बासी) | vāsī (Hindi: वासी) | inhabitant | वासी (vāsī) | bāśindā (बाशिन्दा) | |
jantar (जन्तर) | yantra (Hindi: यन्त्र) | device | यन्त्र (yantra) | ālā (आला) | |
rāt (रात) | rātri (Hindi: रात्रि) | night | रात्रि (rātri) | śab (शब), lail (लैल) | |
ādhā (आधा) | ardh (Hindi: अर्ध) | half | अर्ध (ardha) | nisf (निस्फ़), nem (नेम) | |
sūraj (सूरज) | sūrya (Hindi: सूर्य) | sun | सूर्य (sūrya) | śams (शम्स), āftāb (आफ़ताब) | |
pūrā (पूरा) | pūrṇ (Hindi: पूर्ण) | complete | पूर्ण (pūrṇa) | tamām (तमाम), mukammal (मुकम्मल) | |
ūncā/ūm̐cā (ऊंचा/ऊँचा) | ucc (Hindi: उच्च) | high/tall | उच्च (ucca) | buland (बुलन्द), aʾalā (आ'ला) | |
kām (काम) | karm (Hindi: कर्म) | action | कर्म (karma) | kār (कार) | |
nēm (नेम) | niyam (Hindi: नियम) | rule | नियम (niyama) | qāʾedā (क़ाएदा) | |
dakhin (दखिन) | dakṣiṇa (Hindi: दक्षिण) | south | दक्षिण (dakṣiṇa) | junūb (जुनूब) | |
nain (नैन) | nayan (Hindi: नयन) | eye | नयन (nayana) | caśm (चश्म) | |
lāj (लाज) | lajjā (Hindi: लज्जा) | shame | लज्जा (lajjā) | Śarm (शर्म), hayā (हया) |
Persian words which were not later artificially added were loaned from Classical Persian, the historical variety of the tenth, eleventh and twelfth centuries, which continued to be used as literary language and lingua franca under the Persianate dynasties of the Late Middle Ages and Early Modern Era and is not the same as Modern Persian (though the Dari Persian of Afghanistan is a direct descendant).
Persian loanwords in Hindustani are mainly borrowed nouns and adjectives as well as adverbs and conjunctions and some other parts of speech.
Hindustani | Meaning | Persian | Corresponding Sanskrit loan | |
---|---|---|---|---|
sāyā (साया) | shadow/shade | سایه (sāya) | čhāyā (छाया) | |
pares͟hān (परेशान) | anxious | پرِیشان (parēšān) | čintit (चिंतित) | |
hames͟hā (हमेशा) | always/forever | همِیشه (hamēša) | sadaiv (सदैव), sadā (सदा) | |
k͟hus͟hī (ख़ुशी) | happiness | خوشی (xušī) | ānand (आनंद), sukh (सुख) | |
sabzī (सब्ज़ी) | vegetable | سبزی (sabzī) | sāg (साग), s͟hāk (शाक) | |
mehrbān (मेहरबान) | kind | مهربان (meherbān) | dayālu (दयालु), karunāmaya (करुणामय کرُنامیَ) | |
agar (अगर) | if | اگر (agar) | yadi (यदि) | |
dīvār (दीवार) | wall | دیوار (dīwār) | bhīt (भीत) | |
darvāzā (दरवाज़ा) | door/gate | دروازه (darwāza) | dwār (द्वार) | |
andar (अंदर) | inside/in | اندر (andar) | bhītar (भीतर) | |
tāzā (ताज़ा) | fresh | تازه (tāza) | nirjar (निर्जर), jarhīn (जरहीन جرہین) | |
roz (रोज़) | day | رُوز (rōz) | din (दिन), diwas (दिवस دِوس) | |
s͟hahr (शहर) | city | شهر (šahr) | nagar (नगर) | |
hind (हिंद) | India | هند (hind) | bhārat (भारत) | |
ki (कि) | that (conjunction) | که (ki) | - | |
vāh (वाह) | wow | واه (wāh) | - |
From stems:
Present:
Hindustani | Meaning | Persian verb | Non-Persian alternative | |
---|---|---|---|---|
par (पर) | wing | پریدن (parīdan, "to fly") | paṉkh (पंख) | |
pasand (पसंद) | liked, liking | پسندیدن (pasandīdan, "to prefer") | čahit (चहित), čāhat (चाहत) | |
k͟hwāb/k͟hāb (ख़्वाब/ख़ाब) | dream | خوابیدن (xābīdan, "to sleep") | sapnā (सपना), swapna (स्वप्न) |
Past:
Hindustani | Meaning | Persian verb | Non-Persian alternative | |
---|---|---|---|---|
āmad (आमद) | arrival | آمدن (āmadan, "to come") | āgaman (आगमन) | |
s͟hikast (शिकस्त) | defeat, defeated | شکستن (šikastan, "to break") | parājay (पराजय), parājit (पराजित), hār (हार) | |
giraft (गिरफ़्त) | grip, gripped | گرفتن (giriftan, "to grab") | pakaṛ (पकड़), jabt (जब्त) |
From participles:
Present:
Hindustani | Meaning | Persian verb | Non-Persian alternative | |
---|---|---|---|---|
āyindā/āʾindā (आइन्दा) | future | آمدن (āmadan, "to come") | bhaviṣya (भविष्य), āgāmī (आगामी آگامی) | |
parindā (परिन्दा) | bird | پریدن (parīdan, "to fly") | pančhī (पंछी), pakṣī (पक्षी) | |
zindā (ज़िन्दा) | living, alive | زیستن (zīstan, "to live") | jīvit (जीवित), jīvant (जीवंत جیونت) |
Past:
Hindustani | Meaning | Persian verb | Non-Persian alternative | |
---|---|---|---|---|
bastā (बस्ता) | bag, sack | بستن (bastan, "to bind") | thailā (थैला) | |
pasandīdā (पसन्दीदा) | favorite | پسندیدن (pasandīdan, "to prefer") | priya (प्रिय) | |
murdā (मुर्दा) | dead | مردن (murdan, "to die") | mr̥t (मृत), hat (हत ہت) |
By adding noun suffix ـِش (-iš):
Hindustani | Meaning | Persian verb | Non-Persian alternative | |
---|---|---|---|---|
parvaris͟h (परवरिश) | upbringing, rearing | پروردن (parwardan, "to foster") | pālanpoṣaṇ (पालन-पोषण) | |
kos͟his͟h (कोशिश) | effort, attempt | کوشیدن (kōšīdan, "to attempt") | prayās (प्रयास) | |
varzis͟h (वर्ज़िश) | exercise | ورزیدن (warzīdan, "to exercise") | vyāyām (व्यायाम) | |
āzmāʾis͟h (आज़माइश) | trial, test | آزمودن (āzmūdan, "to test") | vičāraṇ (विचारण), parīkṣaṇ (परीक्षण) |
By forming composite words with Arabic:
Hindustani | Meaning | Persian affix | Arabic element | Non-Persian alternative | |
---|---|---|---|---|---|
k͟hūbsūrat (ख़ूबसूरत) | beautiful | خوب (xūb, "good") | صورت (sūrat, "appearance") | sundar (सुंदर) | |
darasal (दरअसल) | actually | در (dar, "at, in") | اصل (asl, "reality") | vastutah (वस्तुत:), vastabik (वास्तबिक واستبِک) | |
fīsad (फ़ीसद) | percent | صد (sad, "hundred") | فی (fī, "in, at") | pratis͟hat (प्रतिशत), s͟hatansh (शतांश ستانش) | |
rahmdil (रहमदिल) | compassionate | دل (dil, "heart") | رحم (rahm, "mercy") | kr̥pālū (कृपालु), saday (सदय) |
A substantial number of Hindustani verbs have been loaned from Punjabi, however, verb stems originating in less closely related languages are relatively rare. There are a few common verbs formed directly out of Persian stems (or nouns in some cases) listed below.
Hindustani verb | Verb meaning | Persian stem | Stem meaning | Non-Persian alternative | |
---|---|---|---|---|---|
k͟harīdnā (ख़रीदना) | to buy | خرید (xarīd) - noun | buy, purchase | kray karnā (क्रय करना), mol lenā (मोल लेना) | |
guzārnā (गुज़ारना) | to pass (transitive), to spend | گذار (guẕār) | letting | bitānā (बिताना) | |
navāznā (नवाज़ना) | to bestow, to patronize, to favor | نواز (nawāz) | playing, caressing | pradān karnā (प्रदान करना), arpit karnā (अर्पित करना), kr̥pā karnā (कृपा करना), sahāyatā denā (सहायता देना) | |
guzarnā (गुज़रना) | to pass (intransitive), to occur | گذر (guẕar) | passing | bītnā (बीतना) | |
farmānā (फ़रमाना) | to dictate, to say (formal) | فرما (farmā) | ordering, saying (formal) | āgyā karnā (आज्ञा करना), ādes͟h karnā (आदेश करना) | |
badalnā (बदलना) | to change | بدل (badl) - noun | substitute, change | parivartan karnā (परिवर्तन करना), vinimay karnā (विनिमय करना), palṭā denā (पलटा देना) | |
laraznā (लरज़ना) | to tremble | لرز (laraz) | shivering | kāṉpnā (कांपना) | |
guzrānnā (गुज़रानना) | to pass time, to present, to adduce | گذران (guẕarān) | passing time | prastut karnā (प्रस्तुत करना), sāmne rakhnā (सामने रखना) |
Some of the most commonly used words from Arabic, all entering the language through Persian, include vaqt (वक़्त) "time", qalam (क़लम) "pen", kitāb (किताब) "book", qarīb (क़रीब) "near", sahīh/sahī (सहीह/सही) "correct, right", g͟harīb (ग़रीब) "poor", amīr (अमीर) "rich", duniyā (दुनिया) "world", hisāb (हिसाब) "calculation", qudrat (क़ुदरत) "nature", nasīb (नसीब) "fate, luck, fortune", ajīb (अजीब) "strange, unusual", qānūn (क़ानून) "law", filhāl (फ़िलहाल) "currently", sirf (सिर्फ़) "only, mere", taqrīban (तक़रीबन) "close to, about", k͟habar (ख़बर) "news", ak͟hbār (अख़बार) "newspaper", qilā (क़िला) "fort", kursī (कुर्सी) "chair, seat", s͟harbat (शर्बत) "drink, beverage", muāf/māf (मुआफ़/माफ़) "forgiven, pardoned", zarūrī (ज़रूरी) "necessary", etc.[13]
There are a very small number of Turkic words in Hindustani, numbering as little as 24 according to some sources,[14] all entering the language through Persian.Other words attributed to Turkish, the most widely spoken Turkic language, are actually words which are common to Hindustani and Turkish but are of non-Turkic origins, mostly Perso-Arabic.[15] Both languages also share mutual loans from English.Most notably, some honorifics and surnames common in Hindustani are Turkic due to the influence of the ethnically Turkic Mughals - these include k͟hānam (ख़ानम), bājī (बाजी) "sister", and begam (बेगम). Common surnames include k͟hān (ख़ान), čug͟htāʾī (चुग़ताई), pās͟hā (पाशा), and arsalān (अर्सलान). Common Turkic words used in everyday Hindustani are qaiṉčī/qainčī (क़ैंची) "scissors", annā (अन्ना) "governess", tamg͟hā (तमग़ा) "stamp, medal", and čaqmaq (चक़मक़) "flint".
There are not many Chinese words that were loaned into Hindustani in spite of geographical proximity.
Hindustani | Meaning | Chinese/Sinitic | Note |
---|---|---|---|
čāy/čāʾe (चाय) | tea | 茶 (chá) | Derived through Persian Persian: چای (čāy) |
čīn (चीन) | China | 秦 (qín) | Derived through Sanskrit Sanskrit: चीन (cīna) |
līčī (लीची) | lychee | 茘枝 (lìzhī) | |
A small number of Hindustani words were derived from Portuguese due to interaction with colonists and missionaries. These include the following:
Hindustani | Meaning | Portuguese |
---|---|---|
anannās/anānās (अनन्नास/अनानास) | pineapple | ananás |
pādrī (पाद्री) | priest | padre |
bālṭī (बाल्टी) | bucket | balde |
čābī (चाबी) | key | chave |
girjā (गिर्जा) | church | igreja |
almārī (अलमारी) | cupboard | armário |
botal (बोतल) | bottle | botelha |
aspatāl (अस्पताल) | hospital | Hospital |
olandez/valandez (ओलंदेज़/वलंदेज़) | Dutch | holandês |
A few French loans exist in Hindustani resulting from French colonial settlements in India. Other French words such as s͟hemīz (शेमीज़) "chemise" and kūpan (कूपन) "coupon" have entered the language through English.
Hindustani | Meaning | French |
---|---|---|
kārtūs (कारतूस) | cartridge | cartouche |
restorāṉ (रेस्तोरां/रेस्तोराँ) | restaurant | restaurant |
See also: Hinglish. Loanwords from English were borrowed through interaction with the British East India Company and later British rule. English-language education for the native administrative and richer classes during British rule accelerated the adoption of English vocabulary in Hindustani. Many technical and modern terms were and still are borrowed from English, such as ḍākṭar/ḍôkṭar (डाक्टर/डॉक्टर) "doctor", ṭaiksī (टैक्सी) "taxi", and kilomīṭar (किलोमीटर) "kilometer".
Some loanwords from English undergo a significant phonetic transformation. This can either be done intentionally, in order to nativize words or to make them sound more or less "English-sounding", or happen naturally. Words often undergo a phonetic change in order to make them easier for native speakers to pronounce while others change due to a lack of English education or incomplete knowledge of English phonetics, where an alternate pronunciation becomes an accepted norm and overtakes the original as the most used pronunciation.
Hindustani | English |
---|---|
darjan (दर्जन)[16] | dozen |
tijorī (तिजोरी) | treasury |
satalta (सतलता) | subtlety |
māčis (माचिस) | match(es) |
godām (गोदाम) | godown |
bigul (बिगुल) | bugle |
raṉgrūṭ (रंगरूट) | recruit |
ṭamāṭar (टमाटर) | tomato |
kābīnā (काबीना) | cabinet |
ketlī (केतली) | kettle |
darāz (दराज़) | drawer(s) |
bam (बम) | bomb |
lālṭen (लालटेन) | lantern |
būčaṛ (बूचड़) | butcher |
ṭaṉkī (टंकी) | tank |
baksā (बक्सा) | box |
janvarī (जनवरी) | January |