Pakistan is a multilingual country with over 70 languages spoken as first languages.[1] [2] The majority of Pakistan's languages belong to the Indo-Iranian group of the Indo-European language family.[3] [4]
Urdu is the national language and the lingua franca of Pakistan, and while sharing official status with English, it is the preferred and dominant language used for inter-communication between different ethnic groups. Numerous regional languages are spoken as first languages by Pakistan's various ethnolinguistic groups. Languages with more than a million speakers each include Punjabi, Pashto, Sindhi, Saraiki, Urdu, Balochi, Hindko, Pahari-Pothwari and Brahui.[5] There are approximately 60 local languages with fewer than a million speakers.[6]
The 2022 edition of Ethnologue lists 77 established languages in Pakistan. Of these, 68 are indigenous and 9 are non-indigenous. In terms of their vitality, 4 are classified as 'institutional', 24 are 'developing', 30 are 'vigorous', 15 are 'in trouble', and 4 are 'dying'.
Sindh | Indo-Aryan | |
Khyber Pakhtunkwa | Iranian | |
Punjab, Sindh | Indo-Aryan | |
Balochistan, Punjab, Sindh | Iranian | |
Balochistan, Sindh | Iranian | |
Balochistan, Sindh | Iranian | |
Gilgit Baltistan | Sino-Tibetan | |
Khyber Pakhtunkwa | Indo-Aryan | |
Sindh | Indo-Aryan | |
Balochistan, Sindh | Dravidian | |
Gilgit Baltistan | Isolate | |
Khyber Pakhtunkwa | Indo-Aryan | |
Khyber Pakhtunkwa | Indo-Aryan | |
Khyber Pakhtunkwa | Iranian | |
Balochistan | Iranian | |
Sindh | Indo-Aryan | |
Gilgit Baltistan | Indo-Aryan | |
English | Federal co-official | Germanic |
Khyber Pakhtunkwa | Indo-Aryan | |
Khyber Pakhtunkwa | Indo-Aryan | |
Sindh | Indo-Aryan | |
Sindh | Indo-Aryan | |
Khyber Pakhtunkwa | Indo-Aryan | |
Sindh | Indo-Aryan | |
Azad Kashmir, Gilgit Baltistan, Khyber Pakhtunkwa, Punjab | Indo-Aryan | |
Sindh | Indo-Aryan | |
Balochistan | Iranian | |
Azad Kashmir, Khyber Pakhtunkwa | Indo-Aryan | |
Khyber Pakhtunkwa, Punjab | Indo-Aryan | |
Balochistan, Sindh | Indo-Aryan | |
Sindh | Indo-Aryan | |
Sindh | Indo-Aryan | |
Sindh | Indo-Aryan | |
Sindh | Indo-Aryan | |
Khyber Pakhtunkwa | Indo-Aryan | |
Khyber Pakhtunkwa | Indo-Aryan | |
Khyber Pakhtunkwa | Iranian | |
Azad Kashmir | Indo-Aryan | |
Khyber Pakhtunkwa | Iranian | |
Balochistan | Indo-Aryan | |
Gilgit Baltistan, Khyber Pakhtunkwa | Indo-Aryan | |
Khyber Pakhtunkwa | Indo-Aryan | |
Sindh | Indo-Aryan | |
Sindh | Indo-Aryan | |
Sindh | Indo-Aryan | |
Azad Kashmir | Indo-Aryan | |
Balochistan | Indo-Aryan | |
Sindh | Indo-Aryan | |
Khyber Pakhtunkwa | Indo-Aryan | |
Punjab, Sindh | Indo-Aryan | |
Punjab, Sindh | Indo-Aryan | |
Sindh | Indo-Aryan | |
Punjab, Sindh | Indo-Aryan | |
Khyber Pakhtunkwa | Iranian | |
Pahari-Pothwari | Azad Kashmir, Punjab | Indo-Aryan |
Throughout | Indo-Pakistani Sign Language | |
Khyber Pakhtunkwa | Indo-Aryan | |
Balochistan, Khyber Pakhtunkwa, Punjab | Iranian | |
Khyber Pakhtunkwa, Punjab | Iranian | |
Balochistan, Khyber Pakhtunkwa, Punjab | Iranian | |
Punjab | Indo-Aryan | |
Punjab | Indo-Aryan | |
Balochistan, Khyber Pakhtunkwa, Punjab, Sindh | Indo-Aryan | |
Khyber Pakhtunkwa | Iranian | |
Khyber Pakhtunkwa | Indo-Aryan | |
Azad Kashmir, Gilgit Baltistan, Khyber Pakhtunkwa | Indo-Aryan | |
Khyber Pakhtunkwa | Indo-Aryan | |
Balochistan, Sindh | Indo-Aryan | |
Sindh | Indo-Aryan | |
Sindh | Dravidian | |
Khyber Pakhtunkwa | Indo-Aryan | |
Throughout | Indo-Aryan | |
Khyber Pakhtunkwa | Indo-Aryan | |
Sindh | Indo-Aryan | |
Gilgit Baltistan, Khyber Pakhtunkwa | Iranian | |
Balochistan | Iranian | |
Khyber Pakhtunkwa | Iranian | |
1 | Punjabi | 36.98% | 38.78% | 44.15% | 48.17% | 56.39% | 57.08% | |
2 | Pashto | 18.15% | 18.24% | 15.42% | 13.35% | 8.47% | 8.16% | |
3 | Sindhi | 14.31% | 14.57% | 14.1% | 12.7% | 12.59% | 12.85% | |
4 | Saraiki | 12.00% | 12.19% | 10.53% | 9.54% | |||
5 | Urdu | 9.25% | 7.08% | 7.57% | 7.60% | 7.57% | 7.05% | |
6 | Balochi | 3.38% | 3.02% | 3.57% | 3.02% | 2.49% | 3.04% | |
7 | Hindko | 2.32% | 2.44% | |||||
8 | Brahui | 1.16% | 1.24% | |||||
9 | Others | 2.45% | 2.44% | 4.66% | 5.62% | 12.49% | 11.82% |
*Census data for the Pakistani administered territories of Gilgit Baltistan and Azad Kashmir not available as of 2024.
Urdu is the national language and lingua franca of Pakistan.[8] Although only about 9% of Pakistanis speak it as their first language, it is widely spoken and understood as a second language by the vast majority of Pakistanis.[9] [10]
Urdu was chosen as a symbol of unity for the new state of Pakistan in 1947, because it had already served as a lingua franca among Muslims in north and northwest British India.[11] It is written, spoken and used in all provinces/territories of Pakistan, and together with English as the main languages of instruction,[12] although the people from differing provinces may have different native languages.[13]
Urdu is taught as a compulsory subject up to higher secondary school in both English and Urdu medium school systems, which has produced millions of second-language Urdu speakers among people whose native language is one of the other languages of Pakistan – which in turn has led to the absorption of vocabulary from various regional Pakistani languages,[14] while some Urdu vocabularies has also been assimilated by Pakistan's regional languages.[15] [16]
See also: Pakistani English. English is a co-official language of Pakistan and is widely used in the executive, legislative and judicial branches as well as to some extent in the officer ranks of Pakistan's armed forces. Pakistan's Constitution and laws were written in English and are now being re-written in the local languages. It is also widely used in schools, colleges and universities as a medium of instruction. English is seen as the language of upward mobility, and its use is becoming more prevalent in upper social circles, where it is often spoken alongside native Pakistani languages. In 2015, it was announced that there were plans to promote Urdu in official business, but Pakistan's Minister of Planning Ahsan Iqbal stated, "Urdu will be a second medium of language and all official business will be bilingual." He also went on to say that English would be taught alongside Urdu in schools.[17]
See also: Punjabi dialects and languages. Punjabi is an Indo-Aryan language primarily spoken in the Punjab province of Pakistan, with the prominent dialect being the Majha dialect, written in the Shahmukhi script. Punjabi is the most widely spoken language in Pakistan. It is spoken as a first language by 38.78% of Pakistanis.[18] The language is spoken among a significant overseas diaspora, particularly in Canada, the United Kingdom, and the United States. Punjabi is unusual among the Indo-Aryan languages and the broader Indo-European language family in its usage of lexical tone.[19]
Pashto is an Iranian language spoken as a first language by more than 18.24% of Pakistanis, mainly in Khyber Pakhtunkhwa and in northern Balochistan as well as in ethnic Pashtun communities in the cities of Islamabad, Rawalpindi, Lahore, and most notably Karachi,[20] [21] [22] [23] which may have the largest Pashtun population of any city in the world.[24] There are three major dialect patterns within which the various individual dialects may be classified; these are the Pakhto variety of Northern (Peshawar) variety, the southern Pashto spoken in the vicinity of Quetta, and the Wanetsi or Tareeno variety of northern Balochistan.
Sindhi is an Indo-Aryan language spoken as a first language by almost 15% of Pakistanis, mostly in the Sindh province of Pakistan. The name "Sindhi" is derived from Sindhu, the original name of the Indus River.[25]
Like other languages of this family, Sindhi has passed through Old Indo-Aryan (Sanskrit) and Middle Indo-Aryan (Pali, secondary Prakrits, and Apabhramsha) stages of growth. 20th century Western scholars such as George Abraham Grierson believed that Sindhi descended specifically from the Vrācaḍa dialect of Apabhramsha (described by Markandeya as being spoken in Sindhu-deśa) but later work has shown this to be unlikely.[26] It entered the New Indo-Aryan stage around the 10th century CE.[27] [28]
The six major known dialects of the Sindhi language are Siroli, Vicholi, Lari, Thari, Lasi and Kutchi.[29]
Saraiki is an Indo-Aryan language of the Lahnda group, spoken in central and southeastern Pakistan, primarily in the southern part of the province of Punjab. Saraiki is to a high degree mutually intelligible with Standard Punjabi and shares with it a large portion of its vocabulary and morphology. At the same time in its phonology it is radically different (particularly in the lack of tones, the preservation of the voiced aspirates and the development of implosive consonants), and has important grammatical features in common with the Sindhi language spoken to the south.
Saraiki is the language of about 26 million people in Pakistan, ranging across southern Punjab, southern Khyber Pakhtunkhwa, and border regions of northern Sindh and eastern Balochistan.[30]
Balochi is an Iranian language spoken as a first language by about 3% of Pakistanis, mostly in the Balochistan province. Rakshani is the major dialect group in terms of numbers. Sarhaddi is a sub-dialect of Rakshani. Other sub-dialects are Kalati (Qalati), Chagai-Kharani and Panjguri. Eastern Hill Balochi or Northern Balochi is very different from the rest.
Hindko is a cover term for a diverse group of Lahnda dialects spoken in several discontinuous areas in northwestern Pakistan, primarily in the provinces of Khyber Pakhtunkhwa and Punjab.Hindko is mutually intelligible with Punjabi and Saraiki, and has more affinities with the latter than with the former. Differences with other Punjabi varieties are more pronounced in the morphology and phonology than in the syntax.The word Hindko, commonly used to refer to a number of Indo-Aryan dialects spoken in the neighbourhood of Pashto, likely originally meant "the Indian language" (in contrast to Pashto).[31] An alternative local name for this language group is Hindki.
Brahui is a Dravidian language spoken in the central part of Balochistan province. Brahui is spoken in the central part of Pakistani Balochistan, mainly in Kalat, Khuzdar and Mastung districts, but also in smaller numbers in neighboring districts, as well as in Afghanistan which borders Pakistani Balochistan; however, many members of the ethnic group no longer speak Brahui.
Other languages spoken by linguistic minorities include the languages listed below, with speakers ranging from a few hundred to tens of thousands. A few are highly endangered languages that may soon have no speakers at all.[32] The United Nations Educational, Scientific and Cultural Organization defines five levels of language endangerment between "safe" (not endangered) and "extinct":[33]
The list below includes the findings from the third edition of Atlas of the World's Languages in Danger (2010; formerly the Red Book of Endangered Languages), as well as the online edition of the aforementioned publication, both published by UNESCO.[34]
Balti | Vulnerable | Also spoken in: India | bft |
Bashkarik | Definitely endangered | gwc, xka | |
Badeshi | Critically endangered | bdz | |
Bateri | Definitely endangered | btv | |
Bhadravahi | Definitely endangered | Also spoken in: India | bhd |
Brahui | Vulnerable | Also spoken in: Afghanistan | brh |
Burushaski | Vulnerable | bsk | |
Chilisso | Severely endangered | clh | |
Dameli | Severely endangered | dml | |
Domaaki | Severely endangered | dmk | |
Gawar-Bati | Definitely endangered | Also spoken in: Afghanistan | gwt |
Gowro | Severely endangered | gwf | |
Jadgali | jdg | ||
Kalasha language | Severely endangered | Not to be confused with Kalasha-ala | kls |
Kalkoti | Severely endangered | ||
Kati (Kamkata-viri, Kata-vari, Kamviri) | Definitely endangered | Also spoken in: Afghanistan | bsh, xvi |
Khowar | Vulnerable | khw | |
Kundal Shahi | Definitely endangered | Also spoken in: India | |
Maiya | Vulnerable | mvy | |
Ormuri | Definitely endangered | Also spoken in: Afghanistan | oru |
Phalura | Definitely endangered | phl | |
Purik | Vulnerable | Also spoken in: India | prx |
Savi | Definitely endangered | Also spoken in: Afghanistan | sdg |
Spiti | Vulnerable | Also spoken in: India | spt |
Torwali | Definitely endangered | trw | |
Ushojo | Definitely endangered | ush | |
Wakhi | Definitely endangered | Also spoken in: China, Tajikistan, Afghanistan | wbl |
Yidgha | Definitely endangered | ydg | |
Zangskari | Definitely endangered | Also spoken in: India | zau |
Arabic is used as a religious language by Muslims. The Quran, Sunnah, Hadith and Muslim theology is taught in Arabic with Urdu translation. Arabic is taught as a religious language in mosques, schools, colleges, universities and madrassahs. A majority of Pakistan's Muslim population has had some form of formal or informal education in the reading, writing and pronunciation of Arabic as part of their religious education. However, Pakistanis are not Arabs and do not speak Arabic.[35]
Arabic is mentioned in the constitution of Pakistan. It declares in article 31 No. 2 that "The State shall endeavour, as respects the Muslims of Pakistan (a) to make the teaching of the Holy Quran and Islamiat compulsory, to encourage and facilitate the learning of Arabic language ..."[36]
The National Education Policy 2017 declares in article 3.7.4 that: "Arabic as compulsory part will be integrated in Islamiyat from Middle to Higher Secondary level to enable the students to understand the Holy Quran." Furthermore, it specifies in article 3.7.6: "Arabic as elective subject shall be offered properly at Secondary and Higher Secondary level with Arabic literature and grammar in its course to enable the learners to have command in the language." This law is also valid for private schools as it defines in article 3.7.12: "The curriculum in Islamiyat, Arabic and Moral Education of public sector will be adopted by the private institutions to make uniformity in the society."[37]
See main article: Persian language in South Asia.
See also: Persian and Urdu. Persian was the official of the region up until the late 19th century when the English passed several laws to replace it with local languages. Persian had a long history in the lands of Pakistan and was the cultural language of the erstwhile Mughal Empire, a continuation since the introduction of the language by Central Asian Turkic invaders who migrated into the Indian Subcontinent,[38] and the patronisation of it by the earlier Turko-Persian Delhi Sultanate. Persian was officially abolished as a language of administration with the arrival of the British: in Sindh in 1843 and in Punjab in 1849.
Today the eastern Dari dialect of Persian is spoken by refugees from Afghanistan and a small number of local Balochistani Hazara community. A larger number of Pakistani Hazaras speak Hazaragi dialect.[39] In the Madaklasht valley of Chitral, the Madaklashti dialect of Tajik Persian is spoken by the descendants of ironmongers from Badakhshan who settled there in the eighteenth century.
some Pakistanis are learning Mandarin to do business with companies from the People's Republic of China.[40]
Most of the languages of Pakistan belong to the Indo-Iranian branch of the Indo-European language family.[41] [42] The common ancestor of all of the languages in this family is called Proto-Indo-Iranian—also known as Common Aryan—which was spoken in approximately the late 3rd millennium BC. The three branches of the modern Indo-Iranian languages are Indo-Aryan, Iranian, and Nuristani. A fourth independent branch, Dardic, was previously posited, but recent scholarship in general places Dardic languages as archaic members of the Indo-Aryan branch.[43]
Majority of the languages spoken in eastern regions of Pakistan belong to the Indo-Aryan group.
Modern Indo-Aryan languages descend from Old Indo-Aryan languages such as early Vedic Sanskrit, through Middle Indo-Aryan languages (or Prakrits).[44] [45] [46] [47]
Some of the important languages in this family are dialect continuums. One of these is Lahnda,[48] and includes Saraiki (spoken mostly in southern Pakistani Punjab by about 26 million people), the diverse varieties of Hindko (with almost five million speakers in north-western Punjab and neighbouring regions of Khyber Pakhtunkhwa, especially Hazara), Pahari/Pothwari (3.5 million speakers in the Pothohar region of Punjab, Azad Kashmir and parts of Indian Jammu and Kashmir), Khetrani (20,000 speakers in Balochistan), and Inku (a possibly extinct language of Afghanistan).
Majority of the languages spoken in western regions of Pakistan belong to the Iranic group. There are several dialects continuums in this family as well: Balochi, which includes Eastern, Western and Southern Balochi;[49] and Pashto, and includes Northern, Central, and Southern Pashto.[50]
The following three languages of Pakistan are not part of the Indo-European language family:
See main article: Nastaliq and Urdu alphabet. Most languages of Pakistan are written in the Perso-Arabic script. The Mughal Empire adopted Persian as the court language during their rule over South Asia as did their predecessors, such as the Ghaznavids. During this time, the Nastaʿlīq style of the Perso-Arabic script came into widespread use in South Asia, and the influence remains to this day. In Pakistan, almost everything in Urdu is written in the script, concentrating the greater part of Nastaʿlīq usage in the world.The Urdu alphabet is a right-to-left alphabet. It is a modification of the Persian alphabet, which is itself a derivative of the Arabic alphabet. With 38 letters, the Urdu alphabet is typically written in the calligraphic Nasta'liq script.
Sindhi adopted a variant of the Persian alphabet as well, in the 19th century. The script is used in Pakistan today, albeit unlike most other native languages of Pakistan, the Naskh style is more common for Sindhi writing than the Nasta'liq style. It has a total of 52 letters, augmenting the Urdu with digraphs and eighteen new letters (Sindhi: ڄ ٺ ٽ ٿ ڀ ٻ ڙ ڍ ڊ ڏ ڌ ڇ ڃ ڦ ڻ ڱ ڳ ڪ) for sounds particular to Sindhi and other Indo-Aryan languages. Some letters that are distinguished in Arabic or Persian are homophones in Sindhi.
Balochi and Pashto are written in Perso-Arabic script. The Shahmukhī script, a variant of the Urdu alphabet, is used to write the Punjabi language in Pakistan.
Usually, bare transliterations of Urdu into Roman letters, Roman Urdu, omit many phonemic elements that have no equivalent in English or other languages commonly written in the Latin script. The National Language Authority of Pakistan has developed a number of systems with specific notations to signify non-English sounds, but these can only be properly read by someone already familiar with Urdu.
This is a series of maps which shows the distribution of different languages in Pakistan as of the 2017 Pakistan Census. These all refer to the mother tongues of individuals only.