Sindhi transliteration explained

Sindhi is a language broadly spoken by the people of the historical Sindh region in the Indo subcontinent. Modern Sindhi is written in an extended Perso-Arabic script in Sindh province of Pakistan[1] and (formally) in extended-Devanagari by Sindhis in partitioned India.[2] Historically, Sindhi was written in various forms of Landa scripts and various other Indic scripts.[3]

Sindhi Transliteration is essential to convert between Arabic and Devanagari so that speakers of both the countries can read the text of each other.[4] In modern day, Sindhi script colloquially just refers to the Perso-Arabic script since majority of Sindhis are from Pakistan. It is also important to note that the Sindhi script is not same as the Urdu-Shahmukhi script,[5] hence one cannot use script conversions like Hindi-Urdu Transliteration.

Technically, a direct one-to-one mapping or rule-based script conversion is not possible between Pakistani and Indian Sindhi, majorly since Devanagari is an abugida script and Arabic-Sindhi is an abjad script, and also other constraints like multiple similar characters from Perso-Arabic which map onto a single character in Devanagari.[6] Hence it is preferred to use dictionary-based or machine learning-based transliteration between the Sindhi scripts.[7] For colloquial usage in the digital space where writing Sindhi in Latin script is prevalent, Romanisation of Sindhi is used.[8]

In addition to Sindhi, there have been attempts to design Indo-Pakistani transliteration systems for digraphic languages like Punjabi (written in Gurmukhi in East Punjab and Shahmukhi in West Punjab), Saraiki (written in an extended-Shahmukhi script in Saraikistan and unofficially in Sindhi-Devanagari script in India) and Kashmiri (written in extended Perso-Arabic by Kashmiri Muslims and extended-Devanagari by Kashmiri Hindus).[9] [10] [11]

Consonants

The following table provides an approximate one-to-one mapping for modern Sindhi consonants, especially for computational purposes (lossless script conversion). Note that this direct script conversion will not yield correct spellings, but rather a readable text for both the readers.[12]

Sindhi Consonants! Sindhi Abjad !! Urdu-Shahmukhi[13] !! Roman !! Devanagari[14]
ڪ ک k
ک کھ kh
ق ق q क़
خ خ k͟h ख़
گ گ g
غ غ g͟h ग़
ڳ ڳ
گھ گھ gh
چ چ c
ڇ چھ ch
ج ج j
جھ جھ jh
ڄ ڄ
ز ز z ज़
ذ ذ z ज़
ض ض z ज़
ظ ظ z ज़
ژ ژ zh झ़
ٽ ٹ
ٺ ٹھ ṭh
ڏ ݙ
ڊ ڈ
ڍ ڈھ ḍh
ڙ ڑ ड़
ڙھ ڑھ ṛh ढ़
ت ت t
ٿ تھ th
ط ط t
د د d
ڌ دھ dh
ن ن n
ڻ ݨ
ڱ ن٘
ڃ ݩ ñ
پ پ p
ڦ پھ ph
ف ف f फ़
ب ب b
ٻ ٻ ॿ
ڀ بھ bh
م م m
ي ی y
ر ر r
ل ل l
ݪ لؕ
و و v
ش ش sh
ݜ سؕ ṣh
س س s
ص ص s
ث ث s
ه ہ h
ح ح h
ۃ ۃ h
ع ع ʿ ʿ

Single-letter Word Ligatures

Sindhi Urdu Roman Devanagari
۽ اَیں a͠i ऐं
۾ میں mẽ में

Numerals

Usage Numeral System
Pakistan Arabic numerals۰ ۱ ۲ ۳ ۴ ۵ ٦ ۷ ۸ ۹
International 0 1 2 3 4 5 6 7 8 9
India

Punctuations & Symbols

Script Period Question Mark Comma Semi-colon Slash Percent End of verse
۔ ؟ ، ؛ ؍ ٪ ۝
? , / %

Web Application for Sindhi Translitration

This Program is about auto Translitration of Sindhi script into asian Languages . with Artificial Intelligence features, actually transliteration programs may be on bases like letter based transliteration in which multiple alternates of any letter in second languages die and one remains..... some are word base transliterations in which as more as half million words should be first vowel-led and then would be written an other language, this is also time taken work. now this is A I time so this program called sindhi transliteration is one click solution to all multiple alternate letters.... for example if someone writes the program wil transliterate it accurately as کوکر خوراڪ کائي ٿو. Sindhi has three vowels which also are used as consonants like ا، و، ي. For example alif has two sounds like in ada ادا has two sounds one is of alif and an other is vowel of aa. Also, if someone writes سنڌ without zer zabar and peshu the program will convert it with vowels like .

Sindhi Trans[15] literation سنڌي ٽرانسلٽريشن

Notes and References

  1. Web site: 2016-10-03. Sindhi gains national status. 2021-07-02. The Express Tribune. en.
  2. Web site: Languages Included in the Eighth Schedule of the Indian }} ]. . 2018-04-09.
  3. Web site: Lakhani. Rakesh. 2013-06-06. Forgotten Sindhi Script – Waranki. 2021-07-02. Indigenous Sindhis. en.
  4. Web site: 2014-09-18. Transcending barriers: Software to break down the wall within the Sindhi language. 2021-07-02. The Express Tribune. en.
  5. News: The Imposition Of Urdu. 12 September 2015. NAWAIWAQT GROUP OF NEWSPAPERS. September 10, 2015.
  6. Leghari. Mehwish. U Rahman. Mutee. 2015-10-30. Towards Transliteration between Sindhi Scripts Using Roman Script. 3820433. en. Linguistics and Literature Review . 1 . 2 . 95–104. 10.32350/llr.12.03. free.
  7. Lehal. Gurpreet Singh. Saini. Tejinder Singh. December 2014. Sangam: A Perso-Arabic to Indic Script Machine Transliteration Model. en-us. 232–239.
  8. Sodhar. Irum Naz. Jalbani. Akhtar Hussain. Channa. Muhammad Ibrahim. Hakro. Dil Nawaz. 2021-04-01. Romanized Sindhi Rules for Text Communication. Mehran University Research Journal of Engineering and Technology. en. 40. 2. 298–304. 10.22581/muet1982.2102.04. 2021MURJE..40..298S . 2413-7219. free.
  9. Web site: Perso-Arabic To Indic Script Transliteration. 2021-04-07. sangam.learnpunjabi.org.
  10. Web site: Saraiki - Devanagari Machine Transliteration System - SDMTS. 2021-08-09. www.sanlp.org.
  11. Lawaye. Aadil. Kak. Aadil. Mehdi. Nali. January 2010. Building a Cross Script Kashmiri Converter: Issues and Solutions. Proceedings of Oriental COCOSDA.
  12. Pedersen. Thomas T.. Sindhi: Transliteration of Non-Roman Scripts. Transliteration of Non-Roman Scripts.
  13. Carefully note the differences. List also includes 4 implosive consonants from Saraiki alphabet (ڳ, ڄ, ݙ, ٻ)
  14. All the consonants with nuqta sign (dots below) denote adaptation for PersoArabic consonants
  15. Web site: Sindhi Translitration سنڌي ٽرانسلٽريشن، Sindhi Script Converter سنڌي لپي مٽائيندڙ . 2022-04-18 . fahad-memon.com/sindhi_transliteration.