Languages often outlive the scripts they are written in. Sindhi is unusual in that it has been written in four different scripts during recorded history, and at least two of them remain in active use today. The Perso-Arabic form that this keyboard supports is not the oldest, nor was its dominance inevitable. It became the standard in what is now Pakistan through a specific set of 19th-century decisions — and those decisions had consequences that are still visible in the Unicode Standard.
The Khudabadi Script
The oldest writing system with a continuous Sindhi literary tradition is Khudabadi — a syllabic script descended from the Landa merchants' script used across the Punjab and Sindh for commercial record-keeping. Khudabadi developed its own distinct letter forms by the 16th century and became associated primarily with the Sindhi Hindu merchant community, who used it for trade accounts, religious texts, and literature.
The name Khudabadi refers to Khudabad, a former capital of Sindh under the Kalhora rulers. The script's geographic spread followed trade routes — Sindhi merchants carried it to Rajasthan, Gujarat, and eventually to diaspora communities in East Africa, Southeast Asia, and beyond.
Khudabadi is endangered today. UNESCO lists it as such. It was encoded in Unicode 8.0 (2015) at block U+112B0–U+112FF, a recognition of its historical importance even as its active use contracts. The traders who kept it alive for centuries are now largely literate in Devanagari or Perso-Arabic, and the script is maintained primarily by scholars and community organisations rather than in everyday use.
Devanagari Sindhi
Sindhi in the Devanagari script exists primarily in India, where Hindu Sindhi communities settled after Partition in 1947. The Indian government recognised Sindhi as one of the 22 scheduled languages of the Indian Constitution, and state support has gone primarily to the Devanagari form.
Devanagari Sindhi required its own adaptations — the script needed additional characters to represent the implosive consonants that are as central to Sindhi phonology as they are in the Perso-Arabic form. The Unicode encodings for Devanagari additions for Sindhi are found in the Devanagari Extended block.
The divergence between Devanagari and Perso-Arabic Sindhi is not merely orthographic — it reflects the community divide created by Partition, where a single language was distributed between two countries and two script traditions. A Sindhi speaker in Ahmedabad and a Sindhi speaker in Hyderabad (Sindh) may speak mutually intelligible dialects but read in different scripts.
The British Standardisation: 1853
The Perso-Arabic form of Sindhi as it exists today was substantially shaped by a decision made in 1853 by a committee working under the direction of Sir Bartle Frere, the Chief Commissioner of Sindh under British India. Before this standardisation, Sindhi was written in Arabic script with varying conventions — some writers borrowed Urdu practice, others used Persian conventions, and the number of additional characters used for Sindhi-specific sounds varied from one printer or scribe to another.
The 1853 standardisation committed to a specific set of letter forms for the sounds that Arabic and Urdu did not cover. The decision to represent implosive consonants, aspirated consonants, and retroflex consonants with dedicated letter forms — rather than the diacritic-based system sometimes used in Arabic-script adaptations of South Asian languages — gave Sindhi its distinctive 52-letter alphabet.
This was not a purely linguistic decision. It was also an administrative one. The British government of Sindh needed a standardised orthography for official correspondence, for the courts, and for the schools it was establishing. The 1853 form became the official script for Sindhi administration under British rule, and its institutional backing helped it persist.
The Interlude: Devanagari in Sindh
Between 1868 and 1882, the British government of Bombay Presidency (which administered Sindh) replaced the Perso-Arabic script with Devanagari for official use. This was part of a broader effort by some British officials to reduce the influence of Arabic and Persian in the subcontinent and promote Sanskritic forms. Sindhi Hindu communities had no objection; Muslim Sindhi communities did.
The reversal in 1882 — returning official status to the Perso-Arabic form — came after sustained petition by Sindhi Muslim leaders and after practical evidence that Devanagari was not being adopted uniformly. The episode illustrated how Sindhi's script was already entangled with community identity before Partition made that entanglement explicit.
After Partition: 1947 and Beyond
The Partition of British India in 1947 split Sindh entirely into Pakistan. The Sindhi Hindu population — which had maintained Khudabadi and had used both Devanagari and Perso-Arabic — largely migrated to India. The Sindhi Muslim population, and the Perso-Arabic script tradition it predominantly used, remained in what became Pakistan's Sindh province.
In post-Partition Pakistan, Urdu became the national language and faced no rival. Sindhi's status in Sindh province was contested — Muhajir communities (Urdu-speaking migrants from India) and Sindhi-speaking locals frequently clashed over language policy, especially in education. Sindhi-medium schooling was preserved in rural Sindh but reduced in urban centres.
The result for the script was both continuity and stagnation. The 1853 standardisation remained the de facto standard, but Sindhi typography and printing technology lagged. While Urdu press culture flourished in Pakistan with major newspapers, major publishers, and eventually digital infrastructure, Sindhi had fewer resources and a smaller publishing ecosystem.
Into the Digital Age
The arrival of digital text processing presented the same problem for Sindhi that it presented for every script without institutional infrastructure: how to encode a large and complex character set when computing resources were expensive and standardisation was contested.
Early Sindhi digital text used proprietary encodings — custom fonts where the shape of a Sindhi letter was mapped to a Latin keyboard character. A document written in one Sindhi font was unreadable garbage in another. The same problem affected Urdu, but Urdu had more institutional support, more publishers, and more pressure to solve it.
The Unicode solution came in stages. The first broad Sindhi coverage arrived with Unicode 1.0 (1991) for the characters shared with Arabic and Urdu, but the Sindhi-specific characters were not all encoded until later versions, with key additions in Unicode 4.1 (2005) when the Arabic Supplement block was added, specifically to address underrepresented Arabic-script languages.
Today, when you type on the Sindhi keyboard on this site and produce text with U+067B (ٻ) or U+0680 (ڀ), you are using code points that exist because the Unicode Consortium recognised that Sindhi's 52-letter standard needed dedicated encoding. The technical infrastructure now exists. The challenge that remains is adoption — replacing the large body of legacy Sindhi text that uses incorrect Urdu substitutions with properly encoded text.
Where the Alphabet Stands Today
The 52-letter Perso-Arabic Sindhi alphabet taught in Sindhi-medium schools in Pakistan is the form this keyboard supports. It is the form used by Sindhi-language newspapers (including the major daily Kawish), by Sindh government communications, and by the Sindhi Language Authority, which is the statutory body responsible for the development and standardisation of Sindhi language and literature in Pakistan.
The script's position is stable in Pakistan but requires continued active support. Digital tools, Unicode-correct keyboards, and properly encoded fonts are part of that support infrastructure — which is the practical reason this site exists.
If you want to understand the technical side of how this alphabet maps onto the Unicode Standard — specifically the distinction between the Sindhi code points and the Urdu ones — the article on how Sindhi Unicode works covers that in detail.