Gurmukhī (Punjabi: ਗੁਰਮੁਖੀ, Punjabi pronunciation: [ˈɡʊɾᵊmʊkʰiː], Shahmukhi: گُرمُکھی) is an abugida developed from the Laṇḍā scripts, standardized and used by the second Sikh guru, Guru Angad (1504–1552). Commonly regarded as a Sikh script, Gurmukhi is used in Punjab, India as the official script of the Punjabi language.
|16th century CE-present|
|Khudabadi, Khojki, Mahajani, Multani|
|ISO 15924||Guru, 310 , Gurmukhi|
The primary scripture of Sikhism, the Guru Granth Sahib, is written in Gurmukhī, in various dialects and languages often subsumed under the generic title Sant Bhasha or saint language, in addition to other languages like Persian and various phases of Indo-Aryan languages.
Modern Gurmukhī has thirty-five original letters, hence its common alternative term paintī or "the thirty-five," plus six additional consonants, nine vowel diacritics, two diacritics for nasal sounds, one diacritic that geminates consonants and three subscript characters.
The Gurmukhī script is generally believed to have roots in the Proto-Sinaitic alphabet by way of the Brahmi script, which developed further into the Northwestern group (Sharada, or Śāradā, and its descendants, including Landa and Takri), the Central group (Nagari and its descendants, including Devanagari, Gujarati and Modi) and the Eastern group (evolved from Siddhaṃ, including Bangla, Tibetan, and some Nepali scripts), as well as several prominent writing systems of Southeast Asia and Sinhala in Sri Lanka, in addition to scripts used historically in Central Asia for extinct languages like Saka and Tocharian. Gurmukhi is derived from Sharada in the Northwestern group, of which it is the only major surviving member, with full modern currency. Notable features:
|Possible derivation of Gurmukhi from earlier writing systems.[note 1] The Greek alphabet, also descended from Phoenician, is included for comparison.|
Gurmukhi evolved in cultural and historical circumstances notably different from other regional scripts, for the purpose of recording scriptures of Sikhism, a far less Sanskritized cultural tradition than others of the subcontinent. This independence from the Sanskritic model allowed it the freedom to evolve unique orthographical features. These include:
and other features.
From the 10th century onwards, regional differences started to appear between the Sharada script used in Punjab, the Hill States (partly Himachal Pradesh) and Kashmir. Sharada proper was eventually restricted to very limited ceremonial use in Kashmir, as it grew increasingly unsuitable for writing the Kashmiri language. With the last known inscription dating to 1204 C.E., the early 13th century marks a milestone in the development of Sharada. The regional variety in Punjab continued to evolve from this stage through the 14th century; during this period it starts to appear in forms closely resembling Gurmukhī and other Landa scripts. By the 15th century, Sharada had evolved so considerably that epigraphists denote the script at this point by a special name, Devāśeṣa. Tarlochan Singh Bedi (1999) prefers the name Pritham Gurmukhī, or Proto-Gurmukhī.
The Sikh gurus adopted Proto-Gurmukhī to write the Guru Granth Sahib, the religious scriptures of the Sikhs. The Takri alphabet developed through the Devāśeṣa stage of the Sharada script from the 14th-18th centuries and is found mainly in the Hill States such as Chamba, Himachal Pradesh and surrounding areas, where it is called Chambeali. In Jammu Division, it developed into Dogri, which was a "highly imperfect" script later consciously influenced in part by Gurmukhi during the late 19th century, possibly to provide it an air of authority by having it resemble scripts already established in official and literary capacities, though not displacing Takri. The local Takri variants got the status of official scripts in some of the Punjab Hill States, and were used for both administrative and literary purposes until the 19th century. After 1948, when Himachal Pradesh was established as an administrative unit, the local Takri variants were replaced by Devanagari.
Meanwhile, the mercantile scripts of Punjab known as the Laṇḍā scripts were normally not used for literary purposes. Laṇḍā means alphabet "without tail", implying that the script did not have vowel symbols. In Punjab, there were at least ten different scripts classified as Laṇḍā, Mahajani being the most popular. The Laṇḍā scripts were used for household and trade purposes. In contrast to Laṇḍā, the use of vowel diacritics was made obligatory in Gurmukhī for increased accuracy and precision, due to the difficulties involved in deciphering words without vowel signs.
In the following epochs, Gurmukhī became the primary script for the literary writings of the Sikhs. Playing a significant role in Sikh faith and tradition, it expanded from its original use for Sikh scriptures and developed its own orthographical rules, spreading widely under the Sikh Empire and used by Sikh kings and chiefs of Punjab for administrative purposes. Also playing a major role in consolidating and standardizing the Punjabi language, it served as the main medium of literacy in Punjab and adjoining areas for centuries when the earliest schools were attached to gurdwaras. The first natively produced grammars of the Punjabi language were written in the 1860s in Gurmukhi. The Singh Sabha Movement of the late 19th century, a movement to revitalize Sikh institutions which had declined during colonial rule after the fall of the Sikh Empire, also advocated for the usage of the Gurmukhi script for mass media, with print media publications and Punjabi-language newspapers established in the 1880s. Later in the 20th century, after the struggle of the Punjabi Suba movement, from the founding of modern India in the 1940s to the 1960s, the script was given the authority as the official state script of the Punjab, India, where it is used in all spheres of culture, arts, education, and administration, with a firmly established common and secular character.
The prevalent view among Punjabi linguists is that as in the early stages the Gurmukhī letters were primarily used by the Guru's followers, Gurmukhs (literally, those who face, or follow, the Guru, as opposed to a Manmukh); the script thus came to be known as Gurmukhī, "the script of those guided by the Guru." Guru Angad is credited in the Sikh tradition with the creation and standardization of Gurmukhi script from earlier Śāradā-descended scripts native to the region. It is now the standard writing script for the Punjabi language in India. The original Sikh scriptures and most of the historic Sikh literature have been written in the Gurmukhi script.
Although the word Gurmukhī has been commonly translated as "from the Mouth of the Guru," the term used for the Punjabi script has somewhat different connotations. This usage of the term may have gained currency from the use of the script to record the utterances of the Sikh Gurus as scripture, which were often referred to as Gurmukhī, or from the mukh (face, or mouth) of the Gurus. Consequently, the script that was used to write the resulting scripture may have also been designated with the same name.
The Gurmukhī alphabet contains thirty-five base letters (akkhara, plural akkharā̃), traditionally arranged in seven rows of five letters each. The first three letters, or mātarā vāhak ("vowel carrier"), are distinct because they form the basis for vowels and are not consonants, or vianjan, like the remaining letters are, and except for the second letter aiṛā[note 2] are never used on their own; see § Vowel diacritics for further details. The pair of fricatives, or mūl varag ("base class"), share the row, which is followed by the next five sets of consonants, with the consonants in each row being homorganic, the rows arranged from the back (velars) to the front (labials) of the mouth, and the letters in the grid arranged by place and manner of articulation. The arrangement, or varaṇamālā, is completed with the antim ṭolī, literally "ending group." The names of most of the consonants are based on their reduplicative phonetic values, and the varaṇamālā is as follows:
|Occlusives →||Tenuis||Aspirates||Voiced Stops||Tonal||Nasals|
[ kə̀ ]
[ t͡ʃə̀ ]
[ ʈə̀ ]
[ t̪ə̀ ]
[ pə̀ ]
|Approximants and liquids|
The nasal letters ਙ /ŋəŋːaː/ and ਞ /ɲəɲːaː/ have become marginal as independent consonants in modern Gurmukhi. The sounds they represent occur most often as allophones of [n] in clusters with velars and palatals respectively.
The most characteristic feature of the Punjabi language is its tone system. The script has no separate symbol for tones, but they correspond to the tonal consonants that once represented voiced aspirates as well as older *h. To differentiate between consonants, the Punjabi tonal consonants of the fourth column, ਘ kà, ਝ cà, ਢ ṭà, ਧ tà, and ਭ pà, are often transliterated in the way of the voiced aspirate consonants gha, jha, ḍha, dha, and bha respectively, although Punjabi lacks these sounds. Tones in Punjabi can be either rising, neutral, or falling; in the pronunciation of the names of the Gurmukhī letters, they are at the beginning of the word and as such produce the falling tone, hence the grave accent (à) as opposed to the acute. The tone on the stem vowel changes to a rising one (á) and precedes the letter when it is in syllabic coda positions, and is falling when the letter in stem-medial positions after a short vowel and before a long vowel, and when the tonal letter follows the stem vowel. The letters now always represent unaspirated consonants, and are unvoiced in initial positions and voiced elsewhere.
In addition to the 35 original letters, there are six supplementary consonants in official usage, referred to as the navīn ṭolī or navīn varag, meaning "new group," created by placing a dot (bindī) at the foot (pair) of the consonant to create pair bindī consonants. These are not present in the Guru Granth Sahib or old texts. These are used most often for loanwords, though not exclusively,[note 3] and their usage is not always obligatory:
|ਸ਼||sasse pair bindī
[səsːeː pɛ:ɾᵊ bɪnd̪iː]
|ਖ਼||khakkhe pair bindī
[kʰəkʰːeː pɛ:ɾᵊ bɪnd̪iː]
|ਗ਼||gagge pair bindī
[gəgːeː pɛ:ɾᵊ bɪnd̪iː]
|ਜ਼||jajje pair bindī
[d͡ʒəd͡ʒːeː pɛ:ɾᵊ bɪnd̪iː]
|ਫ਼||phapphe pair bindī
[pʰəpʰːeː pɛ:ɾᵊ bɪnd̪iː]
|ਲ਼||lalle pair bindī
[ləlːeː pɛ:ɾᵊ bɪnd̪iː]
The character ਲ਼ /ɭ/, the only character not representing a fricative consonant, was only recently officially added to the Gurmukhī alphabet. It was not a part of the traditional orthography, as the distinctive phonological difference between /l/ and /ɭ/, while both native sounds, was not reflected in the script; however, its usage, while still currently not universal, has been noted along with the other letters of the group among the earliest Punjabi grammars produced. Previous usage of another glyph to represent this sound, [ਲ੍ਰ], has also been attested. Other characters, like the more recent [ਕ਼] /qə/, are also on rare occasion used unofficially, chiefly for transliterating old writings in Persian and Urdu, the knowledge of which is less relevant in modern times. The Shahmukhi alphabet equivalent for representing the sound is ࣇ, "lam with tah above."
Three "subscript" letters, called dutt akkhara ("joint letters") or pairī̃ akkhara ("letters at the foot") are utilised in modern Gurmukhī: forms of ਹ (ha), ਰ (ra), and ਵ (va).
The subscript ਰ (r) and ਵ (v) are used to make consonant clusters and behave similarly; subjoined ਹ (h) introduces tone.
|Subscript letter||Name, original form||Usage|
|For example, the letter ਪ(p) with a regular ਰ(r) following it would yield the word ਪਰ /pəɾᵊ/ ("but"), but with a subjoined ਰ would appear as ਪ੍ਰ- (/prə-/), resulting in a consonant cluster, as in the word ਪ੍ਰਬੰਧਕ (/pɾəbə́n̪d̪əkᵊ/, "managerial, administrative"), as opposed to ਪਰਬੰਧਕ /pəɾᵊbə́n̪d̪əkᵊ/, the Punjabi form of the word used in natural speech in less formal settings (the Punjabi reflex for Sanskrit /pɾə-/ is /pəɾ-/) . This subscript letter is commonly used in Punjabi for personal names, some native dialectal words, loanwords from other languages like English and Sanskrit, etc.|
|Used occasionally in Gurbani (Sikh religious scriptures) but rare in modern usage, it is largely confined to creating the cluster /sʋə-/ in words borrowed from Sanskrit, the reflex of which in Punjabi is /sʊ-/, e.g. Sanskrit ਸ੍ਵਪ੍ਨ /s̪ʋɐ́p.n̪ɐ/→Punjabi ਸੁਪਨਾ /'sʊpᵊna:/, "dream," cf. Hindi-Urdu /səpna:/.
For example, ਸ with a subscript ਵ would produce ਸ੍ਵ (sʋə-) as in the Sanskrit word ਸ੍ਵਰਗ (/sʋəɾəgə/, "heaven"), but followed by a regular ਵ would yield ਸਵ- (səʋ-) as in the common word ਸਵਰਗ (/səʋəɾəgᵊ/, "heaven"), borrowed earlier from Sanskrit but subsequently changed. The natural Punjabi reflex, ਸੁਰਗ /sʊɾəgᵊ/, is also used in everyday speech.
|The most common subscript, this character does not create consonant clusters, but serves as part of Punjabi's characteristic tone system, indicating a tone. It behaves the same way in its use as the regular ਹ(h) does in non-word-initial positions. The regular ਹ(h) is pronounced in stressed positions (as in ਆਹੋ āho "yes" and a few other common words), word-initially in monosyllabic words, and usually in other word-initial positions,[note 4] but not in other positions, where it instead changes the tone of the applicable adjacent vowel. The difference in usage is that the regular ਹ is used after vowels, and the subscript version is used when there is no vowel, and is attached to consonants.
For example, the regular ਹ is used after vowels as in ਮੀਂਹ (transcribed as mĩh (IPA: [míː]), "rain"). The subjoined ਹ(h) acts the same way but instead is used under consonants: ਚ(ch) followed by ੜ(ṛ) yields ਚੜ (caṛa), but not until the rising tone is introduced via a subscript ਹ(h) does it properly spell the word ਚੜ੍ਹ (cáṛa, "climb").
This character's function is similar to that of the udāt character (ੑ U+0A51), which occurs in older texts and indicates a rising tone.
In addition to the three standard subscript letters, another subscript character representing the subjoined /j/, the yakash or pairī̃ yayyā ( ੵ U+0A75), is utilized specifically in archaized sahaskritī-style writings in Sikh scripture, where it is found 268 times for word forms and inflections from older phases of Indo-Aryan, as in the examples ਰਖੵਾ /ɾəkʰːjaː/ "(to be) protected," ਮਿਥੵੰਤ /mɪt̪ʰjən̪t̪ə/ "deceiving," ਸੰਸਾਰਸੵ /sənsaːɾəsjə/ "of the world," ਭਿਖੵਾ /pɪ̀kʰːjaː/ "(act of) begging," etc. There is also a conjunct form of the letter yayyā, ਯ→੍ਯ, which functions similarly to the yakash, and is used exclusively for Sanskrit borrowings, and even then rarely. In addition, miniaturized versions of the letters ਚ, ਟ, ਤ, and ਨ are also found in limited use as subscript letters in Sikh scripture.
Only the subjoined /ɾ/ and /h/ are commonly used; usage of the subjoined /ʋ/ and conjoined forms of /j/, already rare, is increasingly scarce in modern contexts.
To express vowels (singular, sur), Gurmukhī, as an abugida, makes use of obligatory diacritics called lagā̃. Gurmukhī is similar to Brahmi scripts in that all consonants are followed by an inherent schwa sound. This inherent vowel sound can be changed by using dependent vowel signs which attach to a bearing consonant. In some cases, dependent vowel signs cannot be used – at the beginning of a word or syllable for instance – and so an independent vowel character is used instead.
Independent vowels are constructed using three bearer characters: ūṛā (ੳ), aiṛā (ਅ) and īṛī (ੲ). With the exception of aiṛā (which represents the vowel [ə]), the bearer consonants are never used without additional vowel signs.
|Vowel||Transcription||IPA||Closest English equivalent|
|a||[ə]||like a in about|
|ā||[aː]~[äː]||like a in car|
|i||[ɪ]||like i in it|
|ī||[iː]||like i in litre|
|u||[ʊ]||like u in put|
|ū||[uː]||like u in spruce|
|e||[eː]||like e in Chile|
|ai||[ɛː]~[əɪ]||like a in rap|
|o||[oː]||like o in more|
|au||[ɔː]~[əʊ]||like o in off|
Dotted circles represent the bearer consonant. Vowels are always pronounced after the consonant they are attached to. Thus, sihārī is always written to the left, but pronounced after the character on the right. When constructing the independent vowel for [oː], ūṛā takes an irregular form instead of using the usual hoṛā.
Gurmukhi orthography prefers vowel sequences over the use of semivowels ("y" or "w") intervocally and in syllable nuclei, as in the words ਦਿਸਾਇਆ disāiā "caused to be visible" rather than disāyā, ਦਿਆਰ diāra "cedar" rather than dyāra, and ਸੁਆਦ suāda "taste" rather than swāda, permitting vowels in hiatus.
In terms of tone orthography, the short vowels [ɪ] and [ʊ], when paired with [h] to yield /ɪh/ and /ʊh/, represent [é] and [ó] with high tones respectively, e.g. ਕਿਹੜਾ kihṛā (IPA: [kéːɽaː]) 'which,' ਦੁਹਰਾ duhrā (IPA: [d̪óːɾaː]) "repeat, reiterate, double." The compounding of [əɦ] with [ɪ] or [ʊ] yield [ɛ́] and [ɔ́] respectively, e.g. ਮਹਿੰਗਾ mahingā (IPA: [mɛ́ːŋgaː]) "expensive," ਵਹੁਟੀ vahuṭī (IPA: [wɔ́ːʈiː]) "bride."
The diacritics for gemination and nasalization are together referred to as lagākkhara ("applied letters").
The use of adhak ( ੱ ) (IPA: ['ə́d̪əkᵊ]) indicates that the following consonant is geminated, and is placed above the consonant preceding the geminated one. Consonant length is distinctive in the Punjabi language and the use of this diacritic can change the meaning of a word, for example:
|Without adhak||Transliteration||Meaning||With adhak||Transliteration||Meaning|
There is a tendency, especially in rural dialects, to geminate consonants following a long vowel (/a:/, /e:/, /i:/, /o:/, /u:/, /ɛ:/, /ɔː/) in the penult of a word, e.g. ਔਖਾ aukkhā "difficult," ਕੀਤੀ kīttī "did," ਪੋਤਾ pottā "grandson," ਪੰਜਾਬੀ panjābbī "Punjabi," ਹਾਕ hāka "call, shout," but plural ਹਾਕਾਂ hākkā̃. Except in this case, where this unmarked gemination is often etymologically rooted in archaic forms, and has become phonotactically regular, the usage of the adhak is obligatory.
Ṭippī ( ੰ ) and bindī ( ਂ ) are used for producing a nasal phoneme depending on the following obstruent or a nasal vowel at the end of a word. All short vowels are nasalized using ṭippī and all long vowels are nasalized using bindī except for dulaiṅkaṛ ( ੂ ), which uses ṭippi instead.
|Diacritic usage||Result||Examples (IPA)|
|Ṭippī on short vowel (/ə/, /ɪ/, /ʊ/), or long vowel /u:/[note 5], before a non-nasal consonant||Adds nasal consonant at same place of articulation as following consonant
(/ns/, /n̪t̪/, /ɳɖ/, /mb/, /ŋg/, /nt͡ʃ/ etc.)
|ਹੰਸ /ɦənsᵊ/ "goose"|
ਅੰਤ /ən̪t̪ᵊ/ "end"
ਗੰਢ /gə́ɳɖᵊ/ "knot"
ਅੰਬ /əmbᵊ/ "mango"
ਸਿੰਗ /sɪŋgᵊ/ "horn, antler"
ਕੁੰਜੀ / kʊɲd͡ʒiː/ "key"
ਗੂੰਜ /guːɲd͡ʒᵊ/ "rumble, echo"
ਲੂੰਬੜੀ /luːmbᵊɽiː/ "fox"
|Bindī over long vowel (/a:/, /e:/, /i:/, /o:/, /u:/[note 6], /ɛ:/, /ɔː/)
before a non-nasal consonant not including /h/
|Adds nasal consonant at same place of articulation as following consonant (/ns/, /n̪t̪/, /ɳɖ/, /mb/, /ŋg/, /ɲt͡ʃ/ etc.).
May also secondarily nasalize the vowel
|ਕਾਂਸੀ /kaːnsiː/ "bronze"|
ਕੇਂਦਰ /keːn̯d̯əɾᵊ/ "center, core, headquarters"
ਗੁਆਂਢੀ /gʊáːɳɖiː/ "neighbor"
ਚੌਂਕ /t͡ʃɔːŋkᵊ/ "crossroads, plaza"
ਸਾਂਝ /sáːɲd͡ʒᵊ/ "association" (act)
|Ṭippī over consonants followed by long vowel /u:/ (not stand-alone vowel ਊ),
at open syllable at end of word, or ending in /ɦ/
|Vowel nasalization||ਤੂੰ /t̪ũː/ "you"|
ਸਾਨੂੰ /saːnːũː/ "to us"
ਮੂੰਹ /mũːɦ/ "mouth"
|Ṭippī on short vowel before nasal consonant (/n̪/ or /m/)||Gemination of nasal consonant
Ṭippī is used to geminate nasal consonants instead of adhak
|ਇੰਨਾ /ɪn̪:a:/ "this much"|
ਕੰਮ /kəm:ᵊ/ "work"
|Bindī over long vowel (/a:/, /e:/, /i:/, /o:/, /u:/, /ɛ:/, /ɔː/),
at open syllable at end of word, or ending in /ɦ/
|Vowel nasalization||ਬਾਂਹ /bã́h/ "arm"|
ਮੈਂ /mɛ̃ː/ "I, me"
ਅਸੀਂ /əsĩː/ "we"
ਤੋਂ /t̪õː/ "from"
ਸਿਊਂ /sɪ.ũː/ "sew"
Older texts may follow other conventions.
The halanta ( ੍ U+0A4D) character is not used when writing Punjabi in Gurmukhī. However, it may occasionally be used in Sanskritised text or in dictionaries for extra phonetic information. When it is used, it represents the suppression of the inherent vowel.
The effect of this is shown below:
The visarga symbol (ਃ U+0A03) is used very occasionally in Gurmukhī. It can represent an abbreviation, as the period is used in English, though the period for abbreviation, like commas, exclamation points, and other Western punctuation, is freely used in modern Gurmukhī.
Gurmukhī has its own set of digits, which function exactly as in other versions of the Hindu–Arabic numeral system. These are used extensively in older texts. In modern contexts, they are sometimes replaced by standard Western Arabic numerals.
*In some Punjabi dialects, the word for three is ਤ੍ਰੈ trai (IPA: [t̪ɾɛː]).
Gurmukhī script was added to the Unicode Standard in October 1991 with the release of version 1.0.
Many sites still use proprietary fonts that convert Latin ASCII codes to Gurmukhī glyphs.
The Unicode block for Gurmukhī is U+0A00–U+0A7F:
Official Unicode Consortium code chart (PDF)
Panjab Digital Library has taken up digitization of all available manuscripts of Gurmukhī Script. The script has been in formal use since the 1500s, and a lot of literature written within this time period is still traceable. Panjab Digital Library has digitized over 5 million pages from different manuscripts and most of them are available online.
...the Guru Granth Sahib, written in a script particular to the Sikhs (Gurmukhi)...
The following Punjabi-language publications have been written on the origins of the Gurmukhī script:
|Wikimedia Commons has media related to Gurmukhi.|