See W. Geiger, Pali Literature and Language (tr., rev. ed. 1968).
Pali (ISO 15919/ALA-LC: Pāḷi) is a Middle Indo-Aryan language or prakrit of India. It is best known as the language of the earliest extant Buddhist canon, the Pāi Tipitaka or Pāi Canon, and as the liturgical language of Theravada Buddhism. Pali has since been written in a variety of scripts, from the Brahmic family scripts through to a romanised form devised with the research and contributions of Robert Caesar Childers and T. W. Rhys Davids, both of the Pali Text Society.
The word Pali itself signifies "line" or "(canonical) text", and this name for the language seems to have its origins in commentarial traditions, wherein the "" (in the sense of the line of original text quoted) was distinguished from the commentary or the vernacular following after it on the manuscript page. As such, the name of the language has caused some debate among scholars of all ages; the spelling of the name also varies, being found with both long "ā" [ɑː] and short "a" [a], and also with either a retroflex [ɭ] or non-retroflex [l] "l" sound. To this day, there is no single, standard spelling of the term; all four spellings can be found in textbooks. R.C. Childers translates the word as "series" and states that the language "bears the epithet in consequence of the perfection of its grammatical structure.
Pali is a literary language of the Prakrit language family. When the canonical texts were written down in Sri Lanka in the first century BCE, Pali stood close to a living language; this is not the case for the commentaries. Despite excellent scholarship on this problem, there is persistent confusion as to the inter-relation of to the vernacular spoken in the ancient kingdom of Magadha (which most scholars agree to have been located around modern-day Bihār, though some have recently claimed that it may have gotten that name after the Ashokan era and that ancient Magadha may have possibly been in the northwest of ancient India, in Baluchistan).
Pali as a Middle Indo-Aryan language is different from Sanskrit not so much with regard to the time of its origin than as to its dialectal base, since a number of its morphological and lexical features betray the fact that it is not a direct continuation of Vedic Sanskrit; rather it descends from a dialect (or a number of dialects) which was (/were), despite many similarities, different from .
Pali was considered by early Buddhists to be linguistically similar to Old Magadhi or even a direct continuation of that language. Many Theravada sources refer to the Pali language as “Magadhan” or the “language of Magadha.” This seems to be problematic, as the later form of Magadhi of Asoka's inscriptions (3rd century BC) is an Eastern Indian language whereas Pali most closely resembles Western Indian inscriptions. Ancient Magadha may, however, have been in the West of ancient India after all. There are many remarkable analogies between Pali and Ardhamagadhi (Half Magadhi), an old form of Magadhi preserved in ancient Jain texts. Ardhamagadhi differs from the eastern Prakrit of Ashokan inscriptions on similar points as Pali. For example, Ardhamagadhi too does not change r into l, and in the noun inflexion it shows the ending -o instead of the eastern Prakritic -e at least in many metrical places. This similarity is not accidental, since Mahavira, the 24th Tirthankara of Jainism preached in the same area (Magadha) as Buddha Gotama.
T.W. Rhys Davids in his book Buddhist India, and Wilhelm Geiger in his book Pali Literature and Language suggested that Pali may have originated as a form of lingua franca or common language of culture among people who used differing dialects in North India, used at the time of the Buddha and employed by him. Another scholar states that at that time it was "a refined and elegant vernacular of all Aryan-speaking people. Modern scholarship has not arrived at a consensus on the issue; there are a variety of conflicting theories with supporters and detractors (taking as given that Magadha was an eastern district). After the death of the Buddha, Pali may have evolved among Buddhists out of the language of the Buddha as a new artificial language. Bhikkhu Bodhi, summarizing the current state of scholarship, states that the language is "closely related to the language (or, more likely, the various regional dialects) that the Buddha himself spoke." He goes on to write:
Scholars regard this language as a hybrid showing features of several Prakrit dialects used around the third century BCE, subjected to a partial process of Sanskritization. While the language is not identical with any the Buddha himself would have spoken, it belongs to the same broad linguistic family as those he might have used and originates from the same conceptual matrix. This language thus reflects the thought-world that the Buddha inherited from the wider Indian culture into which he was born, so that its words capture the subtle nuances of that thought-world.
Whatever the relationship of the Buddha's speech to Pali, the Canon was eventually transcribed and preserved entirely in it, while the commentarial tradition that accompanied it (according to the information provided by Buddhaghosa) was translated into Sinhalese and preserved in local languages for several generations. R.C. Childers, who held to the theory that Pali was Old Magadhi, wrote: "Had Gautama never preached, it is unlikely that Magadhese would have been distinguished from the many other vernaculars of Hindustan, except perhaps by an inherent grace and strength which make it a sort of Tuscan among the Prakrits.
However Pali was ultimately supplanted in India by Sanskrit as a literary and religious language following the formulation of Classical Sanskrit by the scholar Panini. In Sri Lanka, Pali is thought to have entered into a period of decline ending around the 4th or 5th Century (as Sanskrit rose in prominence), but ultimately survived. The work of Buddhaghosa was largely responsible for its reemergence as an important scholarly language in Buddhist thought. The Visuddhimagga and the other commentaries that Buddhaghosa compiled codified and condensed the Sinhalese commentarial tradition that had been preserved and expanded in Sri Lanka since the 3rd Century BCE.
Today Pali is studied mainly to gain access to Buddhist scriptures, and is frequently chanted in a ritual context. The secular literature of Pali historical chronicles, medical texts, and inscriptions, is also of great historical importance. The great centers of Pali learning remain in the Theravada nations of South-East Asia: Myanmar (Burma), Sri Lanka, Thailand, Laos and Cambodia. Since the 19th century, various societies for the revival of Pali studies in India have promoted awareness of the language and its literature, perhaps most notably the Maha Bodhi Society founded by Anagarika Dhammapala.
In Europe, the Pali Text Society has been a major force in promoting the study of Pali by Western scholars since its founding in 1881. Based in the United Kingdom, the society publishes romanized Pali editions, along with many English translations of these sources. In 1869, the first Pali Dictionary was published using the research of Robert Caesar Childers, one of the founding members of the Pali Text Society. It was the first Pali translated text in English and was published in 1872. Childers's Dictionary later received the Volney Prize in 1876.
The Pali Text Society was in part founded to compensate for the very low level of funds allocated to Indology in late 19th century England; incongruously, the English were not nearly so robust in Sanskrit and Prakrit language studies as Germany, Russia and even Denmark—a situation that many would say continues to this day. Without the inspiration of colonial holdings such as the former British occupation of Sri Lanka and Burma, institutions such as the Danish Royal Library have built up major collections of Pali manuscripts, and major traditions of Pali studies.
Virtually every word in has cognates in the other Prakritic "Middle Indo-Aryan languages", e.g., the Jain Prakrits. The relationship to earlier Sanskrit (e.g., Vedic language) is less direct and more complicated. Historically, influence between Pali and Sanskrit has been felt in both directions. The Pali language's resemblance to Sanskrit is often exaggerated by comparing it to later Sanskrit compositions which were written centuries after Sanskrit ceased to be a living language, and are influenced by developments in Middle Indic, including the direct borrowing of a portion of the Middle Indic lexicon; whereas, a good deal of later Pali technical terminology has been borrowed from the vocabulary of equivalent disciplines in Sanskrit, either directly or with certain phonological adaptations.
Post-canonical Pali also possesses a few loan-words from local languages where Pali was used (e.g. Sri Lankans adding Sinhalese words to Pali). These usages differentiate the Pali found in the from later compositions such as the Pali commentaries on the canon and folklore (e.g., the stories of the Jātaka commentaries), and comparative study (and dating) of texts on the basis of such loan-words is now a specialized field unto itself.
Pali was not exclusively used to convey the teachings of the Buddha, as can be deduced from the existence of a number of secular texts, such as books of medical science/instruction, in Pali. However, scholarly interest in the language has been focused upon religious and philosophical literature, because of the unique window it opens on one phase in the development of Buddhism.
Although Sanskrit was said, in brahmanical tradition, to be the unchanging language spoken by the gods, in which each word had an inherent significance, this view of language was not shared in the early Buddhist tradition, in which words were only conventional and mutable signs. Neither the Buddha nor his early followers shared the brahmans' reverence for the Vedic language or its sacred texts. This view of language naturally extended to Pali, and may have contributed to its usage (as an approximation or standardization of local Middle Indic dialects) in place of Sanskrit. However, by the time of the compilation of the Pali commentaries (4th or 5th century), Pali was regarded as the natural language, the root language of all beings.
Comparable to Ancient Egyptian, Latin or Hebrew in the mystic traditions of the West, Pali recitations were often thought to have a supernatural power (which could be attributed to their meaning, the character of the reciter, or the qualities of the language itself), and in the early strata of Buddhist literature we can already see Pali s used as charms, e.g. against the bite of snakes. Many people in Theravada cultures still believe that taking a vow in Pali has a special significance, and, as one example of the supernatural power assigned to chanting in the language, the recitation of the vows of are believed to alleviate the pain of childbirth in Sri Lanka. In Thailand, the chanting of a portion of the is believed to be beneficial to the recently departed, and this ceremony routinely occupies as much as seven working days. Interestingly, there is nothing in the latter text that relates to this subject, and the origins of the custom are unclear.
|High||i [i] ī [iː]||u [u] ū [uː]|
|Mid||e [e], [eː]||a [ɐ]||o [o], [oː]|
Long and short vowels are only contrastive in open syllables; in closed syllables, all vowels are always short. Short and long e and o are in complementary distribution: the short variants occur only in closed syllables, the long variants occur only in open syllables. Short and long e and o are therefore not distinct phonemes.
A sound called anusvāra (Skt.; Pali: nigghahita), represented by the letter (ISO 15919) or (ALA-LC) in romanization, and by a raised dot in most traditional alphabets, originally marked the fact that the preceding vowel was nasalized. That is, , and represented [ã ], [ĩ ] and [ũ ]. In many traditional pronunciations, however, the anusvāra is pronounced more strongly, like the velar nasal [ŋ ], so that these sounds are pronounced instead [ãŋ ], [ĩŋ ] and [ũŋ ]. However pronounced, never follows a long vowel; ā, ī and ū are converted to the corresponding short vowels when is added to a stem ending in a long vowel, e.g. becomes , not , becomes , not *.
|Place of articulation||Manner of articulation|
|Velars||k [k]||kh [kʰ]||g [ɡ]||gh [ɡʱ]||ṅ [ŋ]|
|Palatals||c [tʃ]||ch[tʃʰ]||j [dʒ]||jh [dʒʱ]||ñ [ɲ]||y [j]|
|Retroflex||ṭ [ʈ]||ṭh [ʈʰ]||ḍ [ɖ]||ḍh [ɖʱ]||ṇ [ɳ]||r[ɻ]||ḷ [ɭ]||ḷh [ɭʱ]|
|Dentals||t [t̪]||th [t̪ʰ]||d [d̪]||dh [d̪ʱ]||n [n̪]|
|Alveolars||l [l]||s [s]|
|Bilabials||p [p]||ph [pʰ]||b [b]||bh [bʱ]||m [m]|
The sounds listed above, except for ṅ, ḷ and ḷh are distinct phonemes in Pali. ṅ only occurs before velar stops. ḷ and ḷh are allophones of ḍ and ḍh when they occur singly between vowels.
|Masculine (loka- "world")||Neuter (yāna- "carriage")|
|Ablative||lokā (lokamhā, lokasmā; lokato)||yānā (yānamhā, yānasmā; yānato)|
|Dative||lokassa (lokāya)||yānassa (yānāya)|
|Feminine (kathā- "story")|
|Masculine (isi- "seer")||Neuter (akkhi- "eye")|
|Nominative||isi||isayo, isī||akkhi,||akkhī, akkhīni|
|Instrumental||isinā||isihi, isīhi||akkhinā||akkhihi, akkhīhi|
|Ablative||isinā, isito||akkhinā, akkhito|
|Genitive||isissa, isino||akkhissa, akkhino|
|Locative||isisu, isīsu||akkhisu, akkhīsu|
|Masculine (bhikkhu- "monk")||Neuter (cakkhu- "eye")|
|Genitive||bhikkhussa, bhikkhuno||cakkhussa, cakkhuno|
Element for element gloss
The three compounds in the first line literally mean:
The literal meaning is therefore: "The dharmas have mind as their leader, mind as their chief, are made of/by mind. If [someone] either speaks or acts with a corrupted mind, from that [cause] suffering goes after him, as the wheel [of a cart follows] the foot of a draught animal."
A slightly freer translation by Acharya Buddharakkhita
The connections were sufficiently well-known that technical terms from Sanskrit were easily converted into Pali by a set of conventional phonological transformations. These transformations mimicked a subset of the phonological developments that had occurred in Proto-Pali. Because of the prevalence of these transformations, it is not always possible to tell whether a given Pali word is a part of the old Prakrit lexicon, or a transformed borrowing from Sanskrit. The existence of a Sanskrit word regularly corresponding to a Pali word is not always secure evidence of the Pali etymology, since, in some cases, artificial Sanskrit words were created by back-formation from Prakrit words.
The following phonological processes are not intended as an exhaustive description of the historical changes which produced Pali from its Old Indic ancestor, but rather are a summary of the most common phonological equations between Sanskrit and Pali, with no claim to completeness.
In Sri Lanka, Pali texts were recorded in Sinhala script. Other local scripts, most prominently Khmer, Burmese, and in modern times Thai (since 1893), Devanāgarī and Mongolian have been used to record Pali.
Since the 19th Century, Pali has also been written in the Roman script. An alternate scheme devised by Frans Velthuis allows for typing without diacritics using plain-ASCII methods, but is much less readable than the standard Rhys Davids system (see below).
The Pali alphabetical order is as follows:
ḷh, although a single sound, is written with ligature of ḷ and h.
There are several fonts to use for Pali transliteration. However, older ASCII fonts such as Leedsbit PaliTranslit, Times_Norman, Times_CSX+, Skt Times, Vri RomanPali CN/CB etc., are not recommendable since they are not compatible with one another and technically out of date. On the contrary, fonts based on the Unicode standard are recommended because Unicode seems to be the future for all fonts and also because they are easily portable to one another.
However, not all Unicode fonts contain the necessary characters. To properly display all the diacritic marks used for romanized Pali (or for that matter, Sanskrit), a Unicode font must contain the following character ranges:
* Basic Latin: U+0000 – U+007F
* Latin-1 Supplement: U+0080 – U+00FF
* Latin Extended-A: U+0100 – U+017F
* Latin Extended-B: U+0180 – U+024F
* Latin Extended Additional: U+1E00 – U+1EFF
The Pali Text Society recommends VU-Times and Gandhari Unicode for Windows and Linux Computers. And The Tibetan & Himalayan Digital Library recommends Times Ext Roman, and provides links to several of other Unicode diacritic fonts usable for typing Pali together with ratings and installation instructions. Moreover, an English Buddhist monk titled Bhikkhu Pesala provides some Pali Unicode fonts he has designed himself here, and some Pali keyboards for Windows XP here Further, the font section of Alanwood's Unicode Resources have links to several general purpose fonts that can be used for Pali typing if they cover the character ranges above.
The Velthuis scheme was originally developed in 1991 by Frans Velthuis for use with his "devnag" Devanāgarī font, designed for the TeX typesetting system. This system of representing Pali diacritical marks has been used in some websites and discussion lists.
The following table compares various conventional renderings and shortcut key assignments:
|character||ASCII rendering||character name||Unicode number||key combination||HTML code|