The Sinhala script is an abugida script used in Sri Lanka to write the official language Sinhala and also sometimes the liturgical languages Pali and Sanskrit. Being a member of the Brahmic family of scripts, the Sinhala script can trace its ancestry back more than 2000 years.
Sinhala is often considered two alphabets, or an alphabet with another alphabet, due to the presence of two different sets of letters. The core set, known as the (Pure Sinhala, ශුද්ධ සිංහල) or (alphabet එළු හෝඩිය), can represent all native phonemes. In order to render Sanskrit and Pali words, an extended set, the (Mixed Sinhala, මිශ්ර සිංහල), is available.
Most of the Sinhala letters are curlicues; straight lines are almost completely absent from the alphabet. This is because Sinhala used to be written on dried palm leaves, which would split along the veins on writing straight lines. This was undesirable, and therefore, the round shapes were preferred.
The core set of letters forms the alphabet (Pure Sinhala, ශුද්ධ සිංහල), which is a subset of the alphabet (Mixed Sinhala, මිශ්ර සිංහල). This 'pure' alphabet contains all the graphemes necessary to write Eḷu (classical Sinhala) as described in the classical grammar Sidatsan̆garā (1300 AD). This is the reason why this set is also called Eḷu hōdiya ('Eḷu alphabet' එළු හෝඩිය).
The definition of the two sets is thus a historic one. Out of pure coincidence, the phoneme inventory of present day colloquial Sinhala is such that yet again the śuddha alphabet suffices as a good representation of the sounds.
All native phonemes of the Sinhala spoken today can be represented in , while in order to render special Sanskrit and Pali sounds, one can fall back on . This is most notably necessary for the graphemes for the Middle Indic phonemes that the Sinhalese language lost during its history, such as aspirates.
Sinhalese had special symbols to represent numerals, which were in use until the beginning of the [19th] century. This system is now superseded by Arabic numerals.
Neither the Sinhala numerals nor U+0DF4 ෴ Sinhala punctuation kunddaliya is in general use today. The kunddaliya was formerly used as a full stop; it is included for scholarly use. The Sinhala numerals are not presently encoded.
Today, the alphabet is used by approx. 16,000,000 people to write the Sinhalese language in very diverse contexts, such as newspapers, TV commercials, government announcements, graffiti, and schoolbooks.
Sinhala is the main language written in this alphabet, but rare instances of Sri Lanka Malay written in this script are recorded.
While a phoneme can be represented by more than one grapheme, each grapheme can be pronounced in only one way. This means that the actual pronunciation of a word is always clear from its orthographic form.
The voiceless affricate (ච [t͡ʃa]) is not included in the śuddha set by purists since it does not occur in the main text of the Sidatsan̆garā. The Sidatsan̆garā does use it in examples though, so this sound did exist in Eḷu. In any case, it is needed for the representation of modern Sinhala.
The basic shapes of these consonants carry an inherent /a/ unless this is replaced by another vowel or removed by the hal kirīma.
| Stops | |||||||||||
| voiceless | voiced | ||||||||||
| unicode | translit. | IPA | unicode | translit. | IPA | ||||||
| velar | ක | 0D9A | ka | [ka] | ග | 0D9C | ga | [ga] | velar | ||
| retroflex | ට | 0DA7 | ṭa | [ʈa] | ඩ | 0DA9 | ḍa | [ɖa] | retroflex | ||
| dental | ත | 0DAD | ta | [ta] | ද | 0DAF | da | [da] | dental | ||
| labial | ප | 0DB4 | pa | [pa] | බ | 0DB6 | ba | [ba] | labial | ||
| Other graphemes | |||||||||||
| unicode | translit. | IPA | unicode | translit. | IPA | ||||||
| fricatives | ස | 0DC3 | sa | [sa] | හ | 0DC4 | ha | [ha] | fricatives | ||
| affricates | (ච) | (0DA0) | (ca) | ([t͡ʃa]) | ජ | 0DA2 | ja | [ʤa] | affricates | ||
| nasals | ම | 0DB8 | ma | [ma] | න | 0DB1 | na | [na] | nasals | ||
| liquid | ල | 0DBD | la | [la] | ර | 0DBB | ra | [ra] | liquid | ||
| glide | ව | 0DC0 | va | [ʋa] | ය | 0DBA | ya | [ja] | glide | ||
| retroflex | ණ | 0DAB | ṇa | [na] | ළ | 0DC5 | ḷa | [la] | retroflex | ||
Vowels come in two shapes: independent and diacritic. The independent shape is used when a vowel does not follow a consonant, e.g. at the beginning of a word. The diacritic shape is used when a vowel follows a consonant. Depending on the vowel, the diacritic can attach at several places. The diacritic for attaches above the consonant, the diacritic for attaches below, the diacritic for <ā> follows, while the diacritic for The prenasalized consonants resemble their plain counterparts. The Anusvara (often called binduva 'zero' ) is represented by one small circle ං (unicode 0D82), and the Visarga (technically part of the miśra alphabet) by two ඃ (unicode 0D83). The inherent vowel can be removed by a special diacritic, the hal kirīma, which varies in shape according to the consonant it attaches to. Both are represented in the image on the right side. The first one is the most common one, while the second one is used for letters ending at the top left corner.
There are six additional vocalic diacritics in the miśra alphabet. The two diphthongs are quite common, while the syllabic ṛ is much rarer, and the syllabic ḷ is all but obsolete. They are almost exclusively found in loanwords from Sanskrit. The miśra <ṛ> can be also be written with śuddha Note that the transliteration of both ළ ්and ෟ is <ḷ>. This is not very problematic since the second one is extremely scarce.
Since the extra miśra letters are phonetically not distinguishable from the śuddha letters, proceeding in the same way would lead to confusion. Names of miśra letters are normally made up of the names of two śuddha letters pronounced as one word. The first one indicates the sound, the second one the shape. For example, the aspirated ඛ (kh) is called kayanna bayanna. kayanna indicates the sound, while bayanna indicates the shape: ඛ (kh) is similar in shape to බ (b). Another method is to qualify the miśra aspirates by mahāprāna (ඛ: mahāprāna kayanna) and the miśra retroflexes by mūrdhaja (ළ: mūrdhaja layanna).
Certain combinations of graphemes trigger special ligatures. Special signs exist for an ර (r) following a consonant (inverted arch underneath), a ර (r) preceding a consonant (loop above) and a ය (y) following a consonant (half a ය on the right). Furthermore, very frequent combinations are often written in one stroke, like ddh, kv or kś. If this is the case, the first consonant is not marked with a hal kirīma.
The image on the left shows she glyph for śrī, which is composed of the letter ś with the vowel ī marked above and a ligature indicating the r below. The image on the right shows ligatures of ද(d)+ය(y) and ක(k)+ෂි (ṣi) on the Political science course advertisement.
The diacritic for Sinhala transliteration can be done in analogy to Devanāgarī transliteration.
A problem is the transliteration of /අැ/, not found in Devanāgarī. This is <ä> in the German tradition of Wilhelm Geiger, and <æ> in the Anglophone tradition (e.g. James Gair). Layman's transliterations in Sri Lanka normally follow neither of these. Vowels are transliterated according to English spelling equivalences, which can yield a variety of spellings for a number of phonemes. /ī/ for instance can be A transliteration pattern peculiar to Sinhala (and Tamil), and facilitated by the absence of phonemic aspirates, is the use of The Unicode range for Sinhala is U+0D80–U+0DFF. Grey areas indicate non-assigned code points.
This character allocation has been adopted in Sri Lanka as the Standard SLS1134.
Generally speaking, Sinhala support is less developed than support for Devanāgarī for instance. A recurring problem is the rendering of diacritics which precede the consonant and diacritic signs which come in different shapes, like the one for for example. Sinhala does not come built in with Windows XP, unlike Tamil and Hindi. However, all versions of Windows Vista come with Sinhala support by default, and do not require external fonts to be installed to read Sinhalese script. For Linux, the scim input method selector allows to use Sinhala script in applications like terminals or web browsers.
* * * * * *
Vowels
short
long
independent
diacritic
independent
diacritic
අ
0D85
a
[a]
inherent
a
ආ
0D86
ā
[aː]
ා
0DCF
ā
[aː]
එ
0D91
e
[e]
ෙ
0DD9
e
[e]
ඒ
0D92
ē
[eː]
ේ
0DDA
ē
[eː]
ඉ
0D89
i
[i]
ි
0DD2
i
[i]
ඊ
0D8A
ī
[iː]
ී
0DD3
ī
[iː]
ඔ
0D94
o
[o]
ො
0DDC
o
[o]
ඕ
0D95
ō
[oː]
ෝ
0DDD
ō
[oː]
උ
0D8B
u
[u]
ු
0DD4
u
[u]
ඌ
0D8C
ū
[uː]
ූ
0DD6
ū
[uː]
ඇ
0D87
æ/ä
[æ]
ැ
0DD0
æ
[æ]
ඈ
0D88
ǣ
[æː]
ෑ
0DD1
ǣ
[æː]
Prenasalized consonants
Prenasalized consonants
nasal
obstruent
prenasalized
consonant
unicode
translit.
IPA
velar
ඞ
ග
ඟ
0D9F
n̆ga
[ⁿga]
velar
retroflex
ණ
ඩ
ඬ
0DAC
n̆ḍa
[ⁿɖa]
retroflex
dental
න
ද
ඳ
0DB3
n̆da
[ⁿda]
dental
labial
ම
බ
ඹ
0DB9
m̆ba
[mba]
labial
Non-vocalic diacritics
Miśra set
The miśra alphabet is a superset of śuddha. It adds letters for aspirates, retroflexes and sibilants, which are not phonemic in today's Sinhala, but which are necessary to represent non-native words, like loanwords from Sanskrit, Pali or English. The use of the extra letters is mainly a question of prestige. From a purely phonemic point of view, there is no benefit in using them, and they can be replaced by a (sequence of) śuddha letters as follows: For the miśra aspirates, the replacement is the plain śuddha counterpart, for the miśra retroflex liquids the corresponding śuddha coronal liquid, for the sibilants, . ඤ (ñ) and ඥ (gn) cannot be represented by śuddha graphemes, but are only found in less than 10 words each. ෆ fa can be represented by ප pa with a Latin
Extra miśra stops
voiceless
voiced
unicode
translit.
IPA
unicode
translit.
IPA
velar
ඛ
0D9B
kha
[ka]
ඝ
0D9D
gha
[ga]
velar
retroflex
ඨ
0DA8
ṭha
[ʈa]
ඪ
0DAA
ḍha
[ɖa]
retroflex
dental
ථ
0DAE
tha
[ta]
ධ
0DB0
dha
[da]
dental
labial
ඵ
0DB5
pha
[pa]
භ
0DB7
bha
[ba]
labial
Other additional miśra graphemes
unicode
translit.
IPA
unicode
translit.
IPA
sibilants
ශ
0DC1
śa
[sa]
ෂ
0DC2
ṣa
[sa]
sibilants
aspirate affricates
ඡ
0DA1
cha
[t͡ʃa]
ඣ
0DA3
jha
[ʤa]
aspirate affricates
nasals
ඤ
0DA4
ña
[ɲa]
ඥ
0DA5
gna
[gna]
nasals
other
ඞ
0D9E
ṅa
[ŋa]
ෆ
0DC6
fa
other
other
ඦ
0DA6
n̆ja
[nʤa]
fප
n/a
fa
other
Vocalic diacritics
independent
diacritic
independent
diacritic
diphthongs
ඓ
0D93
ai
[ai]
ෛ
0DDB
ai
[ai]
ඖ
0D96
au
[au]
ෞ
0DDC
au
[au]
diphthongs
syllabic r
ඍ
0D8D
ṛ
[ur]
ෘ
0DD8
ṛ
ඎ
0D8E
ṝ
[ruː]
ෲ
0DF2
ṝ
syllabic r
syllabic l
ඏ
0D8F
ḷ
[li]
ෟ
0DDF
ḷ
[li]
ඐ
0D90
ḹ
[liː]
ෳ
0DF3
ḹ
[liː]
syllabic l
Names of the graphemes
The letters of the English alphabet have more or less arbitrary names, e.g. em for the letter Ligatures
Similarities to other scripts
Sinhala is one of the Brahmic scripts, and thus shares many similarites with other members of the family, such as the Tamil script and Devanāgarī. As a general example, /a/ is the inherent vowel in all three scripts. Other similarities include the diacritic for Sinhala transliteration
for the voiceless dental stop, and the use of Sinhala in Unicode
Computer support
Online resources
Notes
References
*External links
This article is licensed under the GNU Free Documentation License.
Last updated on Sunday October 05, 2008 at 12:23:53 PDT (GMT -0700)
View this article at Wikipedia.org - Edit this article at Wikipedia.org - Donate to the Wikimedia Foundation