Khmer (ភាសាខ្មែរ), or Cambodian, is the language of the Khmer people and the official language of Cambodia. It is the second most widely spoken Austroasiatic language, with speakers in the tens of millions. Khmer has been considerably influenced by Sanskrit and Pali, especially in the royal and religious registers, through the vehicles of Hinduism and Buddhism. It is also the earliest recorded and earliest written language of the Mon-Khmer family, predating Mon and by a significant margin Vietnamese. As a result of geographic proximity, the Khmer language has affected, and also been affected by, Thai, Lao, Vietnamese and Cham which all form a sprachbund in peninsular Southeast Asia.
Khmer has its own script, an abugida known in Khmer as Aksar Khmer.
Khmer differs from neighboring languages such as Thai, Lao and Vietnamese in that it is not a tonal language. All its main dialects that are mutually intelligible:
Khmer is classified as a member of the Eastern branch of the Mon-Khmer language family, itself a subdivision of the larger Austro-Asiatic language group, which has representatives in a large swath of land from Northeast India down through Southeast Asia to the Malay Peninsula and its islands. As such, its closest relatives are the languages of the Pearic, Bahnaric, and Katuic families spoken by the hill tribes of the region. The Vietic languages have also been classified as belonging to this family.
| Labial | Coronal | Palatal | Velar | Glottal | |
| Aspirated plosive | pʰ | tʰ | cʰ | kʰ | |
| Unaspirated plosive | p | t | c | k | ʔ |
| Implosive | ɓ | ɗ | |||
| Nasal | m | n | ɲ | ŋ | |
| Liquid | r l | ||||
| Fricative | s | h | |||
| Approximant | ʋ | j | |||
The consonants /f/, /ʃ/, /z/ and /ɡ/ may occasionally occur in foreign words from, for example, French and other recent introductions. These consonants do not appear in the chart above because they are not Khmer consonants per se and the sounds do not occur in any Khmer words. These non-native sounds are only heard by speakers familiar with the originating language and have no corresponding symbol in the Khmer script, although combinations of letters otherwise unpronounceable are used to represent these sounds when necessary. In the speech of those who are not bilingual, these sounds are approximated with natively occurring phonemes:
| Foreign Sound (IPA) | Khmer Representation | Khmer Approximation (IPA) |
|---|---|---|
| /ɡ/ | ហ្គ | /k/ |
| /ʃ/ | ហ្ស | /s/ |
| /f/ | ហ្វ | /h/ or /pʰ/ |
| /z/ | ហ្ស | /s/ |
| Long vowels | iː | eː | ɛː | ɨː | əː | aː | uː | oː | ɔː | |
|---|---|---|---|---|---|---|---|---|---|---|
| Short vowels | i | e | ɨ | ə | ɐ | a | u | o | ||
| Long diphthongs | iə | ei | ɐe | ɨə | əɨ | ɐə | ao | uə | ou | ɔə |
| Short diphthongs | eə̆ | uə̆ | oə̆ |
| p | ɓ | t | ɗ | c | k | ʔ | m | n | ɲ | ŋ | j | l | r | s | h | ʋ | |
| p | pʰt- | pɗ- | pʰc | pʰk- | pʔ- | pʰn- | pʰɲ- | pʰŋ- | pʰj- | pʰl- | pr- | ps- | |||||
| t | tʰp- | tɓ | tʰk- | tʔ- | tʰm- | tʰn- | tʰŋ- | tʰj- | tʰl- | tr- | tʰʋ | ||||||
| c | cʰp- | cɓ | cʰk- | cʔ- | cʰm- | cʰn- | cʰŋ- | cʰl- | cr- | cʰʋ- | |||||||
| k | kʰp- | kɓ | kʰt- | kɗ- | kʰc | kʔ- | kʰm- | kʰn- | kʰɲ- | kŋ- | kʰj- | kʰl- | kr- | ks- | kʰʋ- | ||
| s | sp- | sɓ | st- | sɗ- | sk- | sʔ- | sm- | sn- | sɲ- | sŋ- | sl- | sr- | sʋ | ||||
| ʔ | ʔʋ- | ||||||||||||||||
| m | mt- | mɗ- | mc | mʔ- | mʰn- | mʰɲ- | ml- | mr- | ms- | mh- | |||||||
| l | lp- | lɓ | lk- | lʔ- | lm- | lŋ- | lh- | lʋ- | |||||||||
Syllables begin with one of these consonants or consonant clusters, followed by one of the vowel nuclei. When the vowel nucleus is short, there has to be a final consonant. can exist in a syllable coda. /h/ and /ʋ/ become [ç] and [w] respectively. The most common word structure in Khmer is a full syllable as described above, preceded by an unstressed, “minor” syllable that has a consonant-vowel (CV) structure CV-, CrV-, CVN- or CrVN- (N is any nasal in the Khmer inventory). Words can also be made up of two full syllables. The vowel in these preceding syllables is usually reduced in conversation to [ə], however in careful or formal speech and in TV and radio, they are always clearly articulated.
Words with three or more syllables exist, particularly those pertaining to science, the arts, and religion. These words are loanwords, usually derived from Pali, Sanskrit, or more recently, French.
Khmer is generally a Subject Verb Object (SVO) language with prepositions. Although primarily an isolating language, lexical derivation by means of prefixes and infixes is common. Adjectives, demonstratives and numerals follow their noun:
ស្រីស្អាតនោះ /srəj sʔaːt nuh/ (girl pretty that) = that pretty girl
The noun has no grammatical gender or singular/plural distinction. Plurality can be marked by postnominal particles, numerals, or by doubling the adjective, which can also serve to intensify the adjective:
ឆ្កែធំ /cʰkae tʰom/ (dog large) = large dog
ឆ្កែធំណាស់ /cʰkae tʰom nass/ (dog large very) = large dogs or a very large dog
ឆ្កែពីរ /cʰkae piː/ (dog two) = two dogs
Classifying particles for use between numerals and nouns exist although are not obligatory as in, for example, Thai. As is typical of most East Asian languages, the verb does not inflect at all; tense and aspect can be shown by particles and adverbs or understood by context. Verbs are negated by putting "/min/", "/pum/" or "/ʔɐt/" before them and "/teː/" at the end of the sentence or clause.
ខ្ញុំជឿ /kʰɲom cɨə/ - I believe
ខ្ញុំមិនជឿទេ /kʰɲom min cɨə teː/ - I don't believe
Northern Khmer, the dialect spoken in Thailand, is referred to in Khmer as Khmer Surin and, although it only began divergence from standard Khmer within the last 200 years, is considered by some linguists to be a separate language. This is due to its distinct accent influenced by the surrounding tonal language, Thai, lexical differences and its phonemic differences in both vowels and distribution of consonants. Final "r", which has become silent in other dialects of Khmer, is pronounced in Northern Khmer.
Western Khmer, also called Cardamom Khmer, spoken by a small, isolated population in the Cardamom mountain range extending from Cambodia into Thailand, although little studied, is unique in that it maintains a definite system of vocal register that has all but disappeared in other dialects of modern Khmer.
A notable characteristic of Phnom Penh casual speech is merging or complete elision of syllables, considered by speakers from other regions as a "relaxed" pronunciation. For instance, "Phnom Penh" will sometimes be shortened to "m'Penh". Another characteristic of Phnom Penh speech is observed in words with an "r" either as an initial consonant or as the second member of a consonant cluster (as in the English word "bread"). The "r", trilled or flapped in other dialects, is either pronounced as an uvular trill (similar to French) or not pronounced at all. This alters the quality of any preceding consonant causing a harder, more emphasized pronunciation. Another unique result is that the syllable is spoken with a low-rising or "dipping" tone much like the "hỏi" tone in Northern Vietnamese. For example, some people pronounce /trəj/ (meaning "fish") as /təj/, the "r" is dropped and the vowel begins by dipping much lower in tone than standard speech and then rises, effectively doubling its length. Another example is the word /riən/ ("study, learn"). It is pronounced /ʀiən/, with the "uvular r" and the same intonation described above.
As an example, the word for "to eat" used between intimates or in reference to animals is /siː/. Used in polite reference to commoners, it's /ɲam/. When used of those of higher social status, it's /pisa/ or . For monks the word is /cʰan/ and for royals, /saoj/.
The numbers are:
| 0 | ០ | សូន្យ | (son) | /soːu̯n/ |
| 1 | ១ | មួយ | (muŏy) | /muːə̯j/ |
| 2 | ២ | ពីរ | (pi) | /piː/ |
| 3 | ៣ | បី | (bei) | /ɓəj/ |
| 4 | ៤ | បួន | (buŏn) | /ɓuːə̯n/ |
| 5 | ៥ | ប្រាំ | (prăm) | /pram/ |
| 6 | ៦ | ប្រាំមូយ | (prăm muŏy) | |
| 7 | ៧ | ប្រាំពីរ | (prăm pi) | (also ) |
| 8 | ៨ | ប្រាំបី | (prăm bei) | |
| 9 | ៩ | ប្រាំបួន | (prăm buŏn) | |
| 10 | ១០ | ដប់ | (dâp) | /ɗɑp/ |
| 100 | ១០០ | មួយរយ | (muŏy rôy) | |
| 1,000 | ១០០០ | មួយពាន់ | (muŏy péan) | |
| 10,000 | ១០០០០ | មួយម៉ឺន | (muŏy mein) | |
| 100,000 | ១០០០០០ | មួយសែន | (muŏy sên) | |
| 1,000,000 | ១០០០០០០ | មួយលាន | (muŏy léan) |