SAMPA
Wikipedia, the free encyclopedia - Cite This SourceThe Speech Assessment Methods Phonetic Alphabet (SAMPA) is a computer-readable phonetic script using 7-bit printable ASCII characters, based on the International Phonetic Alphabet (IPA).
It was originally developed in the late 1980s for six European languages by the EEC ESPRIT information technology research and development program. As many symbols as possible have been taken over from the IPA; where this is not possible, other signs that are available are used, e.g. [@] for schwa (IPA [ə]), [2] for the vowel sound found in French deux (IPA [ø]), and [9] for the vowel sound found in French neuf (IPA [œ]).
Today, officially, SAMPA has been developed for all the sounds of the following languages:
Features of SAMPA
SAMPA was developed in the late 1980s in the European Commission funded ESPRIT project 2589 "Speech Assessment Methods" (SAM), hence "SAM Phonetic Alphabet" in order to facilitate email data exchange and computational processing of transcriptions in phonetics and speech technology.SAMPA is a partial encoding of the IPA. The first version of SAMPA was the union of the sets of phoneme codes for Danish, Dutch, English, French, German and Italian; later versions extended SAMPA to cover other European languages. Since SAMPA is based on phoneme inventories, each SAMPA table is valid only in the language it was created for. In order to make this IPA encoding technique universally applicable, X-SAMPA was created, which provides one single table without language-specific differences.
SAMPA was devised as a hack to work around the inability of text encodings to represent IPA symbols. Consequently, as Unicode support for IPA symbols becomes more widespread, the necessity for a separate, computer-readable system for representing the IPA in ASCII decreases. However, Unicode is oriented towards rendering in print media and text input relies on specific keyboard encodings or input devicess. For this reason, SAMPA and X-SAMPA are still widely used in computational phonetics and in speech technology.
See also
- A concise version of SAMPA chart for English sounds.
- A more complete SAMPA chart of the sounds found in most of the European languages.
- Kirshenbaum, sometimes called ASCII-IPA is another ASCII phonetic alphabet.
- IPA, International Phonetic Alphabet
- X-SAMPA, a language-independent notation similar to SAMPA, but covering the entire IPA repertoire.
- CXS, an unofficial, extended version of X-SAMPA used for language construction
References
- Ranchhod, Elisabeth & J. Mamede, Nuno (2002). Advances in Natural Language Processing: Third International Conference, PorTAL 2002, Faro, Portugal, June 23-26, 2002. Proceedings (Lecture Notes in Computer Science). (1st ed.). Springer. ISBN 3-540-43829-7.
- L. DeMiller, Anna & Rettig, James (2000). Linguistics: A Guide to the Reference Literature (2nd ed.). Libraries Unlimited. ISBN 1-56308-619-0.
- Lamberts, Koen & Goldstone, Rob (2004). Handbook of Cognition. Sage Publications Ltd. ISBN 0-7619-7277-3.
External links
- SAMPA computer readable phonetic alphabet
- Phonemic notation of English in SAMPA
- SAMPA for Scots
- converter from (german) written text to SAMPA and IPA (Ajax-application)
Wikipedia, the free encyclopedia © 2001-2006 Wikipedia contributors (Disclaimer)
This article is licensed under the GNU Free Documentation License.
Last updated on Tuesday January 08, 2008 at 03:46:34 PST (GMT -0800)
View this article at Wikipedia.org - Edit this article at Wikipedia.org - Donate to the Wikimedia Foundation