Precomposed character
Wikipedia, the free encyclopedia - Cite This SourceA precomposed character (alternatively decomposable character) is a Unicode entity that can be decomposed into an equivalent string of several other characters. Typically, a precomposed character is decomposed into the main character and a combining diacritical mark.
The precomposed characters are included in the character set to aid computer systems with incomplete Unicode support, where decomposed equivalent characters may render incorrectly.
Similarly, ligatures are precompositions of their constituent letters or graphemes.
For example, the two strings
- ḱṷṓn (U+006B U+0301 U+0075 U+032D U+006F U+0304 U+0301 U+006E) and
- ḱṷṓn (U+1E31 U+1E77 U+1E53 U+006E)
OpenType has the ccmp "feature tag" to define glyphs that are compositions or decompositions involving combining characters.
In theory, most Chinese characters as encoded by Han unification and similar schemes could be treated as precomposed characters, since they can be reduced (decomposed) to their constituent strokes and ideograph descriptions, though Unicode does not take this approach that would certainly be on the cutting edge of text storage and layout. Such an approach could potentially reduce the number of characters in the character set from tens of thousands to just a few hundred.
See also
External links
- Free Idg Serif, a derivative of the FreeSerif font with added declarations of precomposed characters.
Wikipedia, the free encyclopedia © 2001-2006 Wikipedia contributors (Disclaimer)
This article is licensed under the GNU Free Documentation License.
Last updated on Tuesday July 15, 2008 at 13:40:56 PDT (GMT -0700)
View this article at Wikipedia.org - Edit this article at Wikipedia.org - Donate to the Wikimedia Foundation