Character entity reference

Wikipedia, the free encyclopedia - Cite This Source

In the markup languages SGML, HTML, XHTML and XML, a character entity reference is a reference to a particular kind of named entity that has been predefined or explicitly declared in a Document Type Definition (DTD). The replacement text of the entity consists of a single character from the Universal Character Set/Unicode. The purpose of a character entity reference is to provide a way to refer to a character that is not universally encodable.

Actually, in XML at least, the term "character entity reference" is incorrect. XML has two relevant concepts:

  • a "predefined entity reference" is a reference to one of the special characters denoted by <, >, &, ", or ';
  • while a "character reference" (or "numeric character reference") is a construct such as   or   that refers to a character by means of its numeric Unicode codepoint.

Although in popular usage character references are often called "entity references" or even "entities", this usage is wrong. A character reference is a reference to a character, not to an entity.

See also

External links



Wikipedia, the free encyclopedia © 2001-2006 Wikipedia contributors (Disclaimer)
This article is licensed under the GNU Free Documentation License.
Last updated on Monday February 25, 2008 at 06:39:44 PST (GMT -0800)
View this article at Wikipedia.org - Edit this article at Wikipedia.org - Donate to the Wikimedia Foundation