Xml escape characters. 2 Line and Paragraph Separator, U+2028 .
Xml escape characters 3. Support for control codes Characters in the compatibility area (i. Wenn wirklich Escapes XML 1. For example, the control code ESC (Escape) U+001B would be represented by either the  (hexadecimal) or  (decimal) Numeric Character References. In XML 1. 2 Line and Paragraph Separator, U+2028 Verwendung in XHTML: Die Verwendung von benannten Zeichenreferenzen in Dokumenten, die als XML verarbeitet werden, wird problematisch, wenn die Entities extern (nicht im Dokument selbst) definiert werden und die XML-Prozessoren die externen Dateien nicht lesen. those with a "compatibility formatting tag" in field 5 of the database -- marked by field 5 beginning with a "<") are not allowed. 1 documents is indicated by the version number information in the XML declaration at the start of each document. Characters which have a font or compatibility decomposition (i. Dann werden die Entity-Referenzen nicht durch die entsprechenden Zeichen ersetzt. You can use a character escape to represent any Unicode character in HTML, XHTML or XML using only ASCII characters. with character code greater than #xF900 and less than #xFFFE) are not allowed in XML names. For example, named character references may be referred to as character entity references. 0 processors must continue to reject documents that contain new characters in XML names, new line-end conventions, and references to control characters. Except for Line and Paragraph Separator, or the Byte Order Mark, it is acceptable for browsers and similar user agents to ignore the presence of discouraged characters in HTML or XML. e. Different specifications give different names to these constructs. . [ HTML4. Escape all additional characters as follows: Each additional character is converted to UTF-8 [RFC3629] as one or more bytes. You can use a character escape to represent any Unicode character in HTML, XHTML or XML using only ASCII characters. 01 ] adds to these the form feed character (U+000C), but that character cannot be used in any XHTML version. It is up to authoring tools to ensure proper conversion between these characters and equivalent markup where it exists. 1, if you need to represent a control code explicitly the simplest alternative is to use an NCR (numeric character reference). The resulting bytes are escaped with the URI escaping mechanism (that is, converted to %HH, where HH Characters in the compatibility area (i. The ASCII symbols and punctuation marks, along with a fairly large group of Unicode symbol characters, are excluded from names because they are more useful as delimiters in contexts where XML names are used outside XML documents; providing this group gives those contexts hard guarantees about what cannot be part of an XML name. The character #x037E, The XML and specifications define white space as a combination of one or more of the following characters: U+0020 SPACE, carriage return (U+000D), line feed (U+000A), or tab (U+0009). The distinction between XML 1. We have chosen to use names for this article that are used for HTML5. 0 and XML 1.