[html5] r1668 - /
whatwg at whatwg.org
whatwg at whatwg.org
Thu May 22 04:48:37 PDT 2008
Author: ianh
Date: 2008-05-22 04:48:37 -0700 (Thu, 22 May 2008)
New Revision: 1668
Modified:
index
source
Log:
[c] (0) Make entities not be allowed to use non-unicode characters
Modified: index
===================================================================
--- index 2008-05-22 11:27:38 UTC (rev 1667)
+++ index 2008-05-22 11:48:37 UTC (rev 1668)
@@ -44510,13 +44510,23 @@
<td>LATIN CAPITAL LETTER Y WITH DIAERESIS ('Ÿ')
</table>
+ <!-- this is the same as the equivalent list in the input stream
+ section, except it has 0x0000 included in the first range. -->
+
+ <p>Otherwise, if the number is in the range 0x0000 to 0x0008, <!--
+ space characters allowed -->
+ 0x000E to 0x001F, <!-- ASCII allowed
+ --> 0x007F
+ <!--to 0x0084, (0x0085 NEL not allowed), 0x0086--> to 0x009F, 0xD800 to
+ 0xDFFF <!-- surrogates not allowed -->, 0xFDD0 to 0xFDDF, or is one of
+ 0xFFFE, 0xFFFF, 0x1FFFE, 0x1FFFF, 0x2FFFE, 0x2FFFF, 0x3FFFE, 0x3FFFF,
+ 0x4FFFE, 0x4FFFF, 0x5FFFE, 0x5FFFF, 0x6FFFE, 0x6FFFF, 0x7FFFE, 0x7FFFF,
+ 0x8FFFE, 0x8FFFF, 0x9FFFE, 0x9FFFF, 0xAFFFE, 0xAFFFF, 0xBFFFE, 0xBFFFF,
+ 0xCFFFE, 0xCFFFF, 0xDFFFE, 0xDFFFF, 0xEFFFE, 0xEFFFF, 0xFFFFE, 0xFFFFF,
+ 0x10FFFE, or 0x10FFFF, or is higher than 0x10FFFF, then this is a <a
+ href="#parse1">parse error</a>; return a character token for the U+FFFD
+ REPLACEMENT CHARACTER character instead.</p>
- <p>Otherwise, if the number is zero, if the number is higher than
- 0x10FFFF, or if it's one of the surrogate characters (characters in the
- range 0xD800 to 0xDFFF), then this is a <a href="#parse1">parse
- error</a>; return a character token for the U+FFFD REPLACEMENT CHARACTER
- character instead.</p>
-
<p>Otherwise, return a character token for the Unicode character whose
code point is that number.</p>
Modified: source
===================================================================
--- source 2008-05-22 11:27:38 UTC (rev 1667)
+++ source 2008-05-22 11:48:37 UTC (rev 1668)
@@ -41870,9 +41870,18 @@
<tr><td>0x9F <td>U+0178 <td>LATIN CAPITAL LETTER Y WITH DIAERESIS ('Ÿ')
</table>
- <p>Otherwise, if the number is zero, if the number is higher than
- 0x10FFFF, or if it's one of the surrogate characters (characters
- in the range 0xD800 to 0xDFFF), then this is a <span>parse
+ <!-- this is the same as the equivalent list in the input stream
+ section, except it has 0x0000 included in the first range. -->
+ <p>Otherwise, if the number is in the range 0x0000 to 0x0008, <!--
+ space characters allowed --> 0x000E to 0x001F, <!-- ASCII allowed
+ --> 0x007F <!--to 0x0084, (0x0085 NEL not allowed), 0x0086--> to
+ 0x009F, 0xD800 to 0xDFFF <!-- surrogates not allowed -->, 0xFDD0
+ to 0xFDDF, or is one of 0xFFFE, 0xFFFF, 0x1FFFE, 0x1FFFF, 0x2FFFE,
+ 0x2FFFF, 0x3FFFE, 0x3FFFF, 0x4FFFE, 0x4FFFF, 0x5FFFE, 0x5FFFF,
+ 0x6FFFE, 0x6FFFF, 0x7FFFE, 0x7FFFF, 0x8FFFE, 0x8FFFF, 0x9FFFE,
+ 0x9FFFF, 0xAFFFE, 0xAFFFF, 0xBFFFE, 0xBFFFF, 0xCFFFE, 0xCFFFF,
+ 0xDFFFE, 0xDFFFF, 0xEFFFE, 0xEFFFF, 0xFFFFE, 0xFFFFF, 0x10FFFE, or
+ 0x10FFFF, or is higher than 0x10FFFF, then this is a <span>parse
error</span>; return a character token for the U+FFFD REPLACEMENT
CHARACTER character instead.</p>
More information about the Commit-Watchers
mailing list