[html5] r8128 - [e] (0) Hide redundant rows since otherwise it'd be two parse errors for no reas [...]

whatwg at whatwg.org whatwg at whatwg.org
Mon Aug 5 11:15:33 PDT 2013


Author: ianh
Date: 2013-08-05 11:15:31 -0700 (Mon, 05 Aug 2013)
New Revision: 8128

Modified:
   complete.html
   index
   source
Log:
[e] (0) Hide redundant rows since otherwise it'd be two parse errors for no reason.
Fixing https://www.w3.org/Bugs/Public/show_bug.cgi?id=22873
Affected topics: HTML Syntax and Parsing

Modified: complete.html
===================================================================
--- complete.html	2013-08-05 18:06:40 UTC (rev 8127)
+++ complete.html	2013-08-05 18:15:31 UTC (rev 8128)
@@ -86151,6 +86151,7 @@
    to aid in application development.</li>
 
   </ol><h5 id=preprocessing-the-input-stream><span class=secno>12.2.2.5 </span>Preprocessing the input stream</h5>
+<!--CLEANUP-->
 
   <p>The <dfn id=input-stream>input stream</dfn> consists of the characters pushed
   into it as the <a href=#the-input-byte-stream>input byte stream</a> is decoded or from the
@@ -88719,6 +88720,7 @@
     </dl><p>Consume as many characters as match the range of characters given above (<a href=#ascii-hex-digits>ASCII hex
     digits</a> or <a href=#ascii-digits>ASCII digits</a>).</p>
 
+<!--CLEANUP-->
     <p>If no characters match the range, then don't consume any
     characters (and unconsume the U+0023 NUMBER SIGN character and, if
     appropriate, the X character). This is a <a href=#parse-error>parse
@@ -88740,9 +88742,9 @@
 
     <table id=table-charref-overrides><thead><tr><th>Number <th colspan=2>Unicode character
      <tbody><tr><td>0x00 <td>U+FFFD <td>REPLACEMENT CHARACTER
-      <tr><td>0x0D <td>U+000D <td>CARRIAGE RETURN (CR)
+      <!-- <tr><td>0x0D <td>U+000D <td>CARRIAGE RETURN (CR) -->
       <tr><td>0x80 <td>U+20AC <td>EURO SIGN (€)
-      <tr><td>0x81 <td>U+0081 <td><control>
+      <!-- <tr><td>0x81 <td>U+0081 <td><control> -->
       <tr><td>0x82 <td>U+201A <td>SINGLE LOW-9 QUOTATION MARK (‚)
       <tr><td>0x83 <td>U+0192 <td>LATIN SMALL LETTER F WITH HOOK (ƒ)
       <tr><td>0x84 <td>U+201E <td>DOUBLE LOW-9 QUOTATION MARK („)
@@ -88754,10 +88756,10 @@
       <tr><td>0x8A <td>U+0160 <td>LATIN CAPITAL LETTER S WITH CARON (Š)
       <tr><td>0x8B <td>U+2039 <td>SINGLE LEFT-POINTING ANGLE QUOTATION MARK (‹)
       <tr><td>0x8C <td>U+0152 <td>LATIN CAPITAL LIGATURE OE (Œ)
-      <tr><td>0x8D <td>U+008D <td><control>
+      <!-- <tr><td>0x8D <td>U+008D <td><control> -->
       <tr><td>0x8E <td>U+017D <td>LATIN CAPITAL LETTER Z WITH CARON (Ž)
-      <tr><td>0x8F <td>U+008F <td><control>
-      <tr><td>0x90 <td>U+0090 <td><control>
+      <!-- <tr><td>0x8F <td>U+008F <td><control> -->
+      <!-- <tr><td>0x90 <td>U+0090 <td><control> -->
       <tr><td>0x91 <td>U+2018 <td>LEFT SINGLE QUOTATION MARK (‘)
       <tr><td>0x92 <td>U+2019 <td>RIGHT SINGLE QUOTATION MARK (’)
       <tr><td>0x93 <td>U+201C <td>LEFT DOUBLE QUOTATION MARK (“)
@@ -88770,7 +88772,7 @@
       <tr><td>0x9A <td>U+0161 <td>LATIN SMALL LETTER S WITH CARON (š)
       <tr><td>0x9B <td>U+203A <td>SINGLE RIGHT-POINTING ANGLE QUOTATION MARK (›)
       <tr><td>0x9C <td>U+0153 <td>LATIN SMALL LIGATURE OE (œ)
-      <tr><td>0x9D <td>U+009D <td><control>
+      <!-- <tr><td>0x9D <td>U+009D <td><control> -->
       <tr><td>0x9E <td>U+017E <td>LATIN SMALL LETTER Z WITH CARON (ž)
       <tr><td>0x9F <td>U+0178 <td>LATIN CAPITAL LETTER Y WITH DIAERESIS (Ÿ)
     </table><p>Otherwise, if the number is in the range 0xD800 to 0xDFFF<!--
@@ -88781,11 +88783,10 @@
     <p>Otherwise, return a character token for the Unicode character
     whose code point is that number.
 
-    <!-- this is the same as the equivalent list in the input stream
-    section -->
+    <!-- this is the same as the equivalent list in the input stream section, except U+000D is not allowed -->
     Additionally, if the number is in the range 0x0001 to 0x0008, <!--
-    HT, LF allowed --> <!-- U+000B is in the next list --> <!-- FF, CR
-    allowed --> 0x000E to 0x001F, <!-- ASCII allowed --> 0x007F <!--to
+    HT, LF allowed --> <!-- U+000B is in the next list --> <!-- FF
+    allowed --> 0x000D to 0x001F, <!-- ASCII allowed --> 0x007F <!--to
     0x0084, (0x0085 NEL not allowed), 0x0086--> to 0x009F, 0xFDD0 to
     0xFDEF, or is one of 0x000B, 0xFFFE, 0xFFFF, 0x1FFFE, 0x1FFFF,
     0x2FFFE, 0x2FFFF, 0x3FFFE, 0x3FFFF, 0x4FFFE, 0x4FFFF, 0x5FFFE,

Modified: index
===================================================================
--- index	2013-08-05 18:06:40 UTC (rev 8127)
+++ index	2013-08-05 18:15:31 UTC (rev 8128)
@@ -86151,6 +86151,7 @@
    to aid in application development.</li>
 
   </ol><h5 id=preprocessing-the-input-stream><span class=secno>12.2.2.5 </span>Preprocessing the input stream</h5>
+<!--CLEANUP-->
 
   <p>The <dfn id=input-stream>input stream</dfn> consists of the characters pushed
   into it as the <a href=#the-input-byte-stream>input byte stream</a> is decoded or from the
@@ -88719,6 +88720,7 @@
     </dl><p>Consume as many characters as match the range of characters given above (<a href=#ascii-hex-digits>ASCII hex
     digits</a> or <a href=#ascii-digits>ASCII digits</a>).</p>
 
+<!--CLEANUP-->
     <p>If no characters match the range, then don't consume any
     characters (and unconsume the U+0023 NUMBER SIGN character and, if
     appropriate, the X character). This is a <a href=#parse-error>parse
@@ -88740,9 +88742,9 @@
 
     <table id=table-charref-overrides><thead><tr><th>Number <th colspan=2>Unicode character
      <tbody><tr><td>0x00 <td>U+FFFD <td>REPLACEMENT CHARACTER
-      <tr><td>0x0D <td>U+000D <td>CARRIAGE RETURN (CR)
+      <!-- <tr><td>0x0D <td>U+000D <td>CARRIAGE RETURN (CR) -->
       <tr><td>0x80 <td>U+20AC <td>EURO SIGN (€)
-      <tr><td>0x81 <td>U+0081 <td><control>
+      <!-- <tr><td>0x81 <td>U+0081 <td><control> -->
       <tr><td>0x82 <td>U+201A <td>SINGLE LOW-9 QUOTATION MARK (‚)
       <tr><td>0x83 <td>U+0192 <td>LATIN SMALL LETTER F WITH HOOK (ƒ)
       <tr><td>0x84 <td>U+201E <td>DOUBLE LOW-9 QUOTATION MARK („)
@@ -88754,10 +88756,10 @@
       <tr><td>0x8A <td>U+0160 <td>LATIN CAPITAL LETTER S WITH CARON (Š)
       <tr><td>0x8B <td>U+2039 <td>SINGLE LEFT-POINTING ANGLE QUOTATION MARK (‹)
       <tr><td>0x8C <td>U+0152 <td>LATIN CAPITAL LIGATURE OE (Œ)
-      <tr><td>0x8D <td>U+008D <td><control>
+      <!-- <tr><td>0x8D <td>U+008D <td><control> -->
       <tr><td>0x8E <td>U+017D <td>LATIN CAPITAL LETTER Z WITH CARON (Ž)
-      <tr><td>0x8F <td>U+008F <td><control>
-      <tr><td>0x90 <td>U+0090 <td><control>
+      <!-- <tr><td>0x8F <td>U+008F <td><control> -->
+      <!-- <tr><td>0x90 <td>U+0090 <td><control> -->
       <tr><td>0x91 <td>U+2018 <td>LEFT SINGLE QUOTATION MARK (‘)
       <tr><td>0x92 <td>U+2019 <td>RIGHT SINGLE QUOTATION MARK (’)
       <tr><td>0x93 <td>U+201C <td>LEFT DOUBLE QUOTATION MARK (“)
@@ -88770,7 +88772,7 @@
       <tr><td>0x9A <td>U+0161 <td>LATIN SMALL LETTER S WITH CARON (š)
       <tr><td>0x9B <td>U+203A <td>SINGLE RIGHT-POINTING ANGLE QUOTATION MARK (›)
       <tr><td>0x9C <td>U+0153 <td>LATIN SMALL LIGATURE OE (œ)
-      <tr><td>0x9D <td>U+009D <td><control>
+      <!-- <tr><td>0x9D <td>U+009D <td><control> -->
       <tr><td>0x9E <td>U+017E <td>LATIN SMALL LETTER Z WITH CARON (ž)
       <tr><td>0x9F <td>U+0178 <td>LATIN CAPITAL LETTER Y WITH DIAERESIS (Ÿ)
     </table><p>Otherwise, if the number is in the range 0xD800 to 0xDFFF<!--
@@ -88781,11 +88783,10 @@
     <p>Otherwise, return a character token for the Unicode character
     whose code point is that number.
 
-    <!-- this is the same as the equivalent list in the input stream
-    section -->
+    <!-- this is the same as the equivalent list in the input stream section, except U+000D is not allowed -->
     Additionally, if the number is in the range 0x0001 to 0x0008, <!--
-    HT, LF allowed --> <!-- U+000B is in the next list --> <!-- FF, CR
-    allowed --> 0x000E to 0x001F, <!-- ASCII allowed --> 0x007F <!--to
+    HT, LF allowed --> <!-- U+000B is in the next list --> <!-- FF
+    allowed --> 0x000D to 0x001F, <!-- ASCII allowed --> 0x007F <!--to
     0x0084, (0x0085 NEL not allowed), 0x0086--> to 0x009F, 0xFDD0 to
     0xFDEF, or is one of 0x000B, 0xFFFE, 0xFFFF, 0x1FFFE, 0x1FFFF,
     0x2FFFE, 0x2FFFF, 0x3FFFE, 0x3FFFF, 0x4FFFE, 0x4FFFF, 0x5FFFE,

Modified: source
===================================================================
--- source	2013-08-05 18:06:40 UTC (rev 8127)
+++ source	2013-08-05 18:15:31 UTC (rev 8128)
@@ -96144,6 +96144,7 @@
 
 
   <h5>Preprocessing the input stream</h5>
+<!--CLEANUP-->
 
   <p>The <dfn>input stream</dfn> consists of the characters pushed
   into it as the <span>input byte stream</span> is decoded or from the
@@ -99103,6 +99104,7 @@
     <p>Consume as many characters as match the range of characters given above (<span>ASCII hex
     digits</span> or <span>ASCII digits</span>).</p>
 
+<!--CLEANUP-->
     <p>If no characters match the range, then don't consume any
     characters (and unconsume the U+0023 NUMBER SIGN character and, if
     appropriate, the X character). This is a <span>parse
@@ -99127,9 +99129,9 @@
       <tr><th>Number <th colspan=2>Unicode character
      <tbody>
       <tr><td>0x00 <td>U+FFFD <td>REPLACEMENT CHARACTER
-      <tr><td>0x0D <td>U+000D <td>CARRIAGE RETURN (CR)
+      <!-- <tr><td>0x0D <td>U+000D <td>CARRIAGE RETURN (CR) -->
       <tr><td>0x80 <td>U+20AC <td>EURO SIGN (&#x20AC;)
-      <tr><td>0x81 <td>U+0081 <td><control>
+      <!-- <tr><td>0x81 <td>U+0081 <td><control> -->
       <tr><td>0x82 <td>U+201A <td>SINGLE LOW-9 QUOTATION MARK (&#x201A;)
       <tr><td>0x83 <td>U+0192 <td>LATIN SMALL LETTER F WITH HOOK (&#x0192;)
       <tr><td>0x84 <td>U+201E <td>DOUBLE LOW-9 QUOTATION MARK (&#x201E;)
@@ -99141,10 +99143,10 @@
       <tr><td>0x8A <td>U+0160 <td>LATIN CAPITAL LETTER S WITH CARON (&#x0160;)
       <tr><td>0x8B <td>U+2039 <td>SINGLE LEFT-POINTING ANGLE QUOTATION MARK (&#x2039;)
       <tr><td>0x8C <td>U+0152 <td>LATIN CAPITAL LIGATURE OE (&#x0152;)
-      <tr><td>0x8D <td>U+008D <td><control>
+      <!-- <tr><td>0x8D <td>U+008D <td><control> -->
       <tr><td>0x8E <td>U+017D <td>LATIN CAPITAL LETTER Z WITH CARON (&#x017D;)
-      <tr><td>0x8F <td>U+008F <td><control>
-      <tr><td>0x90 <td>U+0090 <td><control>
+      <!-- <tr><td>0x8F <td>U+008F <td><control> -->
+      <!-- <tr><td>0x90 <td>U+0090 <td><control> -->
       <tr><td>0x91 <td>U+2018 <td>LEFT SINGLE QUOTATION MARK (&#x2018;)
       <tr><td>0x92 <td>U+2019 <td>RIGHT SINGLE QUOTATION MARK (&#x2019;)
       <tr><td>0x93 <td>U+201C <td>LEFT DOUBLE QUOTATION MARK (&#x201C;)
@@ -99157,7 +99159,7 @@
       <tr><td>0x9A <td>U+0161 <td>LATIN SMALL LETTER S WITH CARON (&#x0161;)
       <tr><td>0x9B <td>U+203A <td>SINGLE RIGHT-POINTING ANGLE QUOTATION MARK (&#x203A;)
       <tr><td>0x9C <td>U+0153 <td>LATIN SMALL LIGATURE OE (&#x0153;)
-      <tr><td>0x9D <td>U+009D <td><control>
+      <!-- <tr><td>0x9D <td>U+009D <td><control> -->
       <tr><td>0x9E <td>U+017E <td>LATIN SMALL LETTER Z WITH CARON (&#x017E;)
       <tr><td>0x9F <td>U+0178 <td>LATIN CAPITAL LETTER Y WITH DIAERESIS (&#x0178;)
     </table>
@@ -99170,11 +99172,10 @@
     <p>Otherwise, return a character token for the Unicode character
     whose code point is that number.
 
-    <!-- this is the same as the equivalent list in the input stream
-    section -->
+    <!-- this is the same as the equivalent list in the input stream section, except U+000D is not allowed -->
     Additionally, if the number is in the range 0x0001 to 0x0008, <!--
-    HT, LF allowed --> <!-- U+000B is in the next list --> <!-- FF, CR
-    allowed --> 0x000E to 0x001F, <!-- ASCII allowed --> 0x007F <!--to
+    HT, LF allowed --> <!-- U+000B is in the next list --> <!-- FF
+    allowed --> 0x000D to 0x001F, <!-- ASCII allowed --> 0x007F <!--to
     0x0084, (0x0085 NEL not allowed), 0x0086--> to 0x009F, 0xFDD0 to
     0xFDEF, or is one of 0x000B, 0xFFFE, 0xFFFF, 0x1FFFE, 0x1FFFF,
     0x2FFFE, 0x2FFFF, 0x3FFFE, 0x3FFFF, 0x4FFFE, 0x4FFFF, 0x5FFFE,




More information about the Commit-Watchers mailing list