[html5] r942 - /

whatwg at whatwg.org whatwg at whatwg.org
Thu Jun 21 17:00:02 PDT 2007


Author: ianh
Date: 2007-06-21 16:57:48 -0700 (Thu, 21 Jun 2007)
New Revision: 942

Modified:
   index
   source
Log:
[ciot] (2) 
 now turns into a U+000A LINE FEED character

Modified: index
===================================================================
--- index	2007-06-21 08:52:55 UTC (rev 941)
+++ index	2007-06-21 23:57:48 UTC (rev 942)
@@ -34779,12 +34779,12 @@
 
     <p>If one or more characters match the range, then take them all and
      interpret the string of characters as a number (either hexadecimal or
-     decimal as appropriate).
+     decimal as appropriate).</p>
 
-    <p>If that number is in the range 128 to 159 (0x80 to 0x9F), then this is
-     a <a href="#parse">parse error</a>. In the following table, find the row
-     with that number in the first column, and return a character token for
-     the Unicode character given in the second column of that row.</p>
+    <p>If that number is one of the numbers in the first column of the
+     following table, then this is a <a href="#parse">parse error</a>. Find
+     the row with that number in the first column, and return a character
+     token for the Unicode character given in the second column of that row.</p>
 
     <table>
      <thead>
@@ -34795,6 +34795,13 @@
 
      <tbody>
       <tr>
+       <td>0x0D
+
+       <td>U+000A
+
+       <td>LINE FEED (LF)
+
+      <tr>
        <td>0x80
 
        <td>U+20AC
@@ -35051,13 +35058,6 @@
     </div>
   </dl>
 
-  <p class=big-issue>It seems browsers convert CRs to LFs even as entities.
-   Should we also do that? If so, we should remove the CRs in the tree
-   construction phase.</p>
-  <!-- IE is not such a browser; you
-  can prove that by comparing id="&#xA;X" to id="&#xD;X" in terms of
-  getElementById('\nX') vs '\rX'. -->
-
   <h4 id=tree-construction><span class=secno>8.2.4. </span><dfn
    id=tree-construction0>Tree construction</dfn></h4>
 
@@ -35114,7 +35114,7 @@
   <dl class=switch>
    <dt>A character token that <em>is</em> one of one of U+0009 CHARACTER
     TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM
-    FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+    FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
 
    <dd>
     <p><a href="#append" title="append a character">Append that character</a>
@@ -35411,7 +35411,7 @@
 
    <dt>A character token that is not one of one of U+0009 CHARACTER
     TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM
-    FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+    FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
 
    <dt>An end-of-file token
 
@@ -35446,7 +35446,7 @@
 
    <dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
     U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
-    U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+    <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
 
    <dd>
     <p><a href="#append" title="append a character">Append that character</a>
@@ -35454,7 +35454,7 @@
 
    <dt>A character token that is <em>not</em> one of U+0009 CHARACTER
     TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM
-    FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+    FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
 
    <dt>A start tag token
 
@@ -36090,7 +36090,7 @@
       <dl class=switch>
        <dt>A character token that is one of one of U+0009 CHARACTER
         TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
-        FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+        FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
 
        <dd>
         <p><a href="#append" title="append a character">Append the
@@ -36146,7 +36146,7 @@
 
        <dt>A character token that is <em>not</em> one of U+0009 CHARACTER
         TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
-        FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+        FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
 
        <dt>Any other start tag token
 
@@ -36171,7 +36171,7 @@
       <dl class=switch>
        <dt>A character token that is one of one of U+0009 CHARACTER
         TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
-        FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+        FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
 
        <dd>
         <p><a href="#append" title="append a character">Append the
@@ -36397,7 +36397,7 @@
 
        <dt>A character token that is one of one of U+0009 CHARACTER
         TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
-        FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+        FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
 
        <dt>A comment token
 
@@ -36426,7 +36426,7 @@
       <dl class=switch>
        <dt>A character token that is one of one of U+0009 CHARACTER
         TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
-        FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+        FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
 
        <dd>
         <p><a href="#append" title="append a character">Append the
@@ -37455,7 +37455,7 @@
       <dl class=switch>
        <dt>A character token that is one of one of U+0009 CHARACTER
         TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
-        FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+        FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
 
        <dd>
         <p><a href="#append" title="append a character">Append the
@@ -37685,7 +37685,7 @@
       <dl class=switch>
        <dt>A character token that is one of one of U+0009 CHARACTER
         TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
-        FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+        FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
 
        <dd>
         <p><a href="#append" title="append a character">Append the
@@ -38131,7 +38131,7 @@
       <dl class=switch>
        <dt>A character token that is one of one of U+0009 CHARACTER
         TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
-        FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+        FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
 
        <dd>
         <p>Process the token as it would be processed if the <a
@@ -38178,7 +38178,7 @@
       <dl class=switch>
        <dt>A character token that is one of one of U+0009 CHARACTER
         TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
-        FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+        FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
 
        <dd>
         <p><a href="#append" title="append a character">Append the
@@ -38247,7 +38247,7 @@
       <dl class=switch>
        <dt>A character token that is one of one of U+0009 CHARACTER
         TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
-        FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+        FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
 
        <dd>
         <p><a href="#append" title="append a character">Append the
@@ -38307,7 +38307,7 @@
 
    <dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
     U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
-    U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+    <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
 
    <dd>
     <p>Process the token as it would be processed in <a href="#the-main0">the
@@ -38315,7 +38315,7 @@
 
    <dt>A character token that is <em>not</em> one of U+0009 CHARACTER
     TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM
-    FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+    FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
 
    <dt>A start tag token
 

Modified: source
===================================================================
--- source	2007-06-21 08:52:55 UTC (rev 941)
+++ source	2007-06-21 23:57:48 UTC (rev 942)
@@ -32290,18 +32290,19 @@
 
     <p>If one or more characters match the range, then take them all
     and interpret the string of characters as a number (either
-    hexadecimal or decimal as appropriate).
+    hexadecimal or decimal as appropriate).</p>
 
-    <p>If that number is in the range 128 to 159 (0x80 to 0x9F), then
-    this is a <span>parse error</span>. In the following table, find
-    the row with that number in the first column, and return a
-    character token for the Unicode character given in the second
-    column of that row.</p>
+    <p>If that number is one of the numbers in the first column of the
+    following table, then this is a <span>parse error</span>. Find the
+    row with that number in the first column, and return a character
+    token for the Unicode character given in the second column of that
+    row.</p>
 
     <table>
      <thead>
       <tr><th>Number <th colspan=2>Unicode character
      <tbody>
+      <tr><td>0x0D <td>U+000A <td>LINE FEED (LF)
       <tr><td>0x80 <td>U+20AC <td>EURO SIGN ('&#x20AC')
       <tr><td>0x81 <td>U+FFFD <td>REPLACEMENT CHARACTER
       <tr><td>0x82 <td>U+201A <td>SINGLE LOW-9 QUOTATION MARK ('&#x201A')
@@ -32382,12 +32383,8 @@
 
   </dl>
 
-  <p class="big-issue">It seems browsers convert CRs to LFs even as
-  entities. Should we also do that? If so, we should remove the CRs in
-  the tree construction phase.</p> <!-- IE is not such a browser; you
-  can prove that by comparing id="&#xA;X" to id="&#xD;X" in terms of
-  getElementById('\nX') vs '\rX'. -->
 
+
   <h4><dfn>Tree construction</dfn></h4>
 
   <p>The input to the tree construction stage is a sequence of tokens
@@ -32442,7 +32439,7 @@
 
    <dt>A character token that <em>is</em> one of one of U+0009
    CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
-   TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or
+   TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or
    U+0020 SPACE</dt>
    <dd>
     <p><span title="append a character">Append that character</span>
@@ -32589,7 +32586,7 @@
    <dt>An end tag token</dt>
    <dt>A character token that is not one of one of U+0009 CHARACTER
    TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
-   FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
+   FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
    <dt>An end-of-file token</dt>
    <dd>
 
@@ -32627,7 +32624,7 @@
 
    <dt>A character token that is one of one of U+0009 CHARACTER
    TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
-   FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
+   FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
    <dd>
     <p><span title="append a character">Append that character</span>
     to the <code>Document</code> node.</p>
@@ -32635,7 +32632,7 @@
 
    <dt>A character token that is <em>not</em> one of U+0009 CHARACTER
    TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
-   FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
+   FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
    <dt>A start tag token</dt>
    <dt>An end tag token</dt>
    <dt>An end-of-file token</dt>
@@ -33241,7 +33238,7 @@
 
        <dt>A character token that is one of one of U+0009
        CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
-       TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR),
+       TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
        or U+0020 SPACE</dt>
        <dd>
         <p><span title="append a character">Append the character</span> to
@@ -33307,7 +33304,7 @@
 
        <dt>A character token that is <em>not</em> one of U+0009 CHARACTER
        TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
-       FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
+       FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
        <dt>Any other start tag token</dt>
        <dd>
 
@@ -33336,7 +33333,7 @@
 
        <dt>A character token that is one of one of U+0009
        CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
-       TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR),
+       TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
        or U+0020 SPACE</dt>
        <dd>
         <p><span title="append a character">Append the character</span> to
@@ -33570,7 +33567,7 @@
 
        <dt>A character token that is one of one of U+0009
        CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
-       TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR),
+       TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
        or U+0020 SPACE</dt>
        <dt>A comment token</dt>
        <dt>A start tag whose tag name is one of: "head", "link", "style"</dt>
@@ -33602,7 +33599,7 @@
 
        <dt>A character token that is one of one of U+0009
        CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
-       TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR),
+       TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
        or U+0020 SPACE</dt>
        <dd>
         <p><span title="append a character">Append the character</span> to
@@ -34694,7 +34691,7 @@
 
        <dt>A character token that is one of one of U+0009
        CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
-       TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR),
+       TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
        or U+0020 SPACE</dt>
        <dd>
         <p><span title="append a character">Append the character</span> to
@@ -34933,7 +34930,7 @@
 
        <dt>A character token that is one of one of U+0009
        CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
-       TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR),
+       TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
        or U+0020 SPACE</dt>
        <dd>
         <p><span title="append a character">Append the character</span> to
@@ -35430,7 +35427,7 @@
 
        <dt>A character token that is one of one of U+0009
        CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
-       TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR),
+       TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
        or U+0020 SPACE</dt>
        <dd>
         <p>Process the token as it would be processed if the
@@ -35483,7 +35480,7 @@
 
        <dt>A character token that is one of one of U+0009
        CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
-       TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR),
+       TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
        or U+0020 SPACE</dt>
        <dd>
         <p><span title="append a character">Append the character</span> to
@@ -35556,7 +35553,7 @@
 
        <dt>A character token that is one of one of U+0009
        CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
-       TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR),
+       TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
        or U+0020 SPACE</dt>
        <dd>
         <p><span title="append a character">Append the character</span> to
@@ -35624,7 +35621,7 @@
 
    <dt>A character token that is one of one of U+0009 CHARACTER
    TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
-   FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
+   FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
    <dd>
     <p>Process the token as it would be processed in <span>the main
     phase</span>.</p>
@@ -35632,7 +35629,7 @@
 
    <dt>A character token that is <em>not</em> one of U+0009 CHARACTER
    TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
-   FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
+   FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
    <dt>A start tag token</dt>
    <dt>An end tag token</dt>
    <dd>




More information about the Commit-Watchers mailing list