[html5] r942 - /
whatwg at whatwg.org
whatwg at whatwg.org
Thu Jun 21 17:00:02 PDT 2007
Author: ianh
Date: 2007-06-21 16:57:48 -0700 (Thu, 21 Jun 2007)
New Revision: 942
Modified:
index
source
Log:
[ciot] (2) 
 now turns into a U+000A LINE FEED character
Modified: index
===================================================================
--- index 2007-06-21 08:52:55 UTC (rev 941)
+++ index 2007-06-21 23:57:48 UTC (rev 942)
@@ -34779,12 +34779,12 @@
<p>If one or more characters match the range, then take them all and
interpret the string of characters as a number (either hexadecimal or
- decimal as appropriate).
+ decimal as appropriate).</p>
- <p>If that number is in the range 128 to 159 (0x80 to 0x9F), then this is
- a <a href="#parse">parse error</a>. In the following table, find the row
- with that number in the first column, and return a character token for
- the Unicode character given in the second column of that row.</p>
+ <p>If that number is one of the numbers in the first column of the
+ following table, then this is a <a href="#parse">parse error</a>. Find
+ the row with that number in the first column, and return a character
+ token for the Unicode character given in the second column of that row.</p>
<table>
<thead>
@@ -34795,6 +34795,13 @@
<tbody>
<tr>
+ <td>0x0D
+
+ <td>U+000A
+
+ <td>LINE FEED (LF)
+
+ <tr>
<td>0x80
<td>U+20AC
@@ -35051,13 +35058,6 @@
</div>
</dl>
- <p class=big-issue>It seems browsers convert CRs to LFs even as entities.
- Should we also do that? If so, we should remove the CRs in the tree
- construction phase.</p>
- <!-- IE is not such a browser; you
- can prove that by comparing id="
X" to id="
X" in terms of
- getElementById('\nX') vs '\rX'. -->
-
<h4 id=tree-construction><span class=secno>8.2.4. </span><dfn
id=tree-construction0>Tree construction</dfn></h4>
@@ -35114,7 +35114,7 @@
<dl class=switch>
<dt>A character token that <em>is</em> one of one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM
- FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+ FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
<p><a href="#append" title="append a character">Append that character</a>
@@ -35411,7 +35411,7 @@
<dt>A character token that is not one of one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM
- FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+ FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dt>An end-of-file token
@@ -35446,7 +35446,7 @@
<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
- U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+ <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
<p><a href="#append" title="append a character">Append that character</a>
@@ -35454,7 +35454,7 @@
<dt>A character token that is <em>not</em> one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM
- FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+ FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dt>A start tag token
@@ -36090,7 +36090,7 @@
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
- FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+ FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
<p><a href="#append" title="append a character">Append the
@@ -36146,7 +36146,7 @@
<dt>A character token that is <em>not</em> one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
- FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+ FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dt>Any other start tag token
@@ -36171,7 +36171,7 @@
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
- FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+ FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
<p><a href="#append" title="append a character">Append the
@@ -36397,7 +36397,7 @@
<dt>A character token that is one of one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
- FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+ FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dt>A comment token
@@ -36426,7 +36426,7 @@
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
- FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+ FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
<p><a href="#append" title="append a character">Append the
@@ -37455,7 +37455,7 @@
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
- FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+ FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
<p><a href="#append" title="append a character">Append the
@@ -37685,7 +37685,7 @@
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
- FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+ FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
<p><a href="#append" title="append a character">Append the
@@ -38131,7 +38131,7 @@
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
- FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+ FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
<p>Process the token as it would be processed if the <a
@@ -38178,7 +38178,7 @@
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
- FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+ FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
<p><a href="#append" title="append a character">Append the
@@ -38247,7 +38247,7 @@
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
- FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+ FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
<p><a href="#append" title="append a character">Append the
@@ -38307,7 +38307,7 @@
<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
- U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+ <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
<p>Process the token as it would be processed in <a href="#the-main0">the
@@ -38315,7 +38315,7 @@
<dt>A character token that is <em>not</em> one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM
- FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE
+ FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dt>A start tag token
Modified: source
===================================================================
--- source 2007-06-21 08:52:55 UTC (rev 941)
+++ source 2007-06-21 23:57:48 UTC (rev 942)
@@ -32290,18 +32290,19 @@
<p>If one or more characters match the range, then take them all
and interpret the string of characters as a number (either
- hexadecimal or decimal as appropriate).
+ hexadecimal or decimal as appropriate).</p>
- <p>If that number is in the range 128 to 159 (0x80 to 0x9F), then
- this is a <span>parse error</span>. In the following table, find
- the row with that number in the first column, and return a
- character token for the Unicode character given in the second
- column of that row.</p>
+ <p>If that number is one of the numbers in the first column of the
+ following table, then this is a <span>parse error</span>. Find the
+ row with that number in the first column, and return a character
+ token for the Unicode character given in the second column of that
+ row.</p>
<table>
<thead>
<tr><th>Number <th colspan=2>Unicode character
<tbody>
+ <tr><td>0x0D <td>U+000A <td>LINE FEED (LF)
<tr><td>0x80 <td>U+20AC <td>EURO SIGN ('€')
<tr><td>0x81 <td>U+FFFD <td>REPLACEMENT CHARACTER
<tr><td>0x82 <td>U+201A <td>SINGLE LOW-9 QUOTATION MARK ('‚')
@@ -32382,12 +32383,8 @@
</dl>
- <p class="big-issue">It seems browsers convert CRs to LFs even as
- entities. Should we also do that? If so, we should remove the CRs in
- the tree construction phase.</p> <!-- IE is not such a browser; you
- can prove that by comparing id="
X" to id="
X" in terms of
- getElementById('\nX') vs '\rX'. -->
+
<h4><dfn>Tree construction</dfn></h4>
<p>The input to the tree construction stage is a sequence of tokens
@@ -32442,7 +32439,7 @@
<dt>A character token that <em>is</em> one of one of U+0009
CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
- TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or
+ TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or
U+0020 SPACE</dt>
<dd>
<p><span title="append a character">Append that character</span>
@@ -32589,7 +32586,7 @@
<dt>An end tag token</dt>
<dt>A character token that is not one of one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
- FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
+ FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
<dt>An end-of-file token</dt>
<dd>
@@ -32627,7 +32624,7 @@
<dt>A character token that is one of one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
- FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
+ FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
<dd>
<p><span title="append a character">Append that character</span>
to the <code>Document</code> node.</p>
@@ -32635,7 +32632,7 @@
<dt>A character token that is <em>not</em> one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
- FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
+ FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
<dt>A start tag token</dt>
<dt>An end tag token</dt>
<dt>An end-of-file token</dt>
@@ -33241,7 +33238,7 @@
<dt>A character token that is one of one of U+0009
CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
- TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR),
+ TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
or U+0020 SPACE</dt>
<dd>
<p><span title="append a character">Append the character</span> to
@@ -33307,7 +33304,7 @@
<dt>A character token that is <em>not</em> one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
- FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
+ FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
<dt>Any other start tag token</dt>
<dd>
@@ -33336,7 +33333,7 @@
<dt>A character token that is one of one of U+0009
CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
- TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR),
+ TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
or U+0020 SPACE</dt>
<dd>
<p><span title="append a character">Append the character</span> to
@@ -33570,7 +33567,7 @@
<dt>A character token that is one of one of U+0009
CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
- TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR),
+ TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
or U+0020 SPACE</dt>
<dt>A comment token</dt>
<dt>A start tag whose tag name is one of: "head", "link", "style"</dt>
@@ -33602,7 +33599,7 @@
<dt>A character token that is one of one of U+0009
CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
- TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR),
+ TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
or U+0020 SPACE</dt>
<dd>
<p><span title="append a character">Append the character</span> to
@@ -34694,7 +34691,7 @@
<dt>A character token that is one of one of U+0009
CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
- TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR),
+ TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
or U+0020 SPACE</dt>
<dd>
<p><span title="append a character">Append the character</span> to
@@ -34933,7 +34930,7 @@
<dt>A character token that is one of one of U+0009
CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
- TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR),
+ TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
or U+0020 SPACE</dt>
<dd>
<p><span title="append a character">Append the character</span> to
@@ -35430,7 +35427,7 @@
<dt>A character token that is one of one of U+0009
CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
- TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR),
+ TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
or U+0020 SPACE</dt>
<dd>
<p>Process the token as it would be processed if the
@@ -35483,7 +35480,7 @@
<dt>A character token that is one of one of U+0009
CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
- TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR),
+ TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
or U+0020 SPACE</dt>
<dd>
<p><span title="append a character">Append the character</span> to
@@ -35556,7 +35553,7 @@
<dt>A character token that is one of one of U+0009
CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
- TABULATION, U+000C FORM FEED (FF), U+000D CARRIAGE RETURN (CR),
+ TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
or U+0020 SPACE</dt>
<dd>
<p><span title="append a character">Append the character</span> to
@@ -35624,7 +35621,7 @@
<dt>A character token that is one of one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
- FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
+ FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
<dd>
<p>Process the token as it would be processed in <span>the main
phase</span>.</p>
@@ -35632,7 +35629,7 @@
<dt>A character token that is <em>not</em> one of U+0009 CHARACTER
TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
- FORM FEED (FF), U+000D CARRIAGE RETURN (CR), or U+0020 SPACE</dt>
+ FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
<dt>A start tag token</dt>
<dt>An end tag token</dt>
<dd>
More information about the Commit-Watchers
mailing list