[html5] r1738 - /
whatwg at whatwg.org
whatwg at whatwg.org
Tue Jun 10 14:31:34 PDT 2008
Author: ianh
Date: 2008-06-10 14:31:27 -0700 (Tue, 10 Jun 2008)
New Revision: 1738
Modified:
index
source
Log:
[eacgiowt] (2) Remove U+000B LINE TABULATION (VT) from the list of characters treated like a space character in HTML, for consistency with CSS. This is incompatible with IE7, but apparently IE has all kinds of complications here and in any case IE8 doesn't treat U+000B as a space character.
Modified: index
===================================================================
--- index 2008-06-10 21:19:58 UTC (rev 1737)
+++ index 2008-06-10 21:31:27 UTC (rev 1738)
@@ -2362,13 +2362,7 @@
<dd>
<p>Implementations must support some version of the Media Queries
- language. However, when applying the rules of the Media Queries
- specification to media queries found in content attributes of <a
- href="#html-elements">HTML elements</a>, user agents must act as if all
- U+000B LINE TABULATION characters in the attribute were in fact U+0020
- SPACE characters. This is required to provide a consistent processing of
- <a href="#space" title="space character">space characters</a> in HTML.
- <a href="#refsMQ">[MQ]</a></p>
+ language. <a href="#refsMQ">[MQ]</a></p>
</dl>
<p>This specification does not require support of any particular network
@@ -4479,8 +4473,8 @@
<p>The <dfn id=space title="space character">space characters</dfn>, for
the purposes of this specification, are U+0020 SPACE, U+0009 CHARACTER
- TABULATION (tab), U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
- FORM FEED (FF), and U+000D CARRIAGE RETURN (CR).
+ TABULATION (tab), U+000A LINE FEED (LF), U+000C FORM FEED (FF), and U+000D
+ CARRIAGE RETURN (CR).
<p>Some of the micro-parsers described below follow the pattern of having
an <var title="">input</var> variable that holds the string being parsed,
@@ -8348,11 +8342,11 @@
<p>An <dfn id=ascii-compatible>ASCII-compatible character encoding</dfn> is
one that is a superset of US-ASCII (specifically, ANSI_X3.4-1968) for
- bytes in the range 0x09 - 0x0D, 0x20, 0x21, 0x22, 0x26, 0x27, 0x2C - 0x3F,
- 0x41 - 0x5A, and 0x61 - 0x7A<!-- is that list ok? do any character
- sets we want to support do things outside that range? -->.
- <!-- XXX
- #refs RFC1345 ? -->
+ bytes in the set 0x09, 0x0A, 0x0C, 0x0D, 0x20 - 0x22, 0x26, 0x27, 0x2C -
+ 0x3F, 0x41 - 0x5A, and 0x61 - 0x7A<!-- is that list ok? do any
+ character sets we want to support do things outside that range?
+ -->.
+ <!-- XXX #refs RFC1345 ? -->
<p>Authors should not use JIS_X0212-1990, x-JIS0208, and encodings based on
EBCDIC. Authors should not use UTF-32. Authors must not use the CESU-8,
@@ -33399,14 +33393,16 @@
<p>Otherwise, if any of the first <var title="">n</var> bytes of the
resource are in one of the following byte ranges:</p>
<!-- This byte list is based on RFC 2046 Section 4.1.2. Characters
- in the range 0x00-0X1F, with the exception of 0x09 - 0x0D (ASCII
- for TAB, LF, VT, FF, and CR), and character 0x1B (reportedly used
- by some encodings as a shift escape), are invalid. Thus, if we see
- them, we assume it's not text. -->
+ in the range 0x00-0x1F, with the exception of 0x09, 0x0A, 0x0C,
+ 0x0D (ASCII for TAB, LF, FF, and CR), and character 0x1B
+ (reportedly used by some encodings as a shift escape), are
+ invalid. Thus, if we see them, we assume it's not text. -->
<ul class=brief>
<li> 0x00 - 0x08
+ <li> 0x0B
+
<li> 0x0E - 0x1A
<li> 0x1C - 0x1F
@@ -33506,10 +33502,10 @@
whitespace to be skipped when sniffing for a type signature.</p>
<p>If the <var title="">index<sub>stream</sub></var>th byte of the
- stream is one of 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0B (ASCII
- VT), 0x0C (ASCII FF), 0x0D (ASCII CR), or 0x20 (ASCII space), then
- increment only the <var title="">index<sub>stream</sub></var> to
- the next byte in the byte stream.</p>
+ stream is one of 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0C (ASCII
+ FF), 0x0D (ASCII CR), or 0x20 (ASCII space), then increment only
+ the <var title="">index<sub>stream</sub></var> to the next byte in
+ the byte stream.</p>
<p>Otherwise, increment only the <var
title="">index<sub>pattern</sub></var> to the next byte in the
@@ -33972,16 +33968,16 @@
such match is found, return nothing.</p>
<li>
- <p>Skip any U+0009, U+000A, U+000B, U+000C, U+000D, or U+0020 characters
- that immediately follow the word 'charset' (there might not be any).
+ <p>Skip any U+0009, U+000A, U+000C, U+000D, or U+0020 characters that
+ immediately follow the word 'charset' (there might not be any).
<li>
<p>If the next character is not a U+003D EQUALS SIGN ('='), return
nothing.
<li>
- <p>Skip any U+0009, U+000A, U+000B, U+000C, U+000D, or U+0020 characters
- that immediately follow the equals sign (there might not be any).
+ <p>Skip any U+0009, U+000A, U+000C, U+000D, or U+0020 characters that
+ immediately follow the equals sign (there might not be any).
<li>
<p>Process the next character as follows:</p>
@@ -34010,7 +34006,7 @@
<dd>
<p>Return the string from this character to the first U+0009, U+000A,
- U+000B, U+000C, U+000D, U+0020, or U+003B character or the end of <var
+ U+000C, U+000D, U+0020, or U+003B character or the end of <var
title="">s</var>, whichever comes first.
</dl>
</ol>
@@ -41956,10 +41952,9 @@
of the string "<code title=""></</code>" (U+003C LESS-THAN SIGN, U+002F
SOLIDUS) followed by characters that case-insensitively<!--ASCII--> match
the tag name of the element followed by one of U+0009 CHARACTER
- TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM
- FEED (FF), U+0020 SPACE, U+003E GREATER-THAN SIGN (>), or U+002F SOLIDUS
- (/), unless that string is part of an <a href="#escaping"
- title=syntax-escape>escaping text span</a>.
+ TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF), U+0020 SPACE,
+ U+003E GREATER-THAN SIGN (>), or U+002F SOLIDUS (/), unless that string is
+ part of an <a href="#escaping" title=syntax-escape>escaping text span</a>.
<p>An <dfn id=escaping title=syntax-escape>escaping text span</dfn> is a
span of <a href="#text2" title=syntax-text>text</a> that starts with an <a
@@ -42329,15 +42324,15 @@
<dt>A sequence of bytes starting with: 0x3C, 0x4D or 0x6D, 0x45 or
0x65, 0x54 or 0x74, 0x41 or 0x61, and finally one of 0x09, 0x0A,
- 0x0B, 0x0C, 0x0D, 0x20, 0x2F (case-insensitive ASCII '<meta'
- followed by a space or slash)
+ 0x0C, 0x0D, 0x20, 0x2F (case-insensitive ASCII '<meta' followed by
+ a space or slash)
<dd>
<ol>
<li>
<p>Advance the <var title="">position</var> pointer so that it
- points at the next 0x09, 0x0A, 0x0B, 0x0C, 0x0D, 0x20, or 0x2F
- byte (the one in sequence of characters matched above).
+ points at the next 0x09, 0x0A, 0x0C, 0x0D, 0x20, or 0x2F byte (the
+ one in sequence of characters matched above).
<li>
<p><a href="#get-an" title=concept-get-attributes-when-sniffing>Get
@@ -42387,9 +42382,9 @@
<ol>
<li>
<p>Advance the <var title="">position</var> pointer so that it
- points at the next 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0B (ASCII
- VT), 0x0C (ASCII FF), 0x0D (ASCII CR), 0x20 (ASCII space), or 0x3E
- (ASCII '>') byte.
+ points at the next 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0C (ASCII
+ FF), 0x0D (ASCII CR), 0x20 (ASCII space), or 0x3E (ASCII '>')
+ byte.
<li>
<p>Repeatedly <a href="#get-an"
@@ -42427,9 +42422,9 @@
<ol>
<li>
<p>If the byte at <var title="">position</var> is one of 0x09 (ASCII
- TAB), 0x0A (ASCII LF), 0x0B (ASCII VT), 0x0C (ASCII FF), 0x0D (ASCII
- CR), 0x20 (ASCII space), or 0x2F (ASCII '/') then advance <var
- title="">position</var> to the next byte and redo this substep.
+ TAB), 0x0A (ASCII LF), 0x0C (ASCII FF), 0x0D (ASCII CR), 0x20 (ASCII
+ space), or 0x2F (ASCII '/') then advance <var title="">position</var>
+ to the next byte and redo this substep.
<li>
<p>If the byte at <var title="">position</var> is 0x3E (ASCII '>'),
@@ -42451,8 +42446,8 @@
<dd>Advance <var title="">position</var> to the next byte and jump to
the step below labeled <em>value</em>.
- <dt>If it is 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0B (ASCII VT), 0x0C
- (ASCII FF), 0x0D (ASCII CR), or 0x20 (ASCII space)
+ <dt>If it is 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0C (ASCII FF), 0x0D
+ (ASCII CR), or 0x20 (ASCII space)
<dd>Jump to the step below labeled <em>spaces</em>.
@@ -42484,9 +42479,9 @@
<li>
<p><em>Spaces.</em> If the byte at <var title="">position</var> is one
- of 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0B (ASCII VT), 0x0C (ASCII
- FF), 0x0D (ASCII CR), or 0x20 (ASCII space) then advance <var
- title="">position</var> to the next byte, then, repeat this step.
+ of 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0C (ASCII FF), 0x0D (ASCII
+ CR), or 0x20 (ASCII space) then advance <var title="">position</var>
+ to the next byte, then, repeat this step.
<li>
<p>If the byte at <var title="">position</var> is <em>not</em> 0x3D
@@ -42499,9 +42494,9 @@
<li>
<p><em>Value.</em> If the byte at <var title="">position</var> is one
- of 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0B (ASCII VT), 0x0C (ASCII
- FF), 0x0D (ASCII CR), or 0x20 (ASCII space) then advance <var
- title="">position</var> to the next byte, then, repeat this step.
+ of 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0C (ASCII FF), 0x0D (ASCII
+ CR), or 0x20 (ASCII space) then advance <var title="">position</var>
+ to the next byte, then, repeat this step.
<li>
<p>Process the byte at <var title="">position</var> as follows:</p>
@@ -42562,8 +42557,8 @@
<p>Process the byte at <var title="">position</var> as follows:</p>
<dl class=switch>
- <dt>If it is 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0B (ASCII VT), 0x0C
- (ASCII FF), 0x0D (ASCII CR), 0x20 (ASCII space), or 0x3E (ASCII '>')
+ <dt>If it is 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0C (ASCII FF), 0x0D
+ (ASCII CR), 0x20 (ASCII space), or 0x3E (ASCII '>')
<dd>Abort the "get an attribute" algorithm. The attribute's name is
the value of <var title="">attribute name</var> and its value is the
@@ -43595,8 +43590,6 @@
<li>U+000A LINE FEED (LF)
- <li>U+000B LINE TABULATION
-
<li>U+000C FORM FEED (FF)</li>
<!--<li>U+000D CARRIAGE RETURN (CR)</li>-->
@@ -43660,8 +43653,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -43708,8 +43699,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -43767,8 +43756,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -43833,8 +43820,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -43887,8 +43872,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -44001,8 +43984,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -44065,8 +44046,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -44321,8 +44300,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -44346,8 +44323,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -44384,8 +44359,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -44421,8 +44394,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -44470,8 +44441,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -44584,8 +44553,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -44635,8 +44602,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -44749,8 +44714,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -44836,8 +44799,6 @@
<dt>U+000A LINE FEED (LF)
- <dt>U+000B LINE TABULATION
-
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
@@ -45615,7 +45576,7 @@
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
- U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+ U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
@@ -45933,7 +45894,7 @@
the comment token.</p>
<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
- U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+ U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
@@ -46000,7 +45961,7 @@
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
- U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+ U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
@@ -46070,7 +46031,7 @@
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
- U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+ U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
@@ -46347,7 +46308,7 @@
title="insertion mode: in head">in head</a>".</p>
<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
- U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+ U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dt>A comment token
@@ -46388,7 +46349,7 @@
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
- U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+ U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
@@ -47623,7 +47584,7 @@
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
- U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+ U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
@@ -47871,7 +47832,7 @@
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
- U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+ U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
@@ -48568,7 +48529,7 @@
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
- U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+ U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
@@ -48632,7 +48593,7 @@
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
- U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+ U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
@@ -48728,7 +48689,7 @@
<dl class=switch>
<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
- U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+ U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dd>
@@ -48801,7 +48762,7 @@
<dt>A DOCTYPE token
<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
- U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+ U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dt>A start tag whose tag name is "html"
@@ -48843,7 +48804,7 @@
<dt>A DOCTYPE token
<dt>A character token that is one of one of U+0009 CHARACTER TABULATION,
- U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF),
+ U+000A LINE FEED (LF), U+000C FORM FEED (FF),
<!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE
<dt>A start tag whose tag name is "html"
Modified: source
===================================================================
--- source 2008-06-10 21:19:58 UTC (rev 1737)
+++ source 2008-06-10 21:31:27 UTC (rev 1738)
@@ -613,13 +613,7 @@
<dd>
<p>Implementations must support some version of the Media Queries
- language. However, when applying the rules of the Media Queries
- specification to media queries found in content attributes of
- <span>HTML elements</span>, user agents must act as if all U+000B
- LINE TABULATION characters in the attribute were in fact U+0020
- SPACE characters. This is required to provide a consistent
- processing of <span title="space character">space
- characters</span> in HTML. <a href="#refsMQ">[MQ]</a></p>
+ language. <a href="#refsMQ">[MQ]</a></p>
</dd>
@@ -2890,8 +2884,8 @@
<p>The <dfn title="space character">space characters</dfn>, for the
purposes of this specification, are U+0020 SPACE, U+0009 CHARACTER
- TABULATION (tab), U+000A LINE FEED (LF), U+000B LINE TABULATION,
- U+000C FORM FEED (FF), and U+000D CARRIAGE RETURN (CR).</p>
+ TABULATION (tab), U+000A LINE FEED (LF), U+000C FORM FEED (FF), and
+ U+000D CARRIAGE RETURN (CR).</p>
<p>Some of the micro-parsers described below follow the pattern of
having an <var title="">input</var> variable that holds the string
@@ -6700,10 +6694,10 @@
<p>An <dfn>ASCII-compatible character encoding</dfn> is one that is
a superset of US-ASCII (specifically, ANSI_X3.4-1968) for bytes in
- the range 0x09 - 0x0D, 0x20, 0x21, 0x22, 0x26, 0x27, 0x2C - 0x3F,
- 0x41 - 0x5A, and 0x61 - 0x7A<!-- is that list ok? do any character
- sets we want to support do things outside that range? -->. <!-- XXX
- #refs RFC1345 ? --></p>
+ the set 0x09, 0x0A, 0x0C, 0x0D, 0x20 - 0x22, 0x26, 0x27, 0x2C -
+ 0x3F, 0x41 - 0x5A, and 0x61 - 0x7A<!-- is that list ok? do any
+ character sets we want to support do things outside that range?
+ -->. <!-- XXX #refs RFC1345 ? --></p>
<p>Authors should not use JIS_X0212-1990, x-JIS0208, and encodings
based on EBCDIC. Authors should not use UTF-32. Authors must not use
@@ -31212,13 +31206,14 @@
of the resource are in one of the following byte ranges:</p>
<!-- This byte list is based on RFC 2046 Section 4.1.2. Characters
- in the range 0x00-0X1F, with the exception of 0x09 - 0x0D (ASCII
- for TAB, LF, VT, FF, and CR), and character 0x1B (reportedly used
- by some encodings as a shift escape), are invalid. Thus, if we see
- them, we assume it's not text. -->
+ in the range 0x00-0x1F, with the exception of 0x09, 0x0A, 0x0C,
+ 0x0D (ASCII for TAB, LF, FF, and CR), and character 0x1B
+ (reportedly used by some encodings as a shift escape), are
+ invalid. Thus, if we see them, we assume it's not text. -->
<ul class="brief">
<li> 0x00 - 0x08 </li>
+ <li> 0x0B </li>
<li> 0x0E - 0x1A </li>
<li> 0x1C - 0x1F </li>
</ul>
@@ -31337,8 +31332,8 @@
<p>If the <var title="">index<sub>stream</sub></var>th byte
of the stream is one of 0x09 (ASCII TAB), 0x0A (ASCII LF),
- 0x0B (ASCII VT), 0x0C (ASCII FF), 0x0D (ASCII CR), or 0x20
- (ASCII space), then increment only the <var
+ 0x0C (ASCII FF), 0x0D (ASCII CR), or 0x20 (ASCII space),
+ then increment only the <var
title="">index<sub>stream</sub></var> to the next byte in
the byte stream.</p>
@@ -31730,14 +31725,14 @@
that are a case-insensitive<!-- XXX ASCII--> match for the word
'charset'. If no such match is found, return nothing.</p>
- <li><p>Skip any U+0009, U+000A, U+000B, U+000C, U+000D, or U+0020
+ <li><p>Skip any U+0009, U+000A, U+000C, U+000D, or U+0020
characters that immediately follow the word 'charset' (there might
not be any).</p></li>
<li><p>If the next character is not a U+003D EQUALS SIGN ('='),
return nothing.</p></li>
- <li><p>Skip any U+0009, U+000A, U+000B, U+000C, U+000D, or U+0020
+ <li><p>Skip any U+0009, U+000A, U+000C, U+000D, or U+0020
characters that immediately follow the equals sign (there might not
be any).</p></li>
@@ -31765,8 +31760,8 @@
<dt>Otherwise</dt>
<dd><p>Return the string from this character to the first U+0009,
- U+000A, U+000B, U+000C, U+000D, U+0020, or U+003B character or
- the end of <var title="">s</var>, whichever comes first.</dd>
+ U+000A, U+000C, U+000D, U+0020, or U+003B character or the end of
+ <var title="">s</var>, whichever comes first.</dd>
</dl>
@@ -39566,10 +39561,9 @@
LESS-THAN SIGN, U+002F SOLIDUS) followed by characters that
case-insensitively<!--ASCII--> match the tag name of the element
followed by one of U+0009 CHARACTER TABULATION, U+000A LINE FEED
- (LF), U+000B LINE TABULATION, U+000C FORM FEED (FF), U+0020 SPACE,
- U+003E GREATER-THAN SIGN (>), or U+002F SOLIDUS (/), unless that
- string is part of an <span title="syntax-escape">escaping text
- span</span>.</p>
+ (LF), U+000C FORM FEED (FF), U+0020 SPACE, U+003E GREATER-THAN SIGN
+ (>), or U+002F SOLIDUS (/), unless that string is part of an <span
+ title="syntax-escape">escaping text span</span>.</p>
<p>An <dfn title="syntax-escape">escaping text span</dfn> is a span
of <span title="syntax-text">text</span> that starts with an <span
@@ -39962,14 +39956,14 @@
</dd>
- <dt>A sequence of bytes starting with: 0x3C, 0x4D or 0x6D, 0x45 or 0x65, 0x54 or 0x74, 0x41 or 0x61, and finally one of 0x09, 0x0A, 0x0B, 0x0C, 0x0D, 0x20, 0x2F (case-insensitive ASCII '<meta' followed by a space or slash)</dt>
+ <dt>A sequence of bytes starting with: 0x3C, 0x4D or 0x6D, 0x45 or 0x65, 0x54 or 0x74, 0x41 or 0x61, and finally one of 0x09, 0x0A, 0x0C, 0x0D, 0x20, 0x2F (case-insensitive ASCII '<meta' followed by a space or slash)</dt>
<dd>
<ol>
<li><p>Advance the <var title="">position</var> pointer so
- that it points at the next 0x09, 0x0A, 0x0B, 0x0C, 0x0D,
- 0x20, or 0x2F byte (the one in sequence of characters matched
+ that it points at the next 0x09, 0x0A, 0x0C, 0x0D, 0x20, or
+ 0x2F byte (the one in sequence of characters matched
above).</p></li>
<li><p><span title="concept-get-attributes-when-sniffing">Get
@@ -40016,8 +40010,8 @@
<li><p>Advance the <var title="">position</var> pointer so
that it points at the next 0x09 (ASCII TAB), 0x0A (ASCII LF),
- 0x0B (ASCII VT), 0x0C (ASCII FF), 0x0D (ASCII CR), 0x20
- (ASCII space), or 0x3E (ASCII '>') byte.</p></li>
+ 0x0C (ASCII FF), 0x0D (ASCII CR), 0x20 (ASCII space), or 0x3E
+ (ASCII '>') byte.</p></li>
<li><p>Repeatedly <span
title="concept-get-attributes-when-sniffing">get an
@@ -40064,10 +40058,10 @@
<ol>
<li><p>If the byte at <var title="">position</var> is one of 0x09
- (ASCII TAB), 0x0A (ASCII LF), 0x0B (ASCII VT), 0x0C (ASCII FF),
- 0x0D (ASCII CR), 0x20 (ASCII space), or 0x2F (ASCII '/') then
- advance <var title="">position</var> to the next byte and redo
- this substep.</p></li>
+ (ASCII TAB), 0x0A (ASCII LF), 0x0C (ASCII FF), 0x0D (ASCII CR),
+ 0x20 (ASCII space), or 0x2F (ASCII '/') then advance <var
+ title="">position</var> to the next byte and redo this
+ substep.</p></li>
<li><p>If the byte at <var title="">position</var> is 0x3E (ASCII
'>'), then abort the "get an attribute" algorithm. There isn't
@@ -40089,9 +40083,8 @@
<dd>Advance <var title="">position</var> to the next byte and
jump to the step below labeled <em>value</em>.</dd>
- <dt>If it is 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0B (ASCII
- VT), 0x0C (ASCII FF), 0x0D (ASCII CR), or 0x20 (ASCII
- space)</dt>
+ <dt>If it is 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0C (ASCII
+ FF), 0x0D (ASCII CR), or 0x20 (ASCII space)</dt>
<dd>Jump to the step below labeled <em>spaces</em>.</dd>
@@ -40127,9 +40120,9 @@
<li><p><em>Spaces.</em> If the byte at <var
title="">position</var> is one of 0x09 (ASCII TAB), 0x0A (ASCII
- LF), 0x0B (ASCII VT), 0x0C (ASCII FF), 0x0D (ASCII CR), or 0x20
- (ASCII space) then advance <var title="">position</var> to the
- next byte, then, repeat this step.</p></li>
+ LF), 0x0C (ASCII FF), 0x0D (ASCII CR), or 0x20 (ASCII space) then
+ advance <var title="">position</var> to the next byte, then,
+ repeat this step.</p></li>
<li><p>If the byte at <var title="">position</var> is
<em>not</em> 0x3D (ASCII '='), abort the "get an attribute"
@@ -40142,9 +40135,9 @@
<li><p><em>Value.</em> If the byte at <var
title="">position</var> is one of 0x09 (ASCII TAB), 0x0A (ASCII
- LF), 0x0B (ASCII VT), 0x0C (ASCII FF), 0x0D (ASCII CR), or 0x20
- (ASCII space) then advance <var title="">position</var> to the
- next byte, then, repeat this step.</p></li>
+ LF), 0x0C (ASCII FF), 0x0D (ASCII CR), or 0x20 (ASCII space) then
+ advance <var title="">position</var> to the next byte, then,
+ repeat this step.</p></li>
<li><p>Process the byte at <var title="">position</var> as
follows:</p>
@@ -40219,9 +40212,9 @@
<dl class="switch">
- <dt>If it is 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0B (ASCII
- VT), 0x0C (ASCII FF), 0x0D (ASCII CR), 0x20 (ASCII space), or
- 0x3E (ASCII '>')</dt>
+ <dt>If it is 0x09 (ASCII TAB), 0x0A (ASCII LF), 0x0C (ASCII
+ FF), 0x0D (ASCII CR), 0x20 (ASCII space), or 0x3E (ASCII
+ '>')</dt>
<dd>Abort the "get an attribute" algorithm. The attribute's
name is the value of <var title="">attribute name</var> and its
@@ -41186,7 +41179,6 @@
<ul class="brief">
<li>U+0009 CHARACTER TABULATION</li>
<li>U+000A LINE FEED (LF)</li>
- <li>U+000B LINE TABULATION</li>
<li>U+000C FORM FEED (FF)</li>
<!--<li>U+000D CARRIAGE RETURN (CR)</li>-->
<li>U+0020 SPACE</li>
@@ -41245,7 +41237,6 @@
<dt>U+0009 CHARACTER TABULATION</dt>
<dt>U+000A LINE FEED (LF)</dt>
- <dt>U+000B LINE TABULATION</dt>
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
<dt>U+0020 SPACE</dt>
@@ -41286,7 +41277,6 @@
<dt>U+0009 CHARACTER TABULATION</dt>
<dt>U+000A LINE FEED (LF)</dt>
- <dt>U+000B LINE TABULATION</dt>
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
<dt>U+0020 SPACE</dt>
@@ -41337,7 +41327,6 @@
<dt>U+0009 CHARACTER TABULATION</dt>
<dt>U+000A LINE FEED (LF)</dt>
- <dt>U+000B LINE TABULATION</dt>
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
<dt>U+0020 SPACE</dt>
@@ -41395,7 +41384,6 @@
<dt>U+0009 CHARACTER TABULATION</dt>
<dt>U+000A LINE FEED (LF)</dt>
- <dt>U+000B LINE TABULATION</dt>
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
<dt>U+0020 SPACE</dt>
@@ -41443,7 +41431,6 @@
<dt>U+0009 CHARACTER TABULATION</dt>
<dt>U+000A LINE FEED (LF)</dt>
- <dt>U+000B LINE TABULATION</dt>
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
<dt>U+0020 SPACE</dt>
@@ -41553,7 +41540,6 @@
<dt>U+0009 CHARACTER TABULATION</dt>
<dt>U+000A LINE FEED (LF)</dt>
- <dt>U+000B LINE TABULATION</dt>
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
<dt>U+0020 SPACE</dt>
@@ -41615,7 +41601,6 @@
<dt>U+0009 CHARACTER TABULATION</dt>
<dt>U+000A LINE FEED (LF)</dt>
- <dt>U+000B LINE TABULATION</dt>
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
<dt>U+0020 SPACE</dt>
@@ -41880,7 +41865,6 @@
<dt>U+0009 CHARACTER TABULATION</dt>
<dt>U+000A LINE FEED (LF)</dt>
- <dt>U+000B LINE TABULATION</dt>
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
<dt>U+0020 SPACE</dt>
@@ -41904,7 +41888,6 @@
<dt>U+0009 CHARACTER TABULATION</dt>
<dt>U+000A LINE FEED (LF)</dt>
- <dt>U+000B LINE TABULATION</dt>
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
<dt>U+0020 SPACE</dt>
@@ -41939,7 +41922,6 @@
<dt>U+0009 CHARACTER TABULATION</dt>
<dt>U+000A LINE FEED (LF)</dt>
- <dt>U+000B LINE TABULATION</dt>
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
<dt>U+0020 SPACE</dt>
@@ -41972,7 +41954,6 @@
<dt>U+0009 CHARACTER TABULATION</dt>
<dt>U+000A LINE FEED (LF)</dt>
- <dt>U+000B LINE TABULATION</dt>
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
<dt>U+0020 SPACE</dt>
@@ -42020,7 +42001,6 @@
<dt>U+0009 CHARACTER TABULATION</dt>
<dt>U+000A LINE FEED (LF)</dt>
- <dt>U+000B LINE TABULATION</dt>
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
<dt>U+0020 SPACE</dt>
@@ -42125,7 +42105,6 @@
<dt>U+0009 CHARACTER TABULATION</dt>
<dt>U+000A LINE FEED (LF)</dt>
- <dt>U+000B LINE TABULATION</dt>
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
<dt>U+0020 SPACE</dt>
@@ -42169,7 +42148,6 @@
<dt>U+0009 CHARACTER TABULATION</dt>
<dt>U+000A LINE FEED (LF)</dt>
- <dt>U+000B LINE TABULATION</dt>
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
<dt>U+0020 SPACE</dt>
@@ -42274,7 +42252,6 @@
<dt>U+0009 CHARACTER TABULATION</dt>
<dt>U+000A LINE FEED (LF)</dt>
- <dt>U+000B LINE TABULATION</dt>
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
<dt>U+0020 SPACE</dt>
@@ -42362,7 +42339,6 @@
<dt>U+0009 CHARACTER TABULATION</dt>
<dt>U+000A LINE FEED (LF)</dt>
- <dt>U+000B LINE TABULATION</dt>
<dt>U+000C FORM FEED (FF)</dt>
<!--<dt>U+000D CARRIAGE RETURN (CR)</dt>-->
<dt>U+0020 SPACE</dt>
@@ -42852,9 +42828,8 @@
<dl class="switch">
<dt>A character token that is one of one of U+0009 CHARACTER
- TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
- FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020
- SPACE</dt>
+ TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF),
+ <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
<dd>
<p>Ignore the token.</p>
</dd>
@@ -43036,9 +43011,8 @@
</dd>
<dt>A character token that is one of one of U+0009 CHARACTER
- TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
- FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020
- SPACE</dt>
+ TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF),
+ <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
<dd>
<p>Ignore the token.</p>
</dd>
@@ -43107,10 +43081,9 @@
<dl class="switch">
- <dt>A character token that is one of one of U+0009
- CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
- TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
- or U+0020 SPACE</dt>
+ <dt>A character token that is one of one of U+0009 CHARACTER
+ TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF),
+ <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
<dd>
<p>Ignore the token.</p> <!-- :-( -->
</dd>
@@ -43186,10 +43159,9 @@
<dl class="switch">
- <dt>A character token that is one of one of U+0009
- CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
- TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
- or U+0020 SPACE</dt>
+ <dt>A character token that is one of one of U+0009 CHARACTER
+ TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF),
+ <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
<dd>
<p><span title="insert a character">Insert the character</span> into
the <span>current node</span>.</p>
@@ -43479,10 +43451,9 @@
</dd>
- <dt>A character token that is one of one of U+0009
- CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
- TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
- or U+0020 SPACE</dt>
+ <dt>A character token that is one of one of U+0009 CHARACTER
+ TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF),
+ <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
<dt>A comment token</dt>
<dt>A start tag whose tag name is one of: "link", "meta", "noframes", "style"</dt>
<dd>
@@ -43523,10 +43494,9 @@
<dl class="switch">
- <dt>A character token that is one of one of U+0009
- CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
- TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
- or U+0020 SPACE</dt>
+ <dt>A character token that is one of one of U+0009 CHARACTER
+ TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF),
+ <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
<dd>
<p><span title="insert a character">Insert the character</span> into
the <span>current node</span>.</p>
@@ -44830,10 +44800,9 @@
<dl class="switch">
- <dt>A character token that is one of one of U+0009
- CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
- TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
- or U+0020 SPACE</dt>
+ <dt>A character token that is one of one of U+0009 CHARACTER
+ TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF),
+ <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
<dd>
<p>If the <span>current table</span> is <span>tainted</span>, then
@@ -45098,10 +45067,9 @@
<dl class="switch">
- <dt>A character token that is one of one of U+0009
- CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
- TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
- or U+0020 SPACE</dt>
+ <dt>A character token that is one of one of U+0009 CHARACTER
+ TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF),
+ <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
<dd>
<p><span title="insert a character">Insert the character</span> into
the <span>current node</span>.</p>
@@ -45828,10 +45796,9 @@
<dl class="switch">
- <dt>A character token that is one of one of U+0009
- CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
- TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
- or U+0020 SPACE</dt>
+ <dt>A character token that is one of one of U+0009 CHARACTER
+ TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF),
+ <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
<dd>
<p>Process the token <span>using the rules for</span> the "<span
title="insertion mode: in body">in body</span>" <span>insertion
@@ -45896,10 +45863,9 @@
<dl class="switch">
- <dt>A character token that is one of one of U+0009
- CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
- TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
- or U+0020 SPACE</dt>
+ <dt>A character token that is one of one of U+0009 CHARACTER
+ TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF),
+ <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
<dd>
<p><span title="insert a character">Insert the character</span> into
the <span>current node</span>.</p>
@@ -45999,10 +45965,9 @@
<!-- due to rules in the "in frameset" mode, this can't be entered in the fragment case -->
<dl class="switch">
- <dt>A character token that is one of one of U+0009
- CHARACTER TABULATION, U+000A LINE FEED (LF), U+000B LINE
- TABULATION, U+000C FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),-->
- or U+0020 SPACE</dt>
+ <dt>A character token that is one of one of U+0009 CHARACTER
+ TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF),
+ <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
<dd>
<p><span title="insert a character">Insert the character</span> into
the <span>current node</span>.</p>
@@ -46075,9 +46040,8 @@
<dt>A DOCTYPE token</dt>
<dt>A character token that is one of one of U+0009 CHARACTER
- TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
- FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020
- SPACE</dt>
+ TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF),
+ <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
<dt>A start tag whose tag name is "html"</dt>
<dd>
<p>Process the token <span>using the rules for</span> the "<span
@@ -46116,9 +46080,8 @@
<dt>A DOCTYPE token</dt>
<dt>A character token that is one of one of U+0009 CHARACTER
- TABULATION, U+000A LINE FEED (LF), U+000B LINE TABULATION, U+000C
- FORM FEED (FF), <!--U+000D CARRIAGE RETURN (CR),--> or U+0020
- SPACE</dt>
+ TABULATION, U+000A LINE FEED (LF), U+000C FORM FEED (FF),
+ <!--U+000D CARRIAGE RETURN (CR),--> or U+0020 SPACE</dt>
<dt>A start tag whose tag name is "html"</dt>
<dd>
<p>Process the token <span>using the rules for</span> the "<span
More information about the Commit-Watchers
mailing list