[html5] r3332 - [] (0) Elaborate on the rules for ASCII-compatible encodings (see last checkin). [...]
whatwg at whatwg.org
whatwg at whatwg.org
Sun Jun 28 03:53:15 PDT 2009
Author: ianh
Date: 2009-06-28 03:53:13 -0700 (Sun, 28 Jun 2009)
New Revision: 3332
Modified:
index
source
Log:
[] (0) Elaborate on the rules for ASCII-compatible encodings (see last checkin). (credit: pt)
Modified: index
===================================================================
--- index 2009-06-28 10:10:34 UTC (rev 3331)
+++ index 2009-06-28 10:53:13 UTC (rev 3332)
@@ -1705,13 +1705,13 @@
the set 0x09, 0x0A, 0x0C, 0x0D, 0x20 - 0x22, 0x26, 0x27, 0x2C -
0x3F, 0x41 - 0x5A, and 0x61 - 0x7A<!-- is that list ok? do any
character sets we want to support do things outside that range?
- -->, ignoring cases where those bytes would be part of multibyte
- sequences. <a href=#refsRFC1345>[RFC1345]</a></p>
+ -->, ignoring the second and later bytes of multibyte sequences. <a href=#refsRFC1345>[RFC1345]</a></p>
<p class=note>This includes such exotic encodings as Shift_JIS and
variants of ISO-2022, even though it is possible for bytes like 0x70
to be part of longer sequences that are unrelated to their
- interpretation as ASCII.</p>
+ interpretation as ASCII. It excludes such encodings as UTF-7,
+ UTF-16, HZ-GB-2312, GSM03.38, and EBCDIC variants.</p>
<!--
We'll have to change that if anyone comes up with a way to have a
Modified: source
===================================================================
--- source 2009-06-28 10:10:34 UTC (rev 3331)
+++ source 2009-06-28 10:53:13 UTC (rev 3332)
@@ -730,13 +730,14 @@
the set 0x09, 0x0A, 0x0C, 0x0D, 0x20 - 0x22, 0x26, 0x27, 0x2C -
0x3F, 0x41 - 0x5A, and 0x61 - 0x7A<!-- is that list ok? do any
character sets we want to support do things outside that range?
- -->, ignoring cases where those bytes would be part of multibyte
- sequences. <a href="#refsRFC1345">[RFC1345]</a></p>
+ -->, ignoring the second and later bytes of multibyte sequences. <a
+ href="#refsRFC1345">[RFC1345]</a></p>
<p class="note">This includes such exotic encodings as Shift_JIS and
variants of ISO-2022, even though it is possible for bytes like 0x70
to be part of longer sequences that are unrelated to their
- interpretation as ASCII.</p>
+ interpretation as ASCII. It excludes such encodings as UTF-7,
+ UTF-16, HZ-GB-2312, GSM03.38, and EBCDIC variants.</p>
<!--
We'll have to change that if anyone comes up with a way to have a
More information about the Commit-Watchers
mailing list