[html5] r3332 - [] (0) Elaborate on the rules for ASCII-compatible encodings (see last checkin). [...]

whatwg at whatwg.org whatwg at whatwg.org
Sun Jun 28 03:53:15 PDT 2009


Author: ianh
Date: 2009-06-28 03:53:13 -0700 (Sun, 28 Jun 2009)
New Revision: 3332

Modified:
   index
   source
Log:
[] (0) Elaborate on the rules for ASCII-compatible encodings (see last checkin). (credit: pt)

Modified: index
===================================================================
--- index	2009-06-28 10:10:34 UTC (rev 3331)
+++ index	2009-06-28 10:53:13 UTC (rev 3332)
@@ -1705,13 +1705,13 @@
   the set 0x09, 0x0A, 0x0C, 0x0D, 0x20 - 0x22, 0x26, 0x27, 0x2C -
   0x3F, 0x41 - 0x5A, and 0x61 - 0x7A<!-- is that list ok? do any
   character sets we want to support do things outside that range?
-  -->, ignoring cases where those bytes would be part of multibyte
-  sequences. <a href=#refsRFC1345>[RFC1345]</a></p>
+  -->, ignoring the second and later bytes of multibyte sequences. <a href=#refsRFC1345>[RFC1345]</a></p>
 
   <p class=note>This includes such exotic encodings as Shift_JIS and
   variants of ISO-2022, even though it is possible for bytes like 0x70
   to be part of longer sequences that are unrelated to their
-  interpretation as ASCII.</p>
+  interpretation as ASCII. It excludes such encodings as UTF-7,
+  UTF-16, HZ-GB-2312, GSM03.38, and EBCDIC variants.</p>
 
   <!--
    We'll have to change that if anyone comes up with a way to have a

Modified: source
===================================================================
--- source	2009-06-28 10:10:34 UTC (rev 3331)
+++ source	2009-06-28 10:53:13 UTC (rev 3332)
@@ -730,13 +730,14 @@
   the set 0x09, 0x0A, 0x0C, 0x0D, 0x20 - 0x22, 0x26, 0x27, 0x2C -
   0x3F, 0x41 - 0x5A, and 0x61 - 0x7A<!-- is that list ok? do any
   character sets we want to support do things outside that range?
-  -->, ignoring cases where those bytes would be part of multibyte
-  sequences. <a href="#refsRFC1345">[RFC1345]</a></p>
+  -->, ignoring the second and later bytes of multibyte sequences. <a
+  href="#refsRFC1345">[RFC1345]</a></p>
 
   <p class="note">This includes such exotic encodings as Shift_JIS and
   variants of ISO-2022, even though it is possible for bytes like 0x70
   to be part of longer sequences that are unrelated to their
-  interpretation as ASCII.</p>
+  interpretation as ASCII. It excludes such encodings as UTF-7,
+  UTF-16, HZ-GB-2312, GSM03.38, and EBCDIC variants.</p>
 
   <!--
    We'll have to change that if anyone comes up with a way to have a




More information about the Commit-Watchers mailing list