[html5] r957 - /

whatwg at whatwg.org whatwg at whatwg.org
Sat Jun 23 02:11:02 PDT 2007


Author: ianh
Date: 2007-06-23 02:06:58 -0700 (Sat, 23 Jun 2007)
New Revision: 957

Modified:
   index
   source
Log:
[ac] (1) Not having any encoding information at all is a bug if you're not using ASCII.

Modified: index
===================================================================
--- index	2007-06-23 08:55:02 UTC (rev 956)
+++ index	2007-06-23 09:06:58 UTC (rev 957)
@@ -390,7 +390,7 @@
           directives</a>
 
          <li><a href="#charset"><span class=secno>3.7.5.4. </span>Specifying
-          and establishing the document's character encoding</a>
+          the document's character encoding</a>
         </ul>
 
        <li><a href="#the-style"><span class=secno>3.7.6. </span>The
@@ -7662,8 +7662,8 @@
    <dd>
   </dl>
 
-  <h5 id=charset><span class=secno>3.7.5.4. </span>Specifying and
-   establishing the document's character encoding</h5>
+  <h5 id=charset><span class=secno>3.7.5.4. </span>Specifying the document's
+   character encoding</h5>
 
   <p>The <code><a href="#meta0">meta</a></code> element may also be used to
    provide UAs with character encoding information for <a href="#html5"
@@ -7689,17 +7689,23 @@
 
   <p>If the document does not start with a BOM, and if its encoding is not
    explicitly given by <a href="#content-type8"
-   title=Content-Type>Content-Type metadata</a>, the character encoding used
-   must be a superset of US-ASCII (specifically, ANSI_X3.4-1968) for bytes in
-   the range 0x09 - 0x0D, 0x20, 0x21, 0x22, 0x26, 0x27, 0x2C - 0x3F, 0x41 -
-   0x5A, and 0x61 - 0x7A. <!-- XXX #refs RFC1345 ? -->
+   title=Content-Type>Content-Type metadata</a>, then the character encoding
+   used must be a superset of US-ASCII (specifically, ANSI_X3.4-1968) for
+   bytes in the range 0x09 - 0x0D, 0x20, 0x21, 0x22, 0x26, 0x27, 0x2C - 0x3F,
+   0x41 - 0x5A, and 0x61 - 0x7A<!-- XXX #refs RFC1345 ? -->
    <!-- is that list ok? do
   any character sets we want to support do things outside that range?
-  -->
-   Authors should not use UTF-32. Authors must not use the CESU-8, UTF-7,
+  -->,
+   and, in addition, if that encoding isn't US-ASCII itself, then the
+   encoding must be specified using a <code><a href="#meta0">meta</a></code>
+   element with a <code title=attr-meta-charset><a
+   href="#charset0">charset</a></code> attribute.
+
+  <p>Authors should not use UTF-32. Authors must not use the CESU-8, UTF-7,
    BOCU-1 and SCSU encodings. <a href="#refsCESU8">[CESU8]</a> <a
    href="#refsUTF7">[UTF7]</a> <a href="#refsBOCU1">[BOCU1]</a> <a
-   href="#refsSCSU">[SCSU]</a>
+   href="#refsSCSU">[SCSU]</a></p>
+  <!-- XXX ref UTF-32 -->
 
   <p>In XHTML, the XML declaration should be used for inline character
    encoding information, if necessary.

Modified: source
===================================================================
--- source	2007-06-23 08:55:02 UTC (rev 956)
+++ source	2007-06-23 09:06:58 UTC (rev 957)
@@ -6248,8 +6248,7 @@
   </dl>
 
 
-  <h5 id="charset">Specifying and establishing the document's
-  character encoding</h5>
+  <h5 id="charset">Specifying the document's character encoding</h5>
 
   <p>The <code>meta</code> element may also be used to provide UAs
   with character encoding information for <span
@@ -6279,15 +6278,19 @@
 
   <p>If the document does not start with a BOM, and if its encoding is
   not explicitly given by <span title="Content-Type">Content-Type
-  metadata</span>, the character encoding used must be a superset of
-  US-ASCII (specifically, ANSI_X3.4-1968) for bytes in the range 0x09
-  - 0x0D, 0x20, 0x21, 0x22, 0x26, 0x27, 0x2C - 0x3F, 0x41 - 0x5A, and
-  0x61 - 0x7A. <!-- XXX #refs RFC1345 ? --> <!-- is that list ok? do
+  metadata</span>, then the character encoding used must be a superset
+  of US-ASCII (specifically, ANSI_X3.4-1968) for bytes in the range
+  0x09 - 0x0D, 0x20, 0x21, 0x22, 0x26, 0x27, 0x2C - 0x3F, 0x41 - 0x5A,
+  and 0x61 - 0x7A<!-- XXX #refs RFC1345 ? --> <!-- is that list ok? do
   any character sets we want to support do things outside that range?
-  --> Authors should not use UTF-32. Authors must not use the CESU-8,
+  -->, and, in addition, if that encoding isn't US-ASCII itself, then
+  the encoding must be specified using a <code>meta</code> element
+  with a <code title="attr-meta-charset">charset</code> attribute.</p>
+
+  <p>Authors should not use UTF-32. Authors must not use the CESU-8,
   UTF-7, BOCU-1 and SCSU encodings. <a href="#refsCESU8">[CESU8]</a>
   <a href="#refsUTF7">[UTF7]</a> <a href="#refsBOCU1">[BOCU1]</a> <a
-  href="#refsSCSU">[SCSU]</a></p>
+  href="#refsSCSU">[SCSU]</a></p> <!-- XXX ref UTF-32 -->
 
   <p>In XHTML, the XML declaration should be used for inline character
   encoding information, if necessary.</p>




More information about the Commit-Watchers mailing list