[html5] r3368 - [] (0) Clarify the encoding aliases mess.

whatwg at whatwg.org whatwg at whatwg.org
Tue Jul 7 01:26:16 PDT 2009


Author: ianh
Date: 2009-07-07 01:26:15 -0700 (Tue, 07 Jul 2009)
New Revision: 3368

Modified:
   index
   source
Log:
[] (0) Clarify the encoding aliases mess.

Modified: index
===================================================================
--- index	2009-07-07 04:41:56 UTC (rev 3367)
+++ index	2009-07-07 08:26:15 UTC (rev 3368)
@@ -4976,14 +4976,30 @@
   <p class=example>For instance, "GB_2312-80" and "g.b.2312(80)" are
   considered equivalent names.</p>
 
-  <hr><p>When a user agent would otherwise use an encoding specified by a
-  label given in the first column of the following table to either
-  convert content to Unicode characters or convert Unicode characters
-  to bytes, it must instead use the encoding given in the cell in the
-  second column of the same row. When a byte or sequence of bytes is
-  treated differently due to this encoding aliasing, it is said to
-  have been <dfn id=misinterpreted-for-compatibility>misinterpreted for compatibility</dfn>.</p>
+<!-- this bit will be replaced by actual alias registrations in due course -->
 
+  <p>In addition, user agents must support the aliases given in the
+  following table, so that labels from the first column are treated as
+  equivalent to the labels given in the corresponding cell from the
+  second column on the same row.</p>
+
+  <table><caption>Additional character encoding aliases</caption>
+   <thead><tr><th> Alias <th> Corresponding encoding <th> References
+   <tbody><tr><td> x-sjis <td> Windows-31J <td>
+         <a href=#refsSHIFTJIS>[SHIFTJIS]</a>
+         <a href=#refsWIN31J>[WIN31J]</a>
+    <tr><td> Windows-932 <td> Windows-31J <td>
+         <a href=#refsWIN31J>[WIN31J]</a>
+    <tr><td> x-x-big5 <td> Big5 <td>
+         <a href=#refsBIG5>[BIG5]</a>
+   </table><!-- end of bit that will be replaced by actual alias registrations in due course --><hr><p>When a user agent would otherwise use an encoding given in the
+  first column of the following table to either convert content to
+  Unicode characters or convert Unicode characters to bytes, it must
+  instead use the encoding given in the cell in the second column of
+  the same row. When a byte or sequence of bytes is treated
+  differently due to this encoding aliasing, it is said to have been
+  <dfn id=misinterpreted-for-compatibility>misinterpreted for compatibility</dfn>.</p>
+
   <table><caption>Character encoding overrides</caption>
    <thead><tr><th> Input encoding <th> Replacement encoding <th> References
    <tbody><!-- how about EUC-JP? --><tr><td> EUC-KR <td> Windows-949 <td>
@@ -5016,8 +5032,6 @@
     <tr><td> US-ASCII <td> Windows-1252 <td>
          <a href=#refsRFC1345>[RFC1345]</a>
          <a href=#refsWIN1252>[WIN1252]</a>
-    <tr><td> x-x-big5 <td> Big5 <td>
-         <a href=#refsBIG5>[BIG5]</a>
    </table><p class=note>The requirement to treat certain encodings as other
   encodings according to the table above is a <a href=#willful-violation>willful
   violation</a> of the W3C Character Model specification, motivated

Modified: source
===================================================================
--- source	2009-07-07 04:41:56 UTC (rev 3367)
+++ source	2009-07-07 08:26:15 UTC (rev 3368)
@@ -4660,15 +4660,39 @@
   <p class="example">For instance, "GB_2312-80" and "g.b.2312(80)" are
   considered equivalent names.</p>
 
+<!-- this bit will be replaced by actual alias registrations in due course -->
+
+  <p>In addition, user agents must support the aliases given in the
+  following table, so that labels from the first column are treated as
+  equivalent to the labels given in the corresponding cell from the
+  second column on the same row.</p>
+
+  <table>
+   <caption>Additional character encoding aliases</caption>
+   <thead>
+    <tr> <th> Alias <th> Corresponding encoding <th> References
+   <tbody>
+    <tr> <td> x-sjis <td> Windows-31J <td>
+         <a href="#refsSHIFTJIS">[SHIFTJIS]</a>
+         <a href="#refsWIN31J">[WIN31J]</a>
+    <tr> <td> Windows-932 <td> Windows-31J <td>
+         <a href="#refsWIN31J">[WIN31J]</a>
+    <tr> <td> x-x-big5 <td> Big5 <td>
+         <a href="#refsBIG5">[BIG5]</a>
+   </tbody>
+  </table>
+
+<!-- end of bit that will be replaced by actual alias registrations in due course -->
+
   <hr>
 
-  <p>When a user agent would otherwise use an encoding specified by a
-  label given in the first column of the following table to either
-  convert content to Unicode characters or convert Unicode characters
-  to bytes, it must instead use the encoding given in the cell in the
-  second column of the same row. When a byte or sequence of bytes is
-  treated differently due to this encoding aliasing, it is said to
-  have been <dfn>misinterpreted for compatibility</dfn>.</p>
+  <p>When a user agent would otherwise use an encoding given in the
+  first column of the following table to either convert content to
+  Unicode characters or convert Unicode characters to bytes, it must
+  instead use the encoding given in the cell in the second column of
+  the same row. When a byte or sequence of bytes is treated
+  differently due to this encoding aliasing, it is said to have been
+  <dfn>misinterpreted for compatibility</dfn>.</p>
 
   <table>
    <caption>Character encoding overrides</caption>
@@ -4706,8 +4730,6 @@
     <tr> <td> US-ASCII <td> Windows-1252 <td>
          <a href="#refsRFC1345">[RFC1345]</a>
          <a href="#refsWIN1252">[WIN1252]</a>
-    <tr> <td> x-x-big5 <td> Big5 <td>
-         <a href="#refsBIG5">[BIG5]</a>
    </tbody>
   </table>
 




More information about the Commit-Watchers mailing list