[html5] r5733 - [e] (0) Define 'overlong form'. Fixing http://www.w3.org/Bugs/Public/show_bug.cg [...]

whatwg at whatwg.org whatwg at whatwg.org
Fri Dec 31 21:46:18 PST 2010


Author: ianh
Date: 2010-12-31 21:46:17 -0800 (Fri, 31 Dec 2010)
New Revision: 5733

Modified:
   complete.html
   index
   source
Log:
[e] (0) Define 'overlong form'.
Fixing http://www.w3.org/Bugs/Public/show_bug.cgi?id=11369

Modified: complete.html
===================================================================
--- complete.html	2011-01-01 05:28:37 UTC (rev 5732)
+++ complete.html	2011-01-01 05:46:17 UTC (rev 5733)
@@ -3604,7 +3604,7 @@
 
   <dl class=switch><dt>One byte in the range FE to FF</dt>
 
-   <dt>Overlong forms (e.g. F0 80 80 A0)</dt>
+   <dt><a href=#overlong-form title="overlong form">Overlong forms</a> (e.g. F0 80 80 A0)</dt>
 
    <dt>One byte in the range C0 to C1, followed by one byte in the range 80 to BF</dt>
 
@@ -3638,7 +3638,12 @@
 
    <dd>Each byte must be replaced with a U+FFFD REPLACEMENT CHARACTER.</dd>
 
-  </dl><p class=example>For example, the byte string "41 98 BA 42 E2 98
+  </dl><p>For the purposes of the above requirements, an <dfn id=overlong-form>overlong
+  form</dfn> in UTF-8 is a sequence that encodes a codepoint using
+  more bytes than the minimum needed to encode that codepoint in
+  UTF-8.</p>
+
+  <p class=example>For example, the byte string "41 98 BA 42 E2 98
   43 E2 98 BA E2 98" would be converted to the string
   "A��B�C☺�".</p>
 

Modified: index
===================================================================
--- index	2011-01-01 05:28:37 UTC (rev 5732)
+++ index	2011-01-01 05:46:17 UTC (rev 5733)
@@ -3583,7 +3583,7 @@
 
   <dl class=switch><dt>One byte in the range FE to FF</dt>
 
-   <dt>Overlong forms (e.g. F0 80 80 A0)</dt>
+   <dt><a href=#overlong-form title="overlong form">Overlong forms</a> (e.g. F0 80 80 A0)</dt>
 
    <dt>One byte in the range C0 to C1, followed by one byte in the range 80 to BF</dt>
 
@@ -3617,7 +3617,12 @@
 
    <dd>Each byte must be replaced with a U+FFFD REPLACEMENT CHARACTER.</dd>
 
-  </dl><p class=example>For example, the byte string "41 98 BA 42 E2 98
+  </dl><p>For the purposes of the above requirements, an <dfn id=overlong-form>overlong
+  form</dfn> in UTF-8 is a sequence that encodes a codepoint using
+  more bytes than the minimum needed to encode that codepoint in
+  UTF-8.</p>
+
+  <p class=example>For example, the byte string "41 98 BA 42 E2 98
   43 E2 98 BA E2 98" would be converted to the string
   "A��B�C☺�".</p>
 

Modified: source
===================================================================
--- source	2011-01-01 05:28:37 UTC (rev 5732)
+++ source	2011-01-01 05:46:17 UTC (rev 5733)
@@ -2592,7 +2592,7 @@
 
    <dt>One byte in the range FE to FF</dt>
 
-   <dt>Overlong forms (e.g. F0 80 80 A0)</dt>
+   <dt><span title="overlong form">Overlong forms</span> (e.g. F0 80 80 A0)</dt>
 
    <dt>One byte in the range C0 to C1, followed by one byte in the range 80 to BF</dt>
 
@@ -2628,6 +2628,11 @@
 
   </dl>
 
+  <p>For the purposes of the above requirements, an <dfn>overlong
+  form</dfn> in UTF-8 is a sequence that encodes a codepoint using
+  more bytes than the minimum needed to encode that codepoint in
+  UTF-8.</p>
+
   <p class="example">For example, the byte string "41 98 BA 42 E2 98
   43 E2 98 BA E2 98" would be converted to the string
   "A&#xFFFD;&#xFFFD;B&#xFFFD;C&#x263A;&#xFFFD;".</p>




More information about the Commit-Watchers mailing list