[html5] r5733 - [e] (0) Define 'overlong form'. Fixing http://www.w3.org/Bugs/Public/show_bug.cg [...]
whatwg at whatwg.org
whatwg at whatwg.org
Fri Dec 31 21:46:18 PST 2010
Author: ianh
Date: 2010-12-31 21:46:17 -0800 (Fri, 31 Dec 2010)
New Revision: 5733
Modified:
complete.html
index
source
Log:
[e] (0) Define 'overlong form'.
Fixing http://www.w3.org/Bugs/Public/show_bug.cgi?id=11369
Modified: complete.html
===================================================================
--- complete.html 2011-01-01 05:28:37 UTC (rev 5732)
+++ complete.html 2011-01-01 05:46:17 UTC (rev 5733)
@@ -3604,7 +3604,7 @@
<dl class=switch><dt>One byte in the range FE to FF</dt>
- <dt>Overlong forms (e.g. F0 80 80 A0)</dt>
+ <dt><a href=#overlong-form title="overlong form">Overlong forms</a> (e.g. F0 80 80 A0)</dt>
<dt>One byte in the range C0 to C1, followed by one byte in the range 80 to BF</dt>
@@ -3638,7 +3638,12 @@
<dd>Each byte must be replaced with a U+FFFD REPLACEMENT CHARACTER.</dd>
- </dl><p class=example>For example, the byte string "41 98 BA 42 E2 98
+ </dl><p>For the purposes of the above requirements, an <dfn id=overlong-form>overlong
+ form</dfn> in UTF-8 is a sequence that encodes a codepoint using
+ more bytes than the minimum needed to encode that codepoint in
+ UTF-8.</p>
+
+ <p class=example>For example, the byte string "41 98 BA 42 E2 98
43 E2 98 BA E2 98" would be converted to the string
"A��B�C☺�".</p>
Modified: index
===================================================================
--- index 2011-01-01 05:28:37 UTC (rev 5732)
+++ index 2011-01-01 05:46:17 UTC (rev 5733)
@@ -3583,7 +3583,7 @@
<dl class=switch><dt>One byte in the range FE to FF</dt>
- <dt>Overlong forms (e.g. F0 80 80 A0)</dt>
+ <dt><a href=#overlong-form title="overlong form">Overlong forms</a> (e.g. F0 80 80 A0)</dt>
<dt>One byte in the range C0 to C1, followed by one byte in the range 80 to BF</dt>
@@ -3617,7 +3617,12 @@
<dd>Each byte must be replaced with a U+FFFD REPLACEMENT CHARACTER.</dd>
- </dl><p class=example>For example, the byte string "41 98 BA 42 E2 98
+ </dl><p>For the purposes of the above requirements, an <dfn id=overlong-form>overlong
+ form</dfn> in UTF-8 is a sequence that encodes a codepoint using
+ more bytes than the minimum needed to encode that codepoint in
+ UTF-8.</p>
+
+ <p class=example>For example, the byte string "41 98 BA 42 E2 98
43 E2 98 BA E2 98" would be converted to the string
"A��B�C☺�".</p>
Modified: source
===================================================================
--- source 2011-01-01 05:28:37 UTC (rev 5732)
+++ source 2011-01-01 05:46:17 UTC (rev 5733)
@@ -2592,7 +2592,7 @@
<dt>One byte in the range FE to FF</dt>
- <dt>Overlong forms (e.g. F0 80 80 A0)</dt>
+ <dt><span title="overlong form">Overlong forms</span> (e.g. F0 80 80 A0)</dt>
<dt>One byte in the range C0 to C1, followed by one byte in the range 80 to BF</dt>
@@ -2628,6 +2628,11 @@
</dl>
+ <p>For the purposes of the above requirements, an <dfn>overlong
+ form</dfn> in UTF-8 is a sequence that encodes a codepoint using
+ more bytes than the minimum needed to encode that codepoint in
+ UTF-8.</p>
+
<p class="example">For example, the byte string "41 98 BA 42 E2 98
43 E2 98 BA E2 98" would be converted to the string
"A��B�C☺�".</p>
More information about the Commit-Watchers
mailing list