[html5] r6650 - [e] (0) Define 'Unicode code point'. Fixing http://www.w3.org/Bugs/Public/show_b [...]
whatwg at whatwg.org
whatwg at whatwg.org
Thu Oct 6 16:34:49 PDT 2011
Author: ianh
Date: 2011-10-06 16:34:47 -0700 (Thu, 06 Oct 2011)
New Revision: 6650
Modified:
complete.html
index
source
Log:
[e] (0) Define 'Unicode code point'.
Fixing http://www.w3.org/Bugs/Public/show_bug.cgi?id=13676
Modified: complete.html
===================================================================
--- complete.html 2011-10-06 23:30:14 UTC (rev 6649)
+++ complete.html 2011-10-06 23:34:47 UTC (rev 6650)
@@ -3370,15 +3370,19 @@
<p>The term <dfn id=unicode-character>Unicode character</dfn> is used to mean a <i title="">Unicode scalar value</i> (i.e. any Unicode code point that
is not a surrogate code point). <a href=#refsUNICODE>[UNICODE]</a></p>
- <p>The term <dfn id=character>character</dfn>, when not qualified as
- <em>Unicode</em> character, means a <a href=#unicode-character>Unicode character</a>
- where possible, or a surrogate code point when not: when an
- algorithm that processes strings is defined in terms of characters,
- a pair of <a href=#code-unit title="code unit">code units</a> consisting of a
- high surrogate followed by a low surrogate must be treated as a
- single character, but isolated surrogates must each be treated as a
- single character also.</p>
+ <p>The term <dfn id=unicode-code-point>Unicode code point</dfn> means a <a href=#unicode-character>Unicode
+ character</a> where possible, and an isolated surrogate code
+ point when not. When a conformance requirement is defined in terms
+ of characters or Unicode code points, a pair of <a href=#code-unit title="code
+ unit">code units</a> consisting of a high surrogate followed by a
+ low surrogate must be treated as the single code point represented
+ by the surrogate pair, but isolated surrogates must each be treated
+ as the single code point with the value of the surrogate.</p>
+ <p>In this specification, the term <dfn id=character>character</dfn>, when not
+ qualified as <em>Unicode</em> character, is synonymous with the term
+ <a href=#unicode-code-point>Unicode code point</a>.</p>
+
<p>The <dfn id=code-point-length>code-point length</dfn> of a string is the number of
<a href=#code-unit title="code unit">code units</a> in that string.</p>
Modified: index
===================================================================
--- index 2011-10-06 23:30:14 UTC (rev 6649)
+++ index 2011-10-06 23:34:47 UTC (rev 6650)
@@ -3370,15 +3370,19 @@
<p>The term <dfn id=unicode-character>Unicode character</dfn> is used to mean a <i title="">Unicode scalar value</i> (i.e. any Unicode code point that
is not a surrogate code point). <a href=#refsUNICODE>[UNICODE]</a></p>
- <p>The term <dfn id=character>character</dfn>, when not qualified as
- <em>Unicode</em> character, means a <a href=#unicode-character>Unicode character</a>
- where possible, or a surrogate code point when not: when an
- algorithm that processes strings is defined in terms of characters,
- a pair of <a href=#code-unit title="code unit">code units</a> consisting of a
- high surrogate followed by a low surrogate must be treated as a
- single character, but isolated surrogates must each be treated as a
- single character also.</p>
+ <p>The term <dfn id=unicode-code-point>Unicode code point</dfn> means a <a href=#unicode-character>Unicode
+ character</a> where possible, and an isolated surrogate code
+ point when not. When a conformance requirement is defined in terms
+ of characters or Unicode code points, a pair of <a href=#code-unit title="code
+ unit">code units</a> consisting of a high surrogate followed by a
+ low surrogate must be treated as the single code point represented
+ by the surrogate pair, but isolated surrogates must each be treated
+ as the single code point with the value of the surrogate.</p>
+ <p>In this specification, the term <dfn id=character>character</dfn>, when not
+ qualified as <em>Unicode</em> character, is synonymous with the term
+ <a href=#unicode-code-point>Unicode code point</a>.</p>
+
<p>The <dfn id=code-point-length>code-point length</dfn> of a string is the number of
<a href=#code-unit title="code unit">code units</a> in that string.</p>
Modified: source
===================================================================
--- source 2011-10-06 23:30:14 UTC (rev 6649)
+++ source 2011-10-06 23:34:47 UTC (rev 6650)
@@ -2248,15 +2248,19 @@
is not a surrogate code point). <a
href="#refsUNICODE">[UNICODE]</a></p>
- <p>The term <dfn>character</dfn>, when not qualified as
- <em>Unicode</em> character, means a <span>Unicode character</span>
- where possible, or a surrogate code point when not: when an
- algorithm that processes strings is defined in terms of characters,
- a pair of <span title="code unit">code units</span> consisting of a
- high surrogate followed by a low surrogate must be treated as a
- single character, but isolated surrogates must each be treated as a
- single character also.</p>
+ <p>The term <dfn>Unicode code point</dfn> means a <span>Unicode
+ character</span> where possible, and an isolated surrogate code
+ point when not. When a conformance requirement is defined in terms
+ of characters or Unicode code points, a pair of <span title="code
+ unit">code units</span> consisting of a high surrogate followed by a
+ low surrogate must be treated as the single code point represented
+ by the surrogate pair, but isolated surrogates must each be treated
+ as the single code point with the value of the surrogate.</p>
+ <p>In this specification, the term <dfn>character</dfn>, when not
+ qualified as <em>Unicode</em> character, is synonymous with the term
+ <span>Unicode code point</span>.</p>
+
<p>The <dfn>code-point length</dfn> of a string is the number of
<span title="code unit">code units</span> in that string.</p>
More information about the Commit-Watchers
mailing list