[html5] r6650 - [e] (0) Define 'Unicode code point'. Fixing http://www.w3.org/Bugs/Public/show_b [...]

Thu Oct 6 16:34:49 PDT 2011

Author: ianh
Date: 2011-10-06 16:34:47 -0700 (Thu, 06 Oct 2011)
New Revision: 6650

Modified:
   complete.html
   index
   source
Log:
[e] (0) Define 'Unicode code point'.
Fixing http://www.w3.org/Bugs/Public/show_bug.cgi?id=13676

Modified: complete.html
===================================================================

--- complete.html	2011-10-06 23:30:14 UTC (rev 6649)
+++ complete.html	2011-10-06 23:34:47 UTC (rev 6650)
@@ -3370,15 +3370,19 @@
   <p>The term <dfn id=unicode-character>Unicode character</dfn> is used to mean a <i title="">Unicode scalar value</i> (i.e. any Unicode code point that
   is not a surrogate code point). <a href=#refsUNICODE>[UNICODE]</a></p>
 
-  <p>The term <dfn id=character>character</dfn>, when not qualified as
-  <em>Unicode</em> character, means a <a href=#unicode-character>Unicode character</a>
-  where possible, or a surrogate code point when not: when an
-  algorithm that processes strings is defined in terms of characters,
-  a pair of <a href=#code-unit title="code unit">code units</a> consisting of a
-  high surrogate followed by a low surrogate must be treated as a
-  single character, but isolated surrogates must each be treated as a
-  single character also.</p>
+  <p>The term <dfn id=unicode-code-point>Unicode code point</dfn> means a <a href=#unicode-character>Unicode
+  character</a> where possible, and an isolated surrogate code
+  point when not. When a conformance requirement is defined in terms
+  of characters or Unicode code points, a pair of <a href=#code-unit title="code
+  unit">code units</a> consisting of a high surrogate followed by a
+  low surrogate must be treated as the single code point represented
+  by the surrogate pair, but isolated surrogates must each be treated
+  as the single code point with the value of the surrogate.</p>
 
+  <p>In this specification, the term <dfn id=character>character</dfn>, when not
+  qualified as <em>Unicode</em> character, is synonymous with the term
+  <a href=#unicode-code-point>Unicode code point</a>.</p>
+
   <p>The <dfn id=code-point-length>code-point length</dfn> of a string is the number of
   <a href=#code-unit title="code unit">code units</a> in that string.</p>
 

Modified: index
===================================================================
--- index	2011-10-06 23:30:14 UTC (rev 6649)
+++ index	2011-10-06 23:34:47 UTC (rev 6650)
@@ -3370,15 +3370,19 @@
   <p>The term <dfn id=unicode-character>Unicode character</dfn> is used to mean a <i title="">Unicode scalar value</i> (i.e. any Unicode code point that
   is not a surrogate code point). <a href=#refsUNICODE>[UNICODE]</a></p>
 
-  <p>The term <dfn id=character>character</dfn>, when not qualified as
-  <em>Unicode</em> character, means a <a href=#unicode-character>Unicode character</a>
-  where possible, or a surrogate code point when not: when an
-  algorithm that processes strings is defined in terms of characters,
-  a pair of <a href=#code-unit title="code unit">code units</a> consisting of a
-  high surrogate followed by a low surrogate must be treated as a
-  single character, but isolated surrogates must each be treated as a
-  single character also.</p>
+  <p>The term <dfn id=unicode-code-point>Unicode code point</dfn> means a <a href=#unicode-character>Unicode
+  character</a> where possible, and an isolated surrogate code
+  point when not. When a conformance requirement is defined in terms
+  of characters or Unicode code points, a pair of <a href=#code-unit title="code
+  unit">code units</a> consisting of a high surrogate followed by a
+  low surrogate must be treated as the single code point represented
+  by the surrogate pair, but isolated surrogates must each be treated
+  as the single code point with the value of the surrogate.</p>
 
+  <p>In this specification, the term <dfn id=character>character</dfn>, when not
+  qualified as <em>Unicode</em> character, is synonymous with the term
+  <a href=#unicode-code-point>Unicode code point</a>.</p>
+
   <p>The <dfn id=code-point-length>code-point length</dfn> of a string is the number of
   <a href=#code-unit title="code unit">code units</a> in that string.</p>
 

Modified: source
===================================================================
--- source	2011-10-06 23:30:14 UTC (rev 6649)
+++ source	2011-10-06 23:34:47 UTC (rev 6650)
@@ -2248,15 +2248,19 @@
   is not a surrogate code point). <a
   href="#refsUNICODE">[UNICODE]</a></p>
 
-  <p>The term <dfn>character</dfn>, when not qualified as
-  <em>Unicode</em> character, means a <span>Unicode character</span>
-  where possible, or a surrogate code point when not: when an
-  algorithm that processes strings is defined in terms of characters,
-  a pair of <span title="code unit">code units</span> consisting of a
-  high surrogate followed by a low surrogate must be treated as a
-  single character, but isolated surrogates must each be treated as a
-  single character also.</p>
+  <p>The term <dfn>Unicode code point</dfn> means a <span>Unicode
+  character</span> where possible, and an isolated surrogate code
+  point when not. When a conformance requirement is defined in terms
+  of characters or Unicode code points, a pair of <span title="code
+  unit">code units</span> consisting of a high surrogate followed by a
+  low surrogate must be treated as the single code point represented
+  by the surrogate pair, but isolated surrogates must each be treated
+  as the single code point with the value of the surrogate.</p>
 
+  <p>In this specification, the term <dfn>character</dfn>, when not
+  qualified as <em>Unicode</em> character, is synonymous with the term
+  <span>Unicode code point</span>.</p>
+
   <p>The <dfn>code-point length</dfn> of a string is the number of
   <span title="code unit">code units</span> in that string.</p>