[html5] r7607 - [e] (0) Clarify how language codes are passed on to other tools. Fixing https:// [...]

whatwg at whatwg.org whatwg at whatwg.org
Sat Dec 29 16:40:09 PST 2012


Author: ianh
Date: 2012-12-29 16:40:08 -0800 (Sat, 29 Dec 2012)
New Revision: 7607

Modified:
   complete.html
   index
   source
Log:
[e] (0) Clarify how language codes are passed on to other tools.
Fixing https://www.w3.org/Bugs/Public/show_bug.cgi?id=17977
Affected topics: HTML

Modified: complete.html
===================================================================
--- complete.html	2012-12-30 00:04:21 UTC (rev 7606)
+++ complete.html	2012-12-30 00:40:08 UTC (rev 7607)
@@ -248,7 +248,7 @@
 
   <header class=head id=head><p><a class=logo href=http://www.whatwg.org/><img alt=WHATWG height=101 src=/images/logo width=101></a></p>
    <hgroup><h1 class=allcaps>HTML</h1>
-    <h2 class="no-num no-toc">Living Standard — Last Updated 29 December 2012</h2>
+    <h2 class="no-num no-toc">Living Standard — Last Updated 30 December 2012</h2>
    </hgroup><dl><dt><strong>Web developer edition:</strong></dt>
     <dd><strong><a href=http://developers.whatwg.org/>http://developers.whatwg.org/</a></strong></dd>
     <dt>Multiple-page version:</dt>
@@ -11055,20 +11055,23 @@
   languages, the language of the node is unknown, and the
   corresponding language tag is the empty string.</p>
 
-  <p>If the resulting value is not a recognized language tag, then it
-  must be treated as an unknown language having the given language
-  tag, distinct from all other languages. For the purposes of
-  round-tripping or communicating with other services that expect
-  language tags, user agents should pass unknown language tags
-  through unmodified.</p>
+  <p>If the resulting value is not a recognized language tag, then it must be treated as an unknown
+  language having the given language tag, distinct from all other languages. For the purposes of
+  round-tripping or communicating with other services that expect language tags, user agents should
+  pass unknown language tags through unmodified, and tagged as being BCP 47 language tags, so that
+  subsequent services do not interpret the data as another type of language description. <a href=#refsBCP47>[BCP47]</a></p>
 
-  <p class=example>Thus, for instance, an element with <code title="">lang="xyzzy"</code> would be matched by the selector <code title="">:lang(xyzzy)</code> (e.g. in CSS), but it would not be
-  matched by <code title="">:lang(abcde)</code>, even though both are
-  equally invalid. Similarly, if a Web browser and screen reader
-  working in unison communicated about the language of the element,
-  the browser would tell the screen reader that the language was
-  "xyzzy", even if it knew it was invalid, just in case the screen
-  reader actually supported a language with that tag after all.</p>
+  <p class=example>Thus, for instance, an element with <code title="">lang="xyzzy"</code> would be
+  matched by the selector <code title="">:lang(xyzzy)</code> (e.g. in CSS), but it would not be
+  matched by <code title="">:lang(abcde)</code>, even though both are equally invalid. Similarly, if
+  a Web browser and screen reader working in unison communicated about the language of the element,
+  the browser would tell the screen reader that the language was "xyzzy", even if it knew it was
+  invalid, just in case the screen reader actually supported a language with that tag after all.
+  Even if the screen reader supported both BCP 47 and another syntax for encoding language names,
+  and in that other syntax the string "xyzzy" was a way to denote the Belarusian language, it would
+  be <em>incorrect</em> for the screen reader to then start treating text as Belarusian, because
+  "xyzzy" is not how Belarusian is described in BCP 47 codes (BCP 47 uses the code "be" for
+  Belarusian).</p>
 
   <p>If the resulting value is the empty string, then it must be
   interpreted as meaning that the language of the node is explicitly

Modified: index
===================================================================
--- index	2012-12-30 00:04:21 UTC (rev 7606)
+++ index	2012-12-30 00:40:08 UTC (rev 7607)
@@ -248,7 +248,7 @@
 
   <header class=head id=head><p><a class=logo href=http://www.whatwg.org/><img alt=WHATWG height=101 src=/images/logo width=101></a></p>
    <hgroup><h1 class=allcaps>HTML</h1>
-    <h2 class="no-num no-toc">Living Standard — Last Updated 29 December 2012</h2>
+    <h2 class="no-num no-toc">Living Standard — Last Updated 30 December 2012</h2>
    </hgroup><dl><dt><strong>Web developer edition:</strong></dt>
     <dd><strong><a href=http://developers.whatwg.org/>http://developers.whatwg.org/</a></strong></dd>
     <dt>Multiple-page version:</dt>
@@ -11055,20 +11055,23 @@
   languages, the language of the node is unknown, and the
   corresponding language tag is the empty string.</p>
 
-  <p>If the resulting value is not a recognized language tag, then it
-  must be treated as an unknown language having the given language
-  tag, distinct from all other languages. For the purposes of
-  round-tripping or communicating with other services that expect
-  language tags, user agents should pass unknown language tags
-  through unmodified.</p>
+  <p>If the resulting value is not a recognized language tag, then it must be treated as an unknown
+  language having the given language tag, distinct from all other languages. For the purposes of
+  round-tripping or communicating with other services that expect language tags, user agents should
+  pass unknown language tags through unmodified, and tagged as being BCP 47 language tags, so that
+  subsequent services do not interpret the data as another type of language description. <a href=#refsBCP47>[BCP47]</a></p>
 
-  <p class=example>Thus, for instance, an element with <code title="">lang="xyzzy"</code> would be matched by the selector <code title="">:lang(xyzzy)</code> (e.g. in CSS), but it would not be
-  matched by <code title="">:lang(abcde)</code>, even though both are
-  equally invalid. Similarly, if a Web browser and screen reader
-  working in unison communicated about the language of the element,
-  the browser would tell the screen reader that the language was
-  "xyzzy", even if it knew it was invalid, just in case the screen
-  reader actually supported a language with that tag after all.</p>
+  <p class=example>Thus, for instance, an element with <code title="">lang="xyzzy"</code> would be
+  matched by the selector <code title="">:lang(xyzzy)</code> (e.g. in CSS), but it would not be
+  matched by <code title="">:lang(abcde)</code>, even though both are equally invalid. Similarly, if
+  a Web browser and screen reader working in unison communicated about the language of the element,
+  the browser would tell the screen reader that the language was "xyzzy", even if it knew it was
+  invalid, just in case the screen reader actually supported a language with that tag after all.
+  Even if the screen reader supported both BCP 47 and another syntax for encoding language names,
+  and in that other syntax the string "xyzzy" was a way to denote the Belarusian language, it would
+  be <em>incorrect</em> for the screen reader to then start treating text as Belarusian, because
+  "xyzzy" is not how Belarusian is described in BCP 47 codes (BCP 47 uses the code "be" for
+  Belarusian).</p>
 
   <p>If the resulting value is the empty string, then it must be
   interpreted as meaning that the language of the node is explicitly

Modified: source
===================================================================
--- source	2012-12-30 00:04:21 UTC (rev 7606)
+++ source	2012-12-30 00:40:08 UTC (rev 7607)
@@ -11203,22 +11203,24 @@
   languages, the language of the node is unknown, and the
   corresponding language tag is the empty string.</p>
 
-  <p>If the resulting value is not a recognized language tag, then it
-  must be treated as an unknown language having the given language
-  tag, distinct from all other languages. For the purposes of
-  round-tripping or communicating with other services that expect
-  language tags, user agents should pass unknown language tags
-  through unmodified.</p>
+  <p>If the resulting value is not a recognized language tag, then it must be treated as an unknown
+  language having the given language tag, distinct from all other languages. For the purposes of
+  round-tripping or communicating with other services that expect language tags, user agents should
+  pass unknown language tags through unmodified, and tagged as being BCP 47 language tags, so that
+  subsequent services do not interpret the data as another type of language description. <a
+  href="#refsBCP47">[BCP47]</a></p>
 
-  <p class="example">Thus, for instance, an element with <code
-  title="">lang="xyzzy"</code> would be matched by the selector <code
-  title="">:lang(xyzzy)</code> (e.g. in CSS), but it would not be
-  matched by <code title="">:lang(abcde)</code>, even though both are
-  equally invalid. Similarly, if a Web browser and screen reader
-  working in unison communicated about the language of the element,
-  the browser would tell the screen reader that the language was
-  "xyzzy", even if it knew it was invalid, just in case the screen
-  reader actually supported a language with that tag after all.</p>
+  <p class="example">Thus, for instance, an element with <code title="">lang="xyzzy"</code> would be
+  matched by the selector <code title="">:lang(xyzzy)</code> (e.g. in CSS), but it would not be
+  matched by <code title="">:lang(abcde)</code>, even though both are equally invalid. Similarly, if
+  a Web browser and screen reader working in unison communicated about the language of the element,
+  the browser would tell the screen reader that the language was "xyzzy", even if it knew it was
+  invalid, just in case the screen reader actually supported a language with that tag after all.
+  Even if the screen reader supported both BCP 47 and another syntax for encoding language names,
+  and in that other syntax the string "xyzzy" was a way to denote the Belarusian language, it would
+  be <em>incorrect</em> for the screen reader to then start treating text as Belarusian, because
+  "xyzzy" is not how Belarusian is described in BCP 47 codes (BCP 47 uses the code "be" for
+  Belarusian).</p>
 
   <p>If the resulting value is the empty string, then it must be
   interpreted as meaning that the language of the node is explicitly




More information about the Commit-Watchers mailing list