<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>
<head>
<meta content="text/html;charset=ISO-8859-1" http-equiv="Content-Type">
</head>
<body bgcolor="#ffffff" text="#000000">
I hadn't thought of that one ;-) (in Dutch there are no native words
with umlauts, only some of German or Scandinavian descent).<br>
My question was about char-sets that contain both a trema version and a
(seperate) umlaut version of the same character. Are there any?<br>
<br>
cheers,<br>
Sander<br>
<br>
<br>
Kristof Zelechovski schreef:
<blockquote cite="mid:000701c7b509$ed3253a0$1a01080a@POCZTOWIEC"
type="cite">
<pre wrap="">Only the vowel U can have either but I have not seen a valid example of
&utrema;. The orthography "ambigüe" has recently been changed to "ambiguë"
for consistency. Polish "nauka" (science) and German "beurteilen" would
make good candidates but the national rules of orthography do not allow this
distinction because Slavic languages do not have diphthongs except in
borrowed words and it would cause ambiguity in German (cf. "geübt").
(Incidentally, this leads to bad pronunciation often encountered even in
Polish media.)
Cheers
Chris
-----Original Message-----
From: Sander [<a class="moz-txt-link-freetext" href="mailto:html5@zoid.nl">mailto:html5@zoid.nl</a>]
Sent: Friday, June 22, 2007 9:26 PM
To: Kristof Zelechovski
Subject: Re: [whatwg] Entity parsing
Kristof Zelechovski schreef:
</pre>
<blockquote type="cite">
<pre wrap="">A dieresis is not an umlaut so I have to bite my tongue each time I write
</pre>
</blockquote>
<pre wrap=""><!---->or
</pre>
<blockquote type="cite">
<pre wrap="">read nonsense like ï. It feels like lying. Umlaut means "mixed", a
dieresis means "standalone". Those are very different things, and "I" can
never gets mixed so there is no ambiguïty. Since "umlaut" is borrowed
</pre>
</blockquote>
<pre wrap=""><!---->from
</pre>
<blockquote type="cite">
<pre wrap="">German, I can see no problem in borrowing "tréma" from French. I
</pre>
</blockquote>
<pre wrap=""><!---->personally
</pre>
<blockquote type="cite">
<pre wrap="">prefer "&itrema;" to "&idier;" because of readability, but I would not
insist on that.
</pre>
</blockquote>
<pre wrap=""><!---->
"In professional typography, umlaut dots are usually a bit closer to the
letter's body than the dots of the trema. In handwriting, however, no
distinction is visible between the two. This is also true for most
computer fonts and encodings."
[<a class="moz-txt-link-freetext" href="http://en.wikipedia.org/wiki/Umlaut_(diacritic)">http://en.wikipedia.org/wiki/Umlaut_(diacritic)</a>]
Are there any char-sets that have both umlaut and trema variations of
characters? If so, both entities could exist.
cheers,
Sander
PS: I'd go for "&itrema;" instead of "&idier;" as well as the term
"trema" is also the one that's used in Dutch.
</pre>
</blockquote>
</body>
</html>