[whatwg] Comments on Web Forms 2.0
Henri Sivonen
hsivonen at iki.fi
Sun Jan 2 06:16:25 PST 2005
On Dec 28, 2004, at 18:53, fantasai wrote:
> Ian Hickson wrote:
>> On Tue, 28 Dec 2004, Henri Sivonen wrote:
>>> According to http://www.unicode.org/faq/utf_bom.html#38 a data
>>> format or protocol may choose to ignore the BOM in the middle of a
>>> string.
>> HTML doesn't choose that, though, so that isn't relevant to us.
>
> It would be if the HTML document in question passes through a processor
> that takes advantage of this allowance. You could of course encode it
> as a numerical entity.
Expecting NCRs to allow characters to be smuggled is unsafe, because
clueful processing converts NCRs to straight characters.
>>> Anyway, I'm still uncomfortable with using a deprecated character
>>> that has a very special other meaning as a magic marker in WF 2.0.
>> I'm not overjoyed with it myself, but I haven't got any better ideas.
>> The current system works quite well, and certainly works better than
>> the "[]" prefix that I first suggested.
>
> That's questionable. At least the [] was visible so you could tell it
> was there.
> I have a strong suspicion that editing invisible characters is more
> error-prone
> than editing visible ones. And the idea of a disappearing invisible
> character
> seems like it would be a bit bizarre to explain to the average person.
Indeed.
Possible other magic marker characters:
ASCII visible, easy to use with the US keyboard and reasonable with
European keyboards
U+007C VERTICAL LINE
U+007E TILDE
U+005E CIRCUMFLEX ACCENT
Non-ASCII visible, available in legacy fonts, can be typed using Mac kb
layouts:
U+2022 BULLET
--
Henri Sivonen
hsivonen at iki.fi
http://iki.fi/hsivonen/
More information about the whatwg-whatwg.org
mailing list