[whatwg] Charset sniffing from XML prolog

Kartikaya Gupta lists.whatwg at stakface.com
Wed Oct 7 21:26:14 PDT 2009


> In this case, though, it's totally my fault: I just forgot that I had 
> the HTML5 parser turned on locally.  Turning that off, I do get UTF-8, 
> because of 
> http://hg.mozilla.org/mozilla-central/file/603759afc77a/parser/htmlparser/src/nsParser.cpp#l2553 
> and following.  That code is just bogus, in my somewhat biased 
> opinion.... ;)

So then is this behavior getting axed or specced? The site in question that relies on this behavior is http://bell.mobi/primary - it's not as noticeable in the english-locale version but if you switch to a french locale you get a bunch of french encoded as utf-8. Browsers with the prolog sniffing will render it fine but others will show garbage.

I'd be happier with not having to change my code to deal with this website, since it will occasionally show garbage even in utf-8.

kats



More information about the whatwg mailing list