[whatwg] Default encoding to UTF-8?

Anne van Kesteren annevk at opera.com
Tue Apr 3 12:08:41 PDT 2012

On Tue, 03 Apr 2012 13:59:25 +0200, Henri Sivonen <hsivonen at iki.fi> wrote:
> On Wed, Jan 4, 2012 at 12:34 AM, Leif Halvard Silli
> <xn--mlform-iua at xn--mlform-iua.no> wrote:
>>> A solution that would border on reasonable would be decoding as
>>> US-ASCII up to the first non-ASCII byte
>> Thus possibly prescan of more than 1024 bytes?
> I didn't mean a prescan.  I meant proceeding with the real parse and
> switching decoders in midstream. This would have the complication of
> also having to change the encoding the document object reports to
> JavaScript in some cases.

On IRC (#whatwg) zcorpan pointed out this would break URLs where entities  
are used to encode non-ASCII code points in the query component.

Anne van Kesteren

More information about the whatwg mailing list