[whatwg] Internal character encoding declaration
Henri Sivonen
hsivonen at iki.fi
Sun Mar 12 06:46:13 PST 2006
On Mar 12, 2006, at 00:49, Henri Sivonen wrote:
>> Encoding errors are easy parse errors. (Emit U+FFFD on bogus data.)
>
> Except for the ISO-8859-* family the easy error recovery should be
> emitting the characters according to the corresponding Windows-*
> family superset.
But those aren't strictly encoding errors. One more try:
For ISO-8859-* family encodings that have a corresponding Windows-*
family superset (e.g. Windows-1252 for ISO-8859-1) the UA must use
the Windows-* family superset decoder instead of the ISO-8859-*
family decoder. However, any bytes in the 0x80–0x9F (inclusive) are
easy parse errors.
--
Henri Sivonen
hsivonen at iki.fi
http://hsivonen.iki.fi/
More information about the whatwg
mailing list