[whatwg] Internal character encoding declaration

Henri Sivonen hsivonen at iki.fi
Sun Mar 12 06:46:13 PST 2006


On Mar 12, 2006, at 00:49, Henri Sivonen wrote:

>> Encoding errors are easy parse errors. (Emit U+FFFD on bogus data.)
>
> Except for the ISO-8859-* family the easy error recovery should be  
> emitting the characters according to the corresponding Windows-*  
> family superset.

But those aren't strictly encoding errors. One more try:

For ISO-8859-* family encodings that have a corresponding Windows-*  
family superset (e.g. Windows-1252 for ISO-8859-1) the UA must use  
the Windows-* family superset decoder instead of the ISO-8859-*  
family decoder. However, any bytes in the 0x80–0x9F (inclusive) are  
easy parse errors.

-- 
Henri Sivonen
hsivonen at iki.fi
http://hsivonen.iki.fi/





More information about the whatwg mailing list