[whatwg] ISO-8859-* and the C1 control range

Kristof Zelechovski giecrilj at stegny.2a.pl
Tue Jun 5 01:38:37 PDT 2007



And why not:?

    2c) If the declared encoding was ISO-8859-2, replace that  
character with the character that you get by casting the code point  
into a byte and decoding it as Windows-1250.

Am I missing something?
Chris

-----Original Message-----
From: whatwg-bounces at lists.whatwg.org
[mailto:whatwg-bounces at lists.whatwg.org] On Behalf Of Henri Sivonen
Sent: Tuesday, June 05, 2007 9:19 AM
To: WHATWG List
Subject: Re: [whatwg] ISO-8859-* and the C1 control range

On May 29, 2007, at 13:13, Henri Sivonen wrote:

Based on the behavior of Minefield and Opera 9.20, the following  
seems to be the least Charmod violating and least quirky approach  
that could possibly work:

1) Decode the byte stream using a decoder for whatever encoding was  
declared, even ISO-8859-1 or ISO-8859-11, according to ftp:// 
ftp.unicode.org/Public/MAPPINGS/.
2) If a character in the decoded character stream is in the C1 code  
point range, this is a document conformance violation.
    2a) If the declared encoding was ISO-8859-1, replace that  
character with the character that you get by casting the code point  
into a byte and decoding it as Windows-1252.
    2b) If the declared encoding was ISO-8859-11, replace that  
character with the character that you get by casting the code point  
into a byte and decoding it as Windows-874.






More information about the whatwg mailing list