[whatwg] Superset encodings [Re: ISO-8859-* and the C1 control range]

Øistein E. Andersen liszt at coq.no
Thu Oct 22 14:58:38 PDT 2009


On 22 Oct 2009, at 22:45, Philip Taylor wrote:
> On Thu, Oct 22, 2009 at 9:23 PM, Øistein E. Andersen <liszt at coq.no>  
> wrote:
>> On 22 Oct 2009, at 17:15, NARUSE, Yui wrote:
>>> Finally, Why ISO 2022 series is discouraged is not clear.
>> We agree on this point.
> The string "숍訊昱穿" encoded as ISO-2022-KR is the bytes 0e 3c  
> 73  63 72
> 69 70 74 3e. A UA that doesn't support ISO-2022-KR (e.g. Chrome, when
> I last checked) will decode it as Windows-1252 and get the string
> "<script>", which is bad. [...]

If that is the reason, at least HZ encoding would seem to be affected  
as well.  Explicitly discouraging a more or less random subset of the  
problematic encdodings without providing rationale makes it difficult  
to assess whether or not other, somewhat similar, encodings should be  
avoided as well, which was the main issue I wanted to raise.

-- 
Øistein E. Andersen


More information about the whatwg mailing list