[whatwg] Superset encodings [Re: ISO-8859-* and the C1 control range]

Øistein E. Andersen liszt at coq.no
Wed Oct 21 14:39:00 PDT 2009


On 19 Oct 2009, at 05:52, Ian Hickson wrote:

> I've noted your e-mail here [...] and moved the whole thing out of  
> the spec.

That does not seem to apply to the last part of the original e-mail,  
quoted below.

Øistein E. Andersen



> Other character encoding issues:
> --------------------------------
>
> ASCII-compatibility:
> The note in ‘2.1.5 Character encodings’ seems to say that ‘variants  
> of ISO-2022’ (presumably including common ones like ISO-2022-CN,  
> ISO-2022KR and ISO-2022-JP) are ASCII-compatible, whereas HZ-GB-2312  
> is not, and I cannot find anything in Section 2.1.5 that would  
> explain this difference.
>
>
> Discouraged encodings:
> ‘4.2.5.5 Specifying the document's character encoding’ advises  
> against certain encodings.  (Incidentally, this advice probably  
> deserves not to be ‘hidden’ in a section nominally reserved for  
> character encoding *declaration* issues.)  In particular:
>
>> Authors should not use JIS-X-0208 (JIS_C6226-1983), JIS-X-0212  
>> (JIS_X0212-1990), encodings based on ISO-2022, and encodings based  
>> on EBCDIC.
>
> It is not clear what this means (e.g., the character set  
> JIS_C6226-1983 in any encoding, or only when encoded alone according  
> to RFC1345 as described above); the list of discouraged encodings  
> seems conspicuously short if it is supposed to be complete; and the  
> lack of rationale makes it difficult to understand why these  
> encodings are considered particularly harmful (JIS_C6226-1983 v.  
> JIS_C6226-1978 or ISO-2022 v. HZ, to mention but two at least  
> initially puzzling cases).  It might be better to say *why*  
> particular encodings are better avoided, whether or not the list of  
> discouraged encodings be presented as definitive.
>
> Minor grammar detail in 4.2.5.5:
>> Conformance checkers may advise against authors using legacy  
>> encodings.
>
> This is ambiguous.  It should probably be ‘advise against authors’  
> using legacy encodings’  or better ‘advise authors against using  
> legacy encodings’.


More information about the whatwg mailing list