[whatwg] Parsing entities

Simon Pieters zcorpan at hotmail.com
Mon Aug 14 14:18:28 PDT 2006


Hi,

From: Ian Hickson <ian at hixie.ch>
>On Mon, 14 Aug 2006, Simon Pieters wrote:
> >
> > I guess that for compat with IE and the Web[1] we have to treat
> > "R&eacutesum&eacute" as if it were "Résumé". So how do we
> > handle "¬i;"? When the parser has come as far as "&not" it can't
> > return U+00AC yet because it could well be "∉". But when it has
> > reached "¬i;" then it can't be "∉", thus it returns U+00AC, but
> > then you also have to reparse the "i;", right? Unless I'm mistaken the
> > spec doesn't say anything about that.
>
>Section 8.2.3.1 "Tokenising entities", under "Anything else", covers this:
>"Consume the maximum number of characters possible, with the consumed
>characters case-sensitively matching one of the identifiers in the first
>column of the entities table".

I've read that several times. But it doesn't say that "i;" in "¬i;" is 
going to be reparsed. ("noti" isn't an entity, but "not" and "notin" are.)

Regards,
Simon Pieters





More information about the whatwg mailing list