[Implementors] Entities (part of detailed review)
t.broyer at gmail.com
Thu Aug 2 03:17:18 PDT 2007
[CC'ing implementors at whatwg.org]
2007/8/1, Henri Sivonen:
> On Aug 1, 2007, at 15:28, Thomas Broyer wrote:
> > Can someone remind me why this hasn't be done with a third "Is
> > semi-colon required" column?
> If anything, the current table suggests a sensible implementation
> approach that works together with the parsing algorithm prose.
Just to say that I've updated Twintsam's tokenizer  to use an "is
missing semi-colon recoverable"-column approach. It now passes all 519
tokenizer/entities.test tests .
It might not be the fastest approach but given that the HtmlEntities
class is public, I find it cleaner (entities are only exposed as names
without semi-colons, the "third colon" is internal to the library and
its HTML5 tokenizing algorithm). The HtmlEntities' class internals
could eventually be refactored though to use the "two columns"
 HTML5 library for .NET 2.0 written in C#. Most of the code is
about six-month old, I'm working on it to align with the latest
current draft. It can be found at http://code.google.com/p/twintsam/
More information about the Implementors