[whatwg] Test cases for parsing spec (Was: Re: Provding Better Tools)

Sam Ruby rubys at intertwingly.net
Wed Dec 6 07:22:35 PST 2006

Anne van Kesteren wrote:
> On Wed, 06 Dec 2006 15:13:26 +0100, Sam Ruby <rubys at intertwingly.net> 
> wrote:
>> Count me in.  This is actually closer to the original reason why I 
>> originally subscribed to this list.  If given a few tests, I could 
>> convert them into a useful form,and this form could serve as a model 
>> for future tests.
>> My original interest was to write a replacement for Python's SGMLLIB, 
>> i.e., one that was not based on the theoretical ideal of how SGML 
>> vocabularies work, but one based on the practical notion of how HTML 
>> actually is parsed.
> The HTMLTokenizer for such a project is mostly finished already:
>   http://code.google.com/p/html5lib/
> (As in, it actually emits the tokens it has to. I'm quite happy about it!)
> James Graham has been working on the Tree Construction part of the 
> process (called HTMLParser in parser.py) and Lachlan Hunt is working on 
> an HTMLInputStream class which handles some of the specifics needed for 
> the input stream.

I have no interest in participating in a project without test cases.

On the bright side, the license chosen for that work is fine, and -- if 
there are test cases -- I have no interest in duplicating others work.

- Sam Ruby

More information about the whatwg mailing list