[whatwg] Article: Growing pains afflict HTML5 standardization

Nils Dagsson Moskopp nils-dagsson-moskopp at dieweltistgarnichtso.net
Mon Jul 12 06:39:20 PDT 2010


Mike Wilcox <mike at mikewilcox.net> schrieb am Mon, 12 Jul 2010 07:44:07
-0500:

> That's a little different. Google purposely uses unstandardized,
> incorrect HTML in ways that still render in a browser in order to
> make it more difficult for screen scrapers. They also "break it" in a
> different way every week.

Assuming this is true (which I find difficult to believe), wouldn't a
screen scraper based on the HTML5 parsing algorithm defeat this
purpose ?

Greetings,
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 230 bytes
Desc: not available
URL: <http://lists.whatwg.org/pipermail/whatwg-whatwg.org/attachments/20100712/21a6c0d0/attachment-0002.pgp>


More information about the whatwg mailing list