[whatwg] [html5] tags, elements and generated DOM
Ian Hickson
ian at hixie.ch
Fri Feb 24 15:06:54 PST 2006
On Thu, 7 Apr 2005, Henri Sivonen wrote:
>
> If you mean: Is it worth going down the road of requiring getElementById
> to support arbitrary characters in ids?
>
> Perhaps not. Depends on what is implemented.
I expect the spec to (when I get to gEBI) say that all characters must be
supported. It's easier to implement, for one.
> If you mean: Is it worth going down the road of requiring getElementById
> to consider idness based on factors other that the ID type in a DTD?
>
> Definitely yes. Browsers don't process DTDs and the W3C already allows
> the DOM impl to use info other than the DTD to figure out which
> attributes count as ids.
Yeah, that's already water long under the bridge.
On Thu, 7 Apr 2005, Henri Sivonen wrote:
>
> I don't think SGML validation is part of What WG conformance
> requirements. I thought Hixie has specifically said he doesn't bother
> with DTDs.
Indeed. WHATWG will happily host schemas and DTDs if people write them
(see, e.g., syntax.whatwg.org; fantasai maintains that). However, none of
them will be given any sort of preferential status. They're just
implementations.
On Thu, 7 Apr 2005, Henri Sivonen wrote:
>
> I am very hostile towards the idea of requiring UAs to implement any XML
> parsing features that are in the realm of the XML 1.0 spec but that the
> XML 1.0 spec does not require. This means processing the DTD beyond
> checking the internal subset for well-formedness.
>
> I would rather suggest that What WG specs explicitly discourage people
> from using a doctype on the XHTML side and point out that authors should
> not expect UAs to process the DTD.
>
> Those who want to use entities for input, should parse and reserialize
> as UTF-8 in their own lair and not expose their entity references (or
> parochial legacy encodings) to the public network.
The spec has text to this effect in places now; let me know if you have
more specific text you'd like to see. I don't want to be too strong, since
if you're using XML, exactly how you do so is the problem of the XML spec,
not the Web Apps / XHTML5 spec.
On Thu, 7 Apr 2005, Henri Sivonen wrote:
>
> Ideally, UAs would know nothing of that particular doctype and would
> trigger the standards mode because there is a doctype that is not on the
> list of doctypes that triggers the quirks mode or the almost standards
> mode.
Currently the spec says you trigger quirks mode if there isn't a <!DOCTYPE
HTML>, and you feel like it (both conditions must be met). I may make this
more specific in due course; do people think we should?
I'm also tempted to just codify many parts of quirks mode; do people want
to do that instead?
> > Or at the very least use something that would not confuse people into
> > thinking that it is an application of SGML or XML.
>
> Do you want to replace "NONSGML" with "THIS-IS-NOT-SGML"?
Well, we're now down to just <!DOCTYPE HTML>, since that was the shortest
I could make it while still triggering strict mode (well, <!DOCTYPEHTML>
worked too, but that just looks silly).
Suggestions on shortening it are welcome.
I really wish we could ditch the DOCTYPE altogether. The number of pages
that use quirks mode is just sad.
On Fri, 8 Apr 2005, Henri Sivonen wrote:
> On Apr 8, 2005, at 11:05, Jim Ley wrote:
>
> > The proposed string that MUST appear as the first line of a WHAT-WG
> > document is... please do not call it a doctype unless it is a doctype,
> > see even people on the list are confused by using this!
>
> Well, it could be defined as a tag soup construct called "doctype",
> which is neither an SGML doctype nor an XML doctype. :-)
Indeed, it's a DOCTYPE token, and it generates (in the right place) a
DOCTYPE DOM node. It's not an XML or SGML DOCTYPE per se...
--
Ian Hickson U+1047E )\._.,--....,'``. fL
http://ln.hixie.ch/ U+263A /, _.. \ _\ ;`._ ,.
Things that are impossible just take longer. `._.-(,_..'--(,_..'`-.;.'
More information about the whatwg
mailing list