[whatwg] HTML as a text format: Should <title> be optional?
Brian Campbell
Brian.P.Campbell at dartmouth.edu
Wed Sep 23 13:30:27 PDT 2009
On Jun 4, 2009, at 6:42 PM, Ian Hickson wrote:
> On Fri, 17 Apr 2009, Øistein E. Andersen wrote:
>>
>> A <title> is usually a good idea, but is it really necessary to
>> require
>> this for conformance? After all, a <title> is not something which an
>> author is likely to forget, and leaving it out has no unexpected
>> consequences.
>
> Leaving it out has a pretty important consequence, it breaks user
> interfaces that need to refer to the document, e.g. "bookmarks"
> features
> in browsers.
HTML documents sometimes occur in places in which they are not
independently bookmarkable, such as in an HTML email, or embedded in
an Atom or RSS feed.
They also sometimes occur in places in which it's generally not
expected that someone will bookmark them, nor particularly easy to do
so, such as within an iframe which is being used for a gadget or some
sort of sandboxed content that isn't really independent of the
containing page.
> On Sat, 18 Apr 2009, Randy Drielinger wrote:
>>
>> If you're converting from a textfile, title could refer to the
>> filename.
>>
>> If it's an automated process, it can be added by default.
>>
>> If it's manual, they'll have to remember the short html5 doctype
>> and the
>> title element.
>
> It does indeed seem easy to include it.
Yes, but sometimes you don't have a reasonable value for the title.
Say you are converting some word processing document format to HTML
(or spreadsheet, or slides, or anything of the sort); what would you
use if there isn't any sort of heading to take the title from?
There are several options (you could use the filename, or look for
anything heading like, or use the first couple of words), but if a
<title> element is missing, then the user agent could do the same.
> On Sat, 18 Apr 2009, Øistein E. Andersen wrote:
>>
>> It could, but chances are that the original filename would
>> typically be
>> less useful than the URL, which is what most browsers use when the
>> <title> element is omitted, so this rather sounds like an argument
>> against forcing authors to include a <title>.
>
> I don't see why this would be the case. In practice, however, if one
> is at
> a loss as to what to use for the <title>, but one has an <h1>, then I
> would recommend using the <h1>'s contents.
This has the problem of duplicating content, which may get out of
sync. And why can't the user agent just extract the title from the
heading if it needs it rather than the generator?
>> Yes, my concern is that a validator should be useful as an authoring
>> tool and not overwhelm the author with spurious errors. As I see it,
>> leaving out <title> is very much like leaving out a paragraph of text
>> and not something that should matter for validation.
>
> As it affects user interfaces, and since the cost of including a
> <title>
> is so low, I think it makes sense to continue to make it required.
As it's something that affects the user interface in a fairly visible
way, it's not likely that many authors will neglect to use it if it's
useful to do so. However, in some cases it is not possible to include
a title, or there isn't any really useful title to include, so I'm not
sure why it should be required in those cases.
This seems somewhat analogous to the <img alt> case; there are some
times when you don't have any useful text to put in there, and putting
in auto-generated placeholder text just to conform seems less than
useful. The number of pages unhelpfully titled "Untitled" or "Untitled
page" seems to confirm this.
There are other cases in which a title just isn't all that useful,
like an iframe; the title will be invisible, and it's pretty uncommon
for people to open that iframe in a new tab and then bookmark it from
there (unless maybe they happen to be web geeks like us). For example,
see Google Gadgets <http://www.google.com/webmasters/gadgets/>, or
iframe sandboxes used for isolating untrusted content while still
being inline in the page.
So, my recommendation is that <title> be made optional; perhaps a
validator could issue a warning if you leave it off, but there are
perfectly valid cases of wanting to produce an HTML document that
doesn't have any sort of meaningful title or for which a title will
never be seen or used, it doesn't seem likely that people will forget
it in cases in which it's useful, and right now it is sometimes being
filled with useless values like "Untitled" that actually get in the
way of a UA computing a better value (such as the URL or the top level
heading).
-- Brian Campbell
More information about the whatwg
mailing list