[whatwg] Speech input element

Doug Schepers doug at schepers.cc
Tue May 18 10:44:42 PDT 2010


Hi, Bjorn-

Bjorn Bringert wrote (on 5/17/10 9:05 AM):
> Back in December there was a discussion about web APIs for speech
> recognition and synthesis that saw a decent amount of interest
> (http://lists.whatwg.org/pipermail/whatwg-whatwg.org/2009-December/thread.html#24281).
> Based on that discussion, we would like to propose a simple API for
> speech recognition, using a new<input type="speech">  element. An
> informal spec of the new API, along with some sample apps and use
> cases can be found at:
> http://docs.google.com/Doc?docid=0AaYxrITemjbxZGNmZzc5cHpfM2Ryajc5Zmhx&hl=en.
>
> It would be very helpful if you could take a look and share your
> comments. Our next steps will be to implement the current design, get
> some feedback from web developers, continue to tweak, and seek
> standardization as soon it looks mature enough and/or other vendors
> become interested in implementing it.

This is important work, thanks for taking it on and bringing it to a 
wider discussion forum.  Here's a couple of other venues you might also 
consider discussing it, above and beyond discussion on the WHATWG list:

* W3C just launched a new Audio Incubator Group (Audio XG), as a forum 
to discuss various aspects of audio on the Web.  The Audio XG is not 
intended to produce Recommendation-track specifications like this 
(though they will likely prototype and write a draft spec for a 
read-write audio API), but it could serve a role in helping work out use 
cases and requirements, reviewing specs, and so forth.  I'm not totally 
sure that this is relevant to your interests, but I thought I would 
bring it up.

* The Voice Browser Working Group is very interested in bringing their 
work and experience into the graphical browser world, so you should work 
with them or get their input.  As I understand it, some of them plan to 
join the Audio XG, too (specifically to talk about speech synthesis in 
the larger context), so that might be one forum to have some 
conversations.  VoiceXML is rather different than X/HTML or the browser 
DOM, and the participants in the VBWG don't necessarily have the right 
experience in graphical browser approaches, so I think there's an 
opportunity for good conversation and cross-pollination here.

[1] http://www.w3.org/2005/Incubator/audio/
[2] http://www.w3.org/Voice/

Regards-
-Doug



More information about the whatwg mailing list