[whatwg] Web API for speech recognition and synthesis
bringert at google.com
Wed Dec 2 11:17:58 PST 2009
I agree that being able to capture and upload audio to a server would
be useful for a lot of applications, and it could be used to do speech
recognition. However, for a web app developer who just wants to
develop an application that uses speech input and/or output, it
doesn't seem very convenient, since it requires server-side
infrastructure that is very costly to develop and run. A
speech-specific API in the browser gives browser implementors the
option to use on-device speech services provided by the OS, or
server-side speech synthesis/recognition.
On Wed, Dec 2, 2009 at 6:23 PM, Diogo Resende <dresende at thinkdigital.pt> wrote:
> I missunderstood too. It would be great to have the ability to access
> the microphone and record+upload or stream sound to the web server.
> On Wed, 2009-12-02 at 10:04 -0800, Jonas Sicking wrote:
>> On Wed, Dec 2, 2009 at 9:17 AM, Bjorn Bringert <bringert at google.com> wrote:
>> > speech API intended for use by web apps. That is, only web apps that
>> > use the speech API would have speech support. But it should be
>> > possible to use such an API to write browser extensions (using
>> > Greasemonkey, Chrome extensions etc) that allow speech control of the
>> > browser and speech synthesis of web page contents. Doing it the other
>> > way around seems like it would reduce the flexibility for web app
>> > developers.
>> Hmm.. I guess I misunderstood your original proposal.
>> Do you want the browser to expose an API that converts speech to text?
>> Or do you want the browser to expose access to the microphone so that
>> If the former, could you describe your use cases in more detail?
>> / Jonas
Google UK Limited, Registered Office: Belgrave House, 76 Buckingham
Palace Road, London, SW1W 9TQ
Registered in England Number: 3977902
More information about the whatwg