Web Speech API

The Web Speech API is not consistently supported in all popular browsers but the simplicity with which it makes conversion of speech to text is very impressive.

The Web Speech API has two parts: SpeechSynthesis (Text-to-Speech), and SpeechRecognition (Asynchronous Speech Recognition)

Samples from Google and Mozilla show proof of the rich possibilities

On Chrome, using Speech Recognition on a web page involves a server-based recognition engine.

Pages hosted on HTTPS do not need to ask repeatedly for permission to use the microphone, whereas HTTP hosted pages do.

The API also supports multiple global languages

There is a caveat when using this API: it only works when you have an active internet connection. The reason for this is that the API doesn’t do its magic inside the browser. It will send a recording coming from your microphone to a server where it will analyse the sound and try to recognise the words being said. Once that server recognises some words or sentences, it will respond with a string of text

Some sample apps:
Hisaab
Dictation
HTML5 API demos

Comments