Speech to Text

The IBM Watson Speech to Text service uses speech recognition capabilities to convert Arabic, English, Spanish, French, Brazilian Portuguese, Japanese, and Mandarin speech into text.

Drop an audio file here.

Watson Speech to Text supports .wav, .opus, and .flac files up to 200mb.

Transcribe Audio

The returned result includes the recognized text, word alternatives, and spotted keywords. Some models can detect multiple speakers; this may slow down performance.

Voice Model:

Keywords to spot: