Questions tagged [speech-synthesis]

Speech synthesis is the artificial production of human speech.

Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented in software or hardware. A text-to-speech (TTS) system converts normal language text into speech; other systems render symbolic linguistic representations like phonetic transcriptions into speech.

Synthesized speech can be created by concatenating pieces of recorded speech that are stored in a database. Systems differ in the size of the stored speech units; a system that stores phones or diphones provides the largest output range, but may lack clarity. For specific usage domains, the storage of entire words or sentences allows for high-quality output. Alternatively, a synthesizer can incorporate a model of the vocal tract and other human voice characteristics to create a completely "synthetic" voice output.

The quality of a speech synthesizer is judged by its similarity to the human voice and by its ability to be understood. An intelligible text-to-speech program allows people with visual impairments or reading disabilities to listen to written works on a home computer. Many computer operating systems have included speech synthesizers since the early 1990s.

For more info, take a look at: http://en.wikipedia.org/wiki/Speech_synthesis

442 questions
62
votes
13 answers

Getting the list of voices in speechSynthesis (Web Speech API)

Following HTML shows empty array in console on first click:
Mehdi
  • 4,202
  • 5
  • 20
  • 36
58
votes
12 answers

Chrome Speech Synthesis with longer texts

I am getting a problem when trying to use Speech Synthesis API in Chrome 33. It works perfectly with a shorter text, but if I try longer text, it just stops in the middle. After it has stopped once like that, the Speech Synthesis does not work…
Andrey Shchekin
  • 21,101
  • 19
  • 94
  • 162
34
votes
9 answers

SpeechSynthesis API onend callback not working

I'm using the Speech Synthesis API on Google Chrome v34.0.1847.131. The API is implemented in Chrome starting in v33. The text-to-speech works for the most part, except when assigning a callback to onend. For instance, the following code: var…
27
votes
9 answers

pyttsx: No module named 'engine'

I'm trying to install TTS package by using this. Everything was okay until I tried to execute the following command: import pyttsx I got back this error: File "/usr/local/lib/python3.4/dist-packages/pyttsx/__init__.py", line 18, in module
26
votes
1 answer

SpeechSynthesis.speak (in Web Speech API) always stops after a few seconds in Google Chrome

When using the speak function in the Web Speech API, in Chrome the speaking stops abruptly after a few seconds, in the middle of the text given to it, in a seemingly random place (without reaching the end). This only happens in Chrome (works well on…
25
votes
1 answer

speechSynthesis.getVoices() returns empty array on Windows

I am making a Chrome Extension, in which I am using Speech Synthesis. When I type speechSynthesis.getVoices() in the console I get an Array of 21 different voices. Great! When I console.log() the same line within my javascript code, I get an Empty…
Melvin Abraham
  • 2,870
  • 5
  • 19
  • 33
21
votes
3 answers

Google Cloud Text-to-speech word timestamps

I'm generating speech through Google Cloud's text-to-speech API and I'd like to highlight words as they are spoken. Is there a way of getting timestamps for spoken words or sentences?
user2248702
  • 2,741
  • 7
  • 41
  • 69
21
votes
7 answers

JavaScript speechSynthesis.speak() without user activation is no longer allowed since M71

I used speechSynthesis API in this way: speechSynthesis.speak(new SpeechSynthesisUtterance("hello world")); But right now I get error after update Google Chrome: [Deprecation] speechSynthesis.speak() without user activation is no longer allowed…
stepozer
  • 1,143
  • 1
  • 10
  • 22
20
votes
2 answers

Android Text-To-Speech API Sounds Robotic

I'm learning android development for the first time and my goal is to create a simple Hello World application that takes in some text, and reads them out loud. I've based my code off an example I found and here's my code: class MainFeeds :…
20
votes
2 answers

Can Web Speech API be used in conjunction with Web Audio API?

Is it possible to use the synthesised speech from Web Speech API as a SourceNode inside Web Audio API's audio context?
zya
  • 830
  • 11
  • 25
19
votes
1 answer

Why does a SpeechSynthesisUtterance sometimes not fire an 'end' event in Chromium-based browsers?

In both Chrome (v72, W10) and Opera, the following snippet very occasionally does not seem to run the attached end listener to the SpeechSynthesisUtterance, maybe 1 out of 50 times the snippet is run. (Sorry, in the original version of this, it…
Snow
  • 3,820
  • 3
  • 13
  • 39
19
votes
4 answers

The right way to use SSML with Web Speech API

Web Speech API specification says: text attribute This attribute specifies the text to be synthesized and spoken for this utterance. This may be either plain text or a complete, well-formed SSML document. For speech synthesis engines that…
Andrey Shchekin
  • 21,101
  • 19
  • 94
  • 162
17
votes
5 answers

System.Speech.Synthesis hangs with high CPU on 2012 R2

I have an asp.net MVC application that has a controller action that takes a string as input and sends a response wav file of the synthesized speech. Here is a simplified example: public async Task Speak(string text) { …
16
votes
1 answer

Is there a way to convert speech directly into SSML?

Just as one is able to use various speech-to-text 'dictation' tools to convert spoken word into its corresponding text, I would like to know if there are similar such tools for converting spoken word into its corresponding SSML. That is, it will…
16
votes
6 answers

Google's text-to speech engine voices?

Most of you probably know the text-to-speech synthesizer of google translate, as you can access programmatically here btw: http://translate.google.com/translate_tts?tl=en&q=text My impression was it's sometimes using espeak, but in the major…
1
2 3
29 30