How does Google Speech API's SingleUtterance
work? According to the docs, it is Google's way of determining when a speaker has spoken a single utterance. I understand what it does, but I would like to know how? Does the API simply wait for a certain duration of "speechless" audio? If so, how long a duration of voiceless audio will trigger the end of an utterance?
Does it have some other sort of AI algorithm that helps determine when someone has stopped speaking?
Thanks