Questions tagged [transcription]

160 questions
16
votes
3 answers

How to punctuate youtube transcripts?

On Youtube, I can download the CC transcript for a video but the transcript does not contain punctuation. How can I punctuate the transcript automatically?
William
  • 761
  • 2
  • 10
  • 27
8
votes
5 answers

Music Transcription of Wav files in Java

I have project about music transcription using Java, the thing is,, I have created an applet that records sound and saves it into a WAV file, the player should only play the piece he/she would like to transcribe, after that I am stuck at the point…
Xtremeaiy
  • 149
  • 1
  • 2
6
votes
0 answers

Why does the `Start-Transcript` cmdlet insert newlines even when I use Write-Host -NoNewline?

From Start-Transcript: The Start-Transcript cmdlet creates a record of all or part of a PowerShell session to a text file. The transcript includes all command that the user types and all output that appears on the…
VA systems engineer
  • 2,856
  • 2
  • 14
  • 38
6
votes
1 answer

Which romanization standard should be used to improve ICU4j transliteration for Arabic-Latin?

We have a requirement to transliterate Arabic text to Latin characters(without diacritical marks) and display them to users. We are currently using IBM ICU4j for this. The API doesn't trasliterate well the Arabic text into proper readable latin…
Kamlesh Sharma
  • 222
  • 1
  • 7
  • 23
6
votes
0 answers

Start-Transcript and Logging Batch File Output

I have a function in a PowerShell module which creates a log file and starts a transcript using that file (see below). When running a PowerShell script, this works great and captures all output. When running a PowerShell script which calls a batch…
Xangrim
  • 133
  • 1
  • 1
  • 9
6
votes
1 answer

Tracking HTML5 Video Closed Captions Visibility State

I am attempting to determine if the TextTrack element of a html video is currently showing or hidden. I took a look at the html specifications and at first glance, the texttracks.mode property would work perfectly according to:…
Modulo
  • 61
  • 5
5
votes
3 answers

Transcribing mp3 to text (python) --> "RIFF id" error

I am trying to turn mp3 file to text, but my code returns the error outlined below. Any help is appreciated! This is a sample mp3 file. And below is what I have tried: import speech_recognition as sr print(sr.__version__) r =…
5
votes
1 answer

Google Speech API Single Utterance

How does Google Speech API's SingleUtterance work? According to the docs, it is Google's way of determining when a speaker has spoken a single utterance. I understand what it does, but I would like to know how? Does the API simply wait for a certain…
Harry Stuart
  • 1,781
  • 2
  • 24
  • 39
5
votes
0 answers

How to automatically transcribe a Skype meeting, correctly attributed to each participant?

Assuming each participant agrees to the recording and transcription of the Skype call, is there a way to transcribe the meeting (either live or offline or both) such that it produces a text transcript where each spoken text is correctly attributed…
user3898238
  • 957
  • 3
  • 11
  • 25
4
votes
2 answers

Use CMU's sphinx4 to transcribe non-digits data

I am recently working on using CMU's sphinx4 for transcription and eventually forced alignment, i.e. aligning audio with its transcript. I found a project called AutoCap that basically did what I wanted to develop. So, I installed it but it did not…
4
votes
1 answer

Real time voice transcription using Dart

I have problem with real-time voice transcription. In my application people can make a video call each other according to their language choice, I want to show subtitles of conversations in real time to users. I used Agora SDK for video call…
Omer Ciftci
  • 917
  • 1
  • 9
  • 16
4
votes
0 answers

Boto3 does not support transcribe service

I'm using Boto3 package in order to transcribe audio files using aws transcribe service I'm following AWS official example but getting this error: botocore.exceptions.UnknownServiceError: Unknown service: 'transcribe' I'm using latest boto3 package…
YardenST
  • 5,119
  • 2
  • 33
  • 54
4
votes
1 answer

Ansi-coloured file editing

Context Have some transcript files from terminal interaction, obtained using traditional Unix command "script". Those transcripts contain lots of control character (like backspace when editing shell commands), and lots of color code sequences as…
Stéphane Gourichon
  • 6,493
  • 4
  • 37
  • 48
3
votes
2 answers

How to transcribe audio without using external APIs?

I would prefer not to use Amazon, Google etc, so how would I use my own computer (macOS) to get a time-stamped transcription of mp3s and videos? Preferably on the command line. So I could do something like this transcribe -o oliver_twist.srt…
cannyboy
  • 24,180
  • 40
  • 146
  • 252
3
votes
0 answers

Problem with reading entire audio in SpeechRecognition because of silent moments

I'm having problems with transcribing an entire audio with SpeechRecognition using Google recognizer API. Even though my audio is correctly read, just the first sentence of it is detected and transcribed. That's because my audio file has many…
1
2 3
10 11