Audio file to text file python

Question

I want to convert an audio(ex: ".mp3") file to text file. I have tried different approaches like pyspeech and speech recognition, But i didn't get any answer. Is there any other way to do this..? Any help would be appreciated !

Possible duplicate: https://stackoverflow.com/questions/12455069/how-to-input-and-process-audio-files-to-convert-to-text-via-pyspeech-or-dragonfl unfortunately without a valid answer. — ρss, Sep 15 '15 at 13:22
@ρss, Thanks for mentioning but there is also no solid answer — Mulagala, Sep 15 '15 at 13:24

score 6 · Answer 1 · answered Sep 15 '15 at 15:25

6

Did you try https://pypi.python.org/pypi/SpeechRecognition/ ? That sounds like exactly what you want.

I also found the CMU Sphinx project via this blog. It has Python bindings too (as mentioned in the article).

The other item I found was Google's Speech to Text API. You might want to check that out too. Here's a decent tutorial on this subject:

http://codeabitwiser.com/2014/09/python-google-speech-api/

answered Sep 15 '15 at 15:25

Mike Driscoll

32,629
8
45
88

SpeechRecognition looks like it will do what you want. Note that it uses external speech to text engines on the web, so you'll need internet access and a key for whatever service option you use.Textract (http://textract.readthedocs.org/en/latest/) provides a simple, uniform interface for text extraction and uses SpeechRecognition for handling audio files. – ViennaMike Dec 23 '15 at 18:18

score 2 · Answer 2 · answered Sep 11 '19 at 05:47

2

import speech_recognition as sr
print(sr.__version__)
r = sr.Recognizer()

file_audio = sr.AudioFile('file_audio.wav')

with file_audio as source:
   audio_text = r.record(source)

print(type(audio_text))
print(r.recognize_google(audio_text))

answered Sep 11 '19 at 05:47

kamran kausar

4,117
1
23
17

Do you know if is there a workaround to other idioms (portuguese) using Sphinx? – Nov 22 '19 at 14:39

score 0 · Answer 3 · answered Sep 21 '21 at 06:46

way 1: convet audio file to bytes (0,1) with https://github.com/jiaaro/pydub or by f = open("test.mp3", "rb") first16bytes = f.read(16)

way 2: audio to speech convertors.eg.-convert to english or other language with pip libraries like SpeechRecognition pydub. (but i think you don't asked for this)

way 3: convert mp3 to Json. If anyone did this, then please share.

Audio file to text file python

3 Answers3