Open AI Whisper is returning the transcription in English instead of the native language

Question

When I use the open AI whisper model on Hindi audio, it returns the transcription in English instead of Hindi.

How do I get the output in Hindi itself? Is there a setting that can be changed?

mel = whisper.log_mel_spectrogram(audio).to(model.device)
options = whisper.DecodingOptions(language = 'hi')
result = whisper.decode(model, mel, options)
print(result.text)

Result:

If you are trying using the `base` model, try `large` or `medium`. The issue with wrong script for Hindi is already reported here: https://github.com/openai/whisper/discussions/118 — Gokul NC, Oct 08 '22 at 14:40

Parzival · Answer 1 · 2023-04-29T15:15:07.100

0

I recommend loading your model as for example base.hi.

Edit: Here's an example:

model = whisper.load_model('base.hi')
result = model.transcribe(file_path)

edited Apr 29 '23 at 15:15

answered Apr 29 '23 at 14:14

Parzival

62
5

You might want to explain why you recommend that, how it is supposed to help. – Yunnosch Apr 29 '23 at 14:23
Thanks, added a more elaborate example. – Parzival Apr 29 '23 at 15:15
Thanks. But an example neither explains why you recommend it nor how it is supposed to help. – Yunnosch Apr 30 '23 at 17:07

Open AI Whisper is returning the transcription in English instead of the native language

1 Answers1