Speech-to-text in Obsidian using OpenAI Whisper Service

The same people who brought us chatGPT have also made an AI speech-to-text service that is rather amazing. It understands multiple languages, cost-effective & fast.

In this article, I profile an Obsidian plugin by https://twitter.com/nikdanilov using the Whisper Service

3 Likes

This is a really fantastic plugin! Thanks for this. Would you have any idea how I can transcribe longer recordings I already made in .mp4 format?

Good question. I would suggest posting it as a feature request at the developer’s GitHub repo for this plugin: Issues · nikdanilov/whisper-obsidian-plugin · GitHub

Another option is this plugin, which seems to do what you want but I am unsure if there are additional charges. GitHub - djmango/obsidian-transcription: Obsidian plugin to create high-quality transcriptions from markdown linked audio files.

I hope you find a good solution, I see a few have been interested in the same thing.

1 Like

Many thanks! I will post the feature request.

regarding the /obsidian-transcription: This requires signing up to https://scribe.gambitengine.com which I do not exactly understand why this is required as direct use of OpenAI API is possible to access Whisper (I just don’t know how).

1 Like

You don’t know how to do what?

Many thanks for your question. I did not know how to upload files to Whisper directly using my personal API.

In the meantime, I have found it amazingly easy to install Whisper locally on my Mac and to run transcriptions in Terminal via this instruction. Install Whisper.cpp on your Mac in 5mn and transcribe all your podcasts for free!.

I would love to see this as an Obsidian plugin, but in the mean time have found a good alternative solution.

1 Like