The problem I have with these often is
- they transcribe only your own audio recordings, and only once you put them in a note.
- they deliver a full transcription (more than I mostly need, it clutters my notes with unnecessary text).
I want to transcribe on-the-go when listening to podcasts/clips, and then only what interests me (the last x seconds).
On Android you can use Momento for Podcast clip-to-text transcriptions.
It will transcribe the last 20 seconds right away by pressing a button. The transcription can then be shared with the Obsidian app to create a note from it.
As a Podcast player it’s a bit restricted I find, but it has a Spotify interface. Also one for readwise.io.