I found this so far: https://github.com/KoljaB/RealtimeSTT
Maybe I can modify it to use whisper api.
I found this so far: https://github.com/KoljaB/RealtimeSTT
Maybe I can modify it to use whisper api.
Dunno, but this guy (all about ai) builds one with 'faster-whisper', so perhaps you can get a few pointers there? I believe he chunks the Audio on silence. He have a few other speech2x videos. Have fun. https://youtu.be/k6nIxWGdrS4
Here is an alternative Piped link(s):
https://piped.video/k6nIxWGdrS4
Piped is a privacy-respecting open-source alternative frontend to YouTube.
I'm open-source; check me out at GitHub.
Just stumbled upon this speedy one: https://github.com/sanchit-gandhi/whisper-jax
And this one for word precision time marks: https://github.com/m-bain/whisperX
Don't have knowledge to answer your question but you could check how home assistant does it, I think that should point you to the right direction.
All about open source! Feel free to ask questions, and share news, and interesting stuff!
Community icon from opensource.org, but we are not affiliated with them.