Faster Whisper + Silero VAD + DeepL

for vocals extraction, replace spleeter with demucs, inspired from: https://github.com/EtienneAb3d/WhisperHallu

several enhancements for Japanese from somewhere on internet

How to use

to open the notebook in Google Colab.
Run the Setup Whisper cell.
Upload your input audio to either the runtime itself, Google Drive, or a file hosting service with direct download links.
Set the audio_path and language variables, and then run the Run Whisper cell. (Note: Audio path is set automatically if you use the Upload cell)
Once it's done, the notebook will automatically download the generated SRT file.

~~bonus: youtube version inspired from https://github.com/ArthurFDLR/whisper-youtube~~ (currently not working)

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
LICENSE		LICENSE
README.md		README.md
WhisperWithVAD.ipynb		WhisperWithVAD.ipynb
whisper_youtube.ipynb		whisper_youtube.ipynb