ComfyUI-WhisperX

★ 62

音频转写SRT字幕多说话人分离多引擎翻译

基于whisperX和translators的ComfyUI自定义节点，用于音频转写并导出带时间轴的SRT字幕，支持多说话人分离与多引擎翻译，一键在ComfyUI流程中生成字幕与翻译。

💡 将音频自动转写并导出带时间轴和翻译的SRT字幕文件

🍴 18 Forks💻 Python🔄 2025-04-01

📦

网盘下载

复制链接后前往夸克网盘下载

https://pan.quark.cn/s/b6135d9bd930

📦 requirements.txt

srt
faster-whisper
transformers
pandas
nltk
pyannote.audio
ffmpeg-python
translators

📄 README

a comfyui cuatom node for audio subtitling based on whisperX and translators

Features

export srt file for subtitle was supported

translate was supported by translators with huge number engine

mutiple speaker diarization was supported by pyannote-audio

huge comfyui custom nodes can merge in whisperx

make sure ffmpeg is worked in your commandline

for Linux

apt update
apt install ffmpeg

for Windows,you can install ffmpeg by WingetUI automatically

then!

git clone https://github.com/AIFSH/ComfyUI-WhisperX.git
cd ComfyUI-WhisperX
pip install -r requirements.txt

weights will be downloaded from huggingface automatically! if you in china,make sure your internet attach the huggingface

or if you still struggle with huggingface, you may try follow hf-mirror to config your env.

to speaker diarization, you need!

Create access token at hf.co/settings/tokens.