ComfyUI-WhisperX

ComfyUI-WhisperX
★ 62

音频转写SRT字幕多说话人分离多引擎翻译
基于whisperX和translators的ComfyUI自定义节点,用于音频转写并导出带时间轴的SRT字幕,支持多说话人分离与多引擎翻译,一键在ComfyUI流程中生成字幕与翻译。
💡 将音频自动转写并导出带时间轴和翻译的SRT字幕文件
🍴 18 Forks💻 Python🔄 2025-04-01
📦
网盘下载
复制链接后前往夸克网盘下载
https://pan.quark.cn/s/b6135d9bd930
📦 requirements.txt
srt
faster-whisper
transformers
pandas
nltk
pyannote.audio
ffmpeg-python
translators
📄 README

ComfyUI-WhisperX

a comfyui cuatom node for audio subtitling based on whisperX and translators

Features

  • export srt file for subtitle was supported
  • translate was supported by translators with huge number engine
  • mutiple speaker diarization was supported by pyannote-audio
  • huge comfyui custom nodes can merge in whisperx
  • How to use

    make sure ffmpeg is worked in your commandline

    for Linux

    apt update
    apt install ffmpeg

    for Windows,you can install ffmpeg by WingetUI automatically

    then!

    git clone https://github.com/AIFSH/ComfyUI-WhisperX.git
    cd ComfyUI-WhisperX
    pip install -r requirements.txt

    weights will be downloaded from huggingface automatically! if you in china,make sure your internet attach the huggingface

    or if you still struggle with huggingface, you may try follow hf-mirror to config your env.

    to speaker diarization, you need!

  • Accept pyannote/segmentation-3.0 user conditions
  • Accept pyannote/speaker-diarization-3.1 user conditions
  • Create access token at hf.co/settings/tokens.
  • Tutorial

    Demo

    WeChat Group && Donate

    Thanks

  • whisperX
  • translators