ComfyUI-IF_AI_WishperSpeechNode

★ 43

文本转语音Whisper即时训练快速推理

ComfyUI 节点，基于 Whisper 的 TTS，支持用短音频即时训练自定义语音并快速推理，支持可选 torch_Compile 加速，提高训练与合成效率。

💡 用短录音在ComfyUI中快速训练并生成自定义语音。

🍴 13 Forks💻 Python🔄 2025-03-09

🔗 GitHub 原文

📦

网盘下载

复制链接后前往夸克网盘下载

https://pan.quark.cn/s/9671236b7e59

📦 requirements.txt

requests
scipy
nltk
WhisperSpeech
librosa
huggingface_hub
webdataset

📄 README

ComfyUI-IF_AI_WishperSpeechNode

A Convenient and Fast Text-to-Speech Application with Whisper Speech

This repository hosts a Text-to-Speech (TTS) application that leverages Whisper Speech for voice synthesis, allowing users to train a voice model on-the-fly. It is built on ComfyUI and supports rapid training and inference processes.

Features

On-the-fly Voice Training: Train a custom voice model using a short audio recording.

Fast Inference: Optional support for torch_Compile to enhance performance during inference and training.

Installation

Git clone the repository to your custom_nodes folder

pip install -r requirements.txt

IF dlib troubles try this workarounds

DEDICATED ENV

1-.VIA PIP

activate env

“`bash

pip install cmake

pip install dlib

“`

2-. VIA CLONING DLIB REPO

“`

git clone https://github.com/davisking/dlib.git

cd dlib

“`

activate env

“`

python.exe setup.py install

“`

if nothing works try this with the terminal as admin

3-.VIA CONDA PKG

1-.Activate the env

on conda env

conda install -c conda-forge dlib

on micromamba env

micromamba install -c conda-forge dlib

Portable ENV

1-.VIA PIP

Open terminal as admin

“`

H:\ComfyUI_windows_portable\python_embeded\python.exe -m pip install cmake

H:\ComfyUI_windows_portable\python_embeded\python.exe -m pip install dlib

“`

2-. VIA CLONING DLIB REPO

“`

git clone https://github.com/davisking/dlib.git

cd dlib

H:\ComfyUI_windows_portable\python_embeded\python.exe setup.py install

“`