ComfyUI-Muyan-TTS

ComfyUI-Muyan-TTS
★ 2

语音合成TTSComfyUI插件Muyan-TTS
在ComfyUI中集成Muyan-TTS,提供文本到语音的快速合成节点,简化语音合成流程并便于部署。
💡 在ComfyUI流水线中将文本快速转换为语音用于配音与原型测试。
🍴 1 Forks💻 Python🔄 2025-05-08
📦
网盘下载
复制链接后前往夸克网盘下载
https://pan.quark.cn/s/a9fb3a59e10c
📦 requirements.txt
transformers>=4.41.2,<=4.49.0,!=4.46.0,!=4.46.1,!=4.46.2,!=4.46.3,!=4.47.0,!=4.47.1,!=4.48.0
fire
sse-starlette
datasets<=3.2.0,>=2.16.0
accelerate<=1.2.1,>=0.34.0
peft<=0.12.0,>=0.11.1
trl>=0.8.6,<=0.9.6
tokenizers>=0.19.0,<=0.21.0
numpy>=1.23.4,<2.0.0
scipy
librosa>=0.9.2
sentencepiece
einops
tiktoken
av>=11
numba>=0.56.4
pytorch-lightning>2.0
torchaudio
ffmpeg-python
vllm==0.6.4.post1
nltk
wordsegment
g2p_en
gradio<=5.21.0,>=4.38.0
x_transformers
deepspeed>=0.9.3
matplotlib
tyro<0.9.0
modelscope
num2words
📄 README

ComfyUI-Muyan-TTS

Make Muyan-TTS avialbe in ComfyUI.

Muyan-TTS is a trainable TTS model designed for podcast applications within a $50,000 budget, which is pre-trained on over 100,000 hours of podcast audio data, enabling zero-shot TTS synthesis with high-quality voice generation. Furthermore, Muyan-TTS supports speaker adaptation with dozens of minutes of target speech, making it highly customizable for individual voices.

Installation

  • Make sure you have ComfyUI installed
  • Clone this repository into your ComfyUI’s custom_nodes directory:
  • cd ComfyUI/custom_nodes
    git clone https://github.com/Yuan-ManX/ComfyUI-Muyan-TTS.git

  • Install dependencies:
  • cd ComfyUI-Muyan-TTS
    pip install -r requirements.txt

    Model

    Model Download

    | Models | Links |

    |——-|——-|

    | Muyan-TTS | huggingface \| modelscope |

    | Muyan-TTS-SFT | huggingface \| modelscope |

    Additionally, you need to download the weights of chinese-hubert-base.