comfyui-kokoro

★ 67

文本转语音Kokoro-onnx说话人合成ComfyUI 节点

在 ComfyUI 中集成 Kokoro TTS 的自定义节点，调用 kokoro-onnx 进行文本到语音，可生成并合并说话人风格与嵌入，便于快速合成新声线与批量语音输出。

💡 在 ComfyUI 中快速生成并合并自定义说话人语音。

🍴 13 Forks💻 Python🔄 2026-02-23

🔗 GitHub 原文

📦

网盘下载

复制链接后前往夸克网盘下载

https://pan.quark.cn/s/9671236b7e59

📦 requirements.txt

kokoro-onnx==0.3.6
onnxruntime
numpy>=2.0.2
requests
tqdm

📄 README

Comfy UI Kokoro

Kokoro TTS nodes, wrapping this kokoro onnx that is based on hexgrad/Kokoro-82M.

note: This picture is also a workflow, just download and drop it into comfy.

Install

Install Via ComfyUI Manager, by stavsap.

Clone the repo into custom_nodes folder

git clone https://github.com/stavsap/comfyui-kokoro.git

Then cd into comfyui-kokoro, and install requirements.

pip install -r requirements.txt

And finally reboot Comfy.

The onnx model and speakers meta-data will be automatically downloaded on the first run.

If using windows portable version and experience issues with dependencies, check the following:

[](https://youtu.be/O6qYvUN7vIc?si=loCoMD5ou2eeGL30)

Nodes

Currently, there are 3 nodes that can be combined for TTS workflow.

Kokoro Speaker

Select supported speakers.

Kokoro Speaker Combiner

Combiner node to combine 2 given speakers to new speaker.

weight: [1, 0], select the weight of speaker a.

Example:

weight == 0.7 will result in strength of 70% of speaker_a and 30% of speaker_b.

Kokoro Generate

speaker: input a speaker

speed: set the speach speed.

lang: set the language, what ever is supported by kokoro.

Available Voices

All supported voices can be found here.

Use Cases:

TTS: Text To Speach, generate voice from test.

Lip Sync: sync lips of videos

License

This repo

kokoro-onnx: MIT

kokoro model: Apache 2.0

Credits

Kokoro TTS Engine

ComfyUI

ComfyUI-BS_Kokoro-onnx

ComfyUI-KokoroTTS