comfyui-kokoro

comfyui-kokoro
★ 67

文本转语音Kokoro-onnx说话人合成ComfyUI 节点
在 ComfyUI 中集成 Kokoro TTS 的自定义节点,调用 kokoro-onnx 进行文本到语音,可生成并合并说话人风格与嵌入,便于快速合成新声线与批量语音输出。
💡 在 ComfyUI 中快速生成并合并自定义说话人语音。
🍴 13 Forks💻 Python🔄 2026-02-23
📦
网盘下载
复制链接后前往夸克网盘下载
https://pan.quark.cn/s/9671236b7e59
📦 requirements.txt
kokoro-onnx==0.3.6
onnxruntime
numpy>=2.0.2
requests
tqdm
📄 README

Comfy UI Kokoro

Buy Me A Coffee

Kokoro TTS nodes, wrapping this kokoro onnx that is based on hexgrad/Kokoro-82M.

note: This picture is also a workflow, just download and drop it into comfy.

Install

Install Via ComfyUI Manager, by stavsap.

Or

Clone the repo into custom_nodes folder

git clone https://github.com/stavsap/comfyui-kokoro.git

Then cd into comfyui-kokoro, and install requirements.

pip install -r requirements.txt 

And finally reboot Comfy.

The onnx model and speakers meta-data will be automatically downloaded on the first run.

If using windows portable version and experience issues with dependencies, check the following:

[](https://youtu.be/O6qYvUN7vIc?si=loCoMD5ou2eeGL30)

Nodes

Currently, there are 3 nodes that can be combined for TTS workflow.

Kokoro Speaker

Select supported speakers.

Kokoro Speaker Combiner

Combiner node to combine 2 given speakers to new speaker.

  • weight: [1, 0], select the weight of speaker a.
  • Example:

    weight == 0.7 will result in strength of 70% of speaker_a and 30% of speaker_b.

    Kokoro Generate

  • speaker: input a speaker
  • speed: set the speach speed.
  • lang: set the language, what ever is supported by kokoro.
  • Available Voices

    All supported voices can be found here.

    Use Cases:

  • TTS: Text To Speach, generate voice from test.
  • Lip Sync: sync lips of videos
  • License

  • This repo
  • kokoro-onnx: MIT
  • kokoro model: Apache 2.0
  • Credits

  • Kokoro TTS Engine
  • ComfyUI
  • ComfyUI-BS_Kokoro-onnx
  • ComfyUI-KokoroTTS