torch numpy soundfile transformers huggingface_hub nltk>=3.8.0 snac==0.1.0 librosa>=0.10.2

This project adds high-quality Text-to-Speech capabilities to ComfyUI using the Orpheus TTS model. Create natural-sounding voices with emotional expressions, multilingual support, and audio effects.
Clone this repository into your ComfyUI’s custom_nodes directory:
cd ComfyUI/custom_nodes
git clone https://github.com/ShmuelRonen/ComfyUI-Orpheus-TTS.git
pip install torch numpy soundfile transformers huggingface_hub nltk snac
For WSL 2, you may need to install directly from the GitHub repository:
pip install git+https://github.com/hubertsiuzdak/snac.git
.exe installer (e.g., sox-14.4.2-win32.exe)C:\Program Files (x86)\sox-14-4-2\)sudo apt-get update
sudo apt-get install sox
brew install sox
After installing all required components, restart ComfyUI to load the extension.
To access private models on Hugging Face, create a file named hf_config.json in the extension directory and insert your HF Token KEY:
“`json
{
“token”: “YOUR_HUGGING_FACE_TOKEN_HERE”
}
“`
Your token will be used to authenticate with Hugging Face when downloading models. This is only required if you’re using private models or if you need higher rate limits.
Loads the required models for Orpheus TTS.
Inputs:
snac_model_path (optional): Path to SNAC model (default: “hubertsiuzdak/snac_24khz”)orpheus_model_path (optional): Path to Orpheus model (default: “canopylabs/orpheus-3b-0.1-ft”)Outputs:
model: Model reference to be passed to the generate nodeGenerates speech from text input.
Inputs:
model: Model reference from the loader nodetext: The text to convert to speechvoice: Voice style to use (tara, leah, jess, leo, dan, mia, zac, zoe, etc.)language (optional): Language for multilingual output (en, fr, es, etc.)max_chunk_size (optional): Maximum chunk size for long text processingOutputs:
audio: Audio data to be passed to preview or effects nodesApplies high-quality audio processing to the generated speech.
Inputs:
audio: Audio data from the generate nodepitch_shift: Semitone adjustment (-12 to +12)speed_factor: Playback speed modifier (0.5x to 2.0x)sox_path (optional): Custom path to SoX executablegain_db (optional): Volume adjustment in decibelsuse_limiter (optional): Enable/disable limiter for positive gainnormalize_audio (optional): Enable/disable audio normalizationadd_reverb (optional): Enable/disable reverb effectreverb_amount (optional): Reverb intensityreverb_room_scale (optional): Size of virtual spaceadd_echo (optional): Enable/disable echo effectecho_delay (optional): Time between echo repetitionsecho_decay (optional): How quickly echo fadesOutputs:
audio: Processed audio dataLooking at the README section you provided, I’ll expand it to include information about the different element position options, including the new pipe feature:
You can add expressive elements to the speech by inserting these tags:
– Natural laughter – Light, subtle laughter – Exhaling with emotion – Clearing throat – Subtle nasal sound – Low, grumbling sound – Tired exhale – Sudden intake of breathThe Element Position dropdown provides different ways to add these paralinguistic elements to your text:
“`
I can’t believe it!
“`
“`
Input: “That’s amazing!”
Output: “That’s amazing!
“`
“`
Input: “I need to get back to work.”
Output: “
“`
“`
Input: “I can’t believe it! | That’s the funniest thing | I’ve heard all day.”
Element: laugh
Output: “I can’t believe it!
“`
I can't believe it! <laugh> That's the funniest thing I've heard all day.
<sigh> But now I need to get back to work.
Input: "Did you hear that? | It's hilarious! | I can't stop laughing!"
Element: laugh
Result: "Did you hear that? <laugh> It's hilarious! <laugh> I can't stop laughing!"
<gasp> What was that? <pause> Did you hear something? <sigh> Maybe I'm just tired.
gain_db to increase or decrease volume without distortionnormalize_audio to automatically balance levelsReverb adds a sense of space to your audio. Here are some suggested settings:
Echo creates repeating sound reflections. Good settings to try:
This extension has been tested and works on:
Different environments may require specific setup steps:
pip install git+https://github.com/hubertsiuzdak/snac.git to ensure compatibilityIf you encounter issues with SoX:
C:\Program Files (x86)\sox-14-4-2\sox.exe"C:\Program Files (x86)\sox-14-4-2\sox.exe" --version“`bash
sox –version
“`
“`bash
sudo apt-get update
sudo apt-get install sox
“`
This extension uses the following models:
This project uses models with their own licenses:
Please consult these licenses for usage terms and restrictions.