accelerate>=1.12.0 huggingface_hub modelscope nagisa==0.2.11 soynlp==0.0.493 DyNet38==2.2 numpy soundfile torch>=2.0.0 torchaudio>=2.0.0 transformers>=4.57.0



ComfyUI custom nodes for Qwen3-ASR (Automatic Speech Recognition). This pack focuses on simple, reliable speech-to-text and subtitle workflows with local model caching and long-audio support.
https://github.com/user-attachments/assets/70d05cb2-9653-448a-ad6f-16868996b61e
ComfyUI/models/Qwen3-ASR/ComfyUI/output/ComfyUI-QwenASR/)Tip: in ComfyUI search, type ASR to find these nodes quickly.
1) Install the custom node:
cd ComfyUI/custom_nodes
git clone https://github.com/1038lab/ComfyUI-QwenASR.git
2) Install dependencies:
cd ComfyUI/custom_nodes/ComfyUI-QwenASR
pip install -r requirements.txt
3) Restart ComfyUI.
Supported:
Qwen/Qwen3-ASR-1.7BQwen/Qwen3-ASR-0.6BQwen/Qwen3-ForcedAligner-0.6B (for subtitles)Downloaded models are stored in:
ComfyUI/models/Qwen3-ASR/
You can edit config.json in the repo root to change defaults (e.g. default model, source)
or to add/remove model repo entries.
Example:
{
"defaults": {
"source": "ModelScope",
"repo_id": "Qwen/Qwen3-ASR-0.6B"
}
}
Tip: If you are in mainland China, using ModelScope as the source is usually faster and more reliable.
If you keep models outside the default folder, add the parent directory to ComfyUI’s extra_model_paths.yaml.
This node will also search those paths for Qwen3-ASR models.
STT
LoadAudio → ASR (QwenASR) → ShowText
Subtitles
LoadAudio → Subtitle (QwenASR) → ShowText / SaveText
local_model_path.