# FL DiffVSR Requirements # Core dependencies for Stream-DiffVSR video super-resolution # Core ML frameworks torch>=2.0.0 torchvision>=0.15.0 # Diffusers and transformers diffusers>=0.21.0 transformers>=4.30.0 # Model loading safetensors>=0.3.0 huggingface_hub>=0.16.0 accelerate>=0.20.0 # Image processing Pillow>=9.0.0 numpy>=1.20.0 # Optional: Memory efficient attention (recommended) xformers>=0.0.20 # Tensor operations einops>=0.6.0

Diffusion-based video super-resolution nodes for ComfyUI powered by Stream-DiffVSR. Upscale videos 4x with temporal coherence for smooth, artifact-free results.
[](https://arxiv.org/abs/2512.23709)
[](https://www.patreon.com/Machinedelusions)
| Node | Description |
|——|————-|
| FL DiffVSR Load Model | Downloads and loads Stream-DiffVSR model from HuggingFace |
| FL DiffVSR Upscale | Upscales video frames with temporal coherence |
Search for “FL DiffVSR” and install.
cd ComfyUI/custom_nodes
git clone https://github.com/filliptm/ComfyUI-FL-DiffVSR.git
cd ComfyUI-FL-DiffVSR
pip install -r requirements.txt
| Parameter | Options | Description |
|———–|———|————-|
| precision | auto, fp32, fp16, bf16 | Model precision (auto selects fp16 for GPU) |
| device | auto, cuda, cpu | Target device for inference |
| enable_xformers | true/false | Enable memory-efficient attention |
| Parameter | Default | Description |
|———–|———|————-|
| inference_steps | 4 | Denoising steps (higher = better quality, slower) |
| guidance_scale | 0.0 | CFG scale (0 = no guidance) |
| chunk_size | 8 | Frames per batch (lower = less VRAM) |
| prompt | “” | Optional text guidance |
| negative_prompt | “” | Optional negative prompt |
| seed | -1 | Random seed (-1 for random) |
The Stream-DiffVSR model downloads automatically to ComfyUI/models/stream_diffvsr/ on first use (~2GB).
| Model | Source | Size |
|——-|——–|——|
| Stream-DiffVSR | Jamichsu/Stream-DiffVSR | ~2GB |
Apache 2.0