ComfyUI-FlowMatching-Upscaler

★ 59

渐进式放大flow-matching流式重噪声化skip残差融合

为Flow-matching模型（如Qwen Image、Flux2）提供渐进式放大：逐步倍增分辨率，进行流式一致的重新噪声化与去噪，并融合skip残差以保留构图与细节。

💡 将Flow-matching模型输出的潜变量渐进放大至高分辨率并保持构图。

🍴 2 Forks💻 Python🔄 2026-01-29

🔗 GitHub 原文

📦

网盘下载

复制链接后前往夸克网盘下载

https://pan.quark.cn/s/2df45d172dc1

📦 requirements.txt

#
Core
numerical
stack
(matches
the
versions
bundled
with
the
reference
ComfyUI
environment).
numpy>=2.0
torch>=2.1
einops>=0.6
#
Frontend/web
dependencies
already
present
in
ComfyUI,
included
here
for
completeness.
aiohttp>=3.9

📄 README

ComfyUI Flow Matching Upscaler

Overview

This repository ships a progressive upscaling node designed for flow-matching

models such as Qwen Image and Flux2, alongside a powerful model patcher (DyPE)

to enable high-resolution generation.

Flow Matching Progressive Upscaler: Implements the approach outlined in

docs/Approach.pdf. It incrementally doubles resolution, re-noises the

latent in a flow-consistent manner, denoises with the selected sampler, and

blends skip residuals to preserve composition.

DyPE for Qwen Image: A model patch node that extends Qwen Image’s

spatial rotary embeddings. It allows the diffusion model to stay coherent

far beyond its native training resolution by applying Dynamic Position

Extrapolation (DyPE).

DyPE for Flux2: A model patch node for Flux2 that applies the same DyPE

approach while respecting Flux2’s 4-axis positional scheme (text stays

static while spatial axes extrapolate).

Latent Upscale Advanced: A covariance-aware latent resampler inspired by

upscaling.md (optional global whitening / moment matching around the

spatial upscaler). Defaults to matching ComfyUI’s standard latent upscale

behavior unless you enable covariance processing.

The mesh drag and latent diagnostic nodes that previously shipped here now live

in the Skoogeer-Noise node pack.

Batch-oriented utility nodes (including Batch Filter Empty Images) now live in

the Skoogeer-Batch-Ops node pack.

Installation

Clone this repository inside the custom_nodes/ directory of your ComfyUI

installation.

Launch ComfyUI; the nodes will be registered under latent/upscaling and model_patches/unet categories.

(Mesh drag + latent debug moved to Skoogeer-Noise.)

Example Workflow

[](examples/Method-Comparison.json)

Part 1: Flow Matching Progressive Upscaler

How Flow Matching Upscaling Works

In traditional diffusion models (like SD1.5 or SDXL), generating an image is

often described as “removing noise” layer by layer. However, Flow Matching

models (like Qwen) work a bit differently: think of them as calculating a

direct, straight-line path (a “flow”) from a chaotic state (noise) to an

ordered state (your image).

When you use Flow Matching Upscaler nodes, you aren’t just denoising; you are

traveling along a specific trajectory.

Stage 1: Generates your base image by following this path at a lower resolution.

Stage 2 (The Upscale): Instead of just stretching the image and adding random noise (like standard “img2img”), the node physically resizes your latents and then mathematically “rewinds” the clock.

As shown in the center of the diagram, we step back along the timeline—for

example, rewinding from the finished state ($t=0.0$) back to a mid-point

($t=0.6$). This places the model back onto the flow trajectory but at a much

higher resolution. Because the model is resuming an existing journey rather

than starting a new one, it doesn’t hallucinate new objects or change your

composition; it simply “flows” forward again, filling in the missing

high-frequency details.

Key Concepts

The Role of the `Noise` Parameter

In Flow Matching, noise isn’t just “randomness”—it is mathematical validity.

Validating Time: A specific time on the flow trajectory (e.g., $t=0.6$) mathematically expects a specific mix of signal and noise. If you provide a clean upscaled image at $t=0.6$, the model gets confused. The noise parameter adds the necessary static back so the latent matches the statistical expectations of that timestep.

Texture Fuel: The noise provides a substrate for the model to “carve” details into. Without it, upscales can look waxy or plastic.

Preventing Burn-in: Noise dithers upscaling artifacts (like blocky edges), allowing the model to hallucinate cleaner, sharper edges in their place.

Dilated Sampling

Dilated sampling adds a coarse refinement lap immediately after the main sampler completes each stage.

Downscale: Shrinks the freshly denoised latent (acting as a low-pass filter).

Sample: Runs a short sampler pass in that reduced space.

Blend: Upscales and mixes it back into the main result.

*Use this to suppress high-frequency hallucinations (like extra pores or glittering artifacts).*

Dilated Blend Strategy

Dilated refinement now always uses a frequency-domain blend when recombining the low-pass latent with the original samples. The FFT-based mix pulls low frequencies from the dilated pass and preserves high-frequency detail from the base latent, which consistently avoids the grid artifacts that the alternative modes were designed to mitigate. With this default in place, no explicit blend selector is required.

Usage Tips

Upscaling is Optional: You don’t have to upscale! Set total_scale to 1.0 to use this node for “Refinement.” It gives the model a second chance to generate details on an existing latent without changing the resolution.

Memory Use: The upscaler samples the full latent. If VRAM is exhausted, the node automatically switches to a streaming fallback (LOW_VRAM mode), throttling the attention kernels. It will be slower, but it will not crash.

Mask-aware Upscaling: When the input latent ships with a noise_mask, the nodes upscale the mask alongside the latent so inpaint workflows keep their boundaries aligned at every stage.

Node Parameters: Progressive Upscaler

Required inputs

|——-|——|———|———|

| seed | INT | 0 | Base seed. |

| steps_per_stage | INT | 16 | Sampler steps per stage. |

| cfg | FLOAT | 4.5 | Guidance strength. |

| stages | INT | 2 | Number of progressive stages to reach total scale. |

Optional controls

|——-|——|———|———|

Modular Nodes

For complex workflows, you can use the individual components of the upscaler.

1. FlowMatchingStage

Chain these nodes manually for caching benefits.

Key Inputs:

scale_factor: How much to resize in this specific step.

noise_ratio: Amount of flow-noise to inject.

skip_blend: Blend factor between pre-sampler latent and denoised result.

next_seed: Connect this output to the seed of the next stage for deterministic chains.

Dilated refinement blends results in the frequency domain automatically—no manual method selection is required (FlowMatchingStage only).

Custom Sampler workflow (modular):

FlowMatchingStagePrep outputs skip_latent + presampler_latent (+ seed/next_seed).

Feed presampler_latent into ComfyUI’s SamplerCustom / SamplerCustomAdvanced (use seed as the noise seed).

FlowMatchingStageMerge blends the sampled latent with skip_latent via skip_blend.

This modular path intentionally omits the stage node’s low-VRAM fallback and dilated refinement.

2. Latent Upscale Advanced

Upscales latents like ComfyUI’s built-in latent upscale node by default, with optional covariance-aware whitening (PCA/eigenbasis) and an optional moment_match pass to restore mean/covariance after interpolation. Note: ComfyUI’s lanczos path is image/PIL-based and is treated as bicubic for latents here.

Part 2: DyPE for Qwen Image

How DyPE Works

Models like Qwen are trained on a specific “map” size (e.g., 1024×1024). When you generate 4K images, you force the model off the edge of its map, causing it to hallucinate repeating patterns (cloning artifacts).

DyPE (Dynamic Position Extrapolation) treats generation as a journey.

Early Steps: It keeps coordinates close to the native training size to secure solid composition.

Later Steps: It dynamically expands the grid (using an exponential ramp) as the sampler progresses.

This allows the model to “see” the massive 4K canvas clearly exactly when it needs to paint high-frequency details, preventing the “washed out” look of standard stretching.

Understanding the Methods

The method parameter determines the math used to handle coordinates outside the training resolution.

| :— | :— | :— | :— |

Node Parameters: DyPE for Qwen Image

Required inputs

|——-|——|———|———|

| width / height | INT | 1024 | Target render resolution (must match your latent). |

Outputs

model: The patched model ready for the KSampler.

Part 3: DyPE for Flux2

Flux2 models use a 4-axis RoPE layout (index, height, width, text). DyPE for

Flux2 extrapolates only the spatial axes (height/width) while keeping the text

axis static so prompt conditioning remains stable.

Node Parameters: DyPE for Flux2

Required inputs

|——-|——|———|———|

| width / height | INT | 1024 | Target render resolution (must match your latent). |

Outputs

model: The patched model ready for the KSampler.

Mesh Drag / Latent Debug Nodes

The mesh drag and latent debug nodes previously shipped in this repo were split

into the Skoogeer-Noise node pack so they can be installed independently.

Development

Running tests

Use pytest; the suite installs lightweight comfy and nodes stubs so it can

run outside of a live ComfyUI process:

pytest

Logging

The node uses Python’s logging subsystem. Enabling DEBUG level emits per-channel mean and standard deviation diagnostics, useful for probing latent space behavior.

ComfyUI-FlowMatching-Upscaler

ComfyUI Flow Matching Upscaler

Overview

Installation

Example Workflow

Part 1: Flow Matching Progressive Upscaler

How Flow Matching Upscaling Works

Key Concepts

The Role of the Noise Parameter

Dilated Sampling

Dilated Blend Strategy

Usage Tips

Node Parameters: Progressive Upscaler

Modular Nodes

1. FlowMatchingStage

2. Latent Upscale Advanced

Part 2: DyPE for Qwen Image

How DyPE Works

Understanding the Methods

Node Parameters: DyPE for Qwen Image

Part 3: DyPE for Flux2

Node Parameters: DyPE for Flux2

Mesh Drag / Latent Debug Nodes

Development

Running tests

Logging

The Role of the `Noise` Parameter