




Implementation of
as an extension for ComfyUI and SD WebUI (reForge).
Works with SD1.5 and SDXL.
You can either:
git clone https://github.com/pamparamm/sd-perturbed-attention.git into ComfyUI/custom-nodes/ folder.comfy node registry-install sd-perturbed-attentiongit clone https://github.com/pamparamm/sd-perturbed-attention.git into stable-diffusion-webui-forge/extensions/ folder.
As an alternative for A1111 WebUI you can use PAG implementation from sd-webui-incantations extension.
[!NOTE]
You can override
CFG ScaleandPAG Scale/SEG Scalefor Hires. fix by opening/enablingOverride for Hires. fixtab.
To disable PAG during Hires. fix, you can set
PAG Scaleunder Override to 0.
scale: Guidance scale, higher values can both increase structural coherence of an image and oversaturate/fry it entirely.adaptive_scale (PAG only): PAG dampening factor, it penalizes PAG during late denoising stages, resulting in overall speedup: 0.0 means no penalty and 1.0 completely removes PAG.blur_sigma (SEG only): Normal deviation of Gaussian blur, higher values increase “clarity” of an image. Negative values set blur_sigma to infinity.unet_block: Part of U-Net to which Guidance is applied, original paper suggests to use middle.unet_block_id: Id of U-Net layer in a selected block to which Guidance is applied. Guidance can be applied only to layers containing Self-attention blocks.sigma_start / sigma_end: Guidance will be active only between sigma_start and sigma_end. Set both values to negative to disable this feature.rescale: Acts similar to RescaleCFG node – it prevents over-exposure on high scale values. Based on Algorithm 2 from Common Diffusion Noise Schedules and Sample Steps are Flawed (Lin et al.). Set to 0 to disable this feature.rescale_mode:full – takes into account both CFG and Guidance.partial – depends only on Guidance.snf – Saliency-adaptive Noise Fusion from High-fidelity Person-centric Subject-to-Image Synthesis (Wang et al.). Should increase image quality on high guidance scales. Ignores rescale value.unet_block_list: Optional input, replaces both unet_block and unet_block_id and allows you to select multiple U-Net layers separated with commas. SDXL U-Net has multiple indices for layers, you can specify them by using dot symbol (if not specified, Guidance will be applied to the whole layer). Example value: m0,u0.4 (it applies Guidance to middle block 0 and to output block 0 with index 4)d means input, m means middle and u means output.d0–d5, m0, u0–u8.d0–d3, m0, u0–u5. In addition, each block except d0 and d1 has 0-9 index values (like m0.7 or u0.4). d0 and d1 have 0-1 index values.d0-d3 corresponds to d0,d1,d2,d3) and index value ranges (d2.2-9 corresponds to all index values of d2 with the exclusion of d2.0 and d2.1).Deprecated: ComfyUI_TensorRT is unmaintained.
@misc{ahn2025selfrectifyingdiffusionsamplingperturbedattention,
title={Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidance},
author={Donghoon Ahn and Hyoungwon Cho and Jaewon Min and Wooseok Jang and Jungwoo Kim and SeonHwa Kim and Hyun Hee Park and Kyong Hwan Jin and Seungryong Kim},
year={2025},
eprint={2403.17377},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2403.17377},
}
@misc{hong2024smoothedenergyguidanceguiding,
title={Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of Attention},
author={Susung Hong},
year={2024},
eprint={2408.00760},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2408.00760},
}
@misc{adaloglou2025guidingdiffusionmodelusing,
title={Guiding a diffusion model using sliding windows},
author={Nikolas Adaloglou and Tim Kaiser and Damir Iagudin and Markus Kollmann},
year={2025},
eprint={2411.10257},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2411.10257},
}
@misc{kim2025pladispushinglimitsattention,
title={PLADIS: Pushing the Limits of Attention in Diffusion Models at Inference Time by Leveraging Sparsity},
author={Kwanyoung Kim and Byeongsu Sim},
year={2025},
eprint={2503.07677},
archivePrefix={arXiv},
primaryClass={cs.LG},
url={https://arxiv.org/abs/2503.07677},
}
@misc{chen2025normalizedattentionguidanceuniversal,
title={Normalized Attention Guidance: Universal Negative Guidance for Diffusion Models},
author={Dar-Yen Chen and Hmrishav Bandyopadhyay and Kai Zou and Yi-Zhe Song},
year={2025},
eprint={2505.21179},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2505.21179},
}
@misc{rajabi2025tokenperturbationguidancediffusion,
title={Token Perturbation Guidance for Diffusion Models},
author={Javad Rajabi and Soroush Mehraban and Seyedmorteza Sadat and Babak Taati},
year={2025},
eprint={2506.10036},
archivePrefix={arXiv},
primaryClass={cs.GR},
url={https://arxiv.org/abs/2506.10036},
}
@misc{sadat2025guidancefrequencydomainenables,
title={Guidance in the Frequency Domain Enables High-Fidelity Sampling at Low CFG Scales},
author={Seyedmorteza Sadat and Tobias Vontobel and Farnood Salehi and Romann M. Weber},
year={2025},
eprint={2506.19713},
archivePrefix={arXiv},
primaryClass={cs.LG},
url={https://arxiv.org/abs/2506.19713},
}
@misc{liao2026momentumguidanceplugandplayguidance,
title={Momentum Guidance: Plug-and-Play Guidance for Flow Models},
author={Runlong Liao and Jian Yu and Baiyu Su and Chi Zhang and Lizhang Chen and Qiang Liu},
year={2026},
eprint={2602.20360},
archivePrefix={arXiv},
primaryClass={cs.LG},
url={https://arxiv.org/abs/2602.20360},
}
@misc{wang2026cfgctrlcontrolbasedclassifierfreediffusion,
title={CFG-Ctrl: Control-Based Classifier-Free Diffusion Guidance},
author={Hanyang Wang and Yiyang Liu and Jiawei Chi and Fangfu Liu and Ran Xue and Yueqi Duan},
year={2026},
eprint={2603.03281},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2603.03281},
}