decord==0.6.0 einops==0.8.1 huggingface_hub==0.29.1 matplotlib==3.7.0 opencv_python==4.7.0.72 pyarrow==11.0.0 PyYAML==6.0.2 Requests==2.32.3 safetensors==0.4.5 scipy==1.10.1 sentencepiece==0.1.99 transformers==4.49.0 flash_attn==2.5.8 accelerate>=0.34.0 # wandb




A ComfyUI custom node package based on the BAGEL-7B-MoT multimodal model.
BAGEL is an open-source multimodal foundation model with 7B active parameters (14B total) that adopts a Mixture-of-Transformer-Experts (MoT) architecture. It is designed for multimodal understanding and generation tasks, outperforming top-tier open-source VLMs like Qwen2.5-VL and InternVL-2.5 on standard multimodal understanding leaderboards, and delivering text-to-image quality competitive with specialist generators such as SD3.
The ComfyUI-Bagel node provides a model selection dropdown with automatic downloading capabilities:
allow_auto_download is enabled, models are automatically downloaded to models/bagel/ on first usemodels/bagel/ directoryAvailable Models:
[!IMPORTANT]
DFloat11 version do not need quantization again!
Memory is automatically calculated based on your GPU specifications and selected quantization mode – no manual configuration needed!
For manual installation, you can download models to models/bagel/:
# Clone model using git lfs (recommended)
git lfs install
git clone https://huggingface.co/ByteDance-Seed/BAGEL-7B-MoT models/bagel/BAGEL-7B-MoT
# Or use huggingface_hub
pip install huggingface_hub
python -c "from huggingface_hub import snapshot_download; snapshot_download(repo_id='ByteDance-Seed/BAGEL-7B-MoT', local_dir='models/bagel/BAGEL-7B-MoT')"
# Clone DFloat11 quantized model
git clone https://huggingface.co/DFloat11/BAGEL-7B-MoT-DF11 models/bagel/BAGEL-7B-MoT-DF11
# Or use huggingface_hub
python -c "from huggingface_hub import snapshot_download; snapshot_download(repo_id='DFloat11/BAGEL-7B-MoT-DF11', local_dir='models/bagel/BAGEL-7B-MoT-DF11')"
Install the required dependencies:
pip install -r requirements.txt
For advanced quantization support (NF4/INT8 modes), also install:
pip install bitsandbytes
For DFloat11 quantized model support, also install:
pip install dfloat11
Note: bitsandbytes is required for NF4 and INT8 quantization modes on the standard ByteDance model. DFloat11 model works without additional quantization libraries.
Restart ComfyUI to load the new nodes.
Generate high-quality images from text descriptions. Suitable for creative design and content generation.
Edit existing images based on textual descriptions, supporting local modifications and style adjustments.
Analyze and answer questions about image content, suitable for content understanding and information extraction.
| Metric | BAGEL-7B-MoT (Standard Model) | BAGEL-7B-MoT (DFloat11 Quantized Model) |
|——–|——————————-|—————————————–|
| Model Size | 29.21 GB | 19.89 GB |
| Peak GPU Memory (1024×1024 image generation) | 30.07 GB | 21.76 GB |
| Generation Time (on an RTX4090 GPU) | 482.95 seconds | 154.39 seconds |
DFloat11 Quantized Model significantly reduces VRAM requirements and speeds up generation time, making it ideal for single GPU setups.
This project is licensed under the Apache 2.0 License. Please refer to the official license terms for the use of the BAGEL model.
Contributions are welcome! Please submit issue reports and feature requests. If you wish to contribute code, please create an issue to discuss your ideas first.
With the new automatic memory management and quantization options:
This issue is likely related to environment or dependency problems. You can install flash-attention and triton with the following links:
v2.7.4.post1_crossos00 flash-attention-windows-wheel triton-windows-builds
For more information, refer to this GitHub issue.