ComfyUI_Pic2Story

ComfyUI_Pic2Story
★ 10

图像转文本图像描述BLIP/Pic2StoryComfyUI 节点
ComfyUI 节点,基于 BLIP/Pic2Story 将图片生成文本描述或故事,便于自动生成提示词与标签。
💡 为图片生成故事或描述,用作提示词和自动标签。
🍴 4 Forks💻 Python🔄 2024-12-06
📦
网盘下载
复制链接后前往夸克网盘下载
https://pan.quark.cn/s/c1eafc754fbb
📄 README

ComfyUI_Pic2Story

ComfyUI simple node based on BLIP method, with the function of “Image to Txt ” .

Original model: link

Using model: link

1.Installation


1.1 In the .\ComfyUI \ custom_node directory, run the following:

“`

git clone https://github.com/smthemex/ComfyUI_Pic2Story.git

“`

1.2 using repo_id or offline

repo_id: abhijit2111/Pic2Story link

repo_id: google/paligemma2-3b-pt-896 link

2.Example


Prompt is not necessary! 提示词不是必须的,可以去掉.

4.Citation


@misc{https://doi.org/10.48550/arxiv.2201.12086,
  doi = {10.48550/ARXIV.2201.12086},
  
  url = {https://arxiv.org/abs/2201.12086},
  
  author = {Li, Junnan and Li, Dongxu and Xiong, Caiming and Hoi, Steven},
  
  keywords = {Computer Vision and Pattern Recognition (cs.CV), FOS: Computer and information sciences, FOS: Computer and information sciences},
  
  title = {BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation},
  
  publisher = {arXiv},
  
  year = {2022},
  
  copyright = {Creative Commons Attribution 4.0 International}
}