cyberdolphin

cyberdolphin
★ 15

ComfyUI扩展布线/连接节点套件Python依赖
为ComfyUI提供一组用于布线与连接的节点,简化复杂工作流与外部集成,提升流程可复用性。
💡 将仓库克隆到custom_nodes并安装依赖,快速添加布线功能。
🍴 6 Forks💻 Python🔄 2024-07-31
📦
网盘下载
复制链接后前往夸克网盘下载
https://pan.quark.cn/s/51137d50651f
📦 requirements.txt
openai
~=
1.2.3
gradio_client
numpy
pillow
cyberdolphin.png
img.png
📄 README

cyberdolphin

The dolphin is wiring up OpenAI and local LLMs. OpenAI v1.2.3 is required.

Installation

Git clone this repo into the custom_nodes folder.

If necessary, check the pip requirements. It will be necessary.

Examples

There are workflows in the examples folder


Nodes

The nodes all share a config file at settings.yaml. Provided with the repo is the

settings.yaml.example which can be copied to a new file called settings.yaml

for editing. The settings.yaml file is ignored by git.

OpenAI GPT Node

REQUIRES STRING user_prompt

The text is the user portion of the gpt prompt.

_Generates_ an engineered prompt

from a user-editable template config file with the user text embedded.

Dropdown select from available OpenAI models. Most of these will not work.

The models that DO work at the time of writing are at least, including but not limited to

gpt-3.5-turbo and gpt-4.

Runs the prompt gpt-3.5-turbo (or a user-selected alternative) with the text.

PRODUCES STRING.

OpenAI Compatible Node

REQUIRES STRING text

The text is embedded in the user prompt.

Generates an “engineered” prompt from template.

The user text is embedded in the engineered prompt.

Calls for completion of the prompt to the user-defined URL.

PRODUCES STRING

OpenAI DALL·E Node

REQUIRES STRING text

Calls OpenAI DALL·E with the text.

PRODUCES IMAGE


Pip requirements

This collection has some extra requirements that are not present in the ComfyUI distribution.

Things like openai, gradio-client and technologist tools.

Experimental

This is an experimental collection of nodes. This project needs validation on MacOS, Windows and Linux.

So far, it works on my machine which is a Linux distribution.

Contributions

Looking for participants, happy to work on PRs!

Guidelines for the Dolphin:

  • Keep it small – PRs should be quick and easy.
  • Large things must be compositions of smaller things.
  • Dependencies should be external – i.e. loaded by a node
  • For example:
  • _the Llava loader node passes the Llava model to the recogniser node which uses the Llava model to emit a list of
  • objects_

  • _and not, the “Llava node does everything”_
  • Keep it small

    In the spirit of “Keep it small”, I’m trying to make sure my big ideas for the dolphin

    stay within the realm of LLMs –

    Here are some big ideas that didn’t make it into the roadmap for CyberDolphin:

    Big Ideas I have for future things that are not the dolphin:

    Cam Nodes

  • Webcam Node for phone/laptop
  • Cam Node for HDMI type input devices
  • Live Stream Node to capture vision from a _Thing of the Internet_
  • Speech to Text

  • Microphone node Captures spoken instructions into audio node
  • Instructions are transcribed using
  • OpenAI-Whisper node or
  • _TTS model_ loaded by the TTS node
  • The Simple Storybook Production Kit

    Where “LLM-node” is short for “LLM powered node”:

    LLM-node dreams up the story type
    LLM-node dreams up the character names, their badge
    LLM-node dreams up the story title
    LLM-node dreams up chapter summaries
    
    LLM-node generates a page in "the story"
    
    LLM-node generates images of characters:
        id badge,
        smiling photo,
        frowning photo,
        'character' shot
    
    LLM-node generates prompt for page illustration
    LLM-node generates page text

    License

    GPL 3.