Omnigen V2/V1: All in one Image Generation and Editing model

Traditional diffusion models uses various mechanisms for image modification like ControlNet, IP-Adapter, Inpainting, Face detection, pose estimation, cropping etc. Omnigen released by Vector Space labs comes with all in one pack.

It uses arbitrarily multi-modal instructions like we use to do with ChatGPT (NLP technique). Interested people can refer their research paper for in-depth understanding. Its come with fine-tuning capability that is one of the great news for the community. They are working to release more optimized model in the coming future that can be from their hugging Face repository.

Let's see how to work in ComfyUI.

Table of Contents:

Installation

Install ComfyUI if have not installed yet. Update ComfyUI from the Manager tab.

OmniGenV2

1. Download OmniGenV2 Model (omnigen2_fp16.safetensors) from hugging face repository and save it into your ComfyUI/models/diffusion_models folder.

2. Next, download VAE (ae.safetensors) and put it into ComfyUI/models/VAE folder. If you are using Flux, SD3.5 then downloading this is not required as its using the same model.

3. At last, download Text Encoders (qwen_2.5_vl_fp16.safetensors) and save it into ComfyUI/models/text_encoders folder.

4. Restart your ComfyUI and refresh it.

OmniGenV1

1. Move to your ComfyUI Manager select "Custom Nodes Manager" and search for "ComfyUI-Omnigen" by author "1038lab" and click Install button.

Alternative:

You can also do the manual installation. Move to your "ComfyUI/custom_nodes" directory.

Open command prompt by typing "cmd" on the top of folder address bar. Then, clone the repository using command provided below:

~~git clone https://github.com/1038lab/ComfyUI-OmniGen.git~~

2. The related model will automatically downloaded in the background if you run the basic workflow for the first time. You can check the real-time downloading status in the ComfyUI's terminal.

Alternative:

Manually download the respective model(model.safetensors) having 15.5GB size from Hugging Face repository.

After downloading, rename it to anything relatable like- "omnigen-all-in-one.safetensors". Then, save it inside "Comfyui/models/LLM/OmniGen-v1" folder.

3. Restart ComfyUI to take effect.

Workflow

1. The workflows can be found inside your "ComfyUI/custom-nodes/ComfyUI-OmniGen/Examples" folder. Drag and drop to ComfyUI.

2. To work with the workflow first we need to understand how the basic things work. So, here we have provided with placeholder that are in the form of html tags.

These will be set as "<img><|image_*|></img>" where you need to put -

CFG=2.5

Image guidance=1.6

Inference Steps=50

Omnigen image 1 node

Omnigen image 2 node

Omnigen image combined node

You can make the model to understand better by putting right placeholder. For instance your prompt will be like -

Example 1 (For single subject)= "A woman holds a bouquet of flowers wearing gown and faces the camera in California streets. The woman is <|image_1|>."

Example 2 (For multiple subject)= "Two woman are enjoying party in the Cafeteria. A woman is <|image_1|>. Another woman is <|image_2|>."

Recommended tips

1. You should only use same image dimensions(width and height) while doing image editing and Controlnet processing.

2. Reduce the CFG value if the image generated with saturation. Default value is 2.5. Just be closer to it to get the better results.

3. Use detailed prompting that more relates to your generation will yielding better results.

4. Working with image editing, place the image prompt before the editing prompts. Like if you want to remove some thing like "book" from image, use prompt structure "<img><|image_1|></img> remove cup" an not like this - "remove cup <img><|image_1|></img>"

Omnigen V2/V1: All in one Image Generation and Editing model

Installation

OmniGenV2

OmniGenV1

Workflow

Recommended tips

Posted by Admin

Search This Blog

Trending

Wan 2.1: Install & Generate Videos locally with lower VRAM

Wan2.1 FusionX 14B: Consistent Fast VideoGen with Low VRAM

Easy Install ComfyUI Portable (Windows/Mac/Linux)

Train your WAN2.1 Lora model on Windows/Linux

Installing Stable Diffusion 3.5 Locally

19 Attractive Prompts for Cool Selfie Clicks

Our Social Pages

Recent Posts

Important pages

Contact form

Omnigen V2/V1: All in one Image Generation and Editing model

Installation

OmniGenV2

OmniGenV1

Workflow

Recommended tips

Posted by Admin

Related Posts

Search This Blog

Trending

Wan 2.1: Install & Generate Videos locally with lower VRAM

Wan2.1 FusionX 14B: Consistent Fast VideoGen with Low VRAM

Easy Install ComfyUI Portable (Windows/Mac/Linux)

Train your WAN2.1 Lora model on Windows/Linux

Installing Stable Diffusion 3.5 Locally

19 Attractive Prompts for Cool Selfie Clicks

Our Social Community

Our Social Pages

Recent Posts

Important pages

Contact form