Instruct-Pix2pix support #679

stduhpf · 2025-05-15T18:28:32Z

ref: #61

sd.exe -M img2img --model instruct-pix2pix-00-22000.safetensors -p "turn him into a cyborg" --color --strength 1 -i .\example.jpg --steps 50 --cfg-scale 7.5 --img-cfg-scale 1.2 --sampling-method euler_a

input	output

sd.exe -M img2img --model instruct-pix2pix-00-22000.safetensors -p "Make it a cat" --strength 1 -i input.png --steps 100 --cfg-scale 7.5 --img-cfg-scale1.5 --sampling-method euler_a --schedule karras

input	output

TODOs:

Classifier-free guidance (CFG) for two conditionings
Fix UX (probably best not to reuse distlled guidance for something completely different like img conditionning)
Check if implementation is correct

(rebased on top of #683 for CosXL edit support)

rmatif · 2025-05-15T21:13:24Z

Awesome! Could you please take a look at cosxl-edit as well? It acts as an ip2p, if I understood correctly. I think we're just missing the EDM VPred schedule

stduhpf · 2025-05-15T23:38:44Z

Awesome! Could you please take a look at cosxl-edit as well? It acts as an ip2p, if I understood correctly. I think we're just missing the EDM VPred schedule

I may take a look at it later.

stduhpf · 2025-05-16T02:00:03Z

For some reason, the "image CFG" (controlled by --guidance flag for now) needs to be very high (>10) to get anything ressembling the input image, this behavior does not match the HuggingFace Demo, or the example on their github. I can't figure out what I'm doing wrong.

stduhpf · 2025-05-16T17:25:26Z

Ah I think I found the issue. By default, the model samples the VAE distribution, bit pix2pix expects the mean of the distribution.

stduhpf · 2025-05-16T18:08:39Z

I'm pretty sure it's working properly now. I think inpaint might be slightly improved too, especially when strength is set to <1 and with higher CFG.

stduhpf · 2025-05-16T20:12:56Z

Awesome! Could you please take a look at cosxl-edit as well? It acts as an ip2p, if I understood correctly. I think we're just missing the EDM VPred schedule

thse ones might also be interesting, and they may be even easier to implement:
https://huggingface.co/diffusers/sdxl-instructpix2pix-768
https://huggingface.co/CaptainZZZ/sd3-instructpix2pix/tree/main

Edit: The SDXL one was pretty easy. Now, I can't figure out how to easily convert sd3.x models from diffusers format to the original format, so I cant test if it would work...

stduhpf · 2025-05-23T12:02:59Z

CosXL edit is now working properly.

cosxl: smol cleanup CosXL: fix schedule choice Rename EDMVDenoiser Avoid inf for EDMVDenoiser + discrete schedule make parametrization flags public Fix CosXL with empty negative prompts Instruct-p2p support support 2 conditionings cfg Do not re-encode the exact same image twice pix2pix: fixes for 2-cfg Fix pix2pix latent inputs + improve inpainting a bit + fix naming prepare for other pix2pix-like models Support sdxl ip2p CoxXL edit: fix reference image embeddings Support 2-cond cfg properly in cli fix typo in help

cosxl: smol cleanup CosXL: fix schedule choice Rename EDMVDenoiser Avoid inf for EDMVDenoiser + discrete schedule make parametrization flags public Fix CosXL with empty negative prompts

cosxl: smol cleanup CosXL: fix schedule choice Rename EDMVDenoiser Avoid inf for EDMVDenoiser + discrete schedule make parametrization flags public Fix CosXL with empty negative prompts Instruct-p2p support support 2 conditionings cfg Do not re-encode the exact same image twice pix2pix: fixes for 2-cfg Fix pix2pix latent inputs + improve inpainting a bit + fix naming prepare for other pix2pix-like models Support sdxl ip2p CoxXL edit: fix reference image embeddings Support 2-cond cfg properly in cli fix typo in help Support masks for ip2p models

stduhpf force-pushed the ip2p branch 2 times, most recently from 1e25a9b to 75af1bd Compare May 16, 2025 01:48

stduhpf force-pushed the ip2p branch from 6b0247b to 4024765 Compare May 16, 2025 10:37

stduhpf mentioned this pull request May 19, 2025

Add CosXL support #683

Open

stduhpf force-pushed the ip2p branch from 2b13fb1 to 333f6ed Compare May 23, 2025 11:26

stduhpf mentioned this pull request May 24, 2025

Photomaker error: "GGML_ASSERT(cgraph->n_nodes < cgraph->size) failed" #688

Open

stduhpf marked this pull request as ready for review May 25, 2025 22:20

stduhpf mentioned this pull request May 26, 2025

Support for Flux Controls + Flex.2 #692

Open

stduhpf added 11 commits May 26, 2025 19:39

Squash CosXL support(#leejet#683)

4744704

cosxl: smol cleanup CosXL: fix schedule choice Rename EDMVDenoiser Avoid inf for EDMVDenoiser + discrete schedule make parametrization flags public Fix CosXL with empty negative prompts

Instruct-p2p support

588c7c8

support 2 conditionings cfg

9cab077

Do not re-encode the exact same image twice

a460348

pix2pix: fixes for 2-cfg

6504211

Fix pix2pix latent inputs + improve inpainting a bit + fix naming

863e1db

prepare for other pix2pix-like models

6355146

Support sdxl ip2p

9731e78

CoxXL edit: fix reference image embeddings

94dea6c

Support 2-cond cfg properly in cli

37d5e27

fix typo in help

b20ccb6

stduhpf force-pushed the ip2p branch from 00fc2e9 to b20ccb6 Compare May 26, 2025 17:39

Support masks for ip2p models

b9f5a5b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Instruct-Pix2pix support #679

Instruct-Pix2pix support #679

Uh oh!

stduhpf commented May 15, 2025 •

edited

Loading

Uh oh!

rmatif commented May 15, 2025

Uh oh!

stduhpf commented May 15, 2025

Uh oh!

stduhpf commented May 16, 2025 •

edited

Loading

Uh oh!

stduhpf commented May 16, 2025

Uh oh!

stduhpf commented May 16, 2025 •

edited

Loading

Uh oh!

stduhpf commented May 16, 2025 •

edited

Loading

Uh oh!

stduhpf commented May 23, 2025

Uh oh!

Uh oh!

Instruct-Pix2pix support #679

Are you sure you want to change the base?

Instruct-Pix2pix support #679

Uh oh!

Conversation

stduhpf commented May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rmatif commented May 15, 2025

Uh oh!

stduhpf commented May 15, 2025

Uh oh!

stduhpf commented May 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stduhpf commented May 16, 2025

Uh oh!

stduhpf commented May 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stduhpf commented May 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stduhpf commented May 23, 2025

Uh oh!

Uh oh!

stduhpf commented May 15, 2025 •

edited

Loading

stduhpf commented May 16, 2025 •

edited

Loading

stduhpf commented May 16, 2025 •

edited

Loading

stduhpf commented May 16, 2025 •

edited

Loading