Fix loading diffusers model (+support F64/I64 types) #681

stduhpf · 2025-05-18T15:24:00Z

Loading diffusers models wasn't working, this fixes it for me,. I also added support for SDXL diffusers model (the second text encoder wasn't being handled).

During testing, I came across models with 64bit types, so I added support for those too since they are supported by GGML. (Fixes #153, #669)

stduhpf · 2025-05-18T19:35:58Z

It would be nice to get it to work for DiT models too, but it looks like a lot of work, because qkv matrices are split in diffusers format. That would probably require a significant refactor of the model loading logic...
Maybe just refactoring the model-specific code itself to accomodate for the different convetions could be a potential solution too

stduhpf · 2025-06-14T18:56:26Z

model.cpp

+        std::string new_name = prefix + name;
+        new_name             = convert_tensor_name(new_name);
+
+        TensorStorage tensor_storage(new_name, type, ne, n_dims, file_index, ST_HEADER_SIZE_LEN + header_size_ + begin);


This is breaking some LoRAs.

leejet · 2025-07-06T15:25:11Z

Thank you for your contribution.

wbruna · 2025-07-08T17:13:46Z

@stduhpf , I'm getting an 'f64 unsupported' error trying the model mentioned on #153 ( https://civitai.com/models/7371/rev-animated?modelVersionId=425083 , fp32 file):

./sd --model ./revAnimated_v2Rebirth.safetensors -p flower 
ggml_vulkan: Found 1 Vulkan devices:
ggml_vulkan: 0 = AMD Radeon RX 7600 XT (RADV NAVI33) (radv) | uma: 0 | fp16: 1 | warp size: 64 | shared memory: 65536 | int dot: 1 | matrix cores: KHR_coopmat
[INFO ] stable-diffusion.cpp:210  - loading model from './revAnimated_v2Rebirth.safetensors'
[INFO ] model.cpp:998  - load ./revAnimated_v2Rebirth.safetensors using safetensors format
[INFO ] stable-diffusion.cpp:259  - Version: SD 1.x 
[INFO ] stable-diffusion.cpp:292  - Weight type:                 f32
[INFO ] stable-diffusion.cpp:293  - Conditioner weight type:     f32
[INFO ] stable-diffusion.cpp:294  - Diffusion model weight type: f32
[INFO ] stable-diffusion.cpp:295  - VAE weight type:             f16
  |====================>                             | 459/1130 - 0.00it/sterminate called after throwing an instance of 'std::runtime_error'
  what():  type f64 unsupported for integer quantization: no dequantization available
Abortado (imagem do núcleo gravada)

This is on master-dafc32d .

stduhpf · 2025-07-08T19:58:16Z

Yeah, F64 is still not properly supported. I didn't realize there isn't a built in "dequantization" (or rather quantization in that case) function for F64 to F32 in GGML, so I just implemented it with #726. This fix the crash during loading time, but I can't get inference to run with either Vulkan or CPU backends when using the models with F64 weights. Forcing it to use F32 works with the new new changes I made ( --type f32).

Fix loading diffusers model

60db19a

Green-Sky approved these changes May 18, 2025

View reviewed changes

vmobilis mentioned this pull request May 23, 2025

F64 dtype #669

Open

stduhpf commented Jun 14, 2025

View reviewed changes

stduhpf added 2 commits June 14, 2025 20:57

Do not break loras

b28ab16

Avoid converting tensor names multiple times

d2e1b7f

wbruna mentioned this pull request Jun 19, 2025

Chroma support (pruned Flux model) #696

Merged

leejet merged commit dafc32d into leejet:master Jul 6, 2025
9 checks passed

stduhpf mentioned this pull request Jul 8, 2025

Fix conversion between F64 and F32 #726

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix loading diffusers model (+support F64/I64 types) #681

Fix loading diffusers model (+support F64/I64 types) #681

Uh oh!

stduhpf commented May 18, 2025 •

edited

Loading

Uh oh!

stduhpf commented May 18, 2025 •

edited

Loading

Uh oh!

stduhpf Jun 14, 2025

Uh oh!

Uh oh!

leejet commented Jul 6, 2025

Uh oh!

wbruna commented Jul 8, 2025

Uh oh!

stduhpf commented Jul 8, 2025 •

edited

Loading

Uh oh!

Uh oh!

Fix loading diffusers model (+support F64/I64 types) #681

Fix loading diffusers model (+support F64/I64 types) #681

Uh oh!

Conversation

stduhpf commented May 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stduhpf commented May 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stduhpf Jun 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

leejet commented Jul 6, 2025

Uh oh!

wbruna commented Jul 8, 2025

Uh oh!

stduhpf commented Jul 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

stduhpf commented May 18, 2025 •

edited

Loading

stduhpf commented May 18, 2025 •

edited

Loading

stduhpf commented Jul 8, 2025 •

edited

Loading