add progress callback, supress pretty_progress #170

fszontagh · 2024-02-05T19:20:54Z

No description provided.

clip.hpp

lora.hpp

…low to release ram

leejet · 2024-02-24T13:07:45Z

Is there a specific reason to increase the size of many graphs and some reserved buffer sizes by 2 to 4 times? Additionally, ggml_n_dims_t seems to serve no purpose, as ggml_n_dims performs the same function. Passing n_dim directly into the creation of ggml_tensor from tensors_storage does not seem to cause any issues, as the calculation of ggml_tensor dimensions does not directly use the passed n_dim but instead calculates it through ggml_n_dims.

fszontagh · 2024-02-24T13:33:58Z

Is there a specific reason to increase the size of many graphs and some reserved buffer sizes by 2 to 4 times? Additionally, ggml_n_dims_t seems to serve no purpose, as ggml_n_dims performs the same function. Passing n_dim directly into the creation of ggml_tensor from tensors_storage does not seem to cause any issues, as the calculation of ggml_tensor dimensions does not directly use the passed n_dim but instead calculates it through ggml_n_dims.

There is it: #178

But not just with the controlnet models. These errors is happened with larger file sized lora models too. I went through over my "lora collection", and tested some lora models. I modified these params while larger loras started to working.

ggml_n_dims_t seems to serve no purpose, as ggml_n_dims

Sorry, i don't remember what was the problem there. But the new method is required, because the original ggml_n_dims paramter type was not compatible with the type TensorStorage.

If you want, i will try to reproducate the original problem. If i remember correctly, the tensor_storage.n_dims was empty or corrupt.
Maybe it is no longer needed.

fszontagh · 2024-02-24T14:23:29Z

I reproduced the n_dims problem. If i added a lora into the prompt (hair length slider lora) the following assesrtion happened:

leejet · 2024-02-25T13:26:53Z

I reproduced the n_dims problem. If i added a lora into the prompt (hair length slider lora) the following assesrtion happened:

@fszontagh

I used the latest code from the master branch and did not encounter this issue. Can you try using the latest code from the master branch and see if the problem persists?

[INFO ] model.cpp:676  - load ..\models\hair_length_slider_v1.safetensors using safetensors format
[DEBUG] model.cpp:742  - init from '..\models\hair_length_slider_v1.safetensors'
[INFO ] lora.hpp:37   - loading LoRA from '..\models\hair_length_slider_v1.safetensors'
[DEBUG] model.cpp:1307 - loading tensors from ..\models\hair_length_slider_v1.safetensors
[DEBUG] ggml_extend.hpp:820  - lora params backend buffer size =   6.49 MB (10240 tensors)
[DEBUG] model.cpp:1307 - loading tensors from ..\models\hair_length_slider_v1.safetensors
[DEBUG] lora.hpp:67   - finished loaded lora
[DEBUG] ggml_extend.hpp:774  - lora compute buffer size: 100.02 MB
[INFO ] stable-diffusion.cpp:417  - lora 'hair_length_slider_v1' applied, taking 0.11s

Cyberhan123 · 2024-02-25T15:34:43Z

I have a guess about his question. In this file: https://github.com/leejet/stable-diffusion.cpp/blob/4a8190405ac32930678ce030dff6289ed680b6fc/.gitmodules#L3C44-L3C45
It points to the official ggml git, but now the merger of this PR: #159, it actually points to leejet/ggml, this is actually confusing for most people.

fszontagh · 2024-02-25T16:39:54Z

@leejet
It's new, because just compiled and tried to run the sd(.exe) without success. Here is the full command history
stable_diffusion_latest_full_cmd.txt

And here is again, but with a really full fresh start:
stable_diffusion_latest_full_cmd_2try.txt

In wsl it's working fine with lora too. Tested with a 256,6MB lora :)

fszontagh · 2024-02-25T17:02:49Z

Build with MSVC 2019 in vscode, and started the diffusion, but got:
Assertion failed: n_dims >= 1 && n_dims <= GGML_MAX_DIMS, file Z:\stable-diffusion.cpp_latest2\ggml\src\ggml.c, line 2678

Please see the full command
msvc2019.txt

without lora it's fine

fszontagh · 2024-02-25T20:18:38Z

Same happening here with the auto builded release.
I downloaded the cuda version and tried to run, but it just stopped with a lora in the prompt.

@Cyberhan123

fszontagh · 2024-02-25T21:33:03Z

Another test, with the latest. I remade my changes with lora (n_dims). Then tried to reproducate an image with embedding:

[DEBUG] model.cpp:1307 - loading tensors from D:\SD_MODELS\embeddings\ng_deepnegative_v1_75t.pt
ggml_new_object: not enough space in the context's memory pool (needed 115632, available 32768)
Assertion failed: false, file Z:\stable-diffusion.cpp_latest2\ggml\src\ggml.c, line 2643

Another shot:

[INFO ] model.cpp:679 - load D:\SD_MODELS\embeddings\badhandv4.pt using checkpoint format
[DEBUG] model.cpp:1191 - init from 'D:\SD_MODELS\embeddings\badhandv4.pt'
[DEBUG] model.cpp:1307 - loading tensors from D:\SD_MODELS\embeddings\badhandv4.pt
[DEBUG] clip.hpp:923 - embedding 'badhandv4' applied, custom embeddings: 6
[DEBUG] clip.hpp:311 - split prompt ", text, title, logo, signature,username,bad anatomy, badhandv4,blurry, multiple ears, sharp teeth," to tokens [",", "text", ",", "title", ",", "logo", ",", "signature", ",", "username", ",", "bad", "anatomy", ",", ",", "blurry", ",", "multiple", "ears", ",", "sharp", "teeth", ",", ]
[DEBUG] clip.hpp:511 - clip_skip 2
ggml_gallocr_reserve_n: reallocating CUDA0 buffer from size 0.00 MiB to 72.62 MiB
[DEBUG] ggml_extend.hpp:774 - clip compute buffer size: 72.62 MB
[DEBUG] clip.hpp:511 - clip_skip 2
GGML_ASSERT: Z:\stable-diffusion.cpp_latest2\ggml\src\ggml-cuda.cu:8583: src0->type == GGML_TYPE_F32

leejet · 2024-02-26T14:10:31Z

Another test, with the latest. I remade my changes with lora (n_dims). Then tried to reproducate an image with embedding

Currently, support for very large embeddings is not available. I will add it later.

leejet · 2024-02-26T14:11:50Z

Build with MSVC 2019 in vscode, and started the diffusion, but got: Assertion failed: n_dims >= 1 && n_dims <= GGML_MAX_DIMS, file Z:\stable-diffusion.cpp_latest2\ggml\src\ggml.c, line 2678

Please see the full command msvc2019.txt

without lora it's fine

This issue is quite puzzling; I cannot replicate it in my local environment.

fszontagh · 2024-02-26T17:53:21Z

I build on a "virgin" PC (avx512), the same happening. Then i downloaded the prebuild binary to the same PC (avx512), that's working fine. But the downloaded cuda version is failed too on my machine.

Maybe the compiler causing this? In the CI that's an enterprise version, but i use community version.

fszontagh · 2024-02-27T06:54:55Z

@leejet i givin up on this n_dims issue thing. But the 'progress callback' feature is a good feature. Do you accept the pr as-is?
On summary, the ggml_n_dims_t method causing nothing bad and maybe on later it will not be necessary after some new update, but currently it's working here at least.

Cyberhan123 · 2024-02-27T08:36:05Z

Same happening here with the auto builded release. I downloaded the cuda version and tried to run, but it just stopped with a lora in the prompt.

@Cyberhan123

I feel like your problem seems like you didn't pull the correct git submodel.

fszontagh · 2024-02-27T09:05:31Z

@Cyberhan123 i tested with a fresh start too

leejet · 2024-02-27T16:26:17Z

But the 'progress callback' feature is a good feature. Do you accept the pr as-is?

Certainly, I'm willing to merge this PR, or we can merge the progress callback first if you prefer.

fszontagh · 2024-02-27T17:02:17Z

@leejet

Certainly, I'm willing to merge this PR, or we can merge the progress callback first if you prefer.

Okay. I pushed in it with removed ggml_n_dims_t method. After all, the 'progress callback' was the main point of this PR.

leejet · 2024-03-02T09:28:11Z

Thank you for your contribution.

fszontagh added 2 commits February 5, 2024 19:18

add progress callback, supress others

374ace4

some error handling at loars and embeddings

6ee1c65

Green-Sky reviewed Feb 10, 2024

View reviewed changes

clip.hpp Outdated Show resolved Hide resolved

Green-Sky reviewed Feb 10, 2024

View reviewed changes

lora.hpp Outdated Show resolved Hide resolved

fszontagh and others added 3 commits February 10, 2024 20:03

formatting mistakes

0e834e7

leejet#175 leejet#154 leejet#141 - remove lora mapping which is disal…

fc8be44

…low to release ram

Merge branch 'master' into progress_callback

1917d85

fszontagh added 2 commits February 26, 2024 08:52

restored graphs and buffer sizes

fecdbcc

Merge branch 'master' into master

263de2e

removed ggml_n_dims_t

af449d3

Merge branch 'master' into progress_callback

254ef8a

leejet merged commit 7be65fa into leejet:master Mar 2, 2024

rmatif pushed a commit to rmatif/stable-diffusion.cpp that referenced this pull request Apr 8, 2025

common : fix gpt_tokenize (ref leejet#170)

8b3a721

add progress callback, supress pretty_progress #170

add progress callback, supress pretty_progress #170

Uh oh!

Conversation

fszontagh commented Feb 5, 2024

Uh oh!

Uh oh!

Uh oh!

leejet commented Feb 24, 2024

Uh oh!

fszontagh commented Feb 24, 2024

Uh oh!

fszontagh commented Feb 24, 2024

Uh oh!

leejet commented Feb 25, 2024

Uh oh!

Cyberhan123 commented Feb 25, 2024

Uh oh!

fszontagh commented Feb 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fszontagh commented Feb 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fszontagh commented Feb 25, 2024

Uh oh!

fszontagh commented Feb 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

leejet commented Feb 26, 2024

Uh oh!

leejet commented Feb 26, 2024

Uh oh!

fszontagh commented Feb 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fszontagh commented Feb 27, 2024

Uh oh!

Cyberhan123 commented Feb 27, 2024

Uh oh!

fszontagh commented Feb 27, 2024

Uh oh!

leejet commented Feb 27, 2024

Uh oh!

fszontagh commented Feb 27, 2024

Uh oh!

leejet commented Mar 2, 2024

Uh oh!

Uh oh!

fszontagh commented Feb 25, 2024 •

edited

Loading

fszontagh commented Feb 25, 2024 •

edited

Loading

fszontagh commented Feb 25, 2024 •

edited

Loading

fszontagh commented Feb 26, 2024 •

edited

Loading