TST: Calculate RMS and diff image in C++ #29102

QuLogic · 2024-11-08T09:26:26Z

PR summary

The current implementation is not slow, but uses a lot of memory per image.

In compare_images, we have:

one actual and one expected image as uint8 (2×image)
both converted to int16 (though original is thrown away) (4×)

which adds up to 4× the image allocated in this function.

Then it calls calculate_rms, which has:

a difference between them as int16 (2×)
the difference cast to 64-bit float (8×)
the square of the difference as 64-bit float (though possibly the original difference was thrown away) (8×)

which at its peak has 16× the image allocated in parallel.

If the RMS is over the desired tolerance, then save_diff_image is called, which:

loads the actual and expected images again as uint8 (2× image)
converts both to 64-bit float (throwing away the original) (16×)
calculates the difference (8×)
calculates the absolute value (8×)
multiples that by 10 (in-place, so no allocation)
clips to 0-255 (8×)
casts to uint8 (1×)

which at peak uses 32× the image.

So at their peak, compare_images→calculate_rms will have 20× the image allocated, and then compare_images→save_diff_image will have 36× the image allocated. This is generally not a problem, but on resource-constrained places like WASM, it can sometimes run out of memory just in calculate_rms.

This implementation in C++ always allocates the diff image, even when not needed, but doesn't have all the temporaries, so it's a maximum of 3× the image size (plus a few scalar temporaries).

PR checklist

[n/a] "closes #0000" is in the body of the PR description to link the related issue
new and changed code is tested
[n/a] Plotting related features are demonstrated in an example
[n/a] New Features and API Changes are noted with a directive and release note
[n/a] Documentation complies with general and docstring guidelines

The current implementation is not slow, but uses a lot of memory per image. In `compare_images`, we have: - one actual and one expected image as uint8 (2×image) - both converted to int16 (though original is thrown away) (4×) which adds up to 4× the image allocated in this function. Then it calls `calculate_rms`, which has: - a difference between them as int16 (2×) - the difference cast to 64-bit float (8×) - the square of the difference as 64-bit float (though possibly the original difference was thrown away) (8×) which at its peak has 16× the image allocated in parallel. If the RMS is over the desired tolerance, then `save_diff_image` is called, which: - loads the actual and expected images _again_ as uint8 (2× image) - converts both to 64-bit float (throwing away the original) (16×) - calculates the difference (8×) - calculates the absolute value (8×) - multiples that by 10 (in-place, so no allocation) - clips to 0-255 (8×) - casts to uint8 (1×) which at peak uses 32× the image. So at their peak, `compare_images`→`calculate_rms` will have 20× the image allocated, and then `compare_images`→`save_diff_image` will have 36× the image allocated. This is generally not a problem, but on resource-constrained places like WASM, it can sometimes run out of memory just in `calculate_rms`. This implementation in C++ always allocates the diff image, even when not needed, but doesn't have all the temporaries, so it's a maximum of 3× the image size (plus a few scalar temporaries).

QuLogic · 2025-06-04T22:10:29Z

So I no longer have any memory-based skips on the PR adding WASM, but maybe we still want to do this to save memory in general?

oscargus · 2025-06-05T10:28:06Z

This seems to make sense!

Should we also use this in compare_rms? Or deprecate that?

story645 · 2025-06-09T22:31:17Z

lib/matplotlib/testing/compare.py

    PNG via the `.converter` dictionary. The underlying RMS is calculated
-    with the `.calculate_rms` function.
+    in a similar way to the `.calculate_rms` function.


I think what's important here is how these methods differ? (what's the takeaway supposed to be here?)

The only reason I wrote it vaguely is that I didn't want people to think they could monkeypatch calculate_rms and expect the image_comparison decorator / compare_images to use it. The algorithm is otherwise the same, I think.

story645 · 2025-06-09T22:32:44Z

src/_image_wrapper.cpp

+    if (expected_image.ndim() != 3) {
+        auto exceptions = py::module_::import("matplotlib.testing.exceptions");
+        auto ImageComparisonFailure = exceptions.attr("ImageComparisonFailure");
+        py::set_error(
+            ImageComparisonFailure,
+            "Expected image must be 3-dimensional, but is {ndim}-dimensional"_s.format(
+                "ndim"_a=expected_image.ndim()));
+        throw py::error_already_set();
+    }
+
+    if (actual_image.ndim() != 3) {
+        auto exceptions = py::module_::import("matplotlib.testing.exceptions");
+        auto ImageComparisonFailure = exceptions.attr("ImageComparisonFailure");
+        py::set_error(
+            ImageComparisonFailure,
+            "Actual image must be 3-dimensional, but is {ndim}-dimensional"_s.format(
+                "ndim"_a=actual_image.ndim()));
+        throw py::error_already_set();
+    }


Can this be done in a loop since it's the same test/error message?

src/_image_wrapper.cpp

story645 · 2025-06-13T20:15:38Z

src/_image_wrapper.cpp

+
+                if (k != 3) { // Hard-code a fully solid alpha channel by omitting it.
+                    diff(i, j, k) = static_cast<unsigned char>(std::clamp(
+                        abs(pixel_diff) * 10, // Expand differences in luminance domain.


why are you only doing this is for rgba?

The alpha channel is ignored, just as with compare_images.

QuLogic added topic: testing Performance labels Nov 8, 2024

github-actions bot added the topic: images label Nov 8, 2024

QuLogic mentioned this pull request Nov 8, 2024

Add wasm CI #29093

Open

4 tasks

github-actions bot added the status: needs rebase label Jan 4, 2025

oscargus approved these changes Jun 5, 2025

View reviewed changes

story645 reviewed Jun 13, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

TST: Calculate RMS and diff image in C++ #29102

TST: Calculate RMS and diff image in C++ #29102

QuLogic commented Nov 8, 2024

Uh oh!

QuLogic commented Jun 4, 2025

Uh oh!

oscargus commented Jun 5, 2025

Uh oh!

story645 Jun 9, 2025

Uh oh!

QuLogic Jun 13, 2025 •

edited

Loading

Uh oh!

story645 Jun 9, 2025

Uh oh!

Uh oh!

story645 Jun 13, 2025

Uh oh!

QuLogic Jun 13, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

TST: Calculate RMS and diff image in C++ #29102

Are you sure you want to change the base?

TST: Calculate RMS and diff image in C++ #29102

Conversation

QuLogic commented Nov 8, 2024

PR summary

PR checklist

Uh oh!

QuLogic commented Jun 4, 2025

Uh oh!

oscargus commented Jun 5, 2025

Uh oh!

story645 Jun 9, 2025

Choose a reason for hiding this comment

Uh oh!

QuLogic Jun 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

story645 Jun 9, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

story645 Jun 13, 2025

Choose a reason for hiding this comment

Uh oh!

QuLogic Jun 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

QuLogic Jun 13, 2025 •

edited

Loading

QuLogic Jun 13, 2025 •

edited

Loading