Any idea for better performance distortion transform ? #4459

jilen · 2025-04-08T12:55:38Z

jilen
Apr 8, 2025

Wrote an distortion transform as below (using java bind vips-ffm), but it was a bit slow.

Any idea to improve performance ?

Here are time cost in 1000 interations

--- Step Timings ---
GenText             : Total:    154.925 ms, Avg:      0.155 ms
addDisplacement     : Total:     67.943 ms, Avg:      0.068 ms
composite2          : Total:    279.342 ms, Avg:      0.279 ms
displacementMap     : Total:    127.110 ms, Avg:      0.127 ms
embed               : Total:     31.626 ms, Avg:      0.032 ms
identityMap         : Total:     19.997 ms, Avg:      0.020 ms
mapim               : Total:    417.110 ms, Avg:      0.417 ms
noiseMap            : Total:     20.438 ms, Avg:      0.020 ms
randomBg            : Total:    189.838 ms, Avg:      0.190 ms
text_processing     : Total:    104.905 ms, Avg:      0.105 ms

Code

def distortion(sourceTextImage: VImage)(using arena: Arena) = {

  // add padding
  val inputVImg = timeStep("embed") {
    sourceTextImage.embed(
      10, // Adjust padding as needed
      10,
      sourceTextImage.getWidth() + 20, // Use source image dimensions
      sourceTextImage.getHeight() + 20
    )
  }


  val textLayerMapped = timeStep("text_mapped") {
    val w = inputVImg.getWidth()
    val h = inputVImg.getHeight()

    // Create an identity map (R=x, G=y)
    val identityMap = timeStep("identityMap") {
      VImage.xyz(arena, w, h)
    }

    // Generate noise for displacement (adjust sigma for distortion strength)
    val noise = timeStep("noiseMap") {
      VImage.gaussnoise(
        arena,
        w,
        h,
        VipsOption.Int(
          "sigma",
          2
        ), // Controls distortion intensity (reduced from 5)
        VipsOption.Int("mean", 0)
      )
    }

    // Create a 2-band displacement map (noise for x, 0 for y)
    val displacementMap = timeStep("displacementMap") {
      val zero = VImage
        .black(arena, w, h)
      VImage.bandjoin(arena, JList.of(noise, zero))
    }

    // Add displacement to the identity map
    val finalMap = timeStep("addDisplacement") {
      identityMap.add(displacementMap)
    }

    // Apply the map
    timeStep("mapim") {
      inputVImg.mapim(finalMap)
    }
  }

  val bg = timeStep("randomBg") {
    randomBg(textLayerMapped.getWidth(), textLayerMapped.getHeight())
  }

  // Group text processing steps or time individually if needed
  val textLayerFinal = timeStep("text_processing") {
    textLayerMapped
      .linear(
        JList.of(1.0, 1.0, 1.0, 0.5),
        JList.of(0.0)
      ) // Make text semi-transparent
      .cast(VipsBandFormat.FORMAT_UCHAR)
  }

  val out = timeStep("composite2") {
    bg.composite2(
      textLayerFinal,
      VipsBlendMode.BLEND_MODE_OVER
    )
  }
  out
}

Answered by jcupitt

Apr 9, 2025

Ah OK.

libvips parallelism is based on regions, typically 128 x 128 pixels. If the images you are working with are small, you'll get little benefit. libvips is designed for processing large images quickly without using much memory.

For small images I'd suggest skia or opencv instead.

View full answer

jcupitt · 2025-04-08T16:15:52Z

jcupitt
Apr 8, 2025
Maintainer

Hi @jilen,

libvips is demand-driven, so you can't really time individual operations. Your program above is not actually processing any pixels -- it's constructing a large pipeline of operations. When you do the final writeToFile() or whatever, every operation in the pipeline executes in parallel and at the same time.

You can run the final save inside a profiler and see which operations are using the most CPU. That might be useful.

0 replies

jcupitt · 2025-04-08T16:44:33Z

jcupitt
Apr 8, 2025
Maintainer

I made a python version of your code (I hope) for benchmarking:

#!/usr/bin/env python3

import sys
import pyvips

def noise_warp(image):
    image = image.embed(10, 10, image.width + 20, image.height + 20)
    distort = pyvips.Image.xyz(image.width, image.height) + \
        pyvips.Image.gaussnoise(image.width, image.height, sigma=2, mean=0) \
        .bandjoin(0)
    return image.mapim(distort)

bg = pyvips.Image.new_from_file(sys.argv[1], access="sequential")
text = noise_warp(pyvips.Image.text(sys.argv[3], rgba=True, dpi=600))
image = bg.composite2(text * [1, 1, 1, 0.5], "over", x=20, y=20)
image.write_to_file(sys.argv[2])

Run with eg.:

$ time ./distort-noise.py ~/pics/k2.jpg x.jpg "hello world"

real	0m0.241s
user	0m0.238s
sys	0m0.065s

So 200ms to load a 1450 x 2048 pixel JPG, render the distorted text in the top-left corner, and save back again. That's fast compared to something like imagemagick, but slow compared to a GPU shader. It depends on your needs, of course.

You could make it go a bit quicker by only generating the distortion map once and reusing it 1000 times (save and reuse an image in python with image = image.copy_memory()), or by only generating the text once and reusing 1000 times. Again, it depends on your exact needs.

3 replies

jilen Apr 9, 2025
Author

I am creating captcha images (as below). Which draw some text then add distortion and noise background.

It cost about 4ms in zen3 (about 20ms in old hardware) to produce a captcha image, any idea how to improve the performance

jcupitt Apr 9, 2025
Maintainer

Ah OK.

libvips parallelism is based on regions, typically 128 x 128 pixels. If the images you are working with are small, you'll get little benefit. libvips is designed for processing large images quickly without using much memory.

For small images I'd suggest skia or opencv instead.

Answer selected by jilen

jilen Apr 9, 2025
Author

@jcupitt Thanks for the advice

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Any idea for better performance distortion transform ? #4459

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 2 comments 3 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

Any idea for better performance distortion transform ? #4459

Uh oh!

Uh oh!

jilen Apr 8, 2025

Replies: 2 comments · 3 replies

Uh oh!

jcupitt Apr 8, 2025 Maintainer

Uh oh!

jcupitt Apr 8, 2025 Maintainer

Uh oh!

jilen Apr 9, 2025 Author

Uh oh!

jcupitt Apr 9, 2025 Maintainer

Uh oh!

jilen Apr 9, 2025 Author

jilen
Apr 8, 2025

Replies: 2 comments 3 replies

jcupitt
Apr 8, 2025
Maintainer

jcupitt
Apr 8, 2025
Maintainer

jilen Apr 9, 2025
Author

jcupitt Apr 9, 2025
Maintainer

jilen Apr 9, 2025
Author