Which benchmarks provide an estimate to enable me compare and decide on which GPU to buy, for Image Processing ?

Hi,

I’m running an AMD Ryzen 5 5600X CPU and an NVidia 1060/6GB card. The GPU’s power limiter is set to 60 W (down from the maximum value 120 W) – this does not seem to impact darktable export performance. For the CPU, I use the ondemand governor. I started up my machine almost 2 hours ago, but was away from it; CPU frequency stats indicate: 3.70 GHz:0.97%, 2.80 GHz:1.87%, 2.20 GHz:97.16%, so the governor is working (no surprise there).

What I found was that with darktable using the Release profile instead of RelWithDebInfo created a huge performance boost for diffuse or sharpen if the GPU was not used; but the GPU was still a lot faster; and even with 6 GB of GPU memory, setting a higher (in my case: 800 MB) OpenCL memory headroom was needed. See Pro Contrast Moose Peterson - #25 by kofa for details, but the summary is:

RelWithDebInfo, CPU path:

[dev_pixelpipe] took 106.707 secs (1127.128 CPU) processed `diffuse or sharpen' on CPU, blended on CPU [export]
[dev_pixelpipe] took 46.582 secs (446.771 CPU) processed `diffuse or sharpen 1' on CPU, blended on CPU [export]
[dev_pixelpipe] took 137.392 secs (1337.772 CPU) processed `diffuse or sharpen 2' on CPU, blended on CPU [export]

Release, CPU path:

[dev_pixelpipe] took 44.513 secs (495.844 CPU) processed `diffuse or sharpen' on CPU, blended on CPU [export]
[dev_pixelpipe] took 16.002 secs (180.493 CPU) processed `diffuse or sharpen 1' on CPU, blended on CPU [export]
[dev_pixelpipe] took 48.474 secs (551.746 CPU) processed `diffuse or sharpen 2' on CPU, blended on CPU [export]

OpenCL:

[dev_pixelpipe] took 23.684 secs (23.518 CPU) processed `diffuse or sharpen' on GPU with tiling, blended on CPU [export]
[dev_pixelpipe] took 7.059 secs (7.005 CPU) processed `diffuse or sharpen 1' on GPU with tiling, blended on CPU [export]
[dev_pixelpipe] took 18.559 secs (17.528 CPU) processed `diffuse or sharpen 2' on GPU with tiling, blended on CPU [export]