Does opencl improve speed of setting parametric masks?

Lunchtime procrastination…

Used the source RAW and *.xmp from here: https://math.dartmouth.edu/~sarunas/darktable_bench.html

Old Desktop, 6-7 year-old Intel NUC:

Intel(R) Core(TM) i7-6770HQ CPU @ 2.60GHz w/ 32GB RAM
/usr/bin/flatpak run --branch=stable --arch=x86_64 --command=/app/bin/darktable-cli --file-forwarding org.darktable.Darktable setubal.orf setubal.orf.xmp test.jpg --core --disable-opencl -d perf
44.8865 [dev_process_export] pixel pipeline processing took 42.476 secs (249.832 CPU)
45.3970 [dev_process_export] pixel pipeline processing took 42.565 secs (250.119 CPU)
46.9705 [dev_process_export] pixel pipeline processing took 44.434 secs (254.009 CPU)

AVERAGES:

  • pixel pipeline processing = 43.158 sec
  • CPU took = 251.320 sec
  • Baseline value.

New Desktop, spec as in post above. AMD Ryzen 7 5700G only w/ opencl disabled.

AMD Ryzen 7 5700G with Radeon Graphics w/ 64GB RAM
/usr/bin/flatpak run --branch=stable --arch=x86_64 --command=/app/bin/darktable-cli --file-forwarding org.darktable.Darktable setubal.orf setubal.orf.xmp test.jpg --core --disable-opencl -d perf
12.9354 [dev_process_export] pixel pipeline processing took 12.101 secs (160.745 CPU)
12.9902 [dev_process_export] pixel pipeline processing took 12.156 secs (161.078 CPU)
13.0178 [dev_process_export] pixel pipeline processing took 12.181 secs (162.105 CPU)

AVERAGES:

  • pixel pipeline processing = 12.146 sec
  • CPU took = 161.309 sec
  • Speedup = 3.56 times faster !

New Desktop, spec as in post above. AMD Ryzen 7 5700G w/ NVIDIA GeForce RTX 3060 opencl enabled.

AMD Ryzen 7 5700G with Radeon Graphics w/ 64GB RAM
/usr/bin/flatpak run --branch=stable --arch=x86_64 --command=/app/bin/darktable-cli --file-forwarding org.darktable.Darktable setubal.orf setubal.orf.xmp test.jpg --core -d opencl -d perf
3.0812 [dev_process_export] pixel pipeline processing took 2.135 secs (3.174 CPU)
3.0812 [dev_process_export] pixel pipeline processing took 2.137 secs (3.241 CPU)
3.0896 [dev_process_export] pixel pipeline processing took 2.132 secs (3.212 CPU)

[dt_opencl_device_init]
   DEVICE:                   0: 'NVIDIA GeForce RTX 3060'
   PLATFORM NAME & VENDOR:   NVIDIA CUDA, NVIDIA Corporation
   CANONICAL NAME:           nvidiacudanvidiageforcertx3060
   DRIVER VERSION:           535.86.05
   DEVICE VERSION:           OpenCL 3.0 CUDA, SM_20 SUPPORT
   DEVICE_TYPE:              GPU
   GLOBAL MEM SIZE:          12044 MB

AVERAGES:

  • pixel pipeline processing = 2.135 sec
  • CPU took = 3.209 sec
  • Speedup = 20.22 times faster !!!
2 Likes