Performance issues

In my experience from trying to move my friends to DT, it is not really usable on Windows, crashes a lot and has major theming issues etc. Maybe it’s an easy fix, idk.
On Linux it’s really ok experience. Crashes every now and then, image rendering breaks from time to time but it’s pretty good experience overall. Exporting images is painfully slow even on very powerful hardware. Like, it’s reaaally slow.
Thankfully OpenCL helps a lot with most performance issues and exporting is around 45% faster which still isn’t lightning fast but at least you won’t have to wait for hours.

DaVinci Resolve works much better on Linux than on Windows. Imo you shouldn’t have any install issues with your hardware. One note is that you loose some output codecs like h.264, h.265 and some others. And you can forget the reading .mp3 files, only .wav. So prepare yourself for some transcoding.
Not working with delivery codecs in post processing also means a greater performance so using Resolve on Linux is well worth it.
Remember to install Nvidia CUDA in order for Resolve to work on your hardware.

1 Like

DT works fairly well on my Windows machine, but after a while it begins to bog down and I have to reboot. I’ve given up on OpenCL because it locks up my PC… no doubt it’s a hardware issue that’s specific to my suboptimal system

@Dave22152 I’ve given up on Windows long time ago, I still keep an instance for Lightroom but I wouldn’t know the first thing about how to go about making OpenCL work on Windows.

Morrn, da!

If I find the time to setup up Davinci resolve on linux, I may move.

Do you have a specific Linux distro in mind?
There are distros that are especially made for your CPU — but they do not necessarily accept your GFX. You have to find a nice balance :slight_smile:

In my case, Manjaro/Gnome works fine with daVinci Resolve (albeit just the free version, but still) as well as with my Ryzen 3900X and my Nvidia GFX.

Have fun!
Claes in Lund, Sweden

Can you give an example, just how slow? Have you tried running with -d perf -d opencl?
On my ancient machine (Core2 Duo @ 2.33 GHz, 4 GB RAM) + cheap low-mid-range(?) card (Nvidia 1060 with 6GB RAM), a simple edit of a 16 MPixel raw (with profiled denoise, colour balance, tone EQ, filmic and local contrast) ran in 12 seconds:

45322.597044 [dev] took 0.000 secs (0.000 CPU) to load the image.
45322.887927 [export] creating pixelpipe took 0.219 secs (0.262 CPU)
45322.887997 [pixelpipe_process] [export] using device 0
45322.931893 [dev_pixelpipe] took 0.042 secs (0.025 CPU) initing base buffer [export]
45322.952401 [dev_pixelpipe] took 0.020 secs (0.018 CPU) processed `raw black/white point' on GPU, blended on GPU [export]
45322.955184 [dev_pixelpipe] took 0.003 secs (0.002 CPU) processed `white balance' on GPU, blended on GPU [export]
45322.959135 [dev_pixelpipe] took 0.004 secs (0.000 CPU) processed `highlight reconstruction' on GPU, blended on GPU [export]
45323.152441 [dev_pixelpipe] took 0.193 secs (0.149 CPU) processed `hot pixels' on CPU, blended on CPU [export]
45326.014697 [dev_pixelpipe] took 2.861 secs (3.753 CPU) processed `demosaic' on CPU with tiling, blended on CPU [export]
45326.436091 [dev_pixelpipe] took 0.420 secs (0.332 CPU) processed `denoise (profiled)' on GPU, blended on GPU [export]
45330.208649 [dev_pixelpipe] took 3.770 secs (1.175 CPU) processed `lens correction' on GPU, blended on GPU [export]
45330.224098 [dev_pixelpipe] took 0.014 secs (0.010 CPU) processed `crop and rotate' on GPU, blended on GPU [export]
45330.233940 [dev_pixelpipe] took 0.009 secs (0.003 CPU) processed `exposure' on GPU, blended on GPU [export]
45331.515568 [dev_pixelpipe] took 1.281 secs (1.891 CPU) processed `tone equalizer' on CPU, blended on CPU [export]
45331.640680 [dev_pixelpipe] took 0.124 secs (0.116 CPU) processed `input color profile' on GPU, blended on GPU [export]
45331.650515 [dev_pixelpipe] took 0.009 secs (0.004 CPU) processed `color balance' on GPU, blended on GPU [export]
image colorspace transform Lab-->RGB took 0.259 secs (0.471 CPU) [filmicrgb ]
45333.868513 [dev_pixelpipe] took 2.217 secs (3.876 CPU) processed `filmic rgb' on CPU, blended on CPU [export]
image colorspace transform RGB-->Lab took 0.008 secs (0.000 GPU) [bilat ]
45334.117169 [dev_pixelpipe] took 0.248 secs (0.169 CPU) processed `local contrast' on GPU, blended on GPU [export]
45334.132672 [dev_pixelpipe] took 0.015 secs (0.009 CPU) processed `output color profile' on GPU, blended on GPU [export]
45334.734361 [dev_pixelpipe] took 0.601 secs (0.716 CPU) processed `dithering' on CPU, blended on CPU [export]
45334.962629 [dev_pixelpipe] took 0.227 secs (0.421 CPU) processed `display encoding' on CPU, blended on CPU [export]
45334.962709 [opencl_profiling] profiling device 0 ('GeForce GTX 1060 6GB'):
45334.962722 [opencl_profiling] spent  0.3528 seconds in [Write Image (from host to device)]
45334.962731 [opencl_profiling] spent  0.0010 seconds in rawprepare_1f
45334.962738 [opencl_profiling] spent  0.0010 seconds in whitebalance_1f
45334.962744 [opencl_profiling] spent  0.0019 seconds in highlights_1f_lch_bayer
45334.962750 [opencl_profiling] spent  0.5592 seconds in [Read Image (from device to host)]
45334.962757 [opencl_profiling] spent  0.0044 seconds in denoiseprofile_precondition_Y0U0V0
45334.962763 [opencl_profiling] spent  0.1665 seconds in denoiseprofile_decompose
45334.962769 [opencl_profiling] spent  0.0153 seconds in denoiseprofile_reduce_first
45334.962776 [opencl_profiling] spent  0.0001 seconds in denoiseprofile_reduce_second
45334.962782 [opencl_profiling] spent  0.0000 seconds in [Read Buffer (from device to host)]
45334.962788 [opencl_profiling] spent  0.0381 seconds in denoiseprofile_synthesize
45334.962794 [opencl_profiling] spent  0.0068 seconds in [Copy Image (on device)]
45334.962801 [opencl_profiling] spent  0.0047 seconds in denoiseprofile_backtransform_Y0U0V0
45334.962807 [opencl_profiling] spent  0.3726 seconds in [Write Buffer (from host to device)]
45334.962817 [opencl_profiling] spent  0.0051 seconds in lens_vignette
45334.962824 [opencl_profiling] spent  0.0222 seconds in lens_distort_lanczos3
45334.962830 [opencl_profiling] spent  0.0089 seconds in clip_rotate_lanczos3
45334.962837 [opencl_profiling] spent  0.0037 seconds in exposure
45334.962843 [opencl_profiling] spent  0.0038 seconds in colorin_unbound
45334.962849 [opencl_profiling] spent  0.0034 seconds in colorbalance_cdl
45334.962855 [opencl_profiling] spent  0.0034 seconds in colorspaces_transform_rgb_matrix_to_lab
45334.962861 [opencl_profiling] spent  0.0030 seconds in pad_input
45334.962866 [opencl_profiling] spent  0.0192 seconds in gauss_reduce
45334.962872 [opencl_profiling] spent  0.0151 seconds in process_curve
45334.962879 [opencl_profiling] spent  0.0208 seconds in laplacian_assemble
45334.962885 [opencl_profiling] spent  0.0038 seconds in write_back
45334.962891 [opencl_profiling] spent  0.0073 seconds in colorout
45334.962897 [opencl_profiling] spent  1.6442 seconds totally in command queue (with 0 events missing)
45334.962935 [dev_process_export] pixel pipeline processing took 12.075 secs (12.675 CPU)
[export_job] exported to `aaaaaaaaaaaa/darktable_exported/DSC_0032_01.jpg'

Test done with Darktable master branch
Used image from this thread
And here is the sidecar file: IMG_8336.cr2.xmp (29.7 KB)

neofetch
OS: Ubuntu 20.04.1 LTS x86_64
Host: Inspiron 13-7359
Kernel: 5.4.0-52-generic
Uptime: 18 hours, 18 mins
Packages: 2222 (dpkg), 93 (flatpak), 41 (snap)
Shell: bash 5.0.17
Resolution: 1920x1080
DE: GNOME
WM: Mutter
WM Theme: Adwaita
Theme: Adwaita-dark [GTK2/3]
Icons: Yaru [GTK2/3]
Terminal: gnome-terminal
CPU: Intel i7-6500U (4) @ 3.100GHz
GPU: Intel Skylake GT2 [HD Graphics 520]
Memory: 4972MiB / 7818MiB

darktable -perf

12,548250 [dev] took 0,180 secs (0,142 CPU) to load the image.
12,672475 [export] creating pixelpipe took 0,099 secs (0,100 CPU)
12,674143 [dev_pixelpipe] took 0,000 secs (0,000 CPU) initing base buffer [export]
12,697233 [dev_pixelpipe] took 0,023 secs (0,030 CPU) processed `raw black/white point' on CPU, blended on CPU [export]
12,715051 [dev_pixelpipe] took 0,018 secs (0,016 CPU) processed `white balance' on CPU, blended on CPU [export]
12,725346 [dev_pixelpipe] took 0,010 secs (0,026 CPU) processed `highlight reconstruction' on CPU, blended on CPU [export]
13,007639 [dev_pixelpipe] took 0,282 secs (0,707 CPU) processed `demosaic' on CPU, blended on CPU [export]
21,806221 [dev_pixelpipe] took 8,799 secs (29,037 CPU) processed `denoise (profiled)' on CPU with tiling, blended on CPU [export]
21,845410 [dev_pixelpipe] took 0,039 secs (0,039 CPU) processed `lens correction' on CPU, blended on CPU [export]
23,422701 [dev_pixelpipe] took 1,577 secs (5,013 CPU) processed `haze removal' on CPU, blended on CPU [export]
23,609436 [dev_pixelpipe] took 0,187 secs (0,237 CPU) processed `retouch' on CPU, blended on CPU [export]
23,667196 [dev_pixelpipe] took 0,058 secs (0,175 CPU) processed `exposure' on CPU, blended on CPU [export]
23,707185 [dev_pixelpipe] took 0,040 secs (0,040 CPU) processed `mask manager' on CPU, blended on CPU [export]
24,528202 [dev_pixelpipe] took 0,821 secs (2,945 CPU) processed `tone equalizer' on CPU, blended on CPU [export]
24,586095 [dev_pixelpipe] took 0,058 secs (0,211 CPU) processed `input color profile' on CPU, blended on CPU [export]
25,324933 [dev_pixelpipe] took 0,739 secs (2,538 CPU) processed `defringe' on CPU, blended on CPU [export]
32,678015 [dev_pixelpipe] took 7,353 secs (25,312 CPU) processed `contrast equalizer' on CPU with tiling, blended on CPU [export]
32,835905 [dev_pixelpipe] took 0,158 secs (0,565 CPU) processed `sharpen' on CPU, blended on CPU [export]
33,084041 [dev_pixelpipe] took 0,248 secs (0,955 CPU) processed `color balance' on CPU, blended on CPU [export]
image colorspace transform Lab-->RGB took 0,063 secs (0,208 CPU) [filmicrgb ]
33,575602 [dev_pixelpipe] took 0,492 secs (1,778 CPU) processed `filmic rgb' on CPU, blended on CPU [export]
image colorspace transform RGB-->Lab took 0,090 secs (0,334 CPU) [bilat ]
34,911570 [dev_pixelpipe] took 1,336 secs (4,237 CPU) processed `local contrast' on CPU, blended on CPU [export]
35,361036 [dev_pixelpipe] took 0,449 secs (1,452 CPU) processed `color zones' on CPU, blended on CPU [export]
35,478764 [dev_pixelpipe] took 0,118 secs (0,425 CPU) processed `output color profile' on CPU, blended on CPU [export]
35,562140 [dev_pixelpipe] took 0,083 secs (0,306 CPU) processed `display encoding' on CPU, blended on CPU [export]
35,562170 [dev_process_export] pixel pipeline processing took 22,890 secs (76,045 CPU)
[export_job] exported to `/home/aresminos/Pictures/Darktable_Master/PlayRaw/darktable_exported/IMG_8336_03.jpg'

Darktable -d perf -d opencl

36,738248 [dev] took 0,152 secs (0,125 CPU) to load the image.
36,858381 [export] creating pixelpipe took 0,098 secs (0,104 CPU)
36,858448 [pixelpipe_process] [export] using device 0
36,860144 [dev_pixelpipe] took 0,000 secs (0,000 CPU) initing base buffer [export]
36,914518 [dev_pixelpipe] took 0,054 secs (0,015 CPU) processed `raw black/white point' on GPU, blended on GPU [export]
36,943555 [dev_pixelpipe] took 0,029 secs (0,012 CPU) processed `white balance' on GPU, blended on GPU [export]
36,978403 [dev_pixelpipe] took 0,035 secs (0,004 CPU) processed `highlight reconstruction' on GPU, blended on GPU [export]
37,184043 [dev_pixelpipe] took 0,206 secs (0,044 CPU) processed `demosaic' on GPU, blended on GPU [export]
40,092893 [dev_pixelpipe] took 2,909 secs (0,816 CPU) processed `denoise (profiled)' on GPU, blended on GPU [export]
40,277341 [dev_pixelpipe] took 0,184 secs (0,024 CPU) processed `lens correction' on GPU, blended on GPU [export]
42,261476 [dev_pixelpipe] took 1,984 secs (1,133 CPU) processed `haze removal' on GPU, blended on GPU [export]
42,596569 [dev_pixelpipe] took 0,335 secs (0,232 CPU) processed `retouch' on GPU, blended on GPU [export]
42,713671 [dev_pixelpipe] took 0,117 secs (0,034 CPU) processed `exposure' on GPU, blended on GPU [export]
42,867763 [dev_pixelpipe] took 0,153 secs (0,033 CPU) processed `mask manager' on GPU, blended on GPU [export]
43,883179 [dev_pixelpipe] took 1,015 secs (2,975 CPU) processed `tone equalizer' on CPU, blended on CPU [export]
44,140016 [dev_pixelpipe] took 0,257 secs (0,064 CPU) processed `input color profile' on GPU, blended on GPU [export]
44,979822 [dev_pixelpipe] took 0,840 secs (2,565 CPU) processed `defringe' on CPU, blended on CPU [export]
47,909443 [dev_pixelpipe] took 2,930 secs (0,285 CPU) processed `contrast equalizer' on GPU, blended on GPU [export]
48,344479 [dev_pixelpipe] took 0,435 secs (0,000 CPU) processed `sharpen' on GPU, blended on GPU [export]
48,557411 [dev_pixelpipe] took 0,213 secs (0,021 CPU) processed `color balance' on GPU, blended on GPU [export]
image colorspace transform Lab-->RGB took 0,095 secs (0,230 CPU) [filmicrgb ]
49,154413 [dev_pixelpipe] took 0,597 secs (1,824 CPU) processed `filmic rgb' on CPU, blended on CPU [export]
image colorspace transform RGB-->Lab took 0,170 secs (0,003 GPU) [bilat ]
50,505172 [dev_pixelpipe] took 1,351 secs (0,188 CPU) processed `local contrast' on GPU, blended on GPU [export]
50,795170 [dev_pixelpipe] took 0,290 secs (0,067 CPU) processed `color zones' on GPU, blended on GPU [export]
50,918578 [dev_pixelpipe] took 0,123 secs (0,067 CPU) processed `output color profile' on GPU, blended on GPU [export]
51,093450 [dev_pixelpipe] took 0,175 secs (0,340 CPU) processed `display encoding' on CPU, blended on CPU [export]
51,093556 [opencl_profiling] profiling device 0 ('Intel(R) Gen9 HD Graphics NEO'):
51,093574 [opencl_profiling] spent  0,2172 seconds in [Write Image (from host to device)]
51,093578 [opencl_profiling] spent  0,0163 seconds in rawprepare_1f
51,093582 [opencl_profiling] spent  0,0154 seconds in whitebalance_1f
51,093585 [opencl_profiling] spent  0,0147 seconds in highlights_1f_clip
51,093590 [opencl_profiling] spent  0,0502 seconds in ppg_demosaic_green
51,093594 [opencl_profiling] spent  0,0671 seconds in ppg_demosaic_redblue
51,093598 [opencl_profiling] spent  0,0038 seconds in border_interpolate
51,093603 [opencl_profiling] spent  0,0671 seconds in denoiseprofile_precondition_Y0U0V0
51,093608 [opencl_profiling] spent  1,2958 seconds in denoiseprofile_decompose
51,093612 [opencl_profiling] spent  0,1463 seconds in denoiseprofile_reduce_first
51,093616 [opencl_profiling] spent  0,0006 seconds in denoiseprofile_reduce_second
51,093632 [opencl_profiling] spent  0,0005 seconds in [Read Buffer (from device to host)]
51,093636 [opencl_profiling] spent  0,6024 seconds in denoiseprofile_synthesize
51,093639 [opencl_profiling] spent  0,4559 seconds in [Copy Image (on device)]
51,093642 [opencl_profiling] spent  0,0674 seconds in denoiseprofile_backtransform_Y0U0V0
51,093646 [opencl_profiling] spent  0,0127 seconds in blendop_set_mask
51,093650 [opencl_profiling] spent  0,0921 seconds in blendop_rgb
51,093654 [opencl_profiling] spent  0,3036 seconds in [Read Image (from device to host)]
51,093658 [opencl_profiling] spent  0,0350 seconds in hazeremoval_transision_map
51,093663 [opencl_profiling] spent  0,0214 seconds in hazeremoval_box_max_x
51,093667 [opencl_profiling] spent  0,0185 seconds in hazeremoval_box_max_y
51,093671 [opencl_profiling] spent  0,0222 seconds in hazeremoval_box_min_x
51,093676 [opencl_profiling] spent  0,0195 seconds in hazeremoval_box_min_y
51,093680 [opencl_profiling] spent  0,0605 seconds in guided_filter_split_rgb_image
51,093684 [opencl_profiling] spent  0,2798 seconds in guided_filter_box_mean_x
51,093688 [opencl_profiling] spent  0,2773 seconds in guided_filter_box_mean_y
51,093692 [opencl_profiling] spent  0,0661 seconds in guided_filter_covariances
51,093696 [opencl_profiling] spent  0,0916 seconds in guided_filter_variances
51,093700 [opencl_profiling] spent  0,2274 seconds in guided_filter_update_covariance
51,093705 [opencl_profiling] spent  0,1133 seconds in guided_filter_solve
51,093709 [opencl_profiling] spent  0,0589 seconds in guided_filter_generate_result
51,093712 [opencl_profiling] spent  0,0723 seconds in hazeremoval_dehaze
51,093716 [opencl_profiling] spent  0,0593 seconds in [Copy Image to Buffer (on device)]
51,093720 [opencl_profiling] spent  0,0002 seconds in [Write Buffer (from host to device)]
51,093724 [opencl_profiling] spent  0,0006 seconds in retouch_copy_buffer_to_buffer
51,093729 [opencl_profiling] spent  0,0002 seconds in retouch_copy_buffer_to_buffer_masked
51,093732 [opencl_profiling] spent  0,0631 seconds in retouch_copy_buffer_to_image
51,093736 [opencl_profiling] spent  0,0630 seconds in exposure
51,093740 [opencl_profiling] spent  0,0590 seconds in colorin_unbound
51,093744 [opencl_profiling] spent  1,4052 seconds in eaw_decompose
51,093747 [opencl_profiling] spent  0,6328 seconds in eaw_synthesize
51,093751 [opencl_profiling] spent  0,0180 seconds in blendop_mask_Lab
51,093755 [opencl_profiling] spent  0,0955 seconds in blendop_Lab
51,093759 [opencl_profiling] spent  0,1379 seconds in sharpen_hblur
51,093763 [opencl_profiling] spent  0,0416 seconds in sharpen_vblur
51,093767 [opencl_profiling] spent  0,0788 seconds in sharpen_mix
51,093771 [opencl_profiling] spent  0,0649 seconds in colorbalance_cdl
51,093774 [opencl_profiling] spent  0,0653 seconds in colorspaces_transform_rgb_matrix_to_lab
51,093778 [opencl_profiling] spent  0,0522 seconds in pad_input
51,093782 [opencl_profiling] spent  0,2516 seconds in gauss_reduce
51,093785 [opencl_profiling] spent  0,2518 seconds in process_curve
51,093789 [opencl_profiling] spent  0,2549 seconds in laplacian_assemble
51,093793 [opencl_profiling] spent  0,0663 seconds in write_back
51,093797 [opencl_profiling] spent  0,0673 seconds in colorzones_v3
51,093801 [opencl_profiling] spent  0,0661 seconds in colorout
51,093805 [opencl_profiling] spent  8,5886 seconds totally in command queue (with 0 events missing)
51,093831 [dev_process_export] pixel pipeline processing took 14,235 secs (10,745 CPU)
[export_job] exported to `/home/aresminos/Pictures/Darktable_Master/PlayRaw/darktable_exported/IMG_8336_02.jpg'

Now imagine if you had to export like 600 images. Thankfully my desktop machine has Threadripper cpu and Vega 64 gpu so it’s faster but it’s still a long export time considering the hardware.

Yowza

Thanks for the reply and the hard work. Much appreciated. For a DT newbie like me the guides, forum replies and videos you have put out are invaluable. If your even in Amsterdam I’d buy you a beer.

I think my problems are more than just Tone Eq. It wasn’t this bad at first. I will make the move to Linux. Just need to choose a distro now.

Looking forward for the next release :slight_smile: Not regretting my move to DT for a second. Just need windows detox now.

1 Like

Tjena Claes,

Ahh, you run Resolve on Manjaro, awesome. I am more familiar with the Debian based distros so Ubuntu would be the easy choice. I was sort of settled to try Centos due to Resolve. But I see allot people like Arch/Manjaro. I do like hearing that you run Resolve on Manjaro.

I want something I can run reasonably stable that doesn’t break after updates, but also have up to date sw availability. Something between Ubuntus convenience and Debian Stable stability. I guess thats probably asking to much.

I do have the Studio dongle for Resolve. I’m definitely not a Linux expert, but my dayjob is sysadmin so I should be able wrap my head around Arch based distro. But been using Windows for too long. Been a few years since I ran Linux as my main OS. Then it was Mint I believe. I do manage a couple of Ubuntu servers at work, but not very advanced stuff.

I will set up for dual boot at first, I have a few requirements but it does look like most my stuff is available on Linux. My goal is to ditch the dual boot as well, and get to a point where i can use a VM if I absolutely have to use windows. Or my company laptop.

I’m excited about moving to Linux now. New project haha. And hyped up on caffeine now.

Other than more effort in initial setup, are there any major disadvantage to going for Arch over Manjaro?

Thanks for the reply :slight_smile: I have the studio version (dongle) so I codecs shouldn’t be a problem?

What distro do you use? I’m fairly IT competent, but don’t want to spend days to get Resolve to work.

It does appear my performnace issues may also be due to Nvidia driver issue, had a ton of explorer.exe crashes due to a nvidia dll. Trying other drivers, but planning disk space for a linux distro.

This is the slowest step; from the 23 seconds, it took over a third of the time.
Maybe you can adjust memory settings to prevent tiling? See “Special topics” in the darktable usermanual
Or the equivalent sections here:
https://elstoc.github.io/dtdocs/special-topics/memory/
https://elstoc.github.io/dtdocs/preferences-settings/cpu-gpu-memory/

I also have the studio version on an SD card (version 16), it’s still a problem with codecs.
I always use Ubuntu LTS or my main work machine but Blackmagic only officially supports CentOS.
I think you have an option to install proprietary Nvidia drivers right from the Ubuntu gui installer but I don’t know if that includes CUDA maybe someone else here can answer that?

If you go with Ubuntu take a look at makeresolvedeb script:

Also take every opportunity to report any bugs or complain about any missing features to Blackmagic Design. They are really neglecting Linux users compared to Windows and MacOS and we must be heard.

Installation is basically this:

  1. Install Ubuntu
  2. Install CUDA
  3. Make a Resolve deb package with makeresolve deb script
  4. Install Resolve
  5. Profit

You might wanna check out DaVinci Resolve Linux install issues thread on BMD forums if you find any issues with installing DR (beware, it’s a rabbit hole):
https://forum.blackmagicdesign.com/viewtopic.php?f=21&t=56878&start=1400

I’ve increased the host memory limit for tiling to 4000MB and decreased the minimum amount of memory for a single buffer in tiling to 8 and got this:

15,580759 [dev_pixelpipe] took 7,879 secs (27,380 CPU) processed `denoise (profiled)' on CPU, blended on CPU [export]

Not much of an improvement. But then again if I export a lot of images 1 second per image is a huge improvement.

It does not, has to be installed separately, if needed. Can be downloaded from the nvidia web site.

1 Like

8 MB for tiling is the minimum, for memory-constrained systems. But if your raised host memory limit helped you avoid tiling, it won’t make a difference. Also, 1 s of difference could come from measurement errors. Sorry it didn’t help much. :frowning: Compared to the editing time of a few hundred images, saving a few hundred seconds won’t be noticeable.

I just increased the memory to 6000MB and 64MB (It doesn’t let me set more than 64MB).
The results are back to 8 seconds. So basically it stays the same however memory I give it.

Hi’ @Tore_Valberg, @kofa and others

In my experience, and I run DT often, DT runs fairly stable on windows. You are not likely to run into crashes, but you will run into errors in various modules especially if you press wrong/unexpected buttons or sequence of buttons (because you are new to the module or have forgotten how to use it). You will then have to reset the module and start again.

Tone equalizer is very, very slow, so I’m looking forward to 3.4. Hopefully next Christmas is this Christmas?

Exporting images (raw to jpg) is also very slow, but you can take a brake and have some coffee.

I will try to run some exporting tests after the weekend……

I think that DT has some very nice features, a lot of development is taking place and you can be a member of an active forum of users. My bottom line is that I absolutely recommend others to use DT on windows (if that is your it environment).

@Tore_Valberg
Tjena, asså!

I was sort of settled to try Centos due to Resolve.

Yes, I follow your line of reasoning — daVinci even offers a downloadable CentOS ISO with Resolve built-in. But as far as I know, it does not like Nvidia GFX at all :frowning:

There are many interesting distros, a few are even tailor-made for Ryzen CPUs.
On Phoronix you can read more about them, as well as study speed graphs.
Clear Linux surprisingly gets good marks [surprisingly because of who is behind it :-)]. But it does not like Nvidia either.

If you plan a double boot with Windows, Manjaro’s installer is a good bet…

Have fun!
Claes in Lund, Sweden

Just for curiosity, I run the same export on my machine (which I wouldn’t call ‘lighting fast’ for today’s standards), using darktable 3.2.1:

OS: Ubuntu 20.04.1 LTS x86_64
Host: Aspire VN7-591G V1.15 
Kernel: 5.4.0-52-generic 
Uptime: 15 hours, 43 mins 
Packages: 3583 (dpkg), 11 (flatpak) 
Shell: bash 5.0.17 
Resolution: 1920x1080 
DE: Plasma 
WM: KWin 
WM Theme: Materia-Light 
Theme: Breeze [Plasma], Breeze [GTK2/3] 
Icons: Papirus-Light [Plasma], Papirus-Light [GTK2/3] 
Terminal: konsole 
Terminal Font: Hack 11 
CPU: Intel i7-4720HQ (8) @ 3.600GHz 
GPU: Intel 4th Gen Core Processor 
GPU: NVIDIA GeForce GTX 960M 
Memory: 6020MiB / 15935MiB

CPU-only:

0.575855 [dev] took 0.099 secs (0.112 CPU) to load the image.
0.610138 [export] creating pixelpipe took 0.028 secs (0.166 CPU)
0.611408 [dev_pixelpipe] took 0.000 secs (0.000 CPU) initing base buffer [export]
0.619871 [dev_pixelpipe] took 0.008 secs (0.063 CPU) processed `raw black/white point' on CPU, blended on CPU [export]
0.627932 [dev_pixelpipe] took 0.008 secs (0.041 CPU) processed `white balance' on CPU, blended on CPU [export]
0.633258 [dev_pixelpipe] took 0.005 secs (0.032 CPU) processed `highlight reconstruction' on CPU, blended on CPU [export]
0.771329 [dev_pixelpipe] took 0.138 secs (0.724 CPU) processed `demosaic' on CPU, blended on CPU [export]
4.660575 [dev_pixelpipe] took 3.889 secs (26.570 CPU) processed `denoise (profiled)' on CPU, blended on CPU [export]
4.684545 [dev_pixelpipe] took 0.024 secs (0.024 CPU) processed `lens correction' on CPU, blended on CPU [export]
6.055441 [dev_pixelpipe] took 1.371 secs (9.217 CPU) processed `haze removal' on CPU, blended on CPU [export]
6.187599 [dev_pixelpipe] took 0.132 secs (0.332 CPU) processed `retouch' on CPU, blended on CPU [export]
6.219955 [dev_pixelpipe] took 0.032 secs (0.223 CPU) processed `exposure' on CPU, blended on CPU [export]
6.245422 [dev_pixelpipe] took 0.025 secs (0.037 CPU) processed `mask manager' on CPU, blended on CPU [export]
6.547148 [dev_pixelpipe] took 0.302 secs (2.315 CPU) processed `tone equalizer' on CPU, blended on CPU [export]
6.582799 [dev_pixelpipe] took 0.036 secs (0.266 CPU) processed `input color profile' on CPU, blended on CPU [export]
7.065598 [dev_pixelpipe] took 0.483 secs (3.437 CPU) processed `defringe' on CPU, blended on CPU [export]
10.201755 [dev_pixelpipe] took 3.136 secs (21.815 CPU) processed `contrast equalizer' on CPU, blended on CPU [export]
10.281415 [dev_pixelpipe] took 0.080 secs (0.576 CPU) processed `sharpen' on CPU, blended on CPU [export]
10.400381 [dev_pixelpipe] took 0.119 secs (0.905 CPU) processed `color balance' on CPU, blended on CPU [export]
image colorspace transform Lab-->RGB took 0.039 secs (0.269 CPU) [filmicrgb ]
10.663881 [dev_pixelpipe] took 0.263 secs (1.948 CPU) processed `filmic rgb' on CPU, blended on CPU [export]
image colorspace transform RGB-->Lab took 0.046 secs (0.342 CPU) [bilat ]
11.427388 [dev_pixelpipe] took 0.763 secs (4.642 CPU) processed `local contrast' on CPU, blended on CPU [export]
11.645072 [dev_pixelpipe] took 0.218 secs (1.582 CPU) processed `color zones' on CPU, blended on CPU [export]
12.419915 [dev_pixelpipe] took 0.775 secs (5.607 CPU) processed `output color profile' on CPU, blended on CPU [export]
12.460058 [dev_pixelpipe] took 0.040 secs (0.299 CPU) processed `display encoding' on CPU, blended on CPU [export]
12.460200 [dev_process_export] pixel pipeline processing took 11.850 secs (80.670 CPU)

GPU+CPU:

1.129415 [dev] took 0.095 secs (0.094 CPU) to load the image.
1.162579 [export] creating pixelpipe took 0.027 secs (0.158 CPU)
1.162611 [pixelpipe_process] [export] using device 0
1.163878 [dev_pixelpipe] took 0.000 secs (0.000 CPU) initing base buffer [export]
1.172080 [dev_pixelpipe] took 0.008 secs (0.007 CPU) processed `raw black/white point' on GPU, blended on GPU [export]
1.176384 [dev_pixelpipe] took 0.004 secs (0.003 CPU) processed `white balance' on GPU, blended on GPU [export]
1.182154 [dev_pixelpipe] took 0.006 secs (0.005 CPU) processed `highlight reconstruction' on GPU, blended on GPU [export]
1.218059 [dev_pixelpipe] took 0.036 secs (0.016 CPU) processed `demosaic' on GPU, blended on GPU [export]
1.780975 [dev_pixelpipe] took 0.563 secs (0.412 CPU) processed `denoise (profiled)' on GPU, blended on GPU [export]
1.801260 [dev_pixelpipe] took 0.020 secs (0.007 CPU) processed `lens correction' on GPU, blended on GPU [export]
3.097789 [dev_pixelpipe] took 1.297 secs (1.886 CPU) processed `haze removal' on GPU, blended on GPU [export]
3.170938 [dev_pixelpipe] took 0.073 secs (0.282 CPU) processed `retouch' on GPU, blended on GPU [export]
3.191366 [dev_pixelpipe] took 0.020 secs (0.004 CPU) processed `exposure' on GPU, blended on GPU [export]
3.211700 [dev_pixelpipe] took 0.020 secs (0.003 CPU) processed `mask manager' on GPU, blended on GPU [export]
3.613545 [dev_pixelpipe] took 0.402 secs (2.268 CPU) processed `tone equalizer' on CPU, blended on CPU [export]
3.658883 [dev_pixelpipe] took 0.045 secs (0.044 CPU) processed `input color profile' on GPU, blended on GPU [export]
4.166169 [dev_pixelpipe] took 0.507 secs (3.306 CPU) processed `defringe' on CPU, blended on CPU [export]
4.853289 [dev_pixelpipe] took 0.687 secs (0.685 CPU) processed `contrast equalizer' on GPU, blended on GPU [export]
4.907485 [dev_pixelpipe] took 0.054 secs (0.029 CPU) processed `sharpen' on GPU, blended on GPU [export]
4.928029 [dev_pixelpipe] took 0.021 secs (0.007 CPU) processed `color balance' on GPU, blended on GPU [export]
image colorspace transform Lab-->RGB took 0.032 secs (0.248 CPU) [filmicrgb ]
5.212033 [dev_pixelpipe] took 0.284 secs (1.893 CPU) processed `filmic rgb' on CPU, blended on CPU [export]
image colorspace transform RGB-->Lab took 0.008 secs (0.004 GPU) [bilat ]
5.552033 [dev_pixelpipe] took 0.340 secs (0.245 CPU) processed `local contrast' on GPU, blended on GPU [export]
5.579511 [dev_pixelpipe] took 0.027 secs (0.014 CPU) processed `color zones' on GPU, blended on GPU [export]
6.333490 [dev_pixelpipe] took 0.754 secs (5.514 CPU) processed `output color profile' on CPU, blended on CPU [export]
6.376401 [dev_pixelpipe] took 0.043 secs (0.316 CPU) processed `display encoding' on CPU, blended on CPU [export]
6.376648 [opencl_profiling] profiling device 0 ('GeForce GTX 960M'):
6.376710 [opencl_profiling] spent  0.1123 seconds in [Write Image (from host to device)]
6.376763 [opencl_profiling] spent  0.0019 seconds in rawprepare_1f
6.376812 [opencl_profiling] spent  0.0019 seconds in whitebalance_1f
6.376883 [opencl_profiling] spent  0.0019 seconds in highlights_1f_clip
6.376932 [opencl_profiling] spent  0.0074 seconds in ppg_demosaic_green
6.376978 [opencl_profiling] spent  0.0084 seconds in ppg_demosaic_redblue
6.377025 [opencl_profiling] spent  0.0011 seconds in border_interpolate
6.377072 [opencl_profiling] spent  0.0074 seconds in denoiseprofile_precondition_Y0U0V0
6.377118 [opencl_profiling] spent  0.2821 seconds in denoiseprofile_decompose
6.377163 [opencl_profiling] spent  0.0391 seconds in denoiseprofile_reduce_first
6.377209 [opencl_profiling] spent  0.0002 seconds in denoiseprofile_reduce_second
6.377259 [opencl_profiling] spent  0.0003 seconds in [Read Buffer (from device to host)]
6.377306 [opencl_profiling] spent  0.0672 seconds in denoiseprofile_synthesize
6.377351 [opencl_profiling] spent  0.0421 seconds in [Copy Image (on device)]
6.377397 [opencl_profiling] spent  0.0071 seconds in denoiseprofile_backtransform_Y0U0V0
6.377443 [opencl_profiling] spent  0.0010 seconds in blendop_set_mask
6.377489 [opencl_profiling] spent  0.0104 seconds in blendop_rgb
6.377533 [opencl_profiling] spent  0.2773 seconds in [Read Image (from device to host)]
6.377580 [opencl_profiling] spent  0.0038 seconds in hazeremoval_transision_map
6.377626 [opencl_profiling] spent  0.0693 seconds in hazeremoval_box_max_x
6.377674 [opencl_profiling] spent  0.0050 seconds in hazeremoval_box_max_y
6.377719 [opencl_profiling] spent  0.0957 seconds in hazeremoval_box_min_x
6.377765 [opencl_profiling] spent  0.0061 seconds in hazeremoval_box_min_y
6.377810 [opencl_profiling] spent  0.0057 seconds in guided_filter_split_rgb_image
6.377854 [opencl_profiling] spent  0.6577 seconds in guided_filter_box_mean_x
6.377899 [opencl_profiling] spent  0.0340 seconds in guided_filter_box_mean_y
6.377944 [opencl_profiling] spent  0.0063 seconds in guided_filter_covariances
6.377989 [opencl_profiling] spent  0.0080 seconds in guided_filter_variances
6.378033 [opencl_profiling] spent  0.0278 seconds in guided_filter_update_covariance
6.378081 [opencl_profiling] spent  0.0142 seconds in guided_filter_solve
6.378126 [opencl_profiling] spent  0.0070 seconds in guided_filter_generate_result
6.378132 [opencl_profiling] spent  0.0069 seconds in hazeremoval_dehaze
6.378143 [opencl_profiling] spent  0.0067 seconds in [Copy Image to Buffer (on device)]
6.378148 [opencl_profiling] spent  0.0002 seconds in [Write Buffer (from host to device)]
6.378151 [opencl_profiling] spent  0.0004 seconds in retouch_copy_buffer_to_buffer
6.378155 [opencl_profiling] spent  0.0002 seconds in retouch_copy_buffer_to_buffer_masked
6.378159 [opencl_profiling] spent  0.0062 seconds in retouch_copy_buffer_to_image
6.378162 [opencl_profiling] spent  0.0061 seconds in exposure
6.378165 [opencl_profiling] spent  0.0072 seconds in colorin_unbound
6.378168 [opencl_profiling] spent  0.3216 seconds in eaw_decompose
6.378170 [opencl_profiling] spent  0.0755 seconds in eaw_synthesize
6.378173 [opencl_profiling] spent  0.0019 seconds in blendop_mask_Lab
6.378176 [opencl_profiling] spent  0.0104 seconds in blendop_Lab
6.378179 [opencl_profiling] spent  0.0085 seconds in sharpen_hblur
6.378182 [opencl_profiling] spent  0.0073 seconds in sharpen_vblur
6.378184 [opencl_profiling] spent  0.0098 seconds in sharpen_mix
6.378187 [opencl_profiling] spent  0.0062 seconds in colorbalance_cdl
6.378190 [opencl_profiling] spent  0.0062 seconds in colorspaces_transform_rgb_matrix_to_lab
6.378193 [opencl_profiling] spent  0.0061 seconds in pad_input
6.378195 [opencl_profiling] spent  0.0738 seconds in gauss_reduce
6.378198 [opencl_profiling] spent  0.0399 seconds in process_curve
6.378201 [opencl_profiling] spent  0.0516 seconds in laplacian_assemble
6.378204 [opencl_profiling] spent  0.0069 seconds in write_back
6.378206 [opencl_profiling] spent  0.0127 seconds in colorzones_v3
6.378209 [opencl_profiling] spent  2.4817 seconds totally in command queue (with 0 events missing)
6.378229 [dev_process_export] pixel pipeline processing took 5.216 secs (16.973 CPU)

Some interesting points:

  • your i7-6500U is more or less matched to my i7-4720hq in benchmarks for single thread loads, but here it takes twice the time to process the image. I guess the difference here is 4 threads vs. 8 threads running in parallel.
  • despite being quite an old GPU, the 960M trounces the integrated Intel GPU. When people here points to the OpenCL speed-up in darktable, this is specifficaly related to discrete GPUs. Even if the Intel NEO driver can give you more speed than the CPU alone, it is no match for an AMD or NVidia card.
1 Like