If you need to dive into SSE coding, you may also have a look at this files:
https://github.com/Beep6581/RawTherapee/blob/dev/rtengine/sleefsseavx.c