I’m not a dt dev or user, but I’ve been here long enough to know that if you have a cogent patch to dt, it will be considered. You’re sparring with a person who has demonstrated just that in his efforts to re-orient dt, and others’ thinking, toward a linear processing pipeline. A lot of the thinking behind that has occurred in threads on this forum; you’ve jumped into that context with a few specific assertions without a real understanding that context. That goes for the module performance, and very likely for the OpenCL consideration.
One thing that needs to happen is that this thread should be teased apart into its separate topics.