Thanks for your feedback !
I'll test again on linux and come back to you (With some numbers)
Nick2021 wrote:Isn't R3D from Red decoded on the GPU?
The issue might be Canon keeping things closed more than the need for new task specific hardware
You got a point, while GPU implementation would be so cool, GOOD CPU would be fine.
Let's do the math with some lame SSE impletation
You need to weigth adjacant pixels, let's say 4 cycles, accumulate, let say 2 cycles, and scale let's say 2 other cycles. Let's add 8 cycles more just because we want to minor performance so that 16 cycles per pixel (Cache works well, you work with nearby data) and let's add 16 more again because I'm a moron. At 4Ghz, that's 166 Million pixels per second per thread. With 8 threads you have 1.3GPixel per second so 28FPS at 45MPixel per frame.
So with something like AVX proper implementation I'm sure we can reach 60Fps at 8k with a proper and rather genereic CPU implementation.