OpenCL: optimize NVIDIA pass
Create a special pass for NVIDIA GPUs to load memory chunks first into the shared memory.
Co-authored-by: SChernykh <sergey.v.chernykh@gmail.com>
Please register or sign in to comment
Create a special pass for NVIDIA GPUs to load memory chunks first into the shared memory.
Co-authored-by: SChernykh <sergey.v.chernykh@gmail.com>