CUDA: optimize cn-heavy div
port OpenCl optimized division to CUDA
Co-authored-by: SChernykh <sergey.v.chernykh@gmail.com>
Please register or sign in to comment
port OpenCl optimized division to CUDA
Co-authored-by: SChernykh <sergey.v.chernykh@gmail.com>