Commit fd27561b authored Sep 19, 2018 by psychocrypt

NVIDIA: optimze v8

- fix that shared memory for fast div is always used even if an algorithm is not using it
- optimize fast div algo
- store `division_result` (64_bit) per thread instead of shuffle around and store it as 32bit

parent 659918f2

Show whitespace changes

Inline Side-by-side

Please to comment