- 03 Dec, 2018 3 commits
-
-
fireice-uk authored
OpenCL: auto tuning option
-
fireice-uk authored
OpenCl: fix NVIDIA
-
psychocrypt authored
NVIDIA is using clang as device compiler so the reciprocal optimizations was disabled with #2104. - re-enable optimized reciprocal calculation
-
- 02 Dec, 2018 7 commits
-
-
psychocrypt authored
Add an option to brute force intensity settings and lock in at the intensity with the highest hashrate. - update decumentation of the `interleave` option to mention the side effect with `auto-tune` - disable `interleave` auto adjustment if `auto-tune` is enabled - jconf: add `auto-tune` as optional option
-
psychocrypt authored
- fix broken compile: change used `ULL` to `UL` because `UL` is defined as 64bit - fix memory copy to shared memory via vload8 (somehow it create wrong access)
-
fireice-uk authored
add interleave documentation
-
fireice-uk authored
OpenCL: auto config two threads per GPU
-
fireice-uk authored
fix clamp implementation
-
psychocrypt authored
The auto config generates for AMD devices now by default two threads per GPU. - remove the savety 128MiB memory now only from the max available GPU memory not from the avaialble memory for one alloc call - extend the memory documentation in amd.txt
-
psychocrypt authored
Due to a wrong implementation clamp was not working.
-
- 01 Dec, 2018 4 commits
-
-
psychocrypt authored
- add image to describe interleave better - add tuning description
-
fireice-uk authored
OpenCL: opimize reciprocal calculation
-
fireice-uk authored
Please add Cryptonight-Superfast
-
fireice-uk authored
OpenCL: comp mode optimization
-
- 30 Nov, 2018 2 commits
-
-
psychocrypt authored
use for non clang (Rocm) OpenCL a optimized reciprocal calculation without lookup table. Co-authored-by: SChernykh <sergey.v.chernykh@gmail.com>
-
psychocrypt authored
Disable compatibility mode if intensity is a multiple of worksize. In that case enabled compaibility mode will only slow down the miner.
-
- 29 Nov, 2018 2 commits
-
-
LPHuynh authored
-
fireice-uk authored
update currencies
-
- 28 Nov, 2018 1 commit
-
-
fireice-uk authored
OpenCL: thread interleaving
-
- 27 Nov, 2018 2 commits
-
-
psychocrypt authored
- `monero` - remove fork from cn-v7 to cn-v8 - remove dev pool fork from cn-v7 to cn-8
-
psychocrypt authored
If two threads are using the same GPU device the start time of each hash round is optimized based on the average time needed to calculate a bunch of hashes. This way to optimize the hash rate was first introduced by @SChernykh. This implementation based on the implementation in xmrig but differen in the details. - introduce a new config option `interleave` - implement thread interleaving
-
- 22 Nov, 2018 1 commit
-
-
fireice-uk authored
OpenCl: optimize strided index 1
-
- 21 Nov, 2018 8 commits
-
-
fireice-uk authored
OpenCL: add strided_index 3
-
psychocrypt authored
Use `mul24` to speedup the scratchpad index calculation. Co-authored-by: SChernykh <sergey.v.chernykh@gmail.com>
-
psychocrypt authored
Add new striding index where the memory is chunked by the size of the work group (worksize). Co-authored-by: SChernykh <sergey.v.chernykh@gmail.com>
-
fireice-uk authored
OpenCL: cnv8 optimization
-
fireice-uk authored
OpenCl: optimize cn-v8 div
-
fireice-uk authored
AMD: use more 32bit operations
-
fireice-uk authored
OpenCL reduce API overhead
-
psychocrypt authored
small optimization for non cryptonight_v8 algorithms
-
- 20 Nov, 2018 8 commits
-
-
fireice-uk authored
OpenCL: optimize cn-heavy div
-
SChernykh authored
- optimize division
-
fireice-uk authored
CUDA: reduce cn-v8 shared mem footprint
-
fireice-uk authored
OpenCL: reduce local mem footprint
-
fireice-uk authored
CUDA: optimize cn-v8 div
-
SChernykh authored
optimize cryptonight_heavy diff
-
psychocrypt authored
- change a few 64bit variables into 32bit. - provide defines type quallified
-
fireice-uk authored
CUDA: optimize cn-heavy div
-
- 19 Nov, 2018 2 commits
-
-
psychocrypt authored
- remove useless `clFinish` - avoid download num threads for skein&co and start always as much threads as in all other kernel (terminate useless threads)
-
psychocrypt authored
Reduce local memory foot print to increase the occupancy. Co-authored-by: SChernykh <sergey.v.chernykh@gmail.com>
-