- 03 Dec, 2018 1 commit
-
-
psychocrypt authored
The default value for interleave was wrongly set to 50. Remove the value and take the devault from the default constructor instead of side channeling it from the json parser.
-
- 02 Dec, 2018 5 commits
-
-
fireice-uk authored
add interleave documentation
-
fireice-uk authored
OpenCL: auto config two threads per GPU
-
fireice-uk authored
fix clamp implementation
-
psychocrypt authored
The auto config generates for AMD devices now by default two threads per GPU. - remove the savety 128MiB memory now only from the max available GPU memory not from the avaialble memory for one alloc call - extend the memory documentation in amd.txt
-
psychocrypt authored
Due to a wrong implementation clamp was not working.
-
- 01 Dec, 2018 4 commits
-
-
psychocrypt authored
- add image to describe interleave better - add tuning description
-
fireice-uk authored
OpenCL: opimize reciprocal calculation
-
fireice-uk authored
Please add Cryptonight-Superfast
-
fireice-uk authored
OpenCL: comp mode optimization
-
- 30 Nov, 2018 2 commits
-
-
psychocrypt authored
use for non clang (Rocm) OpenCL a optimized reciprocal calculation without lookup table. Co-authored-by: SChernykh <sergey.v.chernykh@gmail.com>
-
psychocrypt authored
Disable compatibility mode if intensity is a multiple of worksize. In that case enabled compaibility mode will only slow down the miner.
-
- 29 Nov, 2018 2 commits
-
-
LPHuynh authored
-
fireice-uk authored
update currencies
-
- 28 Nov, 2018 1 commit
-
-
fireice-uk authored
OpenCL: thread interleaving
-
- 27 Nov, 2018 2 commits
-
-
psychocrypt authored
- `monero` - remove fork from cn-v7 to cn-v8 - remove dev pool fork from cn-v7 to cn-8
-
psychocrypt authored
If two threads are using the same GPU device the start time of each hash round is optimized based on the average time needed to calculate a bunch of hashes. This way to optimize the hash rate was first introduced by @SChernykh. This implementation based on the implementation in xmrig but differen in the details. - introduce a new config option `interleave` - implement thread interleaving
-
- 22 Nov, 2018 1 commit
-
-
fireice-uk authored
OpenCl: optimize strided index 1
-
- 21 Nov, 2018 8 commits
-
-
fireice-uk authored
OpenCL: add strided_index 3
-
psychocrypt authored
Use `mul24` to speedup the scratchpad index calculation. Co-authored-by: SChernykh <sergey.v.chernykh@gmail.com>
-
psychocrypt authored
Add new striding index where the memory is chunked by the size of the work group (worksize). Co-authored-by: SChernykh <sergey.v.chernykh@gmail.com>
-
fireice-uk authored
OpenCL: cnv8 optimization
-
fireice-uk authored
OpenCl: optimize cn-v8 div
-
fireice-uk authored
AMD: use more 32bit operations
-
fireice-uk authored
OpenCL reduce API overhead
-
psychocrypt authored
small optimization for non cryptonight_v8 algorithms
-
- 20 Nov, 2018 8 commits
-
-
fireice-uk authored
OpenCL: optimize cn-heavy div
-
SChernykh authored
- optimize division
-
fireice-uk authored
CUDA: reduce cn-v8 shared mem footprint
-
fireice-uk authored
OpenCL: reduce local mem footprint
-
fireice-uk authored
CUDA: optimize cn-v8 div
-
SChernykh authored
optimize cryptonight_heavy diff
-
psychocrypt authored
- change a few 64bit variables into 32bit. - provide defines type quallified
-
fireice-uk authored
CUDA: optimize cn-heavy div
-
- 19 Nov, 2018 5 commits
-
-
psychocrypt authored
- remove useless `clFinish` - avoid download num threads for skein&co and start always as much threads as in all other kernel (terminate useless threads)
-
psychocrypt authored
Reduce local memory foot print to increase the occupancy. Co-authored-by: SChernykh <sergey.v.chernykh@gmail.com>
-
psychocrypt authored
port optimizations from OpenCL. Co-authored-by: SChernykh <sergey.v.chernykh@gmail.com>
-
psychocrypt authored
Use only the half AES matrix and compute the other half in place. This PR increases the possible occupancy.
-
psychocrypt authored
port OpenCl optimized division to CUDA Co-authored-by: SChernykh <sergey.v.chernykh@gmail.com>
-
- 17 Nov, 2018 1 commit
-
-
fireice-uk authored
change load order for backends
-