- Sep 19, 2018
-
-
psychocrypt authored
Create a special pass for NVIDIA GPUs to load memory chunks first into the shared memory. Co-authored-by:
SChernykh <sergey.v.chernykh@gmail.com>
-
psychocrypt authored
- implement cryptonight_v8 - update auto adjust to fit the special requirements of `cryptonight_v8` - add fast math integer implementation for `sqrt`, `reciprocal` and `division` Co-authored-by:
SChernykh <sergey.v.chernykh@gmail.com>
-
psychocrypt authored
If the first bit of the nonce is `1` (this is very often if we use a nicehash pool) than it could be that some OpenCL implementations handle the 64bit representation of the 32bit nonce on the device side as signed integer. During a right bitshift we pull wrong ones from the wrong higher part of the 64bit nonce representation into the 32bit part of the nonce. The result will be that the computed share is invalid. - explicit cast the nonce on the device to `uint` to avoid any side effects
-
- Jul 14, 2018
-
-
psychocrypt authored
- add cryptonight_heavy derivate cryptonight_bittube2 - add coin bittube - remove coin ipbc because this coin is now called bittube
-
- Jul 08, 2018
-
-
psychocrypt authored
- explicit loop unrolling based on changes in @imperdin fork https://github.com/imperdin/xmr-stak/blob/master/xmrstak/backend/amd/amd_gpu/opencl/cryptonight.cl
-
- Jun 10, 2018
-
-
havenprotocol authored
- update pools.txt - add new algorithm `cryptonight_haven` - update all backends
-
- Jun 07, 2018
-
-
psychocrypt authored
- rename cryptonight_fast to cryptonight_masari - set dev pool to cryptonight_monero
-
- Jun 05, 2018
-
-
gnock authored
-
- May 03, 2018
-
-
Tony Butler authored
-
- May 01, 2018
-
-
psychocrypt authored
solve #1494 - add algorithm `cryptonight_v7_stellite` (internal named: `cryptonight_stellite`)
-
- Apr 22, 2018
-
-
psychocrypt authored
- add algorithm `cryptonight_lite_v7_xor` - update documentation
-
- Apr 08, 2018
-
-
psychocrypt authored
- remove version numbers within the kernel - create seperate program context for each mining algorithm - remove kernel `cn1_monero` is now integrated in `cn1` - remname `cnX` kernel in `cnX + algorithmNumber`
-
- Apr 01, 2018
-
-
psychocrypt authored
fix #1218 - remove inline function with ugly macro :-(
-
- Mar 25, 2018
-
-
xmr-stak-devs authored
Co-authored-by:
psychocrypt <psychocryptHPC@gmail.com> Co-authored-by:
fireice-uk <fireice-uk@users.noreply.github.com> Co-authored-by:
Lee Clagett <code@leeclagett.com> Co-authored-by:
curie-kief <curie-kief@users.noreply.github.com>
-
- Feb 21, 2018
-
-
psychocrypt authored
fix double definition of define `MEM_CHUNK`
-
- Feb 19, 2018
-
-
psychocrypt authored
- add new option `comp_mode` to the amd config - disable `if guards` within opencl kernel if `comp_mode : false`
-
psychocrypt authored
allow usage of Mesa OpenCl
-
- Feb 17, 2018
-
-
psychocrypt authored
- add new option for `strided_index` - add additional option if `strided_index == 2` to controll the memory chunk with
-
- Feb 14, 2018
-
-
psychocrypt authored
reduce usage of registers: based on the suggestion of @enerc77
-
- Dec 30, 2017
-
-
psychocrypt authored
remove branch differgences in AMD OpenCl code based on #454 a Please enter the commit message for your changes. Lines starting
-
- Dec 02, 2017
-
-
psychocrypt authored
Allow to change the indexing used to address the hash scratchpad memory. - add option `strided_index` for each gpu
-
- Oct 27, 2017
-
-
psychocrypt authored
- add compile parameter to support aeon and xmr - update auto suggestion to handle aeon
-
- Oct 24, 2017
-
-
psychocrypt authored
- remove 32bit target value - use always 64bit target for amd backend
-
- Oct 10, 2017
-
-
psychocrypt authored
Early returns within a kernel can have unexpected behavior, this strongly depends on the opencl runtime compiler. To avoid errors all early returns are removed. There is no negative effect(e.g.performance) if the threads stay alive up to the end of the kernel. One source for the early return is: http://al-key-opencl.blogspot.de/2014/09/be-careful-not-to-mix-early-return-and.html This pull request also fix a race condition where all fill the shared memory. Also a bug introduced with #16 is fixed, because of the early return not was possible that the last block works with an wrong initilized shared memory (result should be a wrong hash if the result target size is valid).
-
- Oct 06, 2017
-
-
psychocrypt authored
Is is possible that the number of results are greater than the result output array, in this case invalid memory can be access within the device and on the host side.
-
- Oct 04, 2017
-
-
psychocrypt authored
The number of threads within a kernel must be a multiple of the worksize. If not it can crash on some systems.
-
- Sep 30, 2017
-
-
psychocrypt authored
- string literal can only be 16kbyte
-
psychocrypt authored
- move source code to `src` - categorize files and move to group folder - change upper case class files to lower case - change C++ header to `*.hpp`
-
psychocrypt authored
- splitt literal to smaller chunks (windows only support 16k literals) - remove copy command for opencl folder
-
psychocrypt authored
- inlcude opencl source code during compile
-
psychocrypt authored
-