Commits · d322ee4fc4a4b5d56bec6fb0f3f8f61b15e860ff · Recolic / azure-cloud-mining-script

Feb 07, 2019

psychocrypt authored 6 years ago

@xmrig provided the information that the driver 19.2.1 for vega also
create invalid results if pragma unroll is used for the groestl algo.

d322ee4f

Feb 02, 2019
- OpenCL: fix Blake hashing · e274dbcc
  psychocrypt authored 6 years ago
```
Windows driver creates wrong code if unroll is used.
```
  e274dbcc
Feb 01, 2019
- OpenCL: use algorithm names instead of number · 88ea7f36
  psychocrypt authored 6 years ago
```
Use the algorithm names from `cryptonight.hpp` instead if number within the OpenCL kernel.
```
  88ea7f36
Jan 30, 2019

fix compile · 17f3aef0
psychocrypt authored 6 years ago
```
- fix broken trutle coin
- fix non cn_gpu algorithms
```
17f3aef0

Implement CN-GPU Proof-of-Work Algo · 346933d1

fireice-uk authored 6 years ago


Co-authored-by: psychocrypt <psychocryptHPC@gmail.com>
Co-authored-by: fireice-uk <fireice-uk@users.noreply.github.com>

346933d1

Jan 25, 2019
- Add CryptoNight Turtle Support. Special thanks to @DaveLong for his hard work in getting this done. · 749751e3
  Brandon Lehmann authored 6 years ago
  
  Unverified
  
  749751e3
Dec 06, 2018

fix bittube2 · e01eebc2

psychocrypt authored 6 years ago

Since #2080 bittube2 is broken.

- reintroduce special AES function for bittube2

e01eebc2

Dec 03, 2018

OpenCL: enable cn_v8 optimization for NVIDIA · ab19d370

psychocrypt authored 6 years ago

NVIDIA is using clang as device compiler so the reciprocal optimizations was disabled with #2104.

- re-enable optimized reciprocal calculation

ab19d370

Dec 02, 2018

OpenCl: fix NVIDIA · 1b27f0f3

psychocrypt authored 6 years ago

- fix broken compile: change used `ULL` to `UL` because `UL` is defined as 64bit
- fix memory copy to shared memory via vload8 (somehow it create wrong access)

1b27f0f3

Nov 30, 2018

OpenCL: opimize reciprocal calculation · bc91088a

psychocrypt authored 6 years ago


use for non clang (Rocm) OpenCL a optimized reciprocal calculation without lookup table.

Co-authored-by: SChernykh <sergey.v.chernykh@gmail.com>

bc91088a

Nov 29, 2018
- Added Cryptonight-Superfast · 053190bb
  LPHuynh authored 6 years ago
  
  053190bb
Nov 21, 2018

OpenCl: optimize strided index 1 · 39fa7c62

psychocrypt authored 6 years ago


Use `mul24` to speedup the scratchpad index calculation.

Co-authored-by: SChernykh <sergey.v.chernykh@gmail.com>

39fa7c62

OpenCL: add strided_index 3 · 3c9442ce

psychocrypt authored 6 years ago


Add new striding index where the memory is chunked by the size of the work group (worksize).

Co-authored-by: SChernykh <sergey.v.chernykh@gmail.com>

3c9442ce

OpenCL: cn1 optimization · 33e5825c
psychocrypt authored 6 years ago
```
small optimization for non cryptonight_v8 algorithms
```
33e5825c

Nov 20, 2018
- OpenCl: optimize cn-v8 div · bff5b000
  SChernykh authored 6 years ago
```
- optimize division
```
  bff5b000
- AMD: use more 32bit operations · f40c54e3
  psychocrypt authored 6 years ago
```
- change a few 64bit variables into 32bit.
- provide defines type quallified
```
  f40c54e3
Nov 19, 2018

OpenCL reduce API overhead · 6c563c9d

psychocrypt authored 6 years ago

- remove useless `clFinish`
- avoid download num threads for skein&co and start always as much threads as in all other kernel (terminate useless threads)

6c563c9d

OpenCL: reduce local mem footprint · 6f283928

psychocrypt authored 6 years ago


Reduce local memory foot print to increase the occupancy.

Co-authored-by: SChernykh <sergey.v.chernykh@gmail.com>

6f283928

Nov 16, 2018

fix ROCm compile · 18dbff68
psychocrypt authored 6 years ago
```
define shared memory in the outer scope
```
18dbff68

Optimize OpenCl · 28ef8e3d

SChernykh authored 6 years ago


- optimize kernel cn0 and cn2
- optimize vast int math
- use more 32bit variables

Co-authored-by: psychocrypt <psychocryptHPC@gmail.com>

28ef8e3d

Nov 06, 2018

AMD: speedup cryptonight_heavy division · bfb3243c

SChernykh authored 6 years ago

optimize the devision in cryptonight_heavy and cryptonight_haven

import of https://github.com/xmrig/xmrig-amd/pull/185/commits/5d9b9334654df25cea7707f667990fd1577ed290

bfb3243c

Oct 10, 2018

fix right bitshift in `amd_bitalign` · b4387ac0

psychocrypt authored 6 years ago

In the current implementation the bit align is using signed integer which results in pulling in
ones in the case the sign bit is set.

- cast to unsigned integer before using bitshift

b4387ac0

Oct 05, 2018

fix invalid shares · 8e1e7447

psychocrypt authored 6 years ago

With rocm we fighted very long with invalid shares. This is now solved with rocm 1.9 and
this tiny fix.
It is not fully clear where a memory optimization is kicking in and break the kernel `Groestl` if the variables `M` and `H` are not `volatile`.
The performance ill not change with this fix.

The fix is tested with rocm 1.9 with a VEGA64 and a RX570

8e1e7447

Oct 04, 2018
- whitespace trims · 17e0b06e
  Tony Butler authored 6 years ago
  
  17e0b06e
Sep 30, 2018
- iadd cryptonight_v8 tweak 2.2 · cac26b96
  psychocrypt authored 6 years ago
```
add cpu implementation for the final monero POW
```
  cac26b96
Sep 19, 2018

asm, style and spelling fixes · 1692c543

psychocrypt authored 6 years ago

- fix code style issues
- fix spelling issue
- fix asm to support newer clang versions

1692c543

AMD: add unroll option · 28f41a6e

psychocrypt authored 6 years ago

add option `unroll` for OpenCL to allow better tuning the main POW kernel.

28f41a6e

OpenCL: optimize NVIDIA pass · df1a4200

psychocrypt authored 6 years ago


Create a special pass for NVIDIA GPUs to load memory chunks first into the shared memory.

Co-authored-by: SChernykh <sergey.v.chernykh@gmail.com>

df1a4200

OpenCl: cryptonight_v8 · 5608f8df

psychocrypt authored 6 years ago


- implement cryptonight_v8
- update auto adjust to fit the special requirements of `cryptonight_v8`
- add fast math integer implementation for `sqrt`, `reciprocal`  and `division`

Co-authored-by: SChernykh <sergey.v.chernykh@gmail.com>

5608f8df

fix nicehash `invalid results` · 77160cf1

psychocrypt authored 6 years ago

If the first bit of the nonce is `1` (this is very often if we use a nicehash pool)
than it could be that some OpenCL implementations handle the 64bit representation of the 32bit
nonce on the device side as signed integer.
During a right bitshift we pull wrong ones from the wrong higher part of the 64bit
nonce representation into the 32bit part of the nonce.
The result will be that the computed share is invalid.

- explicit cast the nonce on the device to `uint` to avoid any side effects

77160cf1

Jul 14, 2018

cryptonight_bittube2 · 12575794

psychocrypt authored 7 years ago

- add cryptonight_heavy derivate cryptonight_bittube2
- add coin bittube
- remove coin ipbc because this coin is now called bittube

12575794

Jul 08, 2018

optimize cn-heavy AMD · f6f4070c

psychocrypt authored 7 years ago

- explicit loop unrolling

based on changes in @imperdin fork https://github.com/imperdin/xmr-stak/blob/master/xmrstak/backend/amd/amd_gpu/opencl/cryptonight.cl

f6f4070c

Jun 10, 2018
- Add support for CryptoNight Haven (small Heavy tweak) · b55acb71
  havenprotocol authored 7 years ago
```
- update pools.txt
- add new algorithm `cryptonight_haven`
- update all backends
```
  b55acb71
Jun 07, 2018

masari updates new algorithm · adfbeb4c

psychocrypt authored 7 years ago

- rename cryptonight_fast to cryptonight_masari
- set dev pool to cryptonight_monero

adfbeb4c

Jun 05, 2018
- Add Cryptonight-fast · 81437eb0
  gnock authored 7 years ago
  
  81437eb0
May 03, 2018
- Spell check · 3cd0bd95
  Tony Butler authored 7 years ago
  
  3cd0bd95
May 01, 2018

support stellite v4 fork · 624b4ca8

psychocrypt authored 7 years ago

solve #1494

- add algorithm `cryptonight_v7_stellite` (internal named: `cryptonight_stellite`)

624b4ca8

Apr 22, 2018
- add support for IPBC coin · 7e2dbaf9
  psychocrypt authored 7 years ago
```
- add algorithm `cryptonight_lite_v7_xor`
- update documentation
```
  7e2dbaf9
Apr 08, 2018

amd simplify kernel for different algorithms · a5797643

psychocrypt authored 7 years ago

- remove version numbers within the kernel
- create seperate program context for each mining algorithm
- remove kernel `cn1_monero` is now integrated in `cn1`
- remname `cnX` kernel in `cnX + algorithmNumber`

a5797643

Apr 01, 2018
- fix OpenCl AMD on OSX · a832fdf3
  psychocrypt authored 7 years ago
```
fix #1218

- remove inline function with ugly macro :-(
```
  a832fdf3