Skip to content
Snippets Groups Projects
  1. Dec 29, 2018
    • psychocrypt's avatar
      OpenCL: allow more than two algorithms · a39ee088
      psychocrypt authored
      In the current implementation the POW algorithm in dev pool section of a
      currency will not be taken into account during the binary creation.
      This PR changes the behavior and allow to create binaries for more than two POW algorihms.
      a39ee088
  2. Dec 06, 2018
    • psychocrypt's avatar
      fix bittube2 · e01eebc2
      psychocrypt authored
      Since #2080 bittube2 is broken.
      
      - reintroduce special AES function for bittube2
      e01eebc2
  3. Dec 04, 2018
  4. Dec 03, 2018
    • psychocrypt's avatar
      fix default interleave value · 05b4976d
      psychocrypt authored
      The default value for interleave was wrongly set to 50.
      
      Remove the value and take the devault from the default constructor instead of side channeling it from the json parser.
      05b4976d
    • psychocrypt's avatar
      OpenCL: enable cn_v8 optimization for NVIDIA · ab19d370
      psychocrypt authored
      NVIDIA is using clang as device compiler so the reciprocal optimizations was disabled with #2104.
      
      - re-enable optimized reciprocal calculation
      ab19d370
  5. Dec 02, 2018
    • psychocrypt's avatar
      OpenCL: auto tuning option · af87b408
      psychocrypt authored
      Add an option to brute force intensity settings and lock in at the intensity with the highest hashrate.
      
      - update decumentation of the `interleave` option to mention the side effect with `auto-tune`
      - disable `interleave` auto adjustment if `auto-tune` is enabled
      - jconf: add `auto-tune` as optional option
      af87b408
    • psychocrypt's avatar
      OpenCl: fix NVIDIA · 1b27f0f3
      psychocrypt authored
      - fix broken compile: change used `ULL` to `UL` because `UL` is defined as 64bit
      - fix memory copy to shared memory via vload8 (somehow it create wrong access)
      1b27f0f3
    • psychocrypt's avatar
      OpenCL: auto config two threads per GPU · e46226fa
      psychocrypt authored
      The auto config generates for AMD devices now by default two threads per GPU.
      
      - remove the savety 128MiB memory now only from the max available GPU memory not from the avaialble memory for one alloc call
      - extend the memory documentation in amd.txt
      e46226fa
    • psychocrypt's avatar
      fix clamp implementation · b606304b
      psychocrypt authored
      Due to a wrong implementation clamp was not working.
      b606304b
  6. Nov 30, 2018
  7. Nov 29, 2018
  8. Nov 27, 2018
    • psychocrypt's avatar
      OpenCL: thread interleaving · d8316f7d
      psychocrypt authored
      If two threads are using the same GPU device the start time of each hash round is optimized based on the average time needed to calculate a bunch of hashes.
      
      This way to optimize the hash rate was first introduced by @SChernykh. This implementation based on the implementation in xmrig but differen in the details.
      
      - introduce a new config option `interleave`
      - implement thread interleaving
      d8316f7d
  9. Nov 21, 2018
  10. Nov 20, 2018
  11. Nov 19, 2018
  12. Nov 17, 2018
    • psychocrypt's avatar
      change load order for backends · cf959a1c
      psychocrypt authored
      If CUDA is loaded before AMD but no CUDA is available it can be happen that the embadded OpenCL code is empty.
      This is only an issue if the binary is builded static on a different system.
      cf959a1c
  13. Nov 16, 2018
  14. Nov 06, 2018
  15. Oct 24, 2018
    • psychocrypt's avatar
      NVIDIA: fix wrong number of threads · 954296ed
      psychocrypt authored
      In the cuda backend for monero we start always twice as much threads as needed.
      Those threads are than removed after the AES matrix is copied to the shared memory.
      Never the less it is the result of an copy past bug.
      
      - start correct number of threads for `monero`
      954296ed
  16. Oct 16, 2018
  17. Oct 15, 2018
    • psychocrypt's avatar
      fix broken AMD OpenCL compile · 2a0d565b
      psychocrypt authored
      The AMD compiler for OpenCL shipped with the driver 14XX is broken
      and can not compile xmr-stak since the monero v8 changes are introduced.
      
      - workaround a simple compare.
      - add new device define `OPENCL_DRIVER_MAJOR`
      2a0d565b
  18. Oct 11, 2018
    • psychocrypt's avatar
      NVIDIA: support for multiple CUDA libs · 732b0e41
      psychocrypt authored
      Allow to ship the miner with multiple cuda backends those depends on different driver versions.
      This will allow to support Turing/Volta and old Fermi GPU within one release.
      
      - add support to search for the first working CUDA backend
      - add some more messages to support better debugging (if a user has some issues)
      732b0e41
  19. Oct 10, 2018
    • SChernykh's avatar
      NVIDIA: tweak `get_reciprocal` · b1504b36
      SChernykh authored
      - remove helper array to perform division
      - tweak `get_reciprocal`
      b1504b36
    • psychocrypt's avatar
      NVIDIA: rename config option `comp_mode` · bd4a4c94
      psychocrypt authored
      The name `comp_mode` for a memoy load pattern if a bad choosen name.
      Therefore I changed it to `mem_mode` which also gives use the possibility
      to add new mode later if needed.
      
      - rename `comp_mode` to `mem_mode`
      - fix documentation
      bd4a4c94
    • psychocrypt's avatar
      fix right bitshift in `amd_bitalign` · b4387ac0
      psychocrypt authored
      In the current implementation the bit align is using signed integer which results in pulling in
      ones in the case the sign bit is set.
      
      - cast to unsigned integer before using bitshift
      b4387ac0
    • psychocrypt's avatar
      CUDA: fix invalid results · ed2168b4
      psychocrypt authored
      If `comp_mode` is false the results on a windows platform will be invalid.
      The reason for that is that `ulong4` is in windows 16byte and in linux 32byte.
      
      thx @xmrig for finding and solving the issue
      
      fix #1873
      ed2168b4
  20. Oct 08, 2018
    • psychocrypt's avatar
      improve error message · 58b7c66c
      psychocrypt authored
      Add a suggestion to an common line which is shown in the event of an crash under windows.
      58b7c66c
Loading