| Commit message (Collapse) | Author | Age | Files | Lines |
| |
|
|
|
|
|
|
|
|
| |
sumokoin is broken if `bfactor >= 5` is used (default for windows)
sumokoin for `sm_20` is broken due to the missing extern shared memory
- call phase3 kernel two times if sumokoin is enabled
- create extern shared memory for phase3 kernel
|
|
|
|
|
|
| |
fix #1297
If sm_20 is mixed with other architectures the detection for the minimal supported architecture is broken.
|
|
|
|
| |
- fix that version argument was not passed to extended kernel parameters
|
|
|
|
| |
-rename `mining_fork_height` to `*_version`
|
|
|
|
|
| |
- add `fork_height` to currency
- refactor algorithm selection
|
|
|
|
|
|
| |
fix #1218
- remove inline function with ugly macro :-(
|
| |
|
|
|
|
|
|
| |
- documentation: change `aeon` to `aeon7`
- change `case` order in `switch` statements
- update README.txt
|
|
|
|
|
| |
- add new pow for AEON
- fix missing cryptonight-heavy selection for multi hashes
|
|
|
|
|
|
| |
allow to disable the OpenCl cache
- usefull for read only systems
- usefull for unknown errors during cache reading
|
|
|
|
|
|
|
|
|
| |
fix warning
```
/Users/user/xmr-stak/xmrstak/backend/amd/amd_gpu/gpu.cpp:481:11: warning: using the result of an assignment as a condition without parentheses [-Wparentheses]
if( ret = clGetProgramInfo(ctx->Program, CL_PROGRAM_BINARIES, num_devices * sizeof(char*), all_programs.data(),NULL) != CL_SUCCESS)
```
|
|
|
|
| |
- remove call `IsCurrencyMonero` with `cn_select_memory`
|
| |
|
| |
|
| |
|
|\ |
|
| |
| |
| | |
Fix for gcc 7 behavior
|
|/ |
|
| |
|
|\
| |
| | |
revert input size change
|
| |
| |
| |
| | |
revert #1198, the block size is limited to 84byte
|
|/
|
|
| |
change warning when precompiled OpenCL code is not found
|
| |
|
| |
|
|
|
|
|
|
|
| |
Co-authored-by: psychocrypt <psychocryptHPC@gmail.com>
Co-authored-by: fireice-uk <fireice-uk@users.noreply.github.com>
Co-authored-by: Lee Clagett <code@leeclagett.com>
Co-authored-by: curie-kief <curie-kief@users.noreply.github.com>
|
|
|
|
| |
A redefinition of a variable in a local scope avoid that the intensity is rounded to a multiple of the work size.
|
|\
| |
| | |
CUDA: reduce startup time
|
| |
| |
| |
| |
| | |
- reduce startup time for multi gpu systems
- initialize the GPU memory non concurrent
|
|\ \
| | |
| | | |
OpenCL compile cache
|
| |/
| |
| |
| |
| |
| |
| | |
Reduce OpenCL start time by using a self made compiler cache.
- store compiled OpenCL binary
- load OpenCl binary if available
|
|/ |
|
|
|
|
| |
fix double definition of define `MEM_CHUNK`
|
|\
| |
| | |
fix compile with OpenCL 1.1
|
| |
| |
| |
| | |
guard error types those are only defined in OpenCL >1.1
|
|\ \
| | |
| | | |
add Mesa OpenCL support
|
| |/
| |
| |
| | |
allow usage of Mesa OpenCl
|
|/
|
|
|
| |
- add new option `comp_mode` to the amd config
- disable `if guards` within opencl kernel if `comp_mode : false`
|
|\
| |
| | |
AMD: option `mem_chunk`and new `strided_index`
|
| |
| |
| |
| |
| | |
- add new option for `strided_index`
- add additional option if `strided_index == 2` to controll the memory chunk with
|
|/
|
|
| |
reduce usage of registers: based on the suggestion of @enerc77
|
|\
| |
| | |
improve AMD auto suggestion for AEON
|
| |
| |
| |
| | |
increase the intensity limit for AEON
|
|\ \
| | |
| | | |
speedup Volta
|
| |/
| |
| |
| |
| |
| |
| | |
- enable L1 cache for Nvidia Volta GPUs and newer
- remove explicit cache controll for Volta GPU and newer
This pull request increases the hash rate for Volta GPUs by ~5%
|
|\ \
| | |
| | | |
fix output of gpu name
|
| |/
| |
| |
| | |
fix that the GPU name is printed before the name is querried from OpenCL
|
|/
|
|
|
| |
The AMD jconf.cpp would only accept an array. The config sample, and the nvidia and cpu config work with the value 'null', as they accept a 'kNullType'. This means at current, AMD GPUs could not be turned off, the config file wouldn't parse on load.
This change makes AMD consistent with the others, and can now be disabled.
|
|\
| |
| | |
avoid nonce overlapping
|
| |
| |
| |
| |
| | |
The cpu miner backend uses the wrong ranges of nonces instead of using `[startNonce,startNonce + nonce_chunk)`
(startNonce,startNonce + nonce_chunk]` is used. This will results in an overlap with nonces used by the gpu back-ends.
|