| Commit message (Collapse) | Author | Age | Files | Lines |
... | |
| |/
|/|
| |
| |
| |
| | |
remove branch differgences in AMD OpenCl code based on #454
a Please enter the commit message for your changes. Lines starting
|
| | |
|
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| | |
Add messages
Missing include
1
|
|/ |
|
| |
|
|\
| |
| | |
handle cuda error codes
|
| |
| |
| |
| | |
handle all error codes from the cuda api calls.
|
|/
|
|
|
| |
- fix cuda9.1 compile (remove includ eof device_functions.hpp/ removed with cuda9.1)
- remove NVIDIA Volta gpus for MAC OSX
|
|
|
|
| |
Makes casting more explicit.
|
|
|
|
|
| |
When compiled with VS2017, the negative applied to the uint wait
time is ignored. Fixed by casting first.
|
|
|
|
| |
- fix indention
|
|
|
|
|
|
|
|
| |
Be more conservative with the auto suggestion.
- increase bfactor if `smx <= 6`
- limit memory for pascal < GTX1070 to 2GiB
- limt memory for pascal <= GTX1080 to 4GiB
|
|\
| |
| | |
Extend low_power_mode to do up to 5 cn hashes at a time.
|
| | |
|
| | |
|
| |
| |
| |
| |
| |
| | |
The "low_power_mode" option in config.txt can be set to numeral values
between 1 and 5. A value of 5 seems optimal on certain processors
with large L4 cache.
|
|\ \
| | |
| | | |
add message to `CUDA_CHECK...` macros
|
| |/
| |
| |
| |
| | |
- add macro `CUDA_CHECK_MSG_KERNEL` and `CUDA_CHECK_MSG`
- add suggestion of typicle errors can be solved
|
|\ \
| |/
|/| |
option to controll gpu synchronization
|
| |
| |
| |
| |
| | |
- add option `sync_mode`
- update auto suggestion and jconf
|
| | |
|
| | |
|
|\ \
| | |
| | | |
optimize aes round command queue
|
| |/
| |
| |
| | |
unroll AES round by hand to help the compiler to optimize the command queue
|
| |
| |
| |
| | |
Change the default for strided index. Th euser feedback shows that strided_index = true will give a view hashes more per second.
|
|/
|
|
|
|
| |
Allow to change the indexing used to address the hash scratchpad memory.
- add option `strided_index` for each gpu
|
|\
| |
| | |
fix CUDA launch bounds usage
|
| |
| |
| |
| |
| |
| | |
fix #191
lauch bounds must be placed before the return type but after the template paramater
|
|\ \
| |/
|/| |
fix auto suggestion for low end devices
|
| |
| |
| |
| | |
Increase bfactor for all devices with lesser than 6 multi processors.
|
| |
| |
| |
| | |
Fix for AMD GPU detection on MacOS X
On MacOS X the platform is reported as Apple, and the vendor is reported as AMD.
|
|/
|
| |
Fixing the compile error on MacOS X
|
|\
| |
| | |
fix max work size shown to user
|
| |
| |
| |
| | |
The max work size shown to the user was the hardware maximum but not take in account thet some kernel spwning 8 times more threads per work group than configured by the user.
|
|\ \
| | |
| | | |
handle OpenCL errors
|
| |/
| |
| |
| |
| | |
- handle opencl errors to avoid unexpected behaviors
- remove code duplicatio nwithin the some method
|
|/
|
|
|
|
| |
- enforce that only host side OpenCl 1.2 functions are used if the miner is compiled with `XMR-STAK_COMPILE=generic`
- update documentation
- add CMake option `XMR-STAK_COMPILE` to windows
|
|\
| |
| | |
fix wrong cuda binary arch detection
|
| |
| |
| |
| | |
fix wrong arch comparsion
|
|/
|
|
|
| |
The lmem is still incalculably and crash the miner very often.
Increase the potential lmem usage to 16kiB to respect lmem alignments, ...
|
| |
|
| |
|
|
|
|
| |
- removed all space indention in `*.hpp` and `*.cpp` files
|
|\
| |
| | |
Network extensions
|
| | |
|
| | |
|
|\ \
| | |
| | | |
check gpu architecture
|
| | |
| | |
| | |
| | |
| | |
| | | |
- check if the gpu architecture is supported by the compiled miner binary
- remove not supported gpus from the auto suggestion
- disallow the selection of a not supported gpu by hand tuning the config
|
|\ \ \
| |/ /
| | | |
fix wrong memory detection
|
| |/
| |
| |
| |
| |
| |
| |
| |
| | |
Free and total memory is only evaluated on the first device.
To detect the gpu memory the gpu must be selected.
- create context on the gpu before the memory is checked
- add smx to the auto detection
- change the result code of `cuda_get_deviceinfo()`
|