summaryrefslogtreecommitdiffstats
path: root/xmrstak/backend/nvidia
Commit message (Collapse)AuthorAgeFilesLines
* revert input size changepsychocrypt2018-03-251-2/+3
| | | | revert #1198, the block size is limited to 84byte
* fix input size on devicepsychocrypt2018-03-251-1/+1
|
* Fixing allocation issueJuan Leni2018-03-251-1/+1
|
* XMR-Stak 2.3.0 RCxmr-stak-devs2018-03-254-97/+338
| | | | | | | Co-authored-by: psychocrypt <psychocryptHPC@gmail.com> Co-authored-by: fireice-uk <fireice-uk@users.noreply.github.com> Co-authored-by: Lee Clagett <code@leeclagett.com> Co-authored-by: curie-kief <curie-kief@users.noreply.github.com>
* CUDA: reduce startup timepsychocrypt2018-02-282-16/+35
| | | | | - reduce startup time for multi gpu systems - initialize the GPU memory non concurrent
* speedup Voltapsychocrypt2018-01-302-1/+19
| | | | | | | - enable L1 cache for Nvidia Volta GPUs and newer - remove explicit cache controll for Volta GPU and newer This pull request increases the hash rate for Volta GPUs by ~5%
* reduce memory usage for low end gpuspsychocrypt2018-01-221-0/+6
| | | | reduce memory usage to 1GiB for NVIDIA devices with <=6 SMX
* Merge pull request #697 from dangrabbits/masterpsychocrypt2018-01-091-0/+3
|\ | | | | Updated config.tpl comments to include how to exclude CPU/GPUs
| * Updated config.tpl comments to include how to exclude CPU/GPUsdangrabbits2017-12-251-0/+3
| |
* | Merge pull request #778 from b-/patch-2fireice-uk2018-01-081-1/+1
|\ \ | | | | | | Rename "MacOS" to "macOS"
| * | Update minethd.cppBrian Recchia2018-01-041-1/+1
| |/ | | | | | | | | | | Changed capitalization of "macOS" Squashed the commit
* | Modify invalid result report to show GPU idDoug Johnson2017-12-301-1/+1
|/
* Make sure we are using a steady clockfireice-uk2017-12-221-1/+1
|
* Merge pull request #464 from psychocrypt/topic-handleCudaErrorCodesfireice-uk2017-12-151-8/+8
|\ | | | | handle cuda error codes
| * handle cuda error codespsychocrypt2017-12-101-8/+8
| | | | | | | | handle all error codes from the cuda api calls.
* | fix cuda9.1 compilepsychocrypt2017-12-141-1/+0
|/ | | | | - fix cuda9.1 compile (remove includ eof device_functions.hpp/ removed with cuda9.1) - remove NVIDIA Volta gpus for MAC OSX
* Beautification edit as per fireice-uk's suggestionAndrew Whittle2017-12-091-1/+1
| | | | Makes casting more explicit.
* Fix compat_usleep for WIN32Andrew Whittle2017-12-091-1/+1
| | | | | When compiled with VS2017, the negative applied to the uint wait time is ignored. Fixed by casting first.
* fix intentionpsychocrypt2017-12-091-1/+3
| | | | - fix indention
* conservative NVIDIA auto suggestionpsychocrypt2017-12-081-1/+12
| | | | | | | | Be more conservative with the auto suggestion. - increase bfactor if `smx <= 6` - limit memory for pascal < GTX1070 to 2GiB - limt memory for pascal <= GTX1080 to 4GiB
* Merge pull request #399 from psychocrypt/topic-nvidiaErrorWithMessagefireice-uk2017-12-083-23/+53
|\ | | | | add message to `CUDA_CHECK...` macros
| * add message to `CUDA_CHECK...` macrospsychocrypt2017-12-073-23/+53
| | | | | | | | | | - add macro `CUDA_CHECK_MSG_KERNEL` and `CUDA_CHECK_MSG` - add suggestion of typicle errors can be solved
* | option to controll gpu synchronizationpsychocrypt2017-12-017-6/+39
|/ | | | | - add option `sync_mode` - update auto suggestion and jconf
* Merge pull request #221 from psychocrypt/fix-cudaLaunchBoundsfireice-uk2017-11-241-1/+1
|\ | | | | fix CUDA launch bounds usage
| * fix CUDA launch bounds usagepsychocrypt2017-11-231-1/+1
| | | | | | | | | | | | fix #191 lauch bounds must be placed before the return type but after the template paramater
* | fix auto suggestion for low end devicespsychocrypt2017-11-201-0/+4
|/ | | | Increase bfactor for all devices with lesser than 6 multi processors.
* Merge pull request #133 from psychocrypt/fix-cudaArchBinaryDetectionfireice-uk2017-11-171-1/+1
|\ | | | | fix wrong cuda binary arch detection
| * fix wrong cuda binary arch detectionpsychocrypt2017-11-171-1/+1
| | | | | | | | fix wrong arch comparsion
* | fix nvidia auto suggestionpsychocrypt2017-11-171-2/+2
|/ | | | | The lmem is still incalculably and crash the miner very often. Increase the potential lmem usage to 16kiB to respect lmem alignments, ...
* Remove whitespace linesUnknown2017-11-164-7/+7
|
* fix wrong indentions by hand after auto conversionpsychocrypt2017-11-162-6/+6
|
* remove space indention with tabspsychocrypt2017-11-162-53/+53
| | | | - removed all space indention in `*.hpp` and `*.cpp` files
* Merge pull request #115 from fireice-uk/topic-net-extpsychocrypt2017-11-161-1/+1
|\ | | | | Network extensions
| * Pass threadno up the food chainUnknown2017-11-141-1/+1
| |
* | Merge pull request #116 from psychocrypt/topic-checkIfBinarySupportsGPUArchfireice-uk2017-11-152-1/+49
|\ \ | | | | | | check gpu architecture
| * | check gpu architecturepsychocrypt2017-11-152-1/+49
| | | | | | | | | | | | | | | | | | - check if the gpu architecture is supported by the compiled miner binary - remove not supported gpus from the auto suggestion - disallow the selection of a not supported gpu by hand tuning the config
* | | Merge pull request #114 from psychocrypt/fix-wrongMemoryDetectionfireice-uk2017-11-152-13/+43
|\ \ \ | |/ / | | | fix wrong memory detection
| * | fix wrong memory detectionpsychocrypt2017-11-152-13/+43
| |/ | | | | | | | | | | | | | | | | Free and total memory is only evaluated on the first device. To detect the gpu memory the gpu must be selected. - create context on the gpu before the memory is checked - add smx to the auto detection - change the result code of `cuda_get_deviceinfo()`
* | Fix misc bugs (#117)fireice-uk2017-11-152-1/+6
|/ | | | | | * CONF_NO_TLS fix * gpu error fix + always return to sched after pin
* add backend typepsychocrypt2017-11-101-0/+1
| | | | | - add type of the backend to each backend-plugin - add `gteName` to `iBackend` to get the name of the backend
* optimize NVIDIA autosuggestionpsychocrypt2017-11-032-9/+30
| | | | | - avoid creation of a config with zero threads or blocks - WINDOWS: reduce the used memory for the auto suggestion by the amount of already used memory
* fix windows compile and broken aeonpsychocrypt2017-10-284-22/+28
| | | | | - fix windows linker error during compile - fix wrong parameter to call aeon (nvidia-backend)
* increase safety memory for autosuggestionpsychocrypt2017-10-271-8/+8
| | | | | - increase safty memory from 64 to 128 MiB - NVIDIA: increase lmem reserve per thread to 1kiB
* rename `xmr` to `monero`psychocrypt2017-10-273-14/+15
| | | | | | - rename all `xmr` to `monero` - be insensitive while check for set currency - add function to compate two strings insensitive
* add aeon support to backend nvidiapsychocrypt2017-10-275-22/+63
| | | | | - add template parameter to kernel to support aeon and xmr - update auto suggestion
* check if affinity must be setpsychocrypt2017-10-271-2/+3
| | | | - check if affinity vale before calling `thd_setaffinity`
* thread affinity for non cpu backendspsychocrypt2017-10-262-7/+24
| | | | | | | | This is a follow up of #43 and use the some mechanism to set the thread affinity for non cpu backends correct. - use cpu affinity workflow for nvidia and amd - cpu: move messages of thread spawning before thread creation
* Merge pull request #45 from fireice-uk/topic-nonce-allocfireice-uk2017-10-231-9/+15
|\ | | | | Implement pool-controlled nonce allocation
| * small fixfireice-uk2017-10-221-1/+1
| |
| * Fix GPU nicehash supportfireice-uk2017-10-221-1/+4
| |
OpenPOWER on IntegriCloud