summaryrefslogtreecommitdiffstats
path: root/xmrstak/backend/amd/amd_gpu
Commit message (Collapse)AuthorAgeFilesLines
* Fix condition on stellite checkJason Rhinelander2018-06-041-1/+1
|
* Spell checkTony Butler2018-06-045-241/+241
|
* support stellite v4 forkpsychocrypt2018-06-042-8/+16
| | | | | | solve #1494 - add algorithm `cryptonight_v7_stellite` (internal named: `cryptonight_stellite`)
* fix that cli option `--noAMDCache`psychocrypt2018-06-041-6/+6
| | | | - fix that to much OpenCL calls whre used if cache is disabled
* add support for IPBC coinpsychocrypt2018-06-042-7/+16
| | | | | - add algorithm `cryptonight_lite_v7_xor` - update documentation
* remove fork for sumokoin and moneropsychocrypt2018-06-041-1/+1
| | | | remove fork version for sumokoin and monero7
* add independent dev pool coin descriptionpsychocrypt2018-06-041-6/+6
| | | | | | | | | | | | | | - allow the dev pool to fork on a different block version than the user descriped coin All algorithm are centered around the user coin description. It is allowed to have two two different coin algorithms in the user coin description. It is only allowed to use algorithms for the dev pool coin description those are used in the user coin description. There are two ways to define a non forking coin. - set both user coin algorithm descriptions to the same algorithm and set version to zero - set the first algorithm in the user coin description to something you like to use in the dev pool and set the second algorithm to the correct representation of the coin. Set the version to 255. This will allow that the dev pool can mine on a different coin algorithm than the not forking user coin. Do not use an algorithm with different scratchpad size for the dev pool.
* allow non AMD OpenCL driver and devicespsychocrypt2018-06-042-8/+23
| | | | | | - add CLI flag to explicitly use non AMD OpenCL and devices - adjust OpenCL output (use OpenCL instead of AMD if --altOpenCL is sued) - optimize NVIDIA OpenCL auto suggestion
* refactor scratchpad creationpsychocrypt2018-06-041-4/+6
| | | | Use the maximum scratchpad size from before and after the fork.
* amd simplify kernel for different algorithmspsychocrypt2018-06-043-429/+293
| | | | | | | - remove version numbers within the kernel - create seperate program context for each mining algorithm - remove kernel `cn1_monero` is now integrated in `cn1` - remname `cnX` kernel in `cnX + algorithmNumber`
* Fix spelling mistake in gpu.cppTakeshi Suzuki2018-06-041-2/+2
|
* AMD OpenCL: fix sumokoinpsychocrypt2018-06-041-3/+12
| | | | - fix that version argument was not passed to extended kernel parameters
* refactor mining algo selectionpsychocrypt2018-06-041-3/+3
| | | | | - add `fork_height` to currency - refactor algorithm selection
* fix OpenCl AMD on OSXpsychocrypt2018-06-041-4/+1
| | | | | | fix #1218 - remove inline function with ugly macro :-(
* POW AEON v7psychocrypt2018-06-041-3/+3
| | | | | - add new pow for AEON - fix missing cryptonight-heavy selection for multi hashes
* add cli option `--noAMDCache`psychocrypt2018-06-041-21/+25
| | | | | | allow to disable the OpenCl cache - usefull for read only systems - usefull for unknown errors during cache reading
* fix parentheses warningpsychocrypt2018-06-041-1/+1
| | | | | | | | | fix warning ``` /Users/user/xmr-stak/xmrstak/backend/amd/amd_gpu/gpu.cpp:481:11: warning: using the result of an assignment as a condition without parentheses [-Wparentheses] if( ret = clGetProgramInfo(ctx->Program, CL_PROGRAM_BINARIES, num_devices * sizeof(char*), all_programs.data(),NULL) != CL_SUCCESS) ```
* OpenCL precompiled code messagepsychocrypt2018-03-251-3/+3
| | | | change warning when precompiled OpenCL code is not found
* XMR-Stak 2.3.0 RCxmr-stak-devs2018-03-253-44/+334
| | | | | | | Co-authored-by: psychocrypt <psychocryptHPC@gmail.com> Co-authored-by: fireice-uk <fireice-uk@users.noreply.github.com> Co-authored-by: Lee Clagett <code@leeclagett.com> Co-authored-by: curie-kief <curie-kief@users.noreply.github.com>
* fix shadowed variablepsychocrypt2018-03-131-1/+1
| | | | A redefinition of a variable in a local scope avoid that the intensity is rounded to a multiple of the work size.
* add OpenCL compiler cachepsychocrypt2018-03-031-29/+182
| | | | | | | Reduce OpenCL start time by using a self made compiler cache. - store compiled OpenCL binary - load OpenCl binary if available
* fix broken memchunk featurepsychocrypt2018-02-212-2/+2
| | | | fix double definition of define `MEM_CHUNK`
* Merge pull request #1102 from psychocrypt/topic-opencl_1_1fireice-uk2018-02-211-0/+4
|\ | | | | fix compile with OpenCL 1.1
| * fix compile with OpenCL 1.1psychocrypt2018-02-191-0/+4
| | | | | | | | guard error types those are only defined in OpenCL >1.1
* | Merge pull request #1101 from psychocrypt/topic-mesaSupportfireice-uk2018-02-212-3/+26
|\ \ | | | | | | add Mesa OpenCL support
| * | add Mesa OpenCL supportpsychocrypt2018-02-192-3/+26
| |/ | | | | | | allow usage of Mesa OpenCl
* | add OpenCL compatibility modepsychocrypt2018-02-193-9/+28
|/ | | | | - add new option `comp_mode` to the amd config - disable `if guards` within opencl kernel if `comp_mode : false`
* Merge pull request #1087 from psychocrypt/topic-blockedStride3fireice-uk2018-02-193-11/+36
|\ | | | | AMD: option `mem_chunk`and new `strided_index`
| * AMD: `mem_chunk`and new `strided_index`psychocrypt2018-02-173-11/+36
| | | | | | | | | | - add new option for `strided_index` - add additional option if `strided_index == 2` to controll the memory chunk with
* | AMD: reduce register usagepsychocrypt2018-02-141-4/+4
|/ | | | reduce usage of registers: based on the suggestion of @enerc77
* fix output of gpu namepsychocrypt2018-01-301-1/+1
| | | | fix that the GPU name is printed before the name is querried from OpenCL
* remove usage of `rotate`psychocrypt2018-01-241-1/+10
| | | | revert the change that the OpenCl function `rotate` is used instead of `SKEIN_ROT`
* author updateGrant Galitz2018-01-241-0/+1
|
* Optimize SkeinGrant Galitz2018-01-241-30/+48
| | | | | - Eliminate modulus math (It runs slow inside microcode). - Convert whatever the hell was going on into a rotate op. Like... Someone kinda reinvented the wheel in order to do a simple rotate.
* differgence in OpenCL codepsychocrypt2017-12-301-56/+48
| | | | | | remove branch differgences in AMD OpenCl code based on #454 a Please enter the commit message for your changes. Lines starting
* Make k unsigned.Unknown2017-12-061-1/+1
|
* Clean up the x-mas treeUnknown2017-12-061-52/+69
|
* Merge pull request #332 from psychocrypt/topic-tuneAMDfireice-uk2017-12-041-8/+22
|\ | | | | optimize aes round command queue
| * optimize aes round command queuepsychocrypt2017-12-031-8/+22
| | | | | | | | unroll AES round by hand to help the compiler to optimize the command queue
* | add AMD scratchpad indexing optionpsychocrypt2017-12-023-2/+21
|/ | | | | | Allow to change the indexing used to address the hash scratchpad memory. - add option `strided_index` for each gpu
* Update gpu.cppExceptionallyGreat2017-11-211-2/+2
| | | | Fix for AMD GPU detection on MacOS X On MacOS X the platform is reported as Apple, and the vendor is reported as AMD.
* Update gpu.hppExceptionallyGreat2017-11-211-0/+1
| | | Fixing the compile error on MacOS X
* Merge pull request #138 from psychocrypt/fix-amdMaxWorkSizefireice-uk2017-11-181-0/+4
|\ | | | | fix max work size shown to user
| * fix max work size shown to userpsychocrypt2017-11-181-0/+4
| | | | | | | | The max work size shown to the user was the hardware maximum but not take in account thet some kernel spwning 8 times more threads per work group than configured by the user.
* | Merge pull request #137 from psychocrypt/topic-handleOpenClErrorsfireice-uk2017-11-181-64/+82
|\ \ | | | | | | handle OpenCL errors
| * | handle OpenCL errorspsychocrypt2017-11-181-64/+82
| |/ | | | | | | | | - handle opencl errors to avoid unexpected behaviors - remove code duplicatio nwithin the some method
* | fix OpenCL compatibilitypsychocrypt2017-11-181-2/+2
|/ | | | | | - enforce that only host side OpenCl 1.2 functions are used if the miner is compiled with `XMR-STAK_COMPILE=generic` - update documentation - add CMake option `XMR-STAK_COMPILE` to windows
* Remove whitespace linesUnknown2017-11-161-3/+3
|
* remove space indention with tabspsychocrypt2017-11-161-31/+31
| | | | - removed all space indention in `*.hpp` and `*.cpp` files
* rename `xmr` to `monero`psychocrypt2017-10-271-6/+6
| | | | | | - rename all `xmr` to `monero` - be insensitive while check for set currency - add function to compate two strings insensitive
OpenPOWER on IntegriCloud