summaryrefslogtreecommitdiffstats
path: root/xmrstak/backend/amd/amd_gpu/opencl
Commit message (Collapse)AuthorAgeFilesLines
* Spell checkTony Butler2018-06-044-236/+236
|
* support stellite v4 forkpsychocrypt2018-06-041-7/+15
| | | | | | solve #1494 - add algorithm `cryptonight_v7_stellite` (internal named: `cryptonight_stellite`)
* add support for IPBC coinpsychocrypt2018-06-041-6/+15
| | | | | - add algorithm `cryptonight_lite_v7_xor` - update documentation
* amd simplify kernel for different algorithmspsychocrypt2018-06-041-210/+91
| | | | | | | - remove version numbers within the kernel - create seperate program context for each mining algorithm - remove kernel `cn1_monero` is now integrated in `cn1` - remname `cnX` kernel in `cnX + algorithmNumber`
* fix OpenCl AMD on OSXpsychocrypt2018-06-041-4/+1
| | | | | | fix #1218 - remove inline function with ugly macro :-(
* XMR-Stak 2.3.0 RCxmr-stak-devs2018-03-251-13/+260
| | | | | | | Co-authored-by: psychocrypt <psychocryptHPC@gmail.com> Co-authored-by: fireice-uk <fireice-uk@users.noreply.github.com> Co-authored-by: Lee Clagett <code@leeclagett.com> Co-authored-by: curie-kief <curie-kief@users.noreply.github.com>
* fix broken memchunk featurepsychocrypt2018-02-211-1/+1
| | | | fix double definition of define `MEM_CHUNK`
* Merge pull request #1101 from psychocrypt/topic-mesaSupportfireice-uk2018-02-211-0/+5
|\ | | | | add Mesa OpenCL support
| * add Mesa OpenCL supportpsychocrypt2018-02-191-0/+5
| | | | | | | | allow usage of Mesa OpenCl
* | add OpenCL compatibility modepsychocrypt2018-02-191-3/+16
|/ | | | | - add new option `comp_mode` to the amd config - disable `if guards` within opencl kernel if `comp_mode : false`
* Merge pull request #1087 from psychocrypt/topic-blockedStride3fireice-uk2018-02-191-9/+26
|\ | | | | AMD: option `mem_chunk`and new `strided_index`
| * AMD: `mem_chunk`and new `strided_index`psychocrypt2018-02-171-9/+26
| | | | | | | | | | - add new option for `strided_index` - add additional option if `strided_index == 2` to controll the memory chunk with
* | AMD: reduce register usagepsychocrypt2018-02-141-4/+4
|/ | | | reduce usage of registers: based on the suggestion of @enerc77
* remove usage of `rotate`psychocrypt2018-01-241-1/+10
| | | | revert the change that the OpenCl function `rotate` is used instead of `SKEIN_ROT`
* author updateGrant Galitz2018-01-241-0/+1
|
* Optimize SkeinGrant Galitz2018-01-241-30/+48
| | | | | - Eliminate modulus math (It runs slow inside microcode). - Convert whatever the hell was going on into a rotate op. Like... Someone kinda reinvented the wheel in order to do a simple rotate.
* differgence in OpenCL codepsychocrypt2017-12-301-56/+48
| | | | | | remove branch differgences in AMD OpenCl code based on #454 a Please enter the commit message for your changes. Lines starting
* Merge pull request #332 from psychocrypt/topic-tuneAMDfireice-uk2017-12-041-8/+22
|\ | | | | optimize aes round command queue
| * optimize aes round command queuepsychocrypt2017-12-031-8/+22
| | | | | | | | unroll AES round by hand to help the compiler to optimize the command queue
* | add AMD scratchpad indexing optionpsychocrypt2017-12-021-1/+18
|/ | | | | | Allow to change the indexing used to address the hash scratchpad memory. - add option `strided_index` for each gpu
* add eon support to amd backendpsychocrypt2017-10-271-11/+11
| | | | | - add compile parameter to support aeon and xmr - update auto suggestion to handle aeon
* amd: use 64bit targetpsychocrypt2017-10-241-8/+17
| | | | | - remove 32bit target value - use always 64bit target for amd backend
* remove early returnspsychocrypt2017-10-101-335/+376
| | | | | | | | | | | | Early returns within a kernel can have unexpected behavior, this strongly depends on the opencl runtime compiler. To avoid errors all early returns are removed. There is no negative effect(e.g.performance) if the threads stay alive up to the end of the kernel. One source for the early return is: http://al-key-opencl.blogspot.de/2014/09/be-careful-not-to-mix-early-return-and.html This pull request also fix a race condition where all fill the shared memory. Also a bug introduced with #16 is fixed, because of the early return not was possible that the last block works with an wrong initilized shared memory (result should be a wrong hash if the result target size is valid).
* avoid possible illegal memory accesspsychocrypt2017-10-061-8/+27
| | | | | Is is possible that the number of results are greater than the result output array, in this case invalid memory can be access within the device and on the host side.
* fix invalid kernel call parameterpsychocrypt2017-10-051-16/+25
| | | | | The number of threads within a kernel must be a multiple of the worksize. If not it can crash on some systems.
* fix windows compilepsychocrypt2017-09-301-1/+2
| | | | - string literal can only be 16kbyte
* group filespsychocrypt2017-09-306-0/+1725
- move source code to `src` - categorize files and move to group folder - change upper case class files to lower case - change C++ header to `*.hpp`
OpenPOWER on IntegriCloud