| Commit message (Collapse) | Author | Age | Files | Lines |
| |
|
|
|
|
|
|
| |
solve #1494
- add algorithm `cryptonight_v7_stellite` (internal named: `cryptonight_stellite`)
|
|
|
|
|
| |
- add algorithm `cryptonight_lite_v7_xor`
- update documentation
|
|
|
|
|
|
|
| |
- remove version numbers within the kernel
- create seperate program context for each mining algorithm
- remove kernel `cn1_monero` is now integrated in `cn1`
- remname `cnX` kernel in `cnX + algorithmNumber`
|
|
|
|
|
|
| |
fix #1218
- remove inline function with ugly macro :-(
|
|
|
|
|
|
|
| |
Co-authored-by: psychocrypt <psychocryptHPC@gmail.com>
Co-authored-by: fireice-uk <fireice-uk@users.noreply.github.com>
Co-authored-by: Lee Clagett <code@leeclagett.com>
Co-authored-by: curie-kief <curie-kief@users.noreply.github.com>
|
|
|
|
| |
fix double definition of define `MEM_CHUNK`
|
|\
| |
| | |
add Mesa OpenCL support
|
| |
| |
| |
| | |
allow usage of Mesa OpenCl
|
|/
|
|
|
| |
- add new option `comp_mode` to the amd config
- disable `if guards` within opencl kernel if `comp_mode : false`
|
|\
| |
| | |
AMD: option `mem_chunk`and new `strided_index`
|
| |
| |
| |
| |
| | |
- add new option for `strided_index`
- add additional option if `strided_index == 2` to controll the memory chunk with
|
|/
|
|
| |
reduce usage of registers: based on the suggestion of @enerc77
|
|
|
|
| |
revert the change that the OpenCl function `rotate` is used instead of `SKEIN_ROT`
|
| |
|
|
|
|
|
| |
- Eliminate modulus math (It runs slow inside microcode).
- Convert whatever the hell was going on into a rotate op. Like... Someone kinda reinvented the wheel in order to do a simple rotate.
|
|
|
|
|
|
| |
remove branch differgences in AMD OpenCl code based on #454
a Please enter the commit message for your changes. Lines starting
|
|\
| |
| | |
optimize aes round command queue
|
| |
| |
| |
| | |
unroll AES round by hand to help the compiler to optimize the command queue
|
|/
|
|
|
|
| |
Allow to change the indexing used to address the hash scratchpad memory.
- add option `strided_index` for each gpu
|
|
|
|
|
| |
- add compile parameter to support aeon and xmr
- update auto suggestion to handle aeon
|
|
|
|
|
| |
- remove 32bit target value
- use always 64bit target for amd backend
|
|
|
|
|
|
|
|
|
|
|
|
| |
Early returns within a kernel can have unexpected behavior, this strongly depends on the opencl
runtime compiler. To avoid errors all early returns are removed.
There is no negative effect(e.g.performance) if the threads stay alive up to the end of the kernel.
One source for the early return is: http://al-key-opencl.blogspot.de/2014/09/be-careful-not-to-mix-early-return-and.html
This pull request also fix a race condition where all fill the shared memory.
Also a bug introduced with #16 is fixed, because of the early return not was possible that
the last block works with an wrong initilized shared memory (result should be a wrong hash if the result target size is valid).
|
|
|
|
|
| |
Is is possible that the number of results are greater than the result output array,
in this case invalid memory can be access within the device and on the host side.
|
|
|
|
|
| |
The number of threads within a kernel must be a multiple of the worksize.
If not it can crash on some systems.
|
|
|
|
| |
- string literal can only be 16kbyte
|
|
- move source code to `src`
- categorize files and move to group folder
- change upper case class files to lower case
- change C++ header to `*.hpp`
|