summaryrefslogtreecommitdiffstats
path: root/src/ffts_nd.c
Commit message (Collapse)AuthorAgeFilesLines
* Combine ffts_tranpose_scalar and ffts_transpose, and use ↵Jukka Ojanen2016-04-051-111/+1
| | | | ffts_transpose_scalar as native C fallback
* Fix ffts_init_nd() for 3 or higher rank complex FFTsJukka Ojanen2016-04-051-25/+30
|
* Fix neon_transpose8 for non-square matrices, move loops to assembly side, ↵Jukka Ojanen2016-03-291-7/+1
| | | | about 5% faster
* Rename neon_transpose to neon_transpose4, 4x4 tiled matrix transpose.Jukka Ojanen2016-03-281-2/+2
| | | | Rename neon_transpose_to_buf to neon_transpose8, 8x8 tiled matrix transpose.
* "transpose_buf" is not usedJukka Ojanen2016-03-281-14/+9
|
* Improve performance of small complex 2D Neon transform by 15%Jukka Ojanen2016-03-281-63/+6
|
* Enable building shared library and start version numbering from 0.9.0. On ↵Jukka Ojanen2015-11-301-2/+4
| | | | Windows when using FFTS as a DLL, define FFTS_SHARED. This is not mandatory, but it offers a little performance increase. Hide symbols when possible to improve compiler optimization and sizeof binary. Use CMake target alias "ffts" to choose between static and shared library, preferring static
* ffts_nd.c is using SSE2 intrinsics, detect and include emmintrin.h instead ↵Jukka Ojanen2015-03-191-2/+2
| | | | xmmintrin.h, and fix GCC error: inlining failed in call to always_inline '_mm_load_pd': target specific option mismatch by adding "-msse2" instead of "-msse"
* Improve header logicJukka Ojanen2015-03-101-3/+1
|
* Fix warning "comparison between signed and unsigned integer expressions"Jukka Ojanen2014-12-091-3/+4
|
* Fix redefinition of ffts_plan_tJukka Ojanen2014-11-171-1/+1
|
* Add forgotten "ffts.h" headerJukka Ojanen2014-11-161-0/+1
|
* Cleaning and reorganizingJukka Ojanen2014-10-291-250/+331
|
* Adding in Vim modelines to all .c and .h files.Robert Massaioli2013-12-051-0/+1
|
* Fixed transpose -- thanks to Sowon Kim <mogyak@gmail.com>Anthony Blake2013-07-161-1/+1
|
* Fixed up the smaller VFP transforms. Inverse VFP and real/nd VFP still not ↵Anthony Blake2013-04-221-2/+4
| | | | working yet.
* Changes to ND codeAnthony Blake2013-01-251-36/+69
|
* miscAnthony Blake2012-12-131-1/+37
|
* Slight speed improvement for ARM NDAnthony Blake2012-12-121-2/+2
|
* Fixed ND segfaultAnthony Blake2012-12-121-4/+4
|
* 2D real-valued transforms workAnthony Blake2012-11-151-7/+7
|
* Fixed some compiler warnings, generalized transform interface (in prep for ↵Anthony Blake2012-11-151-1/+1
| | | | double etc)
* Uses slightly faster call to internal transformsAnthony Blake2012-11-141-2/+2
|
* Multi-dimensional saves some planningAnthony Blake2012-11-141-2/+17
|
* Slightly better transpose for NEONAnthony Blake2012-11-081-34/+87
|
* Build warningsAnthony Blake2012-11-071-4/+4
|
* Fixed compile warningsAnthony Blake2012-11-071-1/+1
|
* 2D has SIMD transpose (still needs work though)Anthony Blake2012-11-061-4/+32
|
* Multi-dimensional works (slow transpose tho)Anthony Blake2012-11-021-42/+15
|
* 2D transforms workAnthony Blake2012-10-311-19/+23
|
* Square 2D worksAnthony Blake2012-10-311-4/+76
|
* Initial multi-dimensional structureAnthony Blake2012-10-311-0/+66
OpenPOWER on IntegriCloud