summaryrefslogtreecommitdiffstats
Commit message (Collapse)AuthorAgeFilesLines
...
* Unroll loops to process 64 byte cache line per iterationJukka Ojanen2015-07-091-39/+205
|
* Add new attributes to control/improve branch predictionsJukka Ojanen2015-07-091-0/+12
|
* Half the number of calls to sin/cos functions in ffts_init_1d_realJukka Ojanen2015-07-081-12/+68
|
* Add SSE3 optimized version of ffts_execute_1d_real_invJukka Ojanen2015-07-071-20/+78
|
* Add SSE3 optimized version of ffts_execute_1d_realJukka Ojanen2015-07-071-13/+80
|
* Add detection for SSE3 intrinsicsJukka Ojanen2015-07-071-2/+20
|
* To silence warning 'possible loss of data', use explicit casting to floatJukka Ojanen2015-07-061-8/+8
|
* SSE optimized versions of ffts_execute_1d_real and ffts_execute_1d_real_invJukka Ojanen2015-07-061-4/+100
|
* Add new attributes to help auto-vectorizationJukka Ojanen2015-07-062-19/+54
|
* Avoid allocating array of single pointerJukka Ojanen2015-07-061-10/+8
|
* Fix ffts_aligned_free MinGW crashJukka Ojanen2015-07-061-1/+1
|
* Incorrect stride with GCC flags "-march=native -ffast-math"Jukka Ojanen2015-07-022-3/+5
| | | | Note that N/leaf_N is always a multiply of 2
* Fix assertion failed in ffts_compare_offsetsJukka Ojanen2015-07-021-4/+5
|
* Generate cosine and sine table without using C math library. About 100 times ↵Jukka Ojanen2015-03-312-15/+54
| | | | faster on ARM and 15 times faster on x86.
* ffts_nd.c is using SSE2 intrinsics, detect and include emmintrin.h instead ↵Jukka Ojanen2015-03-192-4/+12
| | | | xmmintrin.h, and fix GCC error: inlining failed in call to always_inline '_mm_load_pd': target specific option mismatch by adding "-msse2" instead of "-msse"
* To support building for Windows with MinGW, don't assume MSVC to be the compilerJukka Ojanen2015-03-191-1/+1
|
* Minimize sin/cos calculations by calculating all factors ones and generate ↵Jukka Ojanen2015-03-182-17/+31
| | | | lookup tables by mapping
* Remove unused sse.sJukka Ojanen2015-03-182-895/+0
|
* Always run-time generate x64 dynamic codeJukka Ojanen2015-03-181-841/+1335
|
* Remove dependency on YASM as Windows dynamic code is run-time generatedJukka Ojanen2015-03-172-854/+0
|
* Determinate lookup table size using closed-form expressionJukka Ojanen2015-03-161-22/+4
|
* Remove dead codeJukka Ojanen2015-03-161-98/+47
|
* Don't generate lookup tables when size is less than 32Jukka Ojanen2015-03-161-5/+5
|
* Merge ffts_small with ffts_static, and define small transforms "fully" constantJukka Ojanen2015-03-166-517/+585
|
* Add string.h to fix implicit declaration of function 'memcpy'Jukka Ojanen2015-03-131-0/+4
|
* One more macro fixJukka Ojanen2015-03-131-2/+2
|
* Forgot to rename some V macrosJukka Ojanen2015-03-134-26/+28
|
* Rename vector V as V4SF; vector of 4 single precision floats. Rename all ↵Jukka Ojanen2015-03-128-673/+796
| | | | | | vector V macros accordingly. Redefine ffts_constants as ffts_constants_32f and ffts_constants_64f.
* Replace data_t with floatJukka Ojanen2015-03-121-31/+53
|
* Remove unused neon_float.h headerJukka Ojanen2015-03-121-1127/+0
|
* Remove unused variable 'i' from 'ffts_generate_func_code'Jukka Ojanen2015-03-121-4/+0
|
* Initial steps to support double precision. Replace data_t with float, and ↵Jukka Ojanen2015-03-125-145/+413
| | | | cdata_t with ffts_cpx_32f.
* Try to execute detection quietly without messagesJukka Ojanen2015-03-111-1/+5
|
* Fix conflicting typesJukka Ojanen2015-03-111-0/+2
|
* Automatically detect ARM FPU and float ABIJukka Ojanen2015-03-111-18/+73
|
* Fix conflicting types for 'temp0'Jukka Ojanen2015-03-111-0/+2
|
* Add explicit '.fpu' directiveJukka Ojanen2015-03-113-0/+3
|
* Use FFTS_INLINE instead of __INLINE and make functions staticJukka Ojanen2015-03-111-17/+14
|
* Added "-mfloat-abi=softfp" as default for ARMJukka Ojanen2015-03-111-1/+5
|
* Now ARM VFP building works with "CMake -DENABLE_VFP=ON". Performance when ↵Jukka Ojanen2015-03-111-0/+2
| | | | "-DDYNAMIC_CODE_DISABLED=ON" is poor, but it can be improved with gcc switches.
* Don't use CMake option nameJukka Ojanen2015-03-101-7/+7
|
* ARM compile shall use generic C if dynamic code is disabledJukka Ojanen2015-03-101-7/+7
|
* Don't add 'vfp.s' if dynamic code is disabledJukka Ojanen2015-03-101-3/+0
|
* Remove duplicate codeJukka Ojanen2015-03-101-7/+1
|
* Remove redefinitions and fix namingJukka Ojanen2015-03-101-7/+4
|
* Dereference pointerJukka Ojanen2015-03-101-1/+1
|
* For the moment assume HAVE_VFP as final fallbackJukka Ojanen2015-03-101-0/+2
|
* Removal of 'transforms' broke dynamic codeJukka Ojanen2015-03-101-0/+2
|
* Remove redefinition of 'fw' in function 'ffts_generate_luts'Jukka Ojanen2015-03-101-2/+0
|
* Add explicit '.fpu' directiveJukka Ojanen2015-03-101-50/+49
|
OpenPOWER on IntegriCloud