summaryrefslogtreecommitdiffstats
Commit message (Collapse)AuthorAgeFilesLines
...
| * Detect presence of malloc.h, fixes anthonix/ffts#40Jukka Ojanen2015-07-302-0/+8
| |
| * Define [pa] and [pb] as constant input variables, not writable outputsJukka Ojanen2015-07-162-8/+5
| |
| * Remove unreferenced headerJukka Ojanen2015-07-151-2/+0
| |
| * Improve compiler optimization by turning "patterns.c" to "patterns.h"Jukka Ojanen2015-07-153-233/+506
| |
| * Remove some dead codeJukka Ojanen2015-07-151-19/+0
| |
| * FFTS is no longer depended on any other math library, and this should help ↵Jukka Ojanen2015-07-145-82/+135
| | | | | | | | to verify its numerical accuracy.
| * Move trigonometric stuff to separate file.Jukka Ojanen2015-07-145-54/+274
| | | | | | | | Implemented Oscar Buneman's method for generating a sequence of sines and cosines.
| * SSE3 detection failed with MSVC 2005 x64Jukka Ojanen2015-07-101-3/+12
| |
| * Unroll loops to process 64 byte cache line per iterationJukka Ojanen2015-07-091-39/+205
| |
| * Add new attributes to control/improve branch predictionsJukka Ojanen2015-07-091-0/+12
| |
| * Half the number of calls to sin/cos functions in ffts_init_1d_realJukka Ojanen2015-07-081-12/+68
| |
| * Add SSE3 optimized version of ffts_execute_1d_real_invJukka Ojanen2015-07-071-20/+78
| |
| * Add SSE3 optimized version of ffts_execute_1d_realJukka Ojanen2015-07-071-13/+80
| |
| * Add detection for SSE3 intrinsicsJukka Ojanen2015-07-071-2/+20
| |
| * To silence warning 'possible loss of data', use explicit casting to floatJukka Ojanen2015-07-061-8/+8
| |
| * SSE optimized versions of ffts_execute_1d_real and ffts_execute_1d_real_invJukka Ojanen2015-07-061-4/+100
| |
| * Add new attributes to help auto-vectorizationJukka Ojanen2015-07-062-19/+54
| |
| * Avoid allocating array of single pointerJukka Ojanen2015-07-061-10/+8
| |
| * Fix ffts_aligned_free MinGW crashJukka Ojanen2015-07-061-1/+1
| |
| * Incorrect stride with GCC flags "-march=native -ffast-math"Jukka Ojanen2015-07-022-3/+5
| | | | | | | | Note that N/leaf_N is always a multiply of 2
| * Fix assertion failed in ffts_compare_offsetsJukka Ojanen2015-07-021-4/+5
| |
| * Generate cosine and sine table without using C math library. About 100 times ↵Jukka Ojanen2015-03-312-15/+54
| | | | | | | | faster on ARM and 15 times faster on x86.
| * ffts_nd.c is using SSE2 intrinsics, detect and include emmintrin.h instead ↵Jukka Ojanen2015-03-192-4/+12
| | | | | | | | xmmintrin.h, and fix GCC error: inlining failed in call to always_inline '_mm_load_pd': target specific option mismatch by adding "-msse2" instead of "-msse"
| * To support building for Windows with MinGW, don't assume MSVC to be the compilerJukka Ojanen2015-03-191-1/+1
| |
| * Minimize sin/cos calculations by calculating all factors ones and generate ↵Jukka Ojanen2015-03-182-17/+31
| | | | | | | | lookup tables by mapping
| * Remove unused sse.sJukka Ojanen2015-03-182-895/+0
| |
| * Always run-time generate x64 dynamic codeJukka Ojanen2015-03-181-841/+1335
| |
| * Remove dependency on YASM as Windows dynamic code is run-time generatedJukka Ojanen2015-03-172-854/+0
| |
| * Determinate lookup table size using closed-form expressionJukka Ojanen2015-03-161-22/+4
| |
| * Remove dead codeJukka Ojanen2015-03-161-98/+47
| |
| * Don't generate lookup tables when size is less than 32Jukka Ojanen2015-03-161-5/+5
| |
| * Merge ffts_small with ffts_static, and define small transforms "fully" constantJukka Ojanen2015-03-166-517/+585
| |
| * Add string.h to fix implicit declaration of function 'memcpy'Jukka Ojanen2015-03-131-0/+4
| |
| * One more macro fixJukka Ojanen2015-03-131-2/+2
| |
| * Forgot to rename some V macrosJukka Ojanen2015-03-134-26/+28
| |
| * Rename vector V as V4SF; vector of 4 single precision floats. Rename all ↵Jukka Ojanen2015-03-128-673/+796
| | | | | | | | | | | | vector V macros accordingly. Redefine ffts_constants as ffts_constants_32f and ffts_constants_64f.
| * Replace data_t with floatJukka Ojanen2015-03-121-31/+53
| |
| * Remove unused neon_float.h headerJukka Ojanen2015-03-121-1127/+0
| |
| * Remove unused variable 'i' from 'ffts_generate_func_code'Jukka Ojanen2015-03-121-4/+0
| |
| * Initial steps to support double precision. Replace data_t with float, and ↵Jukka Ojanen2015-03-125-145/+413
| | | | | | | | cdata_t with ffts_cpx_32f.
| * Try to execute detection quietly without messagesJukka Ojanen2015-03-111-1/+5
| |
| * Fix conflicting typesJukka Ojanen2015-03-111-0/+2
| |
| * Automatically detect ARM FPU and float ABIJukka Ojanen2015-03-111-18/+73
| |
| * Fix conflicting types for 'temp0'Jukka Ojanen2015-03-111-0/+2
| |
| * Add explicit '.fpu' directiveJukka Ojanen2015-03-113-0/+3
| |
| * Use FFTS_INLINE instead of __INLINE and make functions staticJukka Ojanen2015-03-111-17/+14
| |
| * Added "-mfloat-abi=softfp" as default for ARMJukka Ojanen2015-03-111-1/+5
| |
| * Now ARM VFP building works with "CMake -DENABLE_VFP=ON". Performance when ↵Jukka Ojanen2015-03-111-0/+2
| | | | | | | | "-DDYNAMIC_CODE_DISABLED=ON" is poor, but it can be improved with gcc switches.
| * Don't use CMake option nameJukka Ojanen2015-03-101-7/+7
| |
| * ARM compile shall use generic C if dynamic code is disabledJukka Ojanen2015-03-101-7/+7
| |
OpenPOWER on IntegriCloud