index
:
vecmathlib
master
vecmathlib forked from https://bitbucket.org/eschnett/vecmathlib for POWER8 improvements
Raptor Engineering, LLC
summary
refs
log
tree
commit
diff
stats
log msg
author
committer
range
Commit message (
Collapse
)
Author
Age
Files
Lines
...
*
Update build instructions
Erik Schnetter
2013-11-25
1
-1
/
+2
|
*
Change order of functions (asin before acos)
Erik Schnetter
2013-09-10
1
-6
/
+6
|
*
Acknowledge SLEEF in asin and friends
Erik Schnetter
2013-09-10
1
-0
/
+6
|
*
Use SLEEF's algorithms for asin, acos, atan, atan2
Erik Schnetter
2013-09-10
1
-105
/
+158
|
*
Use SLEEF's algorithms for sin, cos, tan
Erik Schnetter
2013-09-10
1
-63
/
+198
|
*
Add copy of SLEEF library for reference
Erik Schnetter
2013-09-10
29
-0
/
+24342
|
*
Use mad() instead of fma()
Erik Schnetter
2013-09-10
3
-81
/
+81
|
*
Introduce mad(), as low-accuracy version of fma()
Erik Schnetter
2013-09-10
19
-7
/
+109
|
*
Use __builtin_clzs if available
Erik Schnetter
2013-09-09
1
-2
/
+4
|
*
Provide vector types implemented via (scalar) compiler builtins
Erik Schnetter
2013-09-09
4
-282
/
+866
|
*
Prefer "long" over "long long" if both types are the same
Erik Schnetter
2013-09-09
1
-0
/
+8
|
*
SSE/AVX: Use nicer type names
Erik Schnetter
2013-09-02
6
-626
/
+674
|
*
Use nicer type names in vec_pseudo
Erik Schnetter
2013-09-02
1
-129
/
+129
|
*
Use nicer type names in vec_test
Erik Schnetter
2013-09-02
1
-150
/
+164
|
*
Add build instruction for Pandora (Power7 at LSU)
Erik Schnetter
2013-09-02
1
-1
/
+4
|
*
interp.cc: Use best double precision vector type
Erik Schnetter
2013-09-02
1
-1
/
+1
|
*
rsqrt: Do not require strictly positive arguments
Erik Schnetter
2013-09-02
1
-1
/
+1
|
*
SSE: Check whether abs/max/min are available as intrinsics before using them
Erik Schnetter
2013-09-02
2
-4
/
+39
|
*
Use strictly positive arguments or testing rsqrt
Erik Schnetter
2013-09-02
1
-1
/
+1
|
*
Improve white space in test.cc random()
Erik Schnetter
2013-09-02
1
-3
/
+2
|
*
Support Altivec with Clang
Erik Schnetter
2013-09-03
1
-3
/
+34
|
*
Altivec, VSX: Make build with IBM compiler
Erik Schnetter
2013-09-02
2
-121
/
+90
|
*
Remove invalid comment
Erik Schnetter
2013-09-02
1
-4
/
+0
|
*
Protect __builtin_clz and __builtin_popcount with #ifdefs
Erik Schnetter
2013-09-02
1
-0
/
+8
|
*
Allow (again) larger errors for rounding functions
Erik Schnetter
2013-09-02
1
-25
/
+26
|
*
Provide clean set of libm elemental functions in namespace vml_std
Erik Schnetter
2013-09-02
15
-243
/
+308
|
|
|
|
Different C++ standard versions and different compilers provide different sets of elemental functions in std. Thus, we create a clean set in a new namespace vml_std.
*
Correct auto-detecting VSX
Erik Schnetter
2013-09-02
1
-1
/
+1
|
*
Implement new integer functions for NEON
Erik Schnetter
2013-09-02
2
-13
/
+65
|
*
Correct new integer functions for Altivec/VSX
Erik Schnetter
2013-09-02
2
-13
/
+69
|
*
SSE: Rename function argument
Erik Schnetter
2013-09-02
2
-6
/
+7
|
*
QPX: Support additional integer
Erik Schnetter
2013-09-02
1
-8
/
+61
|
*
Correct SSE frexp()
Erik Schnetter
2013-09-02
1
-4
/
+5
|
*
Reduce accuracy requirement for fma
Erik Schnetter
2013-09-02
1
-1
/
+1
|
*
Add build instructions for Compute (at Perimeter)
Erik Schnetter
2013-09-01
1
-0
/
+3
|
*
Correct error in debug instantiations
Erik Schnetter
2013-09-01
1
-1
/
+1
|
*
Implement more integer functions: abs bitifthen clz isignbit max min ↵
Erik Schnetter
2013-09-01
22
-110
/
+896
|
|
|
|
|
|
popcount rotate Rename integer signbit to isignbit to avoid name conflicts.
*
NEON: Describe idea for prod() improvement
Erik Schnetter
2013-08-26
1
-0
/
+1
|
*
Describe how to build on Stampede (MIC) and Raspberry Pi (ARM)
Erik Schnetter
2013-08-26
1
-0
/
+9
|
*
Provide a string describing the configuration options
Erik Schnetter
2013-08-26
2
-38
/
+74
|
*
Make frexp take a pointer argument instead of a reference
Erik Schnetter
2013-08-21
20
-31
/
+34
|
*
Add float example to build script
Erik Schnetter
2013-08-20
1
-0
/
+1
|
*
Use better timers for benchmarking loops
Erik Schnetter
2013-08-20
1
-4
/
+9
|
*
Benchmark more math functions
Erik Schnetter
2013-08-20
1
-55
/
+151
|
*
Correct detecting ARM NEON intrinsics
Erik Schnetter
2013-08-20
1
-2
/
+2
|
*
Add single-precision example
Erik Schnetter
2013-08-20
2
-0
/
+41
|
*
Provide trivial fallback "best vector" if no "best vector" is available
Erik Schnetter
2013-08-20
1
-14
/
+22
|
*
Neon: Implement unaligned load/store operations
Erik Schnetter
2013-08-20
2
-10
/
+16
|
*
Explicitly align allocated memory
Erik Schnetter
2013-08-19
1
-1
/
+6
|
*
Add potential optimization for SSE/AVX all/any functions
Erik Schnetter
2013-08-19
5
-5
/
+5
|
*
SSE: Correct float4 all/any
Erik Schnetter
2013-08-19
1
-2
/
+2
|
[prev]
[next]