summaryrefslogtreecommitdiffstats
Commit message (Collapse)AuthorAgeFilesLines
...
* Optimize atan and acosErik Schnetter2013-06-201-10/+58
|
* Do not benchmark multi-element test vectors (as they are very slow)Erik Schnetter2013-06-201-5/+5
|
* Choose required accuracy depending on FP_CONTRACTErik Schnetter2013-06-201-1/+6
|
* Improve efficiency of roundErik Schnetter2013-06-201-1/+2
|
* Describe in comment how NEON implements rcpErik Schnetter2013-06-201-0/+2
|
* Reduce accuracy of sin when no FP_CONTRACT is requiredErik Schnetter2013-06-201-0/+19
|
* Implement efficient rsqrt. Implement sqrt in terms of rsqrt.Erik Schnetter2013-06-201-19/+40
|
* Explain what VML_HAVE_FP_CONTRACT meansErik Schnetter2013-06-201-1/+7
|
* Correct case in commentErik Schnetter2013-06-191-1/+1
|
* Test floating point roundingErik Schnetter2013-06-181-2/+11
|
* Test barrier functionErik Schnetter2013-06-181-0/+6
|
* Implement barrier for vec_pseudoErik Schnetter2013-06-181-0/+27
|
* Offer VML_BROKEN_STL to decide whether the STL has C++11 featuresErik Schnetter2013-06-172-4/+4
|
* Support Intel compilerErik Schnetter2013-06-171-1/+7
|
* Explain why vec_builtin doesn't workErik Schnetter2013-06-171-0/+1
|
* Introduce VML_HAVE_FP_CONTRACT (still unused)Erik Schnetter2013-06-171-0/+1
|
* Add missing std:: namespace qualifierErik Schnetter2013-06-172-4/+4
|
* Correct NEON implementationErik Schnetter2013-06-172-12/+26
|
* Do not use constructor forwardingErik Schnetter2013-06-171-1/+8
| | | | The Intel compiler does not support it.
* Correct syntax error in include guardErik Schnetter2013-06-171-2/+2
|
* Test iota() functionErik Schnetter2013-06-171-0/+2
|
* Support NEON float32x4_tErik Schnetter2013-06-172-1/+559
|
* Rename implementation files to indicate vector sizeErik Schnetter2013-06-1713-49/+52
|
* Automatically use a "good" vector type for loop code quality testsErik Schnetter2013-06-131-53/+65
|
* Implement fall-back timer if no architecture-specific high-resolution timer ↵Erik Schnetter2013-06-131-7/+16
| | | | is available
* Beautify screen outputErik Schnetter2013-06-131-3/+3
|
* Allow wrap-around when testing fmod and remainderErik Schnetter2013-06-131-5/+5
|
* Benchmark NEON as wellErik Schnetter2013-06-121-0/+6
|
* Correct fmaErik Schnetter2013-06-121-1/+5
|
* Correct NEON barrierErik Schnetter2013-06-121-1/+1
|
* Correct generic barrier on ARMErik Schnetter2013-06-121-1/+1
|
* Test NEON as well (first float2 architecture)Erik Schnetter2013-06-121-0/+9
|
* Correct operator&Erik Schnetter2013-06-121-1/+1
|
* Correct last (?) syntax errorErik Schnetter2013-06-121-1/+1
|
* Correct more errorsErik Schnetter2013-06-121-5/+12
|
* Correct various errorsErik Schnetter2013-06-121-26/+36
|
* Correct NEON detectionErik Schnetter2013-06-121-1/+1
|
* Implement barrier for ARMErik Schnetter2013-06-121-0/+2
|
* Auto-detect support for unaligned memory accessErik Schnetter2013-06-121-0/+8
|
* Begin to implement ARM NEONErik Schnetter2013-06-122-0/+534
|
* Suggest some additional vector architecturesErik Schnetter2013-06-121-1/+4
|
* Use #if instead of comments to deactive non-C++11 fallback codeErik Schnetter2013-06-121-67/+75
|
* Some QPX corrections/optimizationsErik Schnetter2013-06-121-3/+7
|
* Declare some functions without auto keywordErik Schnetter2013-06-123-5/+5
|
* Add more tests for loop code qualityErik Schnetter2013-06-121-47/+66
|
* Optimize AVX all/any functionsErik Schnetter2013-06-124-22/+20
|
* Use integer signbit instead of integer comparisons in mask classErik Schnetter2013-06-121-11/+25
|
* Implement signbit function for integer vectorsErik Schnetter2013-06-1213-0/+76
|
* Test code quality for loopsErik Schnetter2013-06-111-0/+126
|
* Implement nextafterErik Schnetter2013-06-0915-1/+59
|
OpenPOWER on IntegriCloud