Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | aarch64: NEON vorbis_inverse_coupling | Janne Grunau | 2014-04-22 | 3 | -0/+119 |
| | | | | | From the ARMv7 NEON version. 16 times faster as the C version, overall more than 12% faster vorbis decoding on Apple's A7. | ||||
* | aarch64: NEON fixed/floating point MPADSP apply_window | Janne Grunau | 2014-04-22 | 3 | -0/+267 |
| | | | | | 30%/25% (fixed/float) faster mp3 decoding on Apple's A7. The floating point decoder is approximately 7% faster. | ||||
* | aarch64: NEON float (i)MDCT | Janne Grunau | 2014-04-22 | 3 | -0/+334 |
| | | | | Approximately as fast as the ARM NEON version on Apple's A7. | ||||
* | aarch64: NEON float FFT | Janne Grunau | 2014-04-22 | 3 | -0/+481 |
| | | | | Approximately as fast as the ARM NEON version on Apple's A7. | ||||
* | aarch64: implement videodsp.prefetch | Janne Grunau | 2014-04-06 | 3 | -0/+63 |
| | | | | 8% faster h264 decoding on Apple A7. | ||||
* | build: Group general components separate from de/encoders in arch Makefiles | Diego Biurrun | 2014-03-20 | 1 | -0/+1 |
| | | | | This is in line with how the top-level libavcodec Makefile is structured. | ||||
* | aarch64: get_cabac inline asm | Janne Grunau | 2014-03-09 | 1 | -0/+104 |
| | | | | | | | Based on the x86 branchless get_cabac asm. get_cabac_noinline() gets approximately 20% faster (no cycle counts available) compared to clang from Xcode 5.1 beta5. More than 6% faster overall. A part of the overall speedup might be explained by additional inlining of get_cabac(). | ||||
* | aarch64: use EXTERN_ASM consistently for exported symbols | Janne Grunau | 2014-02-20 | 1 | -8/+8 |
| | | | | Based on e3fec3f095ab5ea08ee662942d98526aaf5e3635 for arm. | ||||
* | aarch64: port neon clobber test from arm | Janne Grunau | 2014-01-15 | 2 | -0/+80 |
| | |||||
* | aarch64: h264 (bi)weight NEON optimizations | Janne Grunau | 2014-01-15 | 2 | -0/+264 |
| | | | | Ported from ARMv7 NEON. | ||||
* | aarch64: h264 loop filter NEON optimizations | Janne Grunau | 2014-01-15 | 4 | -1/+299 |
| | | | | Ported from ARMv7 NEON. | ||||
* | aarch64: hpeldsp NEON optimizations | Janne Grunau | 2014-01-15 | 4 | -5/+528 |
| | | | | Ported from ARMv7 NEON. | ||||
* | aarch64: h264 qpel NEON optimizations | Janne Grunau | 2014-01-15 | 4 | -0/+1172 |
| | | | | Ported from ARMv7 NEON. | ||||
* | aarch64: h264 idct NEON assembler optimizations | Janne Grunau | 2014-01-15 | 4 | -0/+533 |
| | | | | Ported from ARMv7 NEON. | ||||
* | aarch64: h264 chroma motion compensation NEON optimizations | Janne Grunau | 2014-01-15 | 5 | -0/+561 |
Since RV40 and VC-1 use almost the same algorithm so optimizations for those two decoders are easy to do and included. |