Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | avfilter/x86/vf_blend : add SIMD for 16 bit version of | Martin Vignali | 2018-04-05 | 1 | -60/+108 |
| | | | | | | | | grainextract grainmerge average extremity negation | ||||
* | avfilter/x86/vf_blend : reorganize DIFFERENCE macro to reduce line ↵ | Martin Vignali | 2018-04-05 | 1 | -22/+16 |
| | | | | duplication between 8bit and 16 bit version | ||||
* | avfilter/x86/vf_blend : add 16 bit version for BLEND_SIMPLE, phoenix, ↵ | Martin Vignali | 2018-02-24 | 1 | -13/+62 |
| | | | | difference for SSE and AVX2 (x86_64) | ||||
* | avfilter/x86/vf_blend : avfilter/x86/vf_blend : add AVX2 version for each ↵ | Martin Vignali | 2018-01-28 | 1 | -84/+145 |
| | | | | | | func except divide and optimize average, grainextract, multiply, screen, grain merge | ||||
* | avfilter/vf_blend: rename addition128 and difference128 to grainmerge and ↵ | Paul B Mahol | 2017-08-24 | 1 | -2/+2 |
| | | | | grainextract | ||||
* | x86/vf_blend: use ABS2 macro | James Almer | 2017-06-27 | 1 | -6/+3 |
| | |||||
* | x86/vf_blend: optimize difference and negation functions | James Almer | 2017-06-27 | 1 | -16/+24 |
| | | | | | | | Process more pixels per loop. Reviewed-by: Paul B Mahol <onemda@gmail.com> Signed-off-by: James Almer <jamrial@gmail.com> | ||||
* | x86/vf_blend: add sse and ssse3 extremity functions | James Almer | 2017-06-27 | 1 | -0/+25 |
| | | | | | Reviewed-by: Paul B Mahol <onemda@gmail.com> Signed-off-by: James Almer <jamrial@gmail.com> | ||||
* | x86/vf_blend: Add SSE2 optimization for divide | Timothy Gu | 2016-02-28 | 1 | -0/+30 |
| | | | | | | 4.5x faster than C float version with autovectorization 10 x faster than C int version 25 x faster than C float version without autovectorization | ||||
* | vf_blend: Reduce number of arguments for kernel function | Timothy Gu | 2016-02-14 | 1 | -2/+1 |
| | |||||
* | x86/vf_blend: Add SSE2 optimization for screen | Timothy Gu | 2016-02-10 | 1 | -0/+29 |
| | | | | | | 10x faster than C. Reviewed-by: Paul B Mahol <onemda@gmail.com> | ||||
* | x86/vf_blend: Move multiplying to a macro | Timothy Gu | 2016-02-10 | 1 | -6/+10 |
| | | | | Reviewed-by: Paul B Mahol <onemda@gmail.com> | ||||
* | vf_blend: Add SSE2 optimization for multiply | Timothy Gu | 2016-02-08 | 1 | -0/+29 |
| | | | | 5 times faster than C, 3 times overall. | ||||
* | x86/vf_blend: add sse2 versions of blend_difference and blend_negation | James Almer | 2015-12-24 | 1 | -3/+9 |
| | | | | | Reviewed-by: Paul B Mahol <onemda@gmail.com> Signed-off-by: James Almer <jamrial@gmail.com> | ||||
* | x86/vf_blend: make all functions work on x86_32 | James Almer | 2015-12-24 | 1 | -53/+50 |
| | | | | | Reviewed-by: Paul B Mahol <onemda@gmail.com> Signed-off-by: James Almer <jamrial@gmail.com> | ||||
* | x86/vf_blend: simplify using macros | James Almer | 2015-12-24 | 1 | -243/+33 |
| | | | | | Reviewed-by: Paul B Mahol <onemda@gmail.com> Signed-off-by: James Almer <jamrial@gmail.com> | ||||
* | avfilter/x86/vf_blend.asm: hardmix: do same with two pxor instructions less | Paul B Mahol | 2015-10-07 | 1 | -3/+4 |
| | | | | Signed-off-by: Paul B Mahol <onemda@gmail.com> | ||||
* | avfilter/x86/vf_blend.asm: 11th register is used, update functions | Paul B Mahol | 2015-10-07 | 1 | -14/+14 |
| | | | | Signed-off-by: Paul B Mahol <onemda@gmail.com> | ||||
* | avfilter/x86/vf_blend.asm: add hardmix and phoenix sse2 SIMD | Paul B Mahol | 2015-10-07 | 1 | -0/+64 |
| | | | | Signed-off-by: Paul B Mahol <onemda@gmail.com> | ||||
* | avfilter/vf_blend: add x86 SIMD for some modes | Paul B Mahol | 2015-10-03 | 1 | -0/+367 |
Signed-off-by: Paul B Mahol <onemda@gmail.com> |