summaryrefslogtreecommitdiffstats
path: root/libavcodec/x86/vp8dsp.asm
Commit message (Expand)AuthorAgeFilesLines
* Merge remote-tracking branch 'qatar/master'Michael Niedermayer2011-05-181-5/+5
|\
| * Modify x86util.asm to ease transitioning to 10-bit H.264 assembly.Daniel Kang2011-05-171-5/+5
* | Merge remote branch 'qatar/master'Michael Niedermayer2011-05-151-1/+1
|\ \ | |/
| * Fix FSF address copy paste error in some license headers.Diego Biurrun2011-05-141-1/+1
| * Replace FFmpeg with Libav in licence headersMans Rullgard2011-03-191-4/+4
|/
* Use "d" suffix for general-purpose registers used with movd.Reimar Döffinger2010-09-051-13/+13
* Mark xmm registers as clobbered in simple loopfilter. Should fix the lastRonald S. Bultje2010-08-241-11/+11
* Fix segfaults in VP8 SIMD code on Win64 (and FATE/win64 failures).Ronald S. Bultje2010-08-231-14/+14
* VP8: move zeroing of luma DC block into the WHTJason Garrett-Glaser2010-08-021-2/+18
* Use word-writing instead of dword-writing (with two cached but otherwiseRonald S. Bultje2010-07-311-104/+95
* Use pmaddubsw for the mbedge_filter (>=ssse3), 6-10 cycles faster.Ronald S. Bultje2010-07-261-2/+78
* VP8: Much faster SSE2 MCJason Garrett-Glaser2010-07-261-88/+78
* Enable no-loop memory/register saving for ssse3/sse4 also.Ronald S. Bultje2010-07-261-2/+2
* Save a register (or regsize of stackspace for x86-32) for the no-loopRonald S. Bultje2010-07-261-16/+24
* Use nested ifs instead of &&, which appears to not work with %ifidn (i.e. thisRonald S. Bultje2010-07-261-3/+9
* Split pextrw macro-spaghetti into several opt-specific macros, this will makeRonald S. Bultje2010-07-261-30/+49
* Fix obvious bug in assignment. Somehow, the test vectors don't test this...Ronald S. Bultje2010-07-251-1/+1
* Fix SPLATB_REG mess. Used to be a if/elseif/elseif/elseif spaghetti, so thisRonald S. Bultje2010-07-241-33/+52
* VP8: optimize DC-only chroma case in the same way as luma.Jason Garrett-Glaser2010-07-231-3/+44
* VP8 asm: cosmetics (spacing)Jason Garrett-Glaser2010-07-231-2/+2
* VP8: 30% faster idct_mbJason Garrett-Glaser2010-07-231-54/+127
* VP8: clear DCT blocks in iDCT instead of using clear_blocks.Jason Garrett-Glaser2010-07-231-4/+22
* Use pextrw for SSE4 mbedge filter result writing, speedup 5-10cycles onRonald S. Bultje2010-07-221-5/+30
* Fix and enable horizontal >=SSE2 mbedge loopfilter.Ronald S. Bultje2010-07-221-2/+2
* Eliminate one instruction in VP8 dc_add_sse4Jason Garrett-Glaser2010-07-211-2/+1
* Various VP8 x86 deblocking speedupsJason Garrett-Glaser2010-07-211-32/+67
* Make mmx VP8 WHT fasterJason Garrett-Glaser2010-07-211-17/+22
* VP8 MBedge loopfilter MMX/MMX2/SSE2 functions for both luma (width=16)Ronald S. Bultje2010-07-201-0/+641
* Chroma (width=8) inner loopfilter MMX/MMX2/SSE2 for VP8 decoder.Ronald S. Bultje2010-07-201-77/+131
* Revert r24339 (it causes fate failures on x86-64) - I'll figure out what'sRonald S. Bultje2010-07-191-108/+32
* Implement chroma (width=8) inner loopfilter MMX/MMX2/SSE2 functions.Ronald S. Bultje2010-07-191-32/+108
* Be more efficient with registers or stack memory. Saves 8/16 bytes stackRonald S. Bultje2010-07-191-16/+16
* Change function prototypes for width=8 inner and mbedge loopfilter functionsRonald S. Bultje2010-07-191-1/+1
* Attempt to fix x86-64 testsuite on fate.Ronald S. Bultje2010-07-161-1/+1
* Remove duplicate define.Ronald S. Bultje2010-07-161-1/+0
* Revert 24270, it contained some stuff that shouldn't have been in there.Ronald S. Bultje2010-07-161-1/+2
* Remove duplicate define.Ronald S. Bultje2010-07-161-2/+1
* Give x86 r%d registers names, this will simplify implementation of the chromaRonald S. Bultje2010-07-161-58/+81
* Change return statement, the REP_RET is a mistake since the else case (x86-64,Ronald S. Bultje2010-07-161-3/+1
* VP8 H/V inner loopfilter MMX/MMXEXT/SSE2 optimizations.Ronald S. Bultje2010-07-151-13/+464
* Simple H/V loopfilter for VP8 in MMX, MMX2 and SSE2 (yay for yasm macros).Ronald S. Bultje2010-07-031-0/+306
* SSSE3 versions of vp8 width4 bilinear MC functionsJason Garrett-Glaser2010-07-031-2/+23
* SSSE3 versions of width4 VP8 6-tap MC functionsJason Garrett-Glaser2010-07-021-161/+174
* Use add instead of lshift in mmxext vp8 idctJason Garrett-Glaser2010-06-291-2/+2
* Remove unused macros (duplicates from the now-LGPL x86util.asm).Ronald S. Bultje2010-06-291-26/+0
* MMX idct_add for VP8.Ronald S. Bultje2010-06-291-0/+89
* Add mmxext version of VP8 DC Hadamard transformJason Garrett-Glaser2010-06-291-0/+46
* Fix VP8 bilinear mc on x86_64Jason Garrett-Glaser2010-06-281-6/+6
* Add x86 asm functions for VP8 put_pixelsJason Garrett-Glaser2010-06-281-0/+40
* Add MMX, SSE2, SSSE3 asm for VP8 bilinear MCJason Garrett-Glaser2010-06-281-16/+192
OpenPOWER on IntegriCloud