summaryrefslogtreecommitdiffstats
path: root/libavcodec/arm
Commit message (Collapse)AuthorAgeFilesLines
* arm: Add VFP-accelerated version of fft16Martin Storsjö2013-07-223-3/+301
| | | | | | | | | Before After Mean StdDev Mean StdDev Change This function 1389.3 4.2 967.8 35.1 +43.6% Overall 15577.5 83.2 15400.0 336.4 +1.2% Signed-off-by: Martin Storsjö <martin@martin.st>
* arm: Add VFP-accelerated version of dca_lfe_firMartin Storsjö2013-07-223-1/+226
| | | | | | | | | Before After Mean StdDev Mean StdDev Change This function 868.2 33.5 436.0 27.0 +99.1% Overall 15973.0 223.2 15577.5 83.2 +2.5% Signed-off-by: Martin Storsjö <martin@martin.st>
* arm: Add VFP-accelerated version of imdct_halfMartin Storsjö2013-07-224-1/+217
| | | | | | | | | Before After Mean StdDev Mean StdDev Change This function 2653.0 28.5 1108.8 51.4 +139.3% Overall 17049.5 408.2 15973.0 223.2 +6.7% Signed-off-by: Martin Storsjö <martin@martin.st>
* arm: Add VFP-accelerated version of int32_to_float_fmul_array8Ben Avison2013-07-222-1/+167
| | | | | | | | | Before After Mean StdDev Mean StdDev Change This function 366.2 18.3 277.8 13.7 +31.9% Overall 18420.5 489.1 17049.5 408.2 +8.0% Signed-off-by: Martin Storsjö <martin@martin.st>
* arm: Add VFP-accelerated version of int32_to_float_fmul_scalarBen Avison2013-07-222-0/+48
| | | | | | | | | Before After Mean StdDev Mean StdDev Change This function 1175.0 4.4 366.2 18.3 +220.8% Overall 19285.5 292.0 18420.5 489.1 +4.7% Signed-off-by: Martin Storsjö <martin@martin.st>
* arm: Add VFP-accelerated version of synth_filter_floatBen Avison2013-07-223-0/+252
| | | | | | | | | Before After Mean StdDev Mean StdDev Change This function 9295.0 114.9 4853.2 83.5 +91.5% Overall 23699.8 397.6 19285.5 292.0 +22.9% Signed-off-by: Martin Storsjö <martin@martin.st>
* fmtconvert: Explicitly use int32_t instead of intChristophe Gisquet2013-07-172-3/+3
| | | | Signed-off-by: Martin Storsjö <martin@martin.st>
* arm: Include hpeldsp_neon.o if h264qpel is enabledMartin Storsjö2013-05-301-1/+2
| | | | | | | | | A few of the h264qpel neon functions are shared with other hpeldsp functions in this file. This fixes standalone compilation of the h264 decoder on arm. Signed-off-by: Martin Storsjö <martin@martin.st>
* arm: Don't unconditionally build dsputil filesMartin Storsjö2013-05-301-14/+13
| | | | Signed-off-by: Martin Storsjö <martin@martin.st>
* arm: Only build the FFT init files if FFT is enabledMartin Storsjö2013-05-301-2/+2
| | | | | | This fixes build errors in cases where FFT is disabled. Signed-off-by: Martin Storsjö <martin@martin.st>
* build: cosmetics: Place unconditional before conditional OBJS linesDiego Biurrun2013-05-301-19/+19
| | | | Signed-off-by: Martin Storsjö <martin@martin.st>
* build: arm: cosmetics: Place all OBJS declarations in alphabetical orderDiego Biurrun2013-05-301-61/+36
| | | | Signed-off-by: Martin Storsjö <martin@martin.st>
* arm: Drop unnecessary ff_ name prefixes from static functionsDiego Biurrun2013-04-303-9/+9
|
* arm: hpeldsp: Move half-pel assembly from dsputil to hpeldspRonald S. Bultje2013-04-1914-1349/+1541
| | | | Signed-off-by: Martin Storsjö <martin@martin.st>
* vp3: Use full transpose for all IDCTsRonald S. Bultje2013-04-151-1/+0
| | | | | | | | | | | This way, the special IDCT permutations are no longer needed. This is similar to how H264 does it, and removes the dsputil dependency imposed by the scantable code. Also remove the unused type == 0 cases from the plain C version of the idct. Signed-off-by: Martin Storsjö <martin@martin.st>
* h264: Integrate clear_blocks calls with IDCTRonald S. Bultje2013-04-101-6/+23
| | | | | | | | | The non-intra-pcm branch in hl_decode_mb (simple, 8bpp) goes from 700 to 672 cycles, and the complete loop of decode_mb_cabac and hl_decode_mb (in the decode_slice loop) goes from 1759 to 1733 cycles on the clip tested (cathedral), i.e. almost 30 cycles per mb faster. Signed-off-by: Martin Storsjö <martin@martin.st>
* dsputil: convert remaining functions to use ptrdiff_t stridesLuca Barbato2013-03-122-65/+65
| | | | Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* Remove unnecessary dsputil.h #includesDiego Biurrun2013-02-267-7/+0
|
* Fix a number of incorrect intmath.h #includes.Diego Biurrun2013-02-261-1/+2
|
* arm: vp8: Add missing #includes for header to compile standaloneDiego Biurrun2013-02-201-0/+4
|
* rv34: Drop now unnecessary dsputil dependenciesDiego Biurrun2013-02-062-2/+2
|
* dsputil: Separate h264chromaDiego Biurrun2013-02-063-20/+53
|
* Add av_cold attributes to arch-specific init functionsDiego Biurrun2013-02-0515-18/+36
|
* Use ptrdiff_t instead of int for {avg, put}_pixels line_size parameter.Diego Biurrun2013-02-053-46/+46
| | | | | This avoids SIMD-optimized functions having to sign-extend their line size argument manually to be able to do pointer arithmetic.
* Use proper "" quotes for local header #includesDiego Biurrun2013-02-011-1/+1
|
* arm: vp8: Fix the plain-armv6 version of vp8_luma_dc_whtMartin Storsjö2013-01-271-8/+8
| | | | | | | | | This makes the plain-armv6 version use the same registers as the armv6t2 version above. This fixes fate-vp8 on plain-armv6 devices. Signed-off-by: Martin Storsjö <martin@martin.st>
* arm: Add mathops.h to ARCH_HEADERS listDiego Biurrun2013-01-241-0/+2
| | | | It is an arch-specific header not suitable for standalone compilation.
* arm: h264qpel: use neon h264 qpel functions only if supportedJanne Grunau2013-01-241-3/+3
|
* dsputil: Separate h264 qpelMans Rullgard2013-01-245-1069/+1128
| | | | | | | | | | The sh4 optimizations are removed, because the code is 100% identical to the C code, so it is unlikely to provide any real practical benefit. Signed-off-by: Diego Biurrun <diego@biurrun.de> Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com> Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* dsputil: remove one array dimension from avg_no_rnd_pixels_tab.Ronald S. Bultje2013-01-221-4/+4
|
* dsputil: remove avg_no_rnd_pixels8.Ronald S. Bultje2013-01-222-10/+3
| | | | This is never used.
* Drop DCTELEM typedefDiego Biurrun2013-01-2217-74/+74
| | | | | | It does not help as an abstraction and adds dsputil dependencies. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
* arm: Add some missing header #includesDiego Biurrun2013-01-225-0/+7
|
* floatdsp: move scalarproduct_float from dsputil to avfloatdsp.Ronald S. Bultje2013-01-222-16/+0
| | | | This makes the aac decoder and all voice codecs independent of dsputil.
* floatdsp: move butterflies_float from dsputil to avfloatdsp.Ronald S. Bultje2013-01-222-14/+0
| | | | | This makes wmadec/enc, twinvq and mpegaudiodec (i.e. mp2/mp3) independent of dsputil.
* floatdsp: move vector_fmul_reverse from dsputil to avfloatdsp.Ronald S. Bultje2013-01-226-167/+0
| | | | | | Now, nellymoserenc and aacenc no longer depends on dsputil. Independent of this patch, wmaprodec also does not depend on dsputil, so I removed it from there also.
* floatdsp: move vector_fmul_add from dsputil to avfloatdsp.Ronald S. Bultje2013-01-222-30/+0
|
* vorbisdsp: change block_size type from int to intptr_t.Ronald S. Bultje2013-01-201-1/+2
| | | | This saves one instruction in the x86-64 assembly.
* videodsp_armv5te: remove #if HAVE_ARMV5TE_EXTERNALJanne Grunau2013-01-201-2/+0
| | | | | | | | | | | libavutil/arm/asm.S sets '.arch' depending on HAVE_ARMV5TE so that assembling armv5te code will always succeed even if the default -march flag does not support it. HAVE_ARMV5TE_EXTERNAL tests assembling code with the default arch. Fixes the missing symbol ff_prefetch_arm with --cpu= not including armv5te. CC: libav-stable@libav.org
* Move vorbis_inverse_coupling from dsputil to vorbisdspcontext.Ronald S. Bultje2013-01-195-69/+122
| | | | | Conveniently (together with Justin's earlier patches), this makes our vorbis decoder entirely independent of dsputil.
* vp3: integrate clear_blocks with idct of previous block.Ronald S. Bultje2013-01-191-7/+15
| | | | | | | | | This is identical to what e.g. vp8 does, and prevents the function call overhead (plus dependency on dsputil for this particular function). Arm asm updated by Janne Grunau <janne-libav@jannau.net>. Signed-off-by: Janne Grunau <janne-libav@jannau.net>
* lavc: Move vector_fmul_window to AVFloatDSPContextJustin Ruggles2013-01-162-50/+0
| | | | Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* lavc: add missing files for armLuca Barbato2012-12-202-0/+59
| | | | Across the many retouches those did not make the main commit.
* lavc: introduce VideoDSPContextRonald S. Bultje2012-12-205-13/+68
| | | | | | | | Move some functions from dsputil. The idea is that videodsp contains functions that are useful for a large and varied set of video decoders. Currently, it contains emulated_edge_mc() and prefetch(). Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* misc typo, style and wording fixesDiego Biurrun2012-12-181-1/+1
|
* arm: rename ARMVFP config symbol to VFPMans Rullgard2012-12-071-2/+2
| | | | | | | This is consistent with usual ARM nomenclature as well as with the VFPV3 and NEON symbols which both lack the ARM prefix. Signed-off-by: Mans Rullgard <mans@mansr.com>
* arm: use HAVE*_INLINE/EXTERNAL macros for conditional compilationMans Rullgard2012-12-077-13/+13
| | | | | | These macros reflect the actual capabilities required here. Signed-off-by: Mans Rullgard <mans@mansr.com>
* arm: fix use of uninitialised value in ff_fft_fixed_init_arm()Mans Rullgard2012-12-071-1/+1
| | | | | | | | | When initialising an FFTContext for a plain FFT, mdct_bits is not set and can contain a garbage value. Since nbits is always valid and for MDCT operation is mdct_bits - 2 checking this instead avoids using an uninitialised value while having the same effect. Signed-off-by: Mans Rullgard <mans@mansr.com>
* dsputil: move vector_fmul_scalar() to AVFloatDSPContext in libavutilJustin Ruggles2012-11-262-41/+0
|
* Use ptrdiff_t instead of int for intra pred "stride" function parameter.Ronald S. Bultje2012-10-291-18/+18
| | | | | This way, SIMD-optimized functions don't have to sign-extend their stride argument manually to be able to do pointer arithmetic.
OpenPOWER on IntegriCloud