optimisation: merged the init loop of vq_quant().
fix minor compilation error/warning
fixed three declaration-after-statement issues
fixed a few warnings, no real change
Reworked the static modes. Now, if all static modes have the same frame size,
some index work (simplifications for dumb compilers) on IMDCT
Added a missing RESTORE_STACK in intra_prediction()
Not all compilers are equal -- making it clearer how the MDCT indexing is done
Defining IMUL32 for 32x32=>32 int multiplications and using it in the range
Simplified indexing in intra_prediction()
fixed ordering of the channels in the intra prediction.
Defining DISABLE_STEREO now optimises for the mono case
Fixed a stereo regression introduced in e28f25f0d14959d521fda0cdb8f1220995bc50e8
Fixed rsqrt testcase for float
Rework CWRS code.
Changed the rules for using the pulse spreading. It should be used less often
Revert ABS16/32 on C55 -- ended up being slower
ABS16 and ABS32 for the C55
Making the pulsesAtOnce code 16-bit safe.
Just commenting -- nothing to see.
Optimisation: got rid of about 10% of the 32-bit divisions by using ec_enc_uint
Removed a few int divisions from the intra prediction code.
fixed-point: using MULT16_16 instead of * in compute_band_energies()
Making a few functions static inline
Trying to clean up celt_ilog2() vs. EC_ILOG a bit.
making {next|prev}_cwrs* inline
optimisation: changed some for() loops to do-while() to give the compiler
Making it obvious to the compiler how to generate a dual-MAC in
mix_pitch_and_residual() no longer computing Ryp twice
optimisation: defined a reciprocal square root (celt_rsqrt) for use in
Fixed the rcp() testcase for new assumptions (x is positive)
optimisation: shaving a few cycles off prev_cwrs* by not computed the values
optimisations: faster handling of the zero for compute_band_energies() and
changed 1*rcp(x) to just rcp(x)
optimisation: intra_prediction() no longer needs to divide inside the search
optimisation: The "simple" Rxy/sqrt(Ryy) case in alg_quant no longer requires
Decision on whether to use pitch is now taken only based on energy in the
properly defined EPSILON for the float case
A bunch of pointers marked as "restrict" to ease the job of the compiler
optimisation: spreading_func now in-place with no branch in the loop and half
oops. find_max32() now uses VERY_LARGE32 (instead of VERY_LARGE16)
optimisations: Another bunch of simplifications to alg_quant(), mainly to
optimisation: Making use of restrict in find_spectral_pitch() to disambiguate
optimisations: caching sign of x in alg_quant(), changed celt_div()/celt_rcp()
Optimised intra prediction a bit -- removed a conditional branch and replaced
Removed support for more than one MDCT blocks per frame. I don't think there's
Removed the "pitch compression" in the residual quantisation. Also, removed
Unrolled the inner loop in vq_index() so that the codebook unpacking doesn't
Making bits2pulses() use a fixed number of iterations to allow further
include "dsplib.h" in fixed_c5x.h
replaced divisions by recoprocals in intra prediction and folding
defined find_max16 and overrode it for C55x
Made a second version of ec_{en|de}code optimised for encoding bits (no div
No longer trying to save bits when encoding integers near the upper limit
fixed-point: added cheap celt_div() division using a reciprocal
Using restrict to make it clear there's no aliasing issues in the mdct.
Added a few "restrict" keywords and changed some divisions to shifts
fixed TI fft code -- again
Removed potentially unused var in MDCT init
local var name maxval was shadowing the TI function used to compute it
make sure TRIG_UPSCALE is properly defined
fix for TI version of celt_maxabs16()
fixed-point: fix for 32-bit TI FFT
fixed-point: Wrapper for the 32-bit complex FFT used in the MDCT so we can use
fixed-point: defined celt_maxabs16() as basic operator
fixed-point: MULT16_32_Q15 for TI DSP (not entirely happy with it)
fixed-point: using TI intrinsic for celt_ilog2() if available.
Wrapper for the TI dsplib FFT
Making the real/single FFT easier to replace
Random numbers should work on 16-bit archs.
fixed-point: more TI macros. Comments on the existing ones.
Removed useless functions that caused warnings in skeleton.c
Making sure not to use the C library calls directly
Some work on assertions.
fixed-point: Fixed a "shift too large" warning by using a 16-bit mul instead
fixed-point: playing it safe. SHL32() now automatically casts input to 32-bit
fixed-point: fixed a 16-bit issue in normalise_bands() where Q15ONE was directly
Removed unnecessary header inclusions
Adjusting/fixing warnings
fixed-point: Using reciproqual instead of DIV32_16 for find_spectral_pitch()
Enabling warnings and restricting symbol visibility
fixed-point: simplification of the gain in mix_pitch_and_residual()
fixed-point: simplifying the arithmetic in alg_quant()
More accurate sqrt approximation using MULT16_16_Q15() instead of Q14.
Increased accuracy of the reciprocal approximation and made it use
Testcases should now work even when symbols aren't visible in the dso
Added info for properly exporting symbols
Changed definition of VARDECL to make it more flexible. No actual code change.
Using reciprocal approximation instead of full 32-bit division in alg_quant()
configure update (MIXED_PRECISION and STATIC_MODES can now be used)
Defined the celt_rcp() reciprocal approximation
__GNUC_PREREQ should be causing problems anymore.
Added macro definitions for the TI C5x family (untested)