checkasm: Improve cdef_filter test
Avoid redundant calls to CDEF DSP functions
x86: Bump nasm version requirement to 2.14
CI: Use a newer image to build snap packages
x86: add prep_8tap AVX512 asm
x86: replace "mov hb, Xb" by "movzx hd, Xb" in MC
x86inc: save xmm_regs_used in spill_xmm for non-win64
arm64: itx: Fix overflow/clipping in negation in idct16
x86: Fix overflows in SSSE3 idct
x86: Fix missing saturations in inverse identity asm
SSSE3 implementations of film grain
Reduce scope of NO_SANITIZE usage
Add a workaround for -fsanitize=cfi + dlsym() issue
x86: add prep_bilin AVX512 asm
x86: add avx512icl cpu flag to x86inc.asm
checkasm: x86: ensure all SIMD lanes are turned on at all times
Add misc. inverse transform C optimizations
Skip clipping in the inverse wht transform C implementation
x86: Fix SSSE3 inverse identity transform overflow/clipping
x86: Fix AVX2 inverse identity transform overflow/clipping
Fix building as a meson subproject
Fix missing include for limits.h
arm64: msac: Avoid 32 bit intermediates in symbol_adapt
arm64: itx: Use sqrdmulh in the preexisting identity transform functions
arm64: itx: Specialcase transforms with identity in the first pass with downshift
arm64: itx: Adjust .irp in the 4x16/16x4/8x16/16x8 functions
Don't interleave the skip mode index finding loops
Take lossless into account when assigning loopfilter strength
Deal with chroma coefficients that are exactly 0x100000
x86: Fix inverse (flip)adst 8x4 clipping
Fix C inverse ADST clipping
av1: use chroma txtp inference over default DCT_DCT if qidx=0
av1: skip super-resolution upscaling if width < 16
av1: do C inverse transforms in int32_t precision
Allow skip mode
Disable warping on scaled references
Check skip flag before setting tx context to TX_4X4 in lossless frames
Don't assume dlsym exists on linux
Linux: Add a workaround for a glibc stack size issue
x86: Disable AVX-512 by default
Fix potential race condition in dav1d_get_cpu_flags()
x86: Remove unused cpu flags
Add support for Ice Lake AVX-512 cpu flags
Correct the edge buffer documentation
Move lvl variable in cdef_brow to outer loop
Simplify shifting in generate_grain_y/uv AVX2
Avoid adding offsets to NULL chroma pointers in 4:0:0
arm: 32: ipred: Fix assembling with older binutils
Avoid excessive L2 collisions with certain frame widths
build: do not error out if clock_gettime is not found on darwin
Specify inlining decisions for common ipred_z C functions
cli: Return only EXIT_SUCCESS or EXIT_FAILURE from main
tools: simplify get_seed on macOS
checkasm: replace gettimeofday with clock_gettime
tools: prefer mach_absolute_time on macOS for forward compatibility
tools: mach_absolute_time is not portable, so stop checking at configure
Avoid adding offsets to NULL pointers
snap: bump grade to stable
input: Use EIO instead of ENODATA
annexb: Cast size_t to int before adding to an int
Fix accidental capital letter in variable name
section5: use signed type for seeking backwards
Add section-5 raw OBU demuxer
Make OBU_* types public
Increase DAV1D_PICTURE_ALIGNMENT to 64
Don't run film grain generation beyond visible luma border
Explicitly specify rodata section alignment
NEAREST, NEARER, NEAR, NEARISH
arm: ipred: NEON implementation of dc/h/v prediction functions
arm: 64: loopfilter: Avoid nested ifdefs where easily possible
arm: 64: loopfilter: Fix a typo in a macro parameter condition
arm64: loopfilter: Reorder instructions and tweak register use to match the arm32 port
arm64: loopfilter: Remove a stray double newline
arm: 32: Port the arm64 NEON loopfilter to arm32
arm: 32: Use more unique temporary labels within movrel_local
Tiny improvements to generate_grain_uv_420
Update README.md section on Roadmap
build: Add a workaround for Xcode 11 -fstack-check bug
Update NEWS for 0.5.1
x86: Fix overflows in inverse identity SSSE3 transforms
x86: Fix overflows in inverse identity AVX2 transforms
x86: adapt SSSE3 wiener filter to SSE2
x86: adapt SSSE3 warp_affine_8x8{,t} to SSE2
arm: looprestoration: Fix register names in a comment
arm64: looprestoration: Minimal scheduling improvements
arm64: looprestoration: Fix a typo
arm64: looprestoration: Fix register references in comments
arm64: looprestoration: Use ld2r instead of ld1+dup+dup
arm64: looprestoration: Pass a correct height parameter to sgr_box3_h_neon for the top slice
arm: looprestoration: Port the ARM64 SGR NEON assembly to 32 bit arm
x86: Add minor ipred_z AVX2 optimizations
Shrink some stack buffers in the C versions of ipred_z
Don't backup pixel if next block not "CDEFed"
x86inc: fix LOAD_MM_PERMUTATION for AVX512
x86: adapt SSSE3 cdef_filter_{4x4,4x8,8x8} to SSE2
tools: fix SSE2 cpu masking