FFmpeg/libavcodec/loongarch
jinbo 1f642b99af
avcodec/hevc: Add epel_uni_w_hv4/6/8/12/16/24/32/48/64 asm opt
tests/checkasm/checkasm:           C       LSX     LASX
put_hevc_epel_uni_w_hv4_8_c:       9.5     2.2
put_hevc_epel_uni_w_hv6_8_c:       18.5    5.0     3.7
put_hevc_epel_uni_w_hv8_8_c:       30.7    6.0     4.5
put_hevc_epel_uni_w_hv12_8_c:      63.7    14.0    10.7
put_hevc_epel_uni_w_hv16_8_c:      107.5   22.7    17.0
put_hevc_epel_uni_w_hv24_8_c:      236.7   50.2    31.7
put_hevc_epel_uni_w_hv32_8_c:      414.5   88.0    53.0
put_hevc_epel_uni_w_hv48_8_c:      917.5   197.7   118.5
put_hevc_epel_uni_w_hv64_8_c:      1617.0  349.5   203.0

After this patch, the peformance of decoding H265 4K 30FPS 30Mbps
on 3A6000 with 8 threads improves 3fps (52fps-->55fsp).

Change-Id: If067e394cec4685c62193e7adb829ac93ba4804d
Reviewed-by: yinshiyou-hf@loongson.cn
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
2024-01-12 23:35:40 +01:00
..
cabac.h
h264_cabac.c
h264_deblock_lasx.c
h264_intrapred_init_loongarch.c
h264_intrapred_loongarch.h
h264chroma.S
h264chroma_init_loongarch.c
h264chroma_loongarch.h
h264dsp.S
h264dsp_init_loongarch.c
h264dsp_lasx.c
h264dsp_loongarch.h
h264idct.S
h264idct_loongarch.c
h264intrapred.S
h264qpel.S
h264qpel_init_loongarch.c
h264qpel_lasx.c
h264qpel_loongarch.h
h264qpel_lsx.c
hevc_add_res.S
hevc_idct_lsx.c
hevc_lpf_sao_lsx.c
hevc_mc.S avcodec/hevc: Add epel_uni_w_hv4/6/8/12/16/24/32/48/64 asm opt 2024-01-12 23:35:40 +01:00
hevc_mc_bi_lsx.c
hevc_mc_uni_lsx.c
hevc_mc_uniw_lsx.c
hevcdsp_init_loongarch.c avcodec/hevc: Add epel_uni_w_hv4/6/8/12/16/24/32/48/64 asm opt 2024-01-12 23:35:40 +01:00
hevcdsp_lasx.h avcodec/hevc: Add epel_uni_w_hv4/6/8/12/16/24/32/48/64 asm opt 2024-01-12 23:35:40 +01:00
hevcdsp_lsx.c
hevcdsp_lsx.h avcodec/hevc: Add epel_uni_w_hv4/6/8/12/16/24/32/48/64 asm opt 2024-01-12 23:35:40 +01:00
hpeldsp_init_loongarch.c
hpeldsp_lasx.c
hpeldsp_lasx.h
idctdsp_init_loongarch.c
idctdsp_lasx.c
idctdsp_loongarch.h
loongson_asm.S
Makefile avcodec/hevc: Add pel_uni_w_pixels4/6/8/12/16/24/32/48/64 asm opt 2024-01-12 23:35:40 +01:00
simple_idct_lasx.c
vc1dsp_init_loongarch.c
vc1dsp_lasx.c
vc1dsp_loongarch.h
videodsp_init.c
vp8_lpf_lsx.c
vp8_mc_lsx.c
vp8dsp_init_loongarch.c
vp8dsp_loongarch.h
vp9_idct_lsx.c
vp9_intra_lsx.c
vp9_lpf_lsx.c
vp9_mc_lsx.c
vp9dsp_init_loongarch.c
vp9dsp_loongarch.h