whisper.cpp / ggml /src /ggml-cpu /ggml-cpu-aarch64.cpp

Commit History

ggml : fix MUL_MAT_ID repack with Q8_K (llama/12544)
a13f78c

ggerganov commited on

ggml : block interleaving support for Q4_K quantization for x86 AVX2 architecture (llama/12332)
0729506

Srihari-mcw commited on

ggml : upgrade init_tensor API to return a ggml_status (llama/11854)
d6b6852

William Tambellini slaren commited on

ggml-backend : only offload from host buffers (fix) (llama/11124)
9ac3c7e

Diego Devesa commited on

ggml : fixes for AVXVNNI instruction set with MSVC and Clang (llama/11027)
d13ac16

Srihari-mcw slaren commited on

ggml-cpu: replace NEON asm with intrinsics in ggml_gemv_q4_0_4x8_q8_0() (llama/10874)
21f8a02

Adrien Gallouët commited on

ggml : disable iq4_nl interleave size 8 (llama/10709)
a5294e7

ggerganov commited on

ggml : refactor online repacking (llama/10446)
163128e

Djip007 ggerganov commited on