Spaces:
Running
Running
Commit History
ggml : block interleaving support for Q4_K quantization for x86 AVX2 architecture (llama/12332)
0729506
Srihari-mcw
commited on
ggml : upgrade init_tensor API to return a ggml_status (llama/11854)
d6b6852
William Tambellini
slaren
commited on
ggml-backend : only offload from host buffers (fix) (llama/11124)
9ac3c7e
Diego Devesa
commited on
ggml : fixes for AVXVNNI instruction set with MSVC and Clang (llama/11027)
d13ac16
Srihari-mcw
slaren
commited on
ggml-cpu: replace NEON asm with intrinsics in ggml_gemv_q4_0_4x8_q8_0() (llama/10874)
21f8a02
Adrien Gallouët
commited on