ggml : add IQ2 to test-backend-ops + refactoring (llama/4990) 227f2ae unverified ggerganov commited on Jan 17, 2024
ggml : importance matrix support for legacy quants (llama/4969) d8bb9d8 unverified Kawrakow ikawrakow commited on Jan 16, 2024
Add ability to use importance matrix for all k-quants (llama/4930) 7032309 unverified Kawrakow ikawrakow commited on Jan 14, 2024
ggml : SOTA 2-bit quants (add IQ2_XS) (llama/4856) 5e827d5 unverified Kawrakow ikawrakow commited on Jan 11, 2024
sync : ggml (backend v2, k-quants, CUDA opts, Metal opts, etc.) (#1422) 7006035 unverified ggerganov Chris Raethke commited on Nov 3, 2023