whisper.cpp / ggml-quants.h

Commit History

ggml : add IQ2 to test-backend-ops + refactoring (llama/4990)
227f2ae
unverified

ggerganov commited on

ggml : importance matrix support for legacy quants (llama/4969)
d8bb9d8
unverified

Kawrakow ikawrakow commited on

Add ability to use importance matrix for all k-quants (llama/4930)
7032309
unverified

Kawrakow ikawrakow commited on

2-bit quantizations (llama/4897)
8a399ab
unverified

Kawrakow ikawrakow commited on

ggml : SOTA 2-bit quants (add IQ2_XS) (llama/4856)
5e827d5
unverified

Kawrakow ikawrakow commited on

SOTA 2-bit quants (llama/4773)
75de5bf
unverified

Kawrakow ikawrakow commited on

ggml : fix q2_k bpw in comments (ggml/680)
269f9a0
unverified

ggerganov commited on

sync : ggml (backend v2, k-quants, CUDA opts, Metal opts, etc.) (#1422)
7006035
unverified

ggerganov Chris Raethke commited on