ggml : update softmax n_task calculation (llama/5126) 3a3eb8e unverified snadampal commited on Jan 26, 2024
minor : clean-up some warnings and style (llama/5094) 7df090b unverified ggerganov commited on Jan 23, 2024
ggml : parallelize FP32 conversion when using BLAS (llama/5045) 7bf2c87 unverified reinforce20001 ggerganov commited on Jan 22, 2024
llava : MobileVLM support (llama/4954) dc8f956 unverified cxt123 Chenxiaotao03 commited on Jan 22, 2024
ggml : add IQ2 to test-backend-ops + refactoring (llama/4990) 227f2ae unverified ggerganov commited on Jan 17, 2024
ggml : importance matrix support for legacy quants (llama/4969) d8bb9d8 unverified Kawrakow ikawrakow commited on Jan 16, 2024
ggml : introduce GGML_CALL function annotation (llama/4850) 7815f68 unverified jartine commited on Jan 16, 2024
Add ability to use importance matrix for all k-quants (llama/4930) 7032309 unverified Kawrakow ikawrakow commited on Jan 14, 2024
ggml: cache sin/cos for RoPE (llama/4908) c315fbf unverified JohannesGaessler commited on Jan 13, 2024
gguf : fix potential infinite for-loop (llama/4600) 0e93179 unverified texmex76 Bernhard Gstrein commited on Jan 13, 2024
llama : ggml-backend integration (llama/4766) 362430b unverified slaren ggerganov JohannesGaessler commited on Jan 12, 2024
Importance Matrix calculation (llama/4861) c0b17f1 unverified Kawrakow ikawrakow ggerganov commited on Jan 12, 2024
ggml : SOTA 2-bit quants (add IQ2_XS) (llama/4856) 5e827d5 unverified Kawrakow ikawrakow commited on Jan 11, 2024
ggml : remove ggml_cpy_inplace and ggml_cont_inplace (ggml/693) 6469bfe unverified Timothy Cronin commited on Jan 11, 2024
ggml : do not sched_yield when calling BLAS (llama/4761) 5d1dffc unverified ggerganov commited on Jan 5, 2024
ggml : extend ggml_get_rows, ggml_repeat, ggml_concat (ggml/639) f17d170 Guillaume Wenzek ggerganov commited on Dec 29, 2023
sync : ggml (VMM, sync-ggml-am, dotprod ARM fixes, CUDA fixes) (#1691) 919a447 unverified ggerganov commited on Dec 29, 2023
sync : ggml (ggml_scale, ggml_row_size, etc.) (#1677) aa86ade unverified ggerganov commited on Dec 22, 2023
sync : ggml (Metal fixes, new ops, tests) (#1633) a0d4b48 unverified ggerganov commited on Dec 13, 2023
sync : ggml (ggml-alloc + linker + gguf fixes) (#1501) 58507b9 unverified ggerganov commited on Nov 17, 2023
whisper : add full CUDA and Metal offloading (#1472) da4acca unverified ggerganov commited on Nov 12, 2023
sync : ggml (backend v2, k-quants, CUDA opts, Metal opts, etc.) (#1422) 7006035 unverified ggerganov Chris Raethke commited on Nov 3, 2023
whisper : fix bench regression + fix performance when using CPU BLAS (#1275) abbf5f2 unverified ggerganov commited on Sep 12, 2023
build : do not use _GNU_SOURCE gratuitously (#1129) beefa34 unverified Przemysław Pawełczyk commited on Sep 7, 2023
ggml : sync latest llama.cpp (view_src + alloc improvements) (#1247) 8bb66c1 unverified ggerganov commited on Sep 5, 2023
ggml : sync (ggml-alloc, GPU, eps, etc.) (#1220) d41ba35 unverified ggerganov commited on Sep 5, 2023
ggml : fix compilation errors incurred by -Werror (#1227) 45ef7b5 unverified ChangSeok Oh commited on Aug 30, 2023
ggml : fix compiling when SSE3 is available but not SSSE3 (#1210) b7995b7 unverified Przemysław Pawełczyk commited on Aug 27, 2023
ci : more platforms coverage (#1101) c4448fa unverified alonfaraj Alon Faraj commited on Jul 16, 2023
Revert "ggml : do not use _GNU_SOURCE gratuitously (#1027)" 1e5ddb0 unverified ggerganov commited on Jul 2, 2023
ggml : sync latest repo (mostly refactoring changes) d97fd69 unverified ggerganov commited on Jul 2, 2023
ggml : do not use _GNU_SOURCE gratuitously (#1027) 3a69cdf unverified Przemysław Pawełczyk commited on Jun 25, 2023