cmake : fix compile assumptions for power9/etc (#2777) 4683df3 unverified midnight midnight commited on Feb 5, 2025
cmake: Add ability to pass in GGML_BUILD_NUMBER (ggml/1096) 729db34 unverified Christian Kastner commited on Feb 3, 2025
coreml : always convert to "neuralnetwork" (#2770) e351248 unverified mgrachten commited on Feb 3, 2025
HIP: fix flash_attn_stream_k_fixup warning (llama/11604) acfd94f JohannesGaessler commited on Feb 2, 2025
CUDA/HIP: add support for selectable warp size to mmv (llama/11519) ed08269 uvos commited on Feb 2, 2025
HIP: add GGML_CUDA_CC_IS_* for amd familys as increasing cc archtectures for amd gpus are not supersets of eatch other (llama/11601) 4850c24 uvos commited on Feb 2, 2025
CUDA: use mma PTX instructions for FlashAttention (llama/11583) f328957 JohannesGaessler Diego Devesa commited on Feb 2, 2025
`ci`: use sccache on windows instead of ccache (llama/11545) 9ed1962 Olivier Chafik commited on Jan 31, 2025
vulkan: implement initial support for IQ2 and IQ3 quantizations (llama/11360) bd93c1b Rémy Oudompheng jeffbolznv commited on Jan 29, 2025
vulkan: Catch pipeline creation failure and print an error message (llama/11436) d4f6b2c jeffbolznv commited on Jan 29, 2025
HIP: Only call rocblas_initialize on rocblas versions with the multiple instantation bug (llama/11080) 82bb7f3 Nikita Sarychev commited on Jan 28, 2025
SYCL : SOFTMAX F16 mask support and other fixes (llama/11261) 8aaf0c8 qnixsynapse commited on Jan 28, 2025
AMD: parse the architecture as supplied by gcnArchName (llama/11244) 04b01d8 Haus1 commited on Jan 27, 2025
metal: Handle null returned from MTLCreateSystemDefaultDevice() (llama/11441) 4e38ed4 Ihar Hrachyshka commited on Jan 27, 2025
Hip: disable VMM on hip as it seams that it dosent work in some configurations (llama/11420) 2cc4df4 uvos commited on Jan 25, 2025
rocBLAS: Avoid fp32->fp16->fp32 conversion on cdna (llama/11356) 6f5687a uvos commited on Jan 24, 2025
CPU/CUDA: fix (GQA) mul mat back, add CUDA support (llama/11380) 855a9fe JohannesGaessler commited on Jan 24, 2025
cmake : avoid -march=native when reproducible build is wanted (llama/11366) 3cae2d9 Bernhard M. Wiedemann commited on Jan 24, 2025
vulkan: sort shaders for more deterministic binary (llama/11315) d7c0046 jeffbolznv commited on Jan 23, 2025
rpc : better caching of the base buffer pointer (llama/11331) 81a6cae rgerganov commited on Jan 21, 2025
cmake : add sanitizer flags for llama.cpp (llama/11279) 3547979 ggerganov JohannesGaessler commited on Jan 18, 2025
vulkan: fix coopmat2 flash attention for non-contiguous inputs (llama/11281) e0e73fa jeffbolznv commited on Jan 18, 2025