Fixed dequant precision issues in Q4_1 and Q5_1 (llama/9711) 5239c28 Ouadie EL FAROUKI commited on Oct 3, 2024
ggml-backend : add device and backend reg interfaces (llama/9707) 1bdb50a Diego Devesa JohannesGaessler commited on Oct 2, 2024
Initial cmake support of SYCL for AMD GPUs (llama/9658) 7d7ac98 Alberto Cabrera Pérez commited on Oct 2, 2024
ggml/ex: calculate accuracy in graph, adapt MNIST (ggml/980) 52069b8 JohannesGaessler commited on Oct 3, 2024
ggml: refactor cross entropy loss CPU impl. (ggml/976) 2a0805f JohannesGaessler commited on Oct 2, 2024
examples : update dr_wav.h to newer version (#2449) d678325 unverified Rahul Vadhyar commited on Oct 4, 2024
test: fix OPT_STEP_ADAMW for test-backend-ops (ggml/974) 76aa810 JohannesGaessler commited on Sep 30, 2024
ggml : define missing HWCAP flags (llama/9684) 1d52105 ggerganov Willy Tarreau commited on Sep 29, 2024
ggml : add run-time detection of neon, i8mm and sve (llama/9331) 12c0e23 Dan Johansson commited on Sep 28, 2024
Enable use to the rebar feature to upload buffers to the device. (llama/9251) 760f8c2 Markus Tavenrath commited on Sep 28, 2024
ggml : remove assert for AArch64 GEMV and GEMM Q4 kernels (llama/9217) 50395aa Charles Xu commited on Sep 25, 2024
cann: fix crash when llama-bench is running on multiple cann devices (llama/9627) 068c697 dou112 commited on Sep 25, 2024
vulkan : fix build for GGML_VULKAN_RUN_TESTS, add TFLOPS to log (ggml/961) 85e2387 jeffbolznv commited on Sep 27, 2024
vulkan : argsort barriers must be under uniform control flow (ggml/951) b2602d7 smeso commited on Sep 26, 2024
ggml : fix GGML_MAX_N_THREADS + improve formatting (ggml/969) ad34655 ggerganov commited on Sep 24, 2024
server : ffmpeg overwrite leftover temp file (#2431) 2dafb8e unverified dynafire commited on Oct 2, 2024
readme : fix references to download-ggml-model.sh (#2427) 3d92452 unverified Hugo commited on Sep 24, 2024
ggml : add AVX512DQ requirement for AVX512 builds (llama/9622) 14b5848 Eric Zhang commited on Sep 24, 2024
log : add CONT level for continuing previous log entry (llama/9610) a29a4c5 ggerganov commited on Sep 24, 2024
threads: improve ggml_barrier scaling with large number of threads (llama/9598) aca04d5 Max Krasnyansky commited on Sep 23, 2024
Revert "[SYCL] fallback mmvq (ggml/9088)" (llama/9579) 5aceb3d Akarshan Biswas commited on Sep 23, 2024
musa: enable building fat binaries, enable unified memory, and disable Flash Attention on QY1 (MTT S80) (llama/9526) 8ec75c3 R0CKSTAR commited on Sep 22, 2024
ggml-alloc : fix list of allocated tensors with GGML_ALLOCATOR_DEBUG (llama/9573) 673df39 slaren commited on Sep 21, 2024
Update CUDA graph on scale change plus clear nodes/params (llama/9550) 6b63eb1 agray3 commited on Sep 21, 2024