Spaces:
Running
Running
Commit History
HIP: add cmake option to enable compiler output of kernel resource usage metrics (llama/15103) 577f7e4
cmake: Add GGML_BACKEND_DIR option (llama/15074) 6e460b6
Christian Kastner commited on
HIP: add GGML_HIP_MMQ_MFMA option to allow disableing the MFMA path. (llama/14930) f9dbd96
ggml-cpu : disable GGML_NNPA by default due to instability (llama/14880) cac085c
musa: upgrade musa sdk to rc4.2.0 (llama/14498) a687ec3
ggml: Add initial WebGPU backend (llama/14521) 0dd208f
Reese Levine commited on
sync : resolve conflicts (#0) 5ec49ef
sync : resolve conflicts (ggml/0) 497add0
ggml : support bcast ggml_soft_max_ext, ggml_flash_attn_ext (llama/14435) ebacb3e
ggml-cpu: enable IBM NNPA Vector Intrinsics (llama/14317) fea8f94
HIP: disable rocwmma on gfx12 by default until rocm 7.0 (llama/14202) f95736f
uvos commited on
ggml : disable warnings for tests when using MSVC (ggml/1273) 1669c07
examples : include examples in msvc disable warn (ggml/1270) 0c191be
ggml-cpu : split arch-specific implementations (llama/13892) 8c833e9
llama : allow using mmap without PrefetchVirtualMemory, apply GGML_WIN_VER to llama.cpp sources (llama/14013) f0a0ac8
Diego Devesa commited on
vulkan: use timestamp queries for GGML_VULKAN_PERF (llama/13817) 56ddc5b
ggml : riscv: add xtheadvector support (llama/13720) 4790d5d
xctan commited on
sycl: use oneDNN for matrices multiplication (llama/12972) 2008e08
Łukasz Ślusarczyk commited on
whisper: remove MSVC warnings pragmas (#3090) e0d130c unverified
whisper : add check that target name exists (#3103) 60ff3ed unverified
ggml : suppress Windows compiler warnings (#3075) 887f7a2 unverified
ggml : add SSE 4.2 and x64 base variant for CPUs without AVX (llama/12871) f8795d3
Diego Devesa commited on
CUDA/HIP: Share the same unified memory allocation logic. (llama/12934) 143cb70
David Huang commited on
ggml : sync/merge cmake,riscv,powerpc, add common.cmake (ggml/0) f695cbf
SYCL: using graphs is configurable by environment variable and compile option (llama/12371) c18969f
Łukasz Ślusarczyk Romain Biessy commited on
opencl: use OpenCL C standard supported by the device (llama/12221) 57028a7
Henry Linjamäki commited on
cmake: Enable specifying exact PowerPC CPU architecture (ggml/1138) aac4d16
Christian Kastner commited on
ggml : add logging for native build options/vars (#2935) aaf8a91 unverified
ggml-cpu: Faster IQ1 mul_mat_vec on AVX2 using BMI2 instructions (llama/12154) 05466a9
Rémy O commited on
HIP: implement FlashAttention via rocWMMA for CDNA and RDNA3+ (llama/12032) a027c1d
David Huang commited on
CUDA: compress mode option and default to size (llama/12029) 4ec988a
Erik Scholz commited on
cmake: Fix ggml backend dependencies and installation (llama/11818) c6c2a2c
Vladimir Vuksanovic commited on
Told cmake to install ggml-cpp.h as a public header file. (ggml/1126) 3d4f29c
petterreinholdtsen Petter Reinholdtsen commited on
ggml-cpu: Support s390x SIMD Instruction Set (llama/12019) 4aa54ec
Aaron Teo Jinyang He junchao-zhao commited on
CUDA: app option to compile without FlashAttention (llama/12025) fbc5f16
ggml-cpu: Add CPU backend support for KleidiAI library (llama/11390) 9de6d81
Charles Xu commited on
cmake: Add ability to pass in GGML_BUILD_NUMBER (ggml/1096) 729db34 unverified
Christian Kastner commited on
cmake: add ggml find package (llama/11369) ca6577f
Hip: disable VMM on hip as it seams that it dosent work in some configurations (llama/11420) 2cc4df4
uvos commited on
hip : Add hipGraph and VMM support to ROCM (llama/11362) 089afa0
uvos commited on
cmake : avoid -march=native when reproducible build is wanted (llama/11366) 3cae2d9
Bernhard M. Wiedemann commited on
fix: ggml: fix vulkan-shaders-gen build (llama/10448) ad8f031
GGUF: C++ refactor, backend support, misc fixes (llama/11030) 21c5b64
ggml : do not install metal source when embed library (ggml/1054) 9615cf2
ggml : fix arm build (llama/10890) e58e7a9
Diego Devesa Adrien Gallouët commited on
Introducing experimental OpenCL backend with support for Qualcomm Adreno GPUs (llama/10693) 83a0899
lhez Skyler Szot Shangqing Gu Alexander Angus Hongqiang Wang Max Krasnyansky commited on
remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (llama/10797) b38cecf
Diego Devesa commited on
ggml : add predefined list of CPU backend variants to build (llama/10626) 1794b43
Diego Devesa commited on