whisper.cpp / ggml.h

Commit History

metal : support FA without mask + add asserts (llama/7278)
98ce302
unverified

ggerganov commited on

ggml : restore sigmoid decl order (ggml/0)
67c5387

ggerganov commited on

ggml : full ALiBi support (llama/7192)
192bda4

ggerganov commited on

ggml : introduce bfloat16 support (llama/6412)
81ec961

Justine Tunney commited on

add basic tensor data validation function (llama/6884)
71e001c

slaren commited on

ggml : group all experts in a single ggml_mul_mat_id (llama/6505)
f0b5c67

slaren ggerganov commited on

llama : add gguf_remove_key + remove split meta during quantize (llama/6591)
1706870

jiez z5269887 commited on

feat: implemented sigmoid function (ggml/806)
cd0c122

Justina Cho commited on

llama : add Command R Plus support (llama/6491)
8cf7097
unverified

Carolinabanana S S slaren ggerganov commited on

ggml : mul_mat_id use the same tensor for all the experts (llama/6387)
26fdc9f
unverified

slaren ggerganov commited on

sync : ggml (#2001)
cbbfa9e
unverified

ggerganov commited on

ggml : designate enum vals for integer types (llama/6050)
0bd0c7a
unverified

ggerganov commited on

ggml : remove old quantization functions (llama/5942)
11a2545
unverified

ggerganov commited on

llama : support Mamba Selective State Space Models (llama/5328)
224fbc2
unverified

compilade commited on

ggml : introduce ggml_status (ggml/750)
151c676
unverified

Michael Podvitskiy slaren ggerganov commited on

add some new ops, fix some operators and add batch operations to certain operators. (ggml/747)
dd8e3f9
unverified

leejet ggerganov slaren commited on

IQ4_XS: a 4.25 bpw quantization (llama/5747)
0ee1bfb
unverified

Kawrakow ikawrakow commited on

Adding IQ2_S and IQ2_M to complete coverage of the 2-3 bit quantization range (llama/5721)
2b9bb9e
unverified

Kawrakow ikawrakow ggerganov commited on

code : normalize enum names (llama/5697)
93e0830
unverified

ggerganov commited on

IQ3_S: a much better alternative to Q3_K (llama/5676)
32589c9
unverified

Kawrakow ikawrakow commited on

Introduce backend GUIDs (ggml/743)
a7eb9f6
unverified

UEXTM.com slaren commited on

ggml : always define ggml_fp16_t as uint16_t (llama/5666)
bc567d3
unverified

ggerganov commited on

sync : llama.cpp (ggml/0)
f8e8d34
unverified

ggerganov commited on

1.5 bit quantization (llama/5453)
9c3aa6a
unverified

Kawrakow ikawrakow commited on

ggml : add ALiBi support for ggml_soft_max_ext (llama/5488)
26c019a
unverified

ggerganov commited on

ggml : add mmla kernels for quantized GEMM (llama/4966)
0d50a29
unverified

snadampal commited on

ggml-alloc : v3 (ggml/727)
5cffd6f
unverified

slaren commited on

llava : add MobileVLM support (llama/5132)
f17a416
unverified

JidongZhang-THU slaren commited on

kompute : llama-bench support and ggml_cpu_has_kompute() (llama/5226)
0c9c434
unverified

Cebtenzzre commited on

ggml : add abort_callback for cpu backend (ggml/725)
a8ea91b
unverified

Michael Podvitskiy commited on

SOTA 3-bit quants (llama/5196)
4649943
unverified

Kawrakow ikawrakow commited on

ggml : add unified SYCL backend for Intel GPUs (llama/2690)
01169e0
unverified

Abhilash Majumder jianyuzh KevinLy hengyu ggerganov commited on

minor : clean-up some warnings and style (llama/5094)
7df090b
unverified

ggerganov commited on

llava : MobileVLM support (llama/4954)
dc8f956
unverified

cxt123 Chenxiaotao03 commited on

ggml : add IQ2 to test-backend-ops + refactoring (llama/4990)
227f2ae
unverified

ggerganov commited on

imatrix : offload to GPU support (llama/4957)
6490f98
unverified

ggerganov commited on

ggml : introduce GGML_CALL function annotation (llama/4850)
7815f68
unverified

jartine commited on

2-bit quantizations (llama/4897)
8a399ab
unverified

Kawrakow ikawrakow commited on

Importance Matrix calculation (llama/4861)
c0b17f1
unverified

Kawrakow ikawrakow ggerganov commited on

ggml : SOTA 2-bit quants (add IQ2_XS) (llama/4856)
5e827d5
unverified

Kawrakow ikawrakow commited on

ggml : remove ggml_cpy_inplace and ggml_cont_inplace (ggml/693)
6469bfe
unverified

Timothy Cronin commited on

ggml : change GGML_MAX_NAME at compile time (ggml/682)
ded2b1a
unverified

leejet commited on

SOTA 2-bit quants (llama/4773)
75de5bf
unverified

Kawrakow ikawrakow commited on

ggml : add ggml_cpu_has_avx_vnni() (llama/4589)
b10cbfd

alandao ggerganov commited on

sync : ggml (VMM, sync-ggml-am, dotprod ARM fixes, CUDA fixes) (#1691)
919a447
unverified

ggerganov commited on

sync : ggml (ggml_scale, ggml_row_size, etc.) (#1677)
aa86ade
unverified

ggerganov commited on