whisper.cpp / ggml-metal.m

Commit History

metal : switch back to default.metallib (ggml/681)
b945a8f
unverified

ggerganov commited on

ggml : add error handling to graph_compute (#1714)
92f24ee
unverified

finnvoorhees commited on

metal : add kernel_get_rows_i32
459dd87

ggerganov commited on

metal : optimize ggml_mul_mat_id (faster Mixtral PP) (llama/4725)
8bc6274

ggerganov commited on

metal : enable shader debugging (cmake option) (llama/4705)
7dd37dc

ggerganov commited on

sync : ggml (ggml_scale, ggml_row_size, etc.) (#1677)
aa86ade
unverified

ggerganov commited on

sync : ggml (Metal fixes, new ops, tests) (#1633)
a0d4b48
unverified

ggerganov commited on

metal : fix `ggml_metal_log` vargs (#1606)
b3cea90
unverified

finnvoorhees commited on

metal : fix soft_max kernel src1 argument (#1602)
5692844
unverified

ggerganov commited on

sync : ggml (new ops, new backend, etc) (#1602)
895e87a
unverified

ggerganov commited on

metal : add backend function to check device family support (#1547)
c95e649
unverified

ggerganov commited on

metal : fix build (#1544)
02dbf1a
unverified

sandrohanea commited on

whisper : make large version explicit + fix data size units (#1493)
03a3210
unverified

ggerganov commited on

ggml : fix some compile warnings
ad6c9c1
unverified

ggerganov commited on

whisper : add full CUDA and Metal offloading (#1472)
da4acca
unverified

ggerganov commited on

metal : fix asserts for setThreadgroupMemoryLength (close #1435)
b42b45f
unverified

ggerganov commited on

sync : ggml (backend v2, k-quants, CUDA opts, Metal opts, etc.) (#1422)
7006035
unverified

ggerganov Chris Raethke commited on

metal : restore matrix x vector f16_f32 kerenls for now
2dd8c56
unverified

ggerganov commited on

metal : add F32 support + update bench output
02d7878
unverified

ggerganov commited on

whisper : Metal and ggml-alloc support (#1270)
714ee6b
unverified

ggerganov commited on

sync : ggml (HBM + Metal + style) (#1264)
88deeba
unverified

ggerganov commited on

ggml : posixify pagesize (#1251)
4902c26
unverified

Przemysław Pawełczyk commited on

ggml : sync latest llama.cpp (view_src + alloc improvements) (#1247)
8bb66c1
unverified

ggerganov commited on

ggml : sync (ggml-alloc, GPU, eps, etc.) (#1220)
d41ba35
unverified

ggerganov commited on

ggml : sync latest repo (mostly refactoring changes)
d97fd69
unverified

ggerganov commited on

metal : sync ggml-metal (ref #1047)
799974c
unverified

ggerganov commited on