Commits · Xenobd/whisper.cpp

CUDA: backwards pass for misc. ops, add tests (llama/11257)

2fbcec1

JohannesGaessler commited on Jan 16, 2025

ggml: aarch64: implement SVE kernels for q4_K_q8_K vector dot (llama/11227)

bf3dc93

fj-y-saito

ggerganov commited on Jan 16, 2025

vulkan: scale caching for k quants + misc fixes (llama/11081)

03ab36f

Eve commited on Jan 15, 2025

fix: ggml: fix vulkan-shaders-gen build (llama/10448)

ad8f031

Sparkleholic commited on Jan 15, 2025

RoPE: fix back, CUDA support for back + noncont. (llama/11240)

131a21e

JohannesGaessler commited on Jan 15, 2025

SYCL: Add gated linear attention kernel (llama/11175)

fdb1fe5

qnixsynapse commited on Jan 15, 2025

ggml : add option to not print stack on abort (ggml/1081)

9b2706e

William Tambellini Diego Devesa commited on Jan 23, 2025

ggml-cpu : fix ggml_graph_compute_thread did not terminate on abort. (ggml/1065)

8e57313

issixx issi commited on Jan 17, 2025

ci : dummy commit to trigger CI

600a548
unverified

ggerganov commited on Feb 3, 2025

ruby : Make context accept initial parameters, API to retrieve a segment and more (#2749)

7cb9a0e
unverified

KitaitiMakoto commited on Jan 21, 2025

whisper.objc : fix build and CI

9cbd99a
unverified

Corey Earwood commited on Jan 18, 2025

talk-llama : sync llama.cpp

16d40d7

ggerganov commited on Jan 14, 2025

sync : ggml

d50f71a

ggerganov commited on Jan 14, 2025

GGUF: C++ refactor, backend support, misc fixes (skip) (llama/11030)

92311a3

JohannesGaessler commited on Jan 14, 2025

ggml : add opencl backend (skip) (llama/10693)

226358f

lhez Skyler Szot Shangqing Gu Alexander Angus Hongqiang Wang Max Krasnyansky commited on Jan 14, 2025

cuda : CUDA Graph Compute Function Refactor (precursor for performance improvements) (llama/11042)

25882f6

Andreas Kieslinger slaren commited on Jan 13, 2025

ggml : do not define GGML_USE_CUDA when building with GGML_BACKEND_DL (llama/11211)

79f750d

rgerganov commited on Jan 13, 2025

Vulkan: Fix float16 use on devices without float16 support + fix subgroup_size_control validation error (llama/11161)

5ad3f1d

OccamRazor commited on Jan 10, 2025

llama: add support for QRWKV6 model architecture (llama/11001)

4a6b7e0

mollysama

ggerganov

compilade commited on Jan 10, 2025

SYCL: Refactor ggml_sycl_compute_forward (llama/11121)

fa23a38

qnixsynapse commited on Jan 10, 2025

fix: add missing msg in static_assert (llama/11143)

8c60d6a

hydaitw commited on Jan 8, 2025

llamafile : ppc64le MMA INT8 implementation (llama/10912)

6f18eed

amritahs-ibm commited on Jan 8, 2025

Disable GL_KHR_cooperative_matrix Vulkan extension if not available. (llama/11117)

623b74d

mbaudier commited on Jan 8, 2025

fix: Vulkan shader gen binary path when Cross-compiling (llama/11096)

966a7bb

ag2s20150909 commited on Jan 8, 2025

GGUF: C++ refactor, backend support, misc fixes (llama/11030)

21c5b64

JohannesGaessler commited on Jan 7, 2025

ggml-backend : only offload from host buffers (fix) (llama/11124)

9ac3c7e

Diego Devesa commited on Jan 7, 2025

ggml-backend : only offload from host buffers (llama/11120)

1ca87a8

Diego Devesa commited on Jan 7, 2025

rpc : code cleanup (llama/11107)

a0fb22d

rgerganov commited on Jan 7, 2025

SYCL: Use get_multi_ptr instead of deprecated get_pointer in wkv6 (llama/11087)

4ed93cc

qnixsynapse commited on Jan 7, 2025

CUDA: add BF16 support (llama/11093)

961ef57

JohannesGaessler commited on Jan 6, 2025

Vulkan: Add device-specific blacklist for coopmat for the AMD proprietary driver (llama/11074)

4d90c3d

OccamRazor commited on Jan 4, 2025

Support for models with non-512-aligned tensors over RPC. (llama/11047)

895a3a2

Billy462 Diego Devesa commited on Jan 4, 2025

fix: Vulkan shader gen binary path (llama/11037)

7008fb8

Gilad S. commited on Jan 4, 2025

ggml : allow loading backend with env variable (ggml/1059)

48aa6d0

rgerganov commited on Jan 5, 2025

scripts : sync opencl, gguf

f751550
unverified

ggerganov commited on Jan 14, 2025

whisper : fix gpu device selection (#2728)

87b427e
unverified

ggerganov commited on Jan 13, 2025

server : fix build (#2718)

7925ae3
unverified

ggerganov commited on Jan 13, 2025

talk-llama : sync llama.cpp (#2709)

b462700
unverified

ggerganov commited on Jan 13, 2025

server : generate unique tmp filenames (#2718)

89d94b1
unverified

NETZkultur GmbH commited on Jan 13, 2025

whisper : add whisper_full_get_segment_no_speech_prob_from_state (#2716)

cb32a92
unverified

Sandro Hanea commited on Jan 9, 2025

readme : add docker instructions (#2711)

28257a6
unverified

jayant-yadav commited on Jan 7, 2025

docs: Fix main -> whisper-cli in download scripts (#2707)

4abfe5a
unverified

Adam Jones commited on Jan 6, 2025

release : v1.7.4

c775ca4
unverified

ggerganov commited on Jan 6, 2025

ci : cont

6331634
unverified

ggerganov commited on Jan 6, 2025

ci : fix ubuntu runner names

9a3c061
unverified

ggerganov commited on Jan 6, 2025

cli : fix segfault on missing argument (#2700)

245a91f
unverified

Yusuf Redžić commited on Jan 4, 2025

ci : fix arm builds

31f91d9

ggerganov commited on Jan 3, 2025

sync : ggml

0211dda

ggerganov commited on Jan 3, 2025

ggml : do not install metal source when embed library (ggml/1054)

9615cf2

ggerganov commited on Jan 3, 2025

metal : avoid uint (llama/11019)

b788516

ggerganov commited on Jan 3, 2025

Commit History

CUDA: backwards pass for misc. ops, add tests (llama/11257) 2fbcec1

ggml: aarch64: implement SVE kernels for q4_K_q8_K vector dot (llama/11227) bf3dc93

vulkan: scale caching for k quants + misc fixes (llama/11081) 03ab36f

fix: ggml: fix vulkan-shaders-gen build (llama/10448) ad8f031

RoPE: fix back, CUDA support for back + noncont. (llama/11240) 131a21e

SYCL: Add gated linear attention kernel (llama/11175) fdb1fe5

ggml : add option to not print stack on abort (ggml/1081) 9b2706e

ggml-cpu : fix ggml_graph_compute_thread did not terminate on abort. (ggml/1065) 8e57313

ci : dummy commit to trigger CI 600a548 unverified

ruby : Make context accept initial parameters, API to retrieve a segment and more (#2749) 7cb9a0e unverified

whisper.objc : fix build and CI 9cbd99a unverified

talk-llama : sync llama.cpp 16d40d7

sync : ggml d50f71a

GGUF: C++ refactor, backend support, misc fixes (skip) (llama/11030) 92311a3

ggml : add opencl backend (skip) (llama/10693) 226358f

cuda : CUDA Graph Compute Function Refactor (precursor for performance improvements) (llama/11042) 25882f6

ggml : do not define GGML_USE_CUDA when building with GGML_BACKEND_DL (llama/11211) 79f750d

Vulkan: Fix float16 use on devices without float16 support + fix subgroup_size_control validation error (llama/11161) 5ad3f1d

llama: add support for QRWKV6 model architecture (llama/11001) 4a6b7e0

SYCL: Refactor ggml_sycl_compute_forward (llama/11121) fa23a38

fix: add missing msg in static_assert (llama/11143) 8c60d6a

llamafile : ppc64le MMA INT8 implementation (llama/10912) 6f18eed

Disable GL_KHR_cooperative_matrix Vulkan extension if not available. (llama/11117) 623b74d

fix: Vulkan shader gen binary path when Cross-compiling (llama/11096) 966a7bb

GGUF: C++ refactor, backend support, misc fixes (llama/11030) 21c5b64

ggml-backend : only offload from host buffers (fix) (llama/11124) 9ac3c7e

ggml-backend : only offload from host buffers (llama/11120) 1ca87a8

rpc : code cleanup (llama/11107) a0fb22d

SYCL: Use get_multi_ptr instead of deprecated get_pointer in wkv6 (llama/11087) 4ed93cc

CUDA: add BF16 support (llama/11093) 961ef57

Vulkan: Add device-specific blacklist for coopmat for the AMD proprietary driver (llama/11074) 4d90c3d

Support for models with non-512-aligned tensors over RPC. (llama/11047) 895a3a2

fix: Vulkan shader gen binary path (llama/11037) 7008fb8

ggml : allow loading backend with env variable (ggml/1059) 48aa6d0

scripts : sync opencl, gguf f751550 unverified

whisper : fix gpu device selection (#2728) 87b427e unverified

server : fix build (#2718) 7925ae3 unverified

talk-llama : sync llama.cpp (#2709) b462700 unverified

server : generate unique tmp filenames (#2718) 89d94b1 unverified

whisper : add whisper_full_get_segment_no_speech_prob_from_state (#2716) cb32a92 unverified

readme : add docker instructions (#2711) 28257a6 unverified

docs: Fix main -> whisper-cli in download scripts (#2707) 4abfe5a unverified

release : v1.7.4 c775ca4 unverified

ci : cont 6331634 unverified

ci : fix ubuntu runner names 9a3c061 unverified

cli : fix segfault on missing argument (#2700) 245a91f unverified

ci : fix arm builds 31f91d9

sync : ggml 0211dda

ggml : do not install metal source when embed library (ggml/1054) 9615cf2

metal : avoid uint (llama/11019) b788516

CUDA: backwards pass for misc. ops, add tests (llama/11257)

2fbcec1

ggml: aarch64: implement SVE kernels for q4_K_q8_K vector dot (llama/11227)

bf3dc93

vulkan: scale caching for k quants + misc fixes (llama/11081)

03ab36f

fix: ggml: fix vulkan-shaders-gen build (llama/10448)

ad8f031

RoPE: fix back, CUDA support for back + noncont. (llama/11240)

131a21e

SYCL: Add gated linear attention kernel (llama/11175)

fdb1fe5

ggml : add option to not print stack on abort (ggml/1081)

9b2706e

ggml-cpu : fix ggml_graph_compute_thread did not terminate on abort. (ggml/1065)

8e57313

ci : dummy commit to trigger CI

600a548
unverified

ruby : Make context accept initial parameters, API to retrieve a segment and more (#2749)

7cb9a0e
unverified

whisper.objc : fix build and CI

9cbd99a
unverified

talk-llama : sync llama.cpp

16d40d7

sync : ggml

d50f71a

GGUF: C++ refactor, backend support, misc fixes (skip) (llama/11030)

92311a3

ggml : add opencl backend (skip) (llama/10693)

226358f

cuda : CUDA Graph Compute Function Refactor (precursor for performance improvements) (llama/11042)

25882f6

ggml : do not define GGML_USE_CUDA when building with GGML_BACKEND_DL (llama/11211)

79f750d

Vulkan: Fix float16 use on devices without float16 support + fix subgroup_size_control validation error (llama/11161)

5ad3f1d

llama: add support for QRWKV6 model architecture (llama/11001)

4a6b7e0

SYCL: Refactor ggml_sycl_compute_forward (llama/11121)

fa23a38

fix: add missing msg in static_assert (llama/11143)

8c60d6a

llamafile : ppc64le MMA INT8 implementation (llama/10912)

6f18eed

Disable GL_KHR_cooperative_matrix Vulkan extension if not available. (llama/11117)

623b74d

fix: Vulkan shader gen binary path when Cross-compiling (llama/11096)

966a7bb

GGUF: C++ refactor, backend support, misc fixes (llama/11030)

21c5b64

ggml-backend : only offload from host buffers (fix) (llama/11124)

9ac3c7e

ggml-backend : only offload from host buffers (llama/11120)

1ca87a8

rpc : code cleanup (llama/11107)

a0fb22d

SYCL: Use get_multi_ptr instead of deprecated get_pointer in wkv6 (llama/11087)

4ed93cc

CUDA: add BF16 support (llama/11093)

961ef57

Vulkan: Add device-specific blacklist for coopmat for the AMD proprietary driver (llama/11074)

4d90c3d

Support for models with non-512-aligned tensors over RPC. (llama/11047)

895a3a2

fix: Vulkan shader gen binary path (llama/11037)

7008fb8

ggml : allow loading backend with env variable (ggml/1059)

48aa6d0

scripts : sync opencl, gguf

f751550
unverified

whisper : fix gpu device selection (#2728)

87b427e
unverified

server : fix build (#2718)

7925ae3
unverified

talk-llama : sync llama.cpp (#2709)

b462700
unverified

server : generate unique tmp filenames (#2718)

89d94b1
unverified

whisper : add whisper_full_get_segment_no_speech_prob_from_state (#2716)

cb32a92
unverified

readme : add docker instructions (#2711)

28257a6
unverified

docs: Fix main -> whisper-cli in download scripts (#2707)

4abfe5a
unverified

release : v1.7.4

c775ca4
unverified

ci : cont

6331634
unverified

ci : fix ubuntu runner names

9a3c061
unverified

cli : fix segfault on missing argument (#2700)

245a91f
unverified

ci : fix arm builds

31f91d9

sync : ggml

0211dda

ggml : do not install metal source when embed library (ggml/1054)

9615cf2

metal : avoid uint (llama/11019)

b788516