whisper.cpp

Running

App Files Files Community

whisper.cpp / ggml /CMakeLists.txt

Commit History

ggml: initial IBM zDNN backend (llama/14975)

449e1a4

taronaeo commited on Aug 15, 2025

HIP: add cmake option to enable compiler output of kernel resource usage metrics (llama/15103)

577f7e4

uvos commited on Aug 7, 2025

cmake: Add GGML_BACKEND_DIR option (llama/15074)

6e460b6

Christian Kastner commited on Aug 4, 2025

HIP: add GGML_HIP_MMQ_MFMA option to allow disableing the MFMA path. (llama/14930)

f9dbd96

uvos commited on Jul 29, 2025

ggml-cpu : disable GGML_NNPA by default due to instability (llama/14880)

cac085c

taronaeo commited on Jul 25, 2025

musa: upgrade musa sdk to rc4.2.0 (llama/14498)

a687ec3

yeahdongcn commited on Jul 24, 2025

ggml: Add initial WebGPU backend (llama/14521)

0dd208f

Reese Levine commited on Jul 16, 2025

sync : resolve conflicts (#0)

5ec49ef

ggerganov HF Staff commited on Jul 12, 2025

sync : resolve conflicts (ggml/0)

497add0

ggerganov HF Staff commited on Jul 12, 2025

ggml : support bcast ggml_soft_max_ext, ggml_flash_attn_ext (llama/14435)

ebacb3e

ggerganov HF Staff commited on Jul 12, 2025

ggml : add version function to get lib version (ggml/1286)

880f633

danbev

ggerganov HF Staff commited on Jul 2, 2025

ggml-cpu: enable IBM NNPA Vector Intrinsics (llama/14317)

fea8f94

taronaeo slaren commited on Jun 25, 2025

HIP: disable rocwmma on gfx12 by default until rocm 7.0 (llama/14202)

f95736f

uvos commited on Jun 16, 2025

ggml : disable warnings for tests when using MSVC (ggml/1273)

1669c07

danbev commited on Jun 13, 2025

examples : include examples in msvc disable warn (ggml/1270)

0c191be

danbev commited on Jun 12, 2025

ggml-cpu : split arch-specific implementations (llama/13892)

8c833e9

xctan

ggerganov HF Staff commited on Jun 9, 2025

llama : allow using mmap without PrefetchVirtualMemory, apply GGML_WIN_VER to llama.cpp sources (llama/14013)

f0a0ac8

Diego Devesa commited on Jun 5, 2025

vulkan: use timestamp queries for GGML_VULKAN_PERF (llama/13817)

56ddc5b

jeffbolznv commited on May 27, 2025

ggml : riscv: add xtheadvector support (llama/13720)

4790d5d

xctan commited on May 27, 2025

sycl: use oneDNN for matrices multiplication (llama/12972)

2008e08

Łukasz Ślusarczyk commited on May 15, 2025

whisper: remove MSVC warnings pragmas (#3090)

e0d130c
unverified

danbev commited on May 5, 2025

whisper : add check that target name exists (#3103)

60ff3ed
unverified

danbev commited on May 1, 2025

ggml : suppress Windows compiler warnings (#3075)

887f7a2
unverified

danbev commited on Apr 29, 2025

ggml : add SSE 4.2 and x64 base variant for CPUs without AVX (llama/12871)

f8795d3

Diego Devesa commited on Apr 21, 2025

CUDA/HIP: Share the same unified memory allocation logic. (llama/12934)

143cb70

David Huang commited on Apr 15, 2025

ggml : sync/merge cmake,riscv,powerpc, add common.cmake (ggml/0)

f695cbf

ggerganov HF Staff commited on Mar 27, 2025

SYCL: using graphs is configurable by environment variable and compile option (llama/12371)

c18969f

Łukasz Ślusarczyk Romain Biessy commited on Mar 18, 2025

opencl: use OpenCL C standard supported by the device (llama/12221)

57028a7

Henry Linjamäki commited on Mar 10, 2025

cmake: Enable specifying exact PowerPC CPU architecture (ggml/1138)

aac4d16

Christian Kastner commited on Mar 10, 2025

ggml : add logging for native build options/vars (#2935)

aaf8a91
unverified

danbev commited on Mar 24, 2025

ggml-cpu: Faster IQ1 mul_mat_vec on AVX2 using BMI2 instructions (llama/12154)

05466a9

Rémy O commited on Mar 6, 2025

HIP: implement FlashAttention via rocWMMA for CDNA and RDNA3+ (llama/12032)

a027c1d

David Huang commited on Mar 3, 2025

CUDA: compress mode option and default to size (llama/12029)

4ec988a

Erik Scholz commited on Mar 1, 2025

cmake: Fix ggml backend dependencies and installation (llama/11818)

c6c2a2c

Vladimir Vuksanovic commited on Feb 27, 2025

Told cmake to install ggml-cpp.h as a public header file. (ggml/1126)

3d4f29c

petterreinholdtsen Petter Reinholdtsen commited on Feb 26, 2025

ggml-cpu: Support s390x SIMD Instruction Set (llama/12019)

4aa54ec

Aaron Teo Jinyang He junchao-zhao commited on Feb 22, 2025

CUDA: app option to compile without FlashAttention (llama/12025)

fbc5f16

JohannesGaessler commited on Feb 22, 2025

ggml-cpu: Add CPU backend support for KleidiAI library (llama/11390)

9de6d81

Charles Xu commited on Feb 20, 2025

cmake: Add ability to pass in GGML_BUILD_NUMBER (ggml/1096)

729db34
unverified

Christian Kastner commited on Feb 3, 2025

cmake: add ggml find package (llama/11369)

ca6577f

bandoti

ggerganov HF Staff commited on Jan 26, 2025

Hip: disable VMM on hip as it seams that it dosent work in some configurations (llama/11420)

2cc4df4

uvos commited on Jan 25, 2025

hip : Add hipGraph and VMM support to ROCM (llama/11362)

089afa0

uvos commited on Jan 24, 2025

cmake : avoid -march=native when reproducible build is wanted (llama/11366)

3cae2d9

Bernhard M. Wiedemann commited on Jan 24, 2025

fix: ggml: fix vulkan-shaders-gen build (llama/10448)

ad8f031

Sparkleholic commited on Jan 15, 2025

GGUF: C++ refactor, backend support, misc fixes (llama/11030)

21c5b64

JohannesGaessler commited on Jan 7, 2025

ggml : do not install metal source when embed library (ggml/1054)

9615cf2

ggerganov HF Staff commited on Jan 3, 2025

ggml : fix arm build (llama/10890)

e58e7a9

Diego Devesa Adrien Gallouët commited on Dec 18, 2024

Introducing experimental OpenCL backend with support for Qualcomm Adreno GPUs (llama/10693)

83a0899

lhez Skyler Szot Shangqing Gu Alexander Angus Hongqiang Wang Max Krasnyansky commited on Dec 13, 2024

remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (llama/10797)

b38cecf

Diego Devesa commited on Dec 12, 2024

ggml : add predefined list of CPU backend variants to build (llama/10626)

1794b43

Diego Devesa commited on Dec 4, 2024

Commit History

ggml: initial IBM zDNN backend (llama/14975) 449e1a4

HIP: add cmake option to enable compiler output of kernel resource usage metrics (llama/15103) 577f7e4

cmake: Add GGML_BACKEND_DIR option (llama/15074) 6e460b6

HIP: add GGML_HIP_MMQ_MFMA option to allow disableing the MFMA path. (llama/14930) f9dbd96

ggml-cpu : disable GGML_NNPA by default due to instability (llama/14880) cac085c

musa: upgrade musa sdk to rc4.2.0 (llama/14498) a687ec3

ggml: Add initial WebGPU backend (llama/14521) 0dd208f

sync : resolve conflicts (#0) 5ec49ef

sync : resolve conflicts (ggml/0) 497add0

ggml : support bcast ggml_soft_max_ext, ggml_flash_attn_ext (llama/14435) ebacb3e

ggml : add version function to get lib version (ggml/1286) 880f633

ggml-cpu: enable IBM NNPA Vector Intrinsics (llama/14317) fea8f94

HIP: disable rocwmma on gfx12 by default until rocm 7.0 (llama/14202) f95736f

ggml : disable warnings for tests when using MSVC (ggml/1273) 1669c07

examples : include examples in msvc disable warn (ggml/1270) 0c191be

ggml-cpu : split arch-specific implementations (llama/13892) 8c833e9

llama : allow using mmap without PrefetchVirtualMemory, apply GGML_WIN_VER to llama.cpp sources (llama/14013) f0a0ac8

vulkan: use timestamp queries for GGML_VULKAN_PERF (llama/13817) 56ddc5b

ggml : riscv: add xtheadvector support (llama/13720) 4790d5d

sycl: use oneDNN for matrices multiplication (llama/12972) 2008e08

whisper: remove MSVC warnings pragmas (#3090) e0d130c unverified

whisper : add check that target name exists (#3103) 60ff3ed unverified

ggml : suppress Windows compiler warnings (#3075) 887f7a2 unverified

ggml : add SSE 4.2 and x64 base variant for CPUs without AVX (llama/12871) f8795d3

CUDA/HIP: Share the same unified memory allocation logic. (llama/12934) 143cb70

ggml : sync/merge cmake,riscv,powerpc, add common.cmake (ggml/0) f695cbf

SYCL: using graphs is configurable by environment variable and compile option (llama/12371) c18969f

opencl: use OpenCL C standard supported by the device (llama/12221) 57028a7

cmake: Enable specifying exact PowerPC CPU architecture (ggml/1138) aac4d16

ggml : add logging for native build options/vars (#2935) aaf8a91 unverified

ggml-cpu: Faster IQ1 mul_mat_vec on AVX2 using BMI2 instructions (llama/12154) 05466a9

HIP: implement FlashAttention via rocWMMA for CDNA and RDNA3+ (llama/12032) a027c1d

CUDA: compress mode option and default to size (llama/12029) 4ec988a

cmake: Fix ggml backend dependencies and installation (llama/11818) c6c2a2c

Told cmake to install ggml-cpp.h as a public header file. (ggml/1126) 3d4f29c

ggml-cpu: Support s390x SIMD Instruction Set (llama/12019) 4aa54ec

CUDA: app option to compile without FlashAttention (llama/12025) fbc5f16

ggml-cpu: Add CPU backend support for KleidiAI library (llama/11390) 9de6d81

cmake: Add ability to pass in GGML_BUILD_NUMBER (ggml/1096) 729db34 unverified

cmake: add ggml find package (llama/11369) ca6577f

Hip: disable VMM on hip as it seams that it dosent work in some configurations (llama/11420) 2cc4df4

hip : Add hipGraph and VMM support to ROCM (llama/11362) 089afa0

cmake : avoid -march=native when reproducible build is wanted (llama/11366) 3cae2d9

fix: ggml: fix vulkan-shaders-gen build (llama/10448) ad8f031

GGUF: C++ refactor, backend support, misc fixes (llama/11030) 21c5b64

ggml : do not install metal source when embed library (ggml/1054) 9615cf2

ggml : fix arm build (llama/10890) e58e7a9

Introducing experimental OpenCL backend with support for Qualcomm Adreno GPUs (llama/10693) 83a0899

remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (llama/10797) b38cecf

ggml : add predefined list of CPU backend variants to build (llama/10626) 1794b43

ggml: initial IBM zDNN backend (llama/14975)

449e1a4

HIP: add cmake option to enable compiler output of kernel resource usage metrics (llama/15103)

577f7e4

cmake: Add GGML_BACKEND_DIR option (llama/15074)

6e460b6

HIP: add GGML_HIP_MMQ_MFMA option to allow disableing the MFMA path. (llama/14930)

f9dbd96

ggml-cpu : disable GGML_NNPA by default due to instability (llama/14880)

cac085c

musa: upgrade musa sdk to rc4.2.0 (llama/14498)

a687ec3

ggml: Add initial WebGPU backend (llama/14521)

0dd208f

sync : resolve conflicts (#0)

5ec49ef

sync : resolve conflicts (ggml/0)

497add0

ggml : support bcast ggml_soft_max_ext, ggml_flash_attn_ext (llama/14435)

ebacb3e

ggml : add version function to get lib version (ggml/1286)

880f633

ggml-cpu: enable IBM NNPA Vector Intrinsics (llama/14317)

fea8f94

HIP: disable rocwmma on gfx12 by default until rocm 7.0 (llama/14202)

f95736f

ggml : disable warnings for tests when using MSVC (ggml/1273)

1669c07

examples : include examples in msvc disable warn (ggml/1270)

0c191be

ggml-cpu : split arch-specific implementations (llama/13892)

8c833e9

llama : allow using mmap without PrefetchVirtualMemory, apply GGML_WIN_VER to llama.cpp sources (llama/14013)

f0a0ac8

vulkan: use timestamp queries for GGML_VULKAN_PERF (llama/13817)

56ddc5b

ggml : riscv: add xtheadvector support (llama/13720)

4790d5d

sycl: use oneDNN for matrices multiplication (llama/12972)

2008e08

whisper: remove MSVC warnings pragmas (#3090)

e0d130c
unverified

whisper : add check that target name exists (#3103)

60ff3ed
unverified

ggml : suppress Windows compiler warnings (#3075)

887f7a2
unverified

ggml : add SSE 4.2 and x64 base variant for CPUs without AVX (llama/12871)

f8795d3

CUDA/HIP: Share the same unified memory allocation logic. (llama/12934)

143cb70

ggml : sync/merge cmake,riscv,powerpc, add common.cmake (ggml/0)

f695cbf

SYCL: using graphs is configurable by environment variable and compile option (llama/12371)

c18969f

opencl: use OpenCL C standard supported by the device (llama/12221)

57028a7

cmake: Enable specifying exact PowerPC CPU architecture (ggml/1138)

aac4d16

ggml : add logging for native build options/vars (#2935)

aaf8a91
unverified

ggml-cpu: Faster IQ1 mul_mat_vec on AVX2 using BMI2 instructions (llama/12154)

05466a9

HIP: implement FlashAttention via rocWMMA for CDNA and RDNA3+ (llama/12032)

a027c1d

CUDA: compress mode option and default to size (llama/12029)

4ec988a

cmake: Fix ggml backend dependencies and installation (llama/11818)

c6c2a2c

Told cmake to install ggml-cpp.h as a public header file. (ggml/1126)

3d4f29c

ggml-cpu: Support s390x SIMD Instruction Set (llama/12019)

4aa54ec

CUDA: app option to compile without FlashAttention (llama/12025)

fbc5f16

ggml-cpu: Add CPU backend support for KleidiAI library (llama/11390)

9de6d81

cmake: Add ability to pass in GGML_BUILD_NUMBER (ggml/1096)

729db34
unverified

cmake: add ggml find package (llama/11369)

ca6577f

Hip: disable VMM on hip as it seams that it dosent work in some configurations (llama/11420)

2cc4df4

hip : Add hipGraph and VMM support to ROCM (llama/11362)

089afa0

cmake : avoid -march=native when reproducible build is wanted (llama/11366)

3cae2d9

fix: ggml: fix vulkan-shaders-gen build (llama/10448)

ad8f031

GGUF: C++ refactor, backend support, misc fixes (llama/11030)

21c5b64

ggml : do not install metal source when embed library (ggml/1054)

9615cf2

ggml : fix arm build (llama/10890)

e58e7a9

Introducing experimental OpenCL backend with support for Qualcomm Adreno GPUs (llama/10693)

83a0899

remove CMAKE_WINDOWS_EXPORT_ALL_SYMBOLS (llama/10797)

b38cecf

ggml : add predefined list of CPU backend variants to build (llama/10626)

1794b43