CUDA: fix partial offloading for ne0 % 256 != 0 (llama/8572) afc137c JohannesGaessler commited on Jul 18, 2024
whisper : reorganize source code + improve CMake (#2256) f75c2e3 unverified ggerganov commited on Jun 26, 2024