JohannesGaessler's picture
CUDA: revise q8_1 data layout for mul_mat_q (llama/7824)
fcfd59e