JohannesGaessler's picture
CUDA: use tensor cores for MMQ (llama/7676)
78a5b67